Designed from the ground up to distribute analytical NLP tasks, it takes maximal advantage from available hardware.
You can create language-sensitive analyses for a wide range of supported languages, focusing on corner cases for each of them.
With an integrated full-text search engine (with a rich query language) you can search through all your corpora and documents.
Explore documents and corpora visualy on your browser, with snappy dynamic visualizations which can be exported easily.
PyPLN is a platform for processing and extracting useful information from text. It was conceived to run in the cloud, scale quickly and be easy to use. It integrates many text mining and natural language processing tools, which can be acessed via an easy-to-use Web interface, where you can manage documents, corpora and interact with its analysis/visualizations.
As its main feature, you can visualize analysis like part-of-speech tags, word frequency statistics and other useful information. It also offers a full-text search on your corpora so you can easily find information and then visualize its analysis.
PyPLN is developed by a research group called Núcleo de Análise e Modelagem de Dados (aka NAMD) located on Applied Mathematics School at Fundação Getúlio Vargas (in Rio de Janeiro, Brazil).
For more information, please check the project documentation.
Do you want to try PyPLN without needing to install it? Register a username and start using it free (as in free beer) right now:
Note that this is just a demo installation and sometimes we need to migrate or delete data for testing new features (the project is in active development). So, do not rely on this demo to store your documents and create analysis on top of it (we advise you to install a instance in your own infrastructure).
One of the most valuable characteristics in free/libre software projects is the collaboration that comes from the community. Join our discussions, suggest new features and stay in touch through:
PyPLN is free (as in free speech) software. You can download, submit bug reports and contribute through GitHub. Feel free to fork our repositories and submit pull requests:
If you have skills on programming, linguistics or design and want to help this project, please see our contributing guidelines.
Our work is sponsored by
Fundação Getúlio Vargas (a brazilian
think-tank university) and its
Applied Mathematics School.
We have a multidisciplinary team focused on creating the best free/libre platform for natural language processing: there are engineers, computer programmers, mathematicians and linguists working together in this project.
If you are using PyPLN in your institution please, tell us more about your experience! Join our mail list, share your story and give us feedback so we can keep improving.