Kadot, unsupervised natural language processing.
Project description
Unsupervised natural language processing library.
Kadot just lets you process a text easily.
>>> hello_world = Text("Kadot just lets you process a text easily.") >>> hello_world.ngrams(n=2)
[(‘Kadot’, ‘just’), (‘just’, ‘lets’), (‘lets’, ‘you’), (‘you’, ‘process’), (‘process’, ‘a’), (‘a’, ‘text’), (‘text’, ‘easily’)]
🔋 What’s included ?
Kadot includes tokenizers, text generators, classifiers, word-level and document-level vectorizers as well as a spell checker, a fuzzy string matching utility or a stopwords detector.
The philosophy of Kadot is “never hardcode the language rules” : use unsupervised solutions to support most languages. So it will never includes Treebank based algorithms (like a POS Tagger).
🔨 Installation
Use the “pip” command that refair to the Python 3.x interpreter. In my case :
$ pip3 install kadot
It actually require the Python’s standard library, Numpy, Scipy and Scikit-Learn.
⚖️ License
Kadot is under MIT license.
More
Check the GitHub repository : https://github.com/the-new-sky/Kadot