easily bult LDA Topic Models with just a list of docs (e.g. a list of twitter posts in CSV/TXT
Project description
easyLDA
--------
|PyPI version|
easyLDA is a library that easily build LDA Topic Models with just a list of docs (e.g. a list of twitter posts in CSV/ TXT)
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
github: https://github.com/shichaoji/easyLDA
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
- If you have a collection of documents, and what to explore the
relationship & topics of the docs, easyLDA is a very handy library to
use. Simply run the commend and you'll get a trained LDA model with
results visualized
The library pipeline text preprocessing, such as tf-idf, n-grams from Gensim library
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
Credit to:
https://radimrehurek.com/gensim/
http://pyldavis.readthedocs.io/en/latest/readme.html
.. |PyPI version| image:: https://badge.fury.io/py/easyLDA.svg
:target: https://badge.fury.io/py/easyLDA
installation
~~~~~~~~~~~~
``$ pip install easyLDA``
usage example
~~~~~~~~~~~~~
simple need a text file (.csv/ .txt) with each row represents a document (a post, comment, short article etc.), with only one column which is the text
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
text file (csv) sample view
^^^^^^^^^^^^^^^^^^^^^^^^^^^
.. image:: https://user-images.githubusercontent.com/20619704/35779561-dba715a0-099c-11e8-8519-09d6164e63ae.jpg
:width: 60%
:alt: Demo 1
:align: left
easy to use, just in a shell window, type: easyLDA, then specify the location of the text document
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
1. then choose how many topics you want the model to fit
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
2. choose the topic contains only single word (1) or can be phases (2/3) as well
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
the program will be starting to train
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
- in shell $ easyLDA
.. image:: https://user-images.githubusercontent.com/20619704/35779521-49237200-099c-11e8-8cb2-ed916040a526.jpg
:width: 70%
:alt: Demo 2
:align: left
model result
~~~~~~~~~~~~
models folder created by program contains the trained model
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
xx.html file is the interactive visulization of the model result
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
.. image:: https://user-images.githubusercontent.com/20619704/35779593-cfe800c0-099d-11e8-8db5-d3431f155496.jpg
:width: 60%
:alt: Demo 3
:align: left
visualization live example
~~~~~~~~~~~~~~~~~~~~~~~~~~
http://shichaoji.com/2016/02/04/easylda-live-example/
.. image:: https://user-images.githubusercontent.com/20619704/36183178-386dfde8-10fb-11e8-88f0-9eb51b94bc27.jpg
:width: 60%
:alt: Demo 4
:align: left
--------
|PyPI version|
easyLDA is a library that easily build LDA Topic Models with just a list of docs (e.g. a list of twitter posts in CSV/ TXT)
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
github: https://github.com/shichaoji/easyLDA
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
- If you have a collection of documents, and what to explore the
relationship & topics of the docs, easyLDA is a very handy library to
use. Simply run the commend and you'll get a trained LDA model with
results visualized
The library pipeline text preprocessing, such as tf-idf, n-grams from Gensim library
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
Credit to:
https://radimrehurek.com/gensim/
http://pyldavis.readthedocs.io/en/latest/readme.html
.. |PyPI version| image:: https://badge.fury.io/py/easyLDA.svg
:target: https://badge.fury.io/py/easyLDA
installation
~~~~~~~~~~~~
``$ pip install easyLDA``
usage example
~~~~~~~~~~~~~
simple need a text file (.csv/ .txt) with each row represents a document (a post, comment, short article etc.), with only one column which is the text
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
text file (csv) sample view
^^^^^^^^^^^^^^^^^^^^^^^^^^^
.. image:: https://user-images.githubusercontent.com/20619704/35779561-dba715a0-099c-11e8-8519-09d6164e63ae.jpg
:width: 60%
:alt: Demo 1
:align: left
easy to use, just in a shell window, type: easyLDA, then specify the location of the text document
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
1. then choose how many topics you want the model to fit
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
2. choose the topic contains only single word (1) or can be phases (2/3) as well
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
the program will be starting to train
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
- in shell $ easyLDA
.. image:: https://user-images.githubusercontent.com/20619704/35779521-49237200-099c-11e8-8cb2-ed916040a526.jpg
:width: 70%
:alt: Demo 2
:align: left
model result
~~~~~~~~~~~~
models folder created by program contains the trained model
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
xx.html file is the interactive visulization of the model result
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
.. image:: https://user-images.githubusercontent.com/20619704/35779593-cfe800c0-099d-11e8-8db5-d3431f155496.jpg
:width: 60%
:alt: Demo 3
:align: left
visualization live example
~~~~~~~~~~~~~~~~~~~~~~~~~~
http://shichaoji.com/2016/02/04/easylda-live-example/
.. image:: https://user-images.githubusercontent.com/20619704/36183178-386dfde8-10fb-11e8-88f0-9eb51b94bc27.jpg
:width: 60%
:alt: Demo 4
:align: left
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
easyLDA-0.2.8.4.tar.gz
(6.5 kB
view hashes)
Built Distributions
easyLDA-0.2.8.4-py2.7.egg
(13.6 kB
view hashes)
Close
Hashes for easyLDA-0.2.8.4-py2-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 5bb54ec22e2d982079024197e8354d33abc559d00e47ac2ffa0a385d84310579 |
|
MD5 | c411bf814e5a393980a49f50515bf789 |
|
BLAKE2b-256 | 76ae7cb7bd4a505ef03e02a6d0543a03bb92db4cf370dadba973157052fceccb |