Sentiment analysis library for russian language

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

Dostoevsky

Sentiment analysis library for russian language

Install

Please note that Dostoevsky supports only Python 3.6+

$ pip install dostoevsky

Social networks comment model

This model was trained on RuSentiment dataset and achieves up to ~0.70 F1 score

Usage

First of all, you'll need to download pretrained word embeddings and model:

$ dostoevsky download vk-embeddings cnn-social-network-model

Then, we can build our pipeline: text -> tokenizer -> word embeddings -> CNN

from dostoevsky.tokenization import UDBaselineTokenizer, RegexTokenizer
from dostoevsky.embeddings import SocialNetworkEmbeddings
from dostoevsky.models import SocialNetworkModel

tokenizer = UDBaselineTokenizer() or RegexTokenizer()
tokens = tokenizer.split('всё очень плохо')  # [('всё', 'ADJ'), ('очень', 'ADV'), ('плохо', 'ADV')]

embeddings_container = SocialNetworkEmbeddings()

vectors = embeddings_container.get_word_vectors(tokens)
vectors.shape  # (3, 300) - three words/vectors with dim=300

model = SocialNetworkModel(
  tokenizer=tokenizer,
  embeddings_container=embeddings_container,
  lemmatize=False,
)

messages = [
    'наступили на ногу',
    'всё суперски',
]

results = model.predict(messages)

for message, sentiment in zip(messages, results):
    print(message, '->', sentiment)  # наступили на ногу -> negative

License

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

0.6.0

Jan 11, 2021

0.5.0

Feb 28, 2020

0.4.0

Dec 9, 2019

0.3.0

Sep 5, 2019

This version

0.2.1

Aug 12, 2019

0.2.0

Jul 18, 2019

0.1.2

Dec 5, 2018

0.1.1

Dec 4, 2018

0.1.0

Dec 4, 2018

0.0.1

May 9, 2018

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dostoevsky-0.2.1.tar.gz (8.0 kB view hashes)

Uploaded Aug 12, 2019 Source

Built Distribution

dostoevsky-0.2.1-py2.py3-none-any.whl (12.5 kB view hashes)

Uploaded Aug 12, 2019 Python 2 Python 3

Hashes for dostoevsky-0.2.1.tar.gz

Hashes for dostoevsky-0.2.1.tar.gz
Algorithm	Hash digest
SHA256	`03a1a2ad9bf6363733ae1a1a752e1360a8bce410cab49480d6f87992f9a289db`
MD5	`a5d5a5836eee62d6a4bce3b990d6e559`
BLAKE2b-256	`7f74d1219651dcca278c0a5a824229c40451d434fb8b2fe31dcacdd0d2570c93`

Hashes for dostoevsky-0.2.1-py2.py3-none-any.whl

Hashes for dostoevsky-0.2.1-py2.py3-none-any.whl
Algorithm	Hash digest
SHA256	`0417e7fa552b67a5dc988ef182e23eda446918fe333cd653bf39a6d6af24f6e5`
MD5	`90c603b9e41dbea9a5becc44bc11c64b`
BLAKE2b-256	`c81aaf50d8aa216bbefd5b3b617de7f336f3bcd3fbaa86f21836c6bdb07b64f9`