skip to navigation
skip to content

Not Logged In

pymystem3 0.1.5

Python wrapper for the Yandex MyStem 3

Package Documentation

Introduction

This module contains a wrapper for an excellent morphological analyzer for Russian language Yandex Mystem 3.0 released in June 2014. A morphological analyzer can perform lemmatization of text and derive a set of morphological attributes for each token. For more details about the algorithm see I. Segalovich «A fast morphological algorithm with unknown word guessing induced by a dictionary for a web search engine», MLMTA-2003, Las Vegas, Nevada, USA.

Python is the language of choice for many computational linguists, including those working with Russian language. The main motivation for this development was absence of any Python wrapper for the Mystem, a one of the most popular morphological analyzers for Russian language along with the PyMorphy2, the TreeTagger and AOT.

The third version of Mystem introduces several importaint improvements, most importaintly part-of-speech disambiguation. Our wrapper runs the Mystem in the mode which performs POS disambiguation.

This wrapper is open sources under MIT license. However, please consider that the Yandex Mystem is not open source and licensed under conditions of the Yandex License.

System Requrements

The wrapper works with CPython 2.6+/3.3+ and PyPy 1.9+.

The wrapper was tested on Ubuntu Linux 12.04+, Mac OSX 10.9+ and Windows 7+.

Installation

  1. Stable version: https://pypi.python.org/pypi/pymystem3. You can install it using pip:

    pip install pymystem3
    
  1. Latest version: https://github.com/Digsolab/pymystem3

A Quick Example

>>> from pymystem3 import Mystem
>>> text = "Красивая мама красиво мыла раму"
>>> m = Mystem()
>>> lemmas = m.lemmatize(text)
>>> print(''.join(lemmas))
красивый мама красиво мыть рама

Issues

The current version can be considered as an alpha version, so please let us know if something does not work as expected. Please report any bugs or requests that you have using the GitHub issue tracker (https://github.com/Digsolab/pymystem3/issues)!

Authors

 
File Type Py Version Uploaded on Size
pymystem3-0.1.5-py2.7.egg (md5) Python Egg 2.7 2015-04-20 16KB
pymystem3-0.1.5.tar.gz (md5) Source 2015-04-20 18KB
  • Downloads (All Versions):
  • 25 downloads in the last day
  • 286 downloads in the last week
  • 1159 downloads in the last month