Skip to main content

Python port of Igo Japanese morphological analyzer

Project description

About

Igo is a Japanese morphological analyzer written in Java and Common Lisp. This software is Python port of Igo(Java version).

Notice

Dictionary builder is not provided. You need to use Igo Java version to build the dictionary for Igo. From igo-python 0.9.7, pre-built IPA dictionary (2.7.0-20070801) is included for ease of use.

How To Use

You can use Igo Python easily:

>>> from igo.Tagger import Tagger
>>> t = Tagger() # use bundled dictionary
>>> for m in t.parse(u'すもももももももものうち'):
...     print m.surface, m.feature
...
すもも 名詞,一般,*,*,*,*,すもも,スモモ,スモモ
も 助詞,係助詞,*,*,*,*,も,モ,モ
もも 名詞,一般,*,*,*,*,もも,モモ,モモ
も 助詞,係助詞,*,*,*,*,も,モ,モ
もも 名詞,一般,*,*,*,*,もも,モモ,モモ
の 助詞,連体化,*,*,*,*,の,ノ,ノ
うち 名詞,非自立,副詞可能,*,*,*,うち,ウチ,ウチ
>>>

Using command line parser:

$ igo # or python -m igo.parse
私の名前は中野です
私      名詞,代名詞,一般,*,*,*,私,ワタシ,ワタシ
の      助詞,連体化,*,*,*,*,の,ノ,ノ
名前    名詞,一般,*,*,*,*,名前,ナマエ,ナマエ
は      助詞,係助詞,*,*,*,*,は,ハ,ワ
中野    名詞,固有名詞,地域,一般,*,*,中野,ナカノ,ナカノ
です    助動詞,*,*,*,特殊・デス,基本形,です,デス,デス
EOS

Changelog for Igo-Python

2018-09-22 – 1.0.0
  • dropped Python 3.3 from supported Python version.

2017-02-09 – 0.9.9
  • fixed few small issues and add unit test.

2016-07-08 – 0.9.8
  • fixed a surrogate pair character handling issue

2015-12-16 – 0.9.7.1
  • Tagger can be pickled on Python2 env

2015-12-14 – 0.9.7
  • include pre-built IPA dictionary

2015-07-03 – 0.9.6
  • fixed an issue of dictionary size in mmap mode

2015-04-14 – 0.9.5
  • use mmap to load files of a dictionary if it is possible

2013-07-05 – 0.9.3
  • support characters in Supplementary Planes

2012-02-27 – 0.9.2
  • apply changes in Igo(java) 0.4.4

2012-02-27 – 0.9.1
  • synchronize with Igo(java) 0.4.5

2011-09-01 – 0.9
  • compatibility with python3.2

  • update the code base to igo-0.4.3

  • add some enhances and clean up.

2011-08-29 – 0.4
  • add some enhances and clean up.

2010-11-28 – 0.3a
  • fix bugs(if a phrase ends with ‘た’ will causes error)

2010-11-27 – 0.3
  • drop mmap related code

  • reduce memory footprint

2010-11-27 – 0.2
  • support Google App Engine(maybe)

2010-11-27 – 0.1a
  • update package info only.

2010-11-25 – 0.1
  • first release.

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

igo-python-1.0.0.tar.gz (8.8 MB view hashes)

Uploaded Source

Built Distribution

igo_python-1.0.0-py2.py3-none-any.whl (8.9 MB view hashes)

Uploaded Python 2 Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page