Skip to main content

Pure Python module to hyphenate text

Project description

Pyphen is a pure Python module to hyphenate text using existing Hunspell hyphenation dictionaries.

https://github.com/Kozea/Pyphen

This module is a fork of python-hyphenator, written by Wilbert Berendsen.

https://code.google.com/p/python-hyphenator/

Many dictionaries are included in pyphen, they come from the LibreOffice git repository and are distributed under various licenses, including the LGPL version 2.1 also used for pyphen.

See: http://cgit.freedesktop.org/libreoffice/dictionaries/tree/

Usage:

>>> import pyphen
>>> 'nl_NL' in pyphen.LANGUAGES
True
>>> dic = pyphen.Pyphen(lang='nl_NL')
>>> dic.inserted('lettergrepen')
'let-ter-gre-pen'
>>> dic.wrap('autobandventieldopje', 11)
('autoband-', 'ventieldopje')
>>> for pair in dic.iterate('Amsterdam'):
...     print(pair)
...
('Amster', 'dam')
('Am', 'sterdam')
>>>

Features:

  • 100% pure Python with no dependencies

  • a lot of included dictionaries

  • caches dict files and hyphenated words

  • supports nonstandard hyphenation patterns

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

Pyphen-0.6.tar.gz (878.4 kB view hashes)

Uploaded Source

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page