Unidecode 0.04.1
US-ASCII transliterations of Unicode text
Latest Version: 0.04.9
ASCII transliterations of Unicode text
Example Use
from unidecode import unidecode print unidecode(u"\u5317\u4EB0") # That prints: Bei Jing
Description
It often happens that you have non-Roman text data in Unicode, but you can't display it -- usually because you're trying to show it to a user via an application that doesn't support Unicode, or because the fonts you need aren't accessible. You could represent the Unicode characters as "???????" or " BA A0q0...", but that's nearly useless to the user who actually wants to read what the text says.
What Unidecode provides is a function, 'unidecode(...)' that takes Unicode data and tries to represent it in ASCII characters (i.e., the universally displayable characters between 0x00 and 0x7F). The representation is almost always an attempt at transliteration -- i.e., conveying, in Roman letters, the pronunciation expressed by the text in some other writing system. (See the example above)
This is a Python port of Text::Unidecode Perl module by Sean M. Burke <sburke@cpan.org>.
| File | Type | Py Version | Uploaded on | Size | # downloads |
|---|---|---|---|---|---|
| Unidecode-0.04.1-py2.6.egg (md5) | Python Egg | 2.6 | 2009-06-17 | 387KB | 23378 |
| Unidecode-0.04.1.tar.gz (md5) | Source | 2009-06-17 | 163KB | 11559 | |
| Unidecode-0.04.5.tar.gz (md5) | Source | 2010-10-15 | 182KB | 5061 | |
- Author: Tomaz Solc
- Home Page: http://code.zemanta.com/tsolc/unidecode/
- Provides unidecode
- Package Index Owner: bbangert, avian
- DOAP record: Unidecode-0.04.1.xml
