Skip to main content

A Python parser for the CC-CEDICT Chinese-English dictionary

Project description

A Python parser for the CC-CEDICT Chinese-English dictionary

Travis Pypi


A parser for the CC-CEDICT Chinese-English dictionary.

Install

PIP

https://pypi.org/project/cedict-utils/

pip install cedict-utils

REPO

  1. Create virtualenv.
python3 -m venv ~/.virtualenvs/cedict-utils
  1. Activate venv.
source ~/.virtualenvs/cedict-utils/bin/activate
  1. Install requirements.
pip install -r requirements.txt

Dictionary

Download dictionary

wget -O - https://www.mdbg.net/chinese/export/cedict/cedict_1_0_ts_utf-8_mdbg.txt.gz | gunzip > data/cedict_ts.u8

Usage

$ python
>>> from cedict_utils.cedict import CedictParser
>>> parser = CedictParser()
>>> parser.read_file()
>>> entries = parser.parse()
>>> for e in entries:
...     print(e)
... 
..
龟缩 (龜縮) - gui1 suo1
龟背竹 (龜背竹) - gui1 bei4 zhu2
龟船 (龜船) - gui1 chuan2
..
>>> entries[200].simplified
'敦'
>>> entries[200].traditional
'㪟'
>>> entries[200].pinyin
'dun1'
>>> entries[200].raw_line
'㪟 敦 [dun1] /variant of 敦[dun1]/'
>>> entries[200].meanings
['variant of 敦[dun1]']
>>> 

Tests

Running tests

$ make test
pytest
================================================================ test session starts ================================================================
platform linux -- Python 3.6.8, pytest-3.2.3, py-1.4.34, pluggy-0.4.0
rootdir: /Development/cedict-utils, inifile:
collected 8 items                                                                                                                                    

tests/test_cedict.py ........

============================================================= 8 passed in 0.09 seconds ==============================================================

Resources

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page