Python API for the Turkish Language Foundation
Project description
Python API for the Turkish Language Foundation
tdk-py is a Python package that allows for simple access to Turkish dictionaries made available by the TDK, the Turkish Language Society. tdk-py aims to be easy to use and internally queries the TDK and parses its response into easy to use Python class objects.
Sample usage
tdk.gts is used to access TDK’s GTS, the up-to-date Turkish dictionary (Güncel Türkçe Sözlük).
>>> import tdk.gts >>> list(tdk.gts.search("merkeziyetçilik")) [<Entry 41635 (merkeziyetçilik)>]
tdk-py’s querying functions return iterators. The query is made and its response is held in memory. Then, its elements are parsed one-by-one when requested. This makes sure that you don’t wait for a long time to parse all search results at the start of a loop.
>>> for number, entry in enumerate(tdk.gts.search("bar")): # "bar" is searched, all responses are held in memory. ... # They are parsed one by one. ... for meaning in entry.meanings: ... print(number+1, entry.entry, meaning.meaning) ... 1 bar Anadolu'nun doğu ve kuzey bölgesinde, en çok Artvin ve Erzurum yörelerinde el ele tutuşularak oynanan, ağır ritimli bir halk oyunu 2 bar Danslı, içkili eğlence yeri 2 bar Ayaküstü içki içilen eğlence yeri 2 bar Amerikan bar 3 bar Hava basıncı birimi 4 bar Ateşten, mide bozukluğundan, ağızda, dil ve dişlerde meydana gelen acılık, pas 5 bar Halter sporunda ağırlığı oluşturan kiloları birbirine bağlayan metal çubuk
You can query suggestions for misspelt words or for other similar words.
>>> tdk.gts.get_suggestions("feldispat") ['feldspat', 'felekiyat', 'ispat'] >>> tdk.gts.get_suggestions("feldspat") ['espas', 'felah', 'felaket', 'felekiyat', 'fellah', 'felsefe', 'felsefi']
You can perform complex analyses very easily. Let’s see the distribution of entries by the number of maximum consecutive consonants.
>>> from tdk.tools import max_streak >>> from tdk.alphabet import CONSONANTS >>> annotated_list = [max_streak(word=x, targets=CONSONANTS) for x in tdk.gts.get_index()] >>> for i in set(annotated_list): ... print(i, annotated_list.count(i)) 0 19 1 15199 2 73511 3 3605 4 68 5 5
License
tdk-py’s source code is provided under the MIT License.
Copyright © 2021 Emre Özcan
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for tdk_py-0.1.0.post1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 8d8c64274c0348cbe9908034ea4b697cbbbc9239ebbe6db0bde041c4aba83baf |
|
MD5 | 41af28f9226f750162c3c03ad9254119 |
|
BLAKE2b-256 | 00b9fa389763c023b32eb02265874f830bae6928b3d6f7e17f48bf980c0f0d90 |