A collection of Korean Text Datasets ready to use using Tensorflow-Datasets.
Project description
tfds-korean
A collection of Korean Text Datasets ready to use using Tensorflow-Datasets.
Usage
Installation
pip install tfds-korean
Loading dataset
import tensorflow_datasets as tfds
import tfds_korean.nsmc # register nsmc dataset
ds = tfds.load('nsmc')
train_ds = ds['train'].batch(32)
test_ds = ds['test'].batch(128)
# define model
# ....
# ....
model.fit(train_ds)
model.evaluate(test_ds)
Licenses
The license for this repository and licenses for datasets are applied separately. It is recommended to use each dataset after checking the dataset's website.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
tfds-korean-0.0.1a3.tar.gz
(17.7 kB
view hashes)
Built Distribution
Close
Hashes for tfds_korean-0.0.1a3-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | b76ac52ff4271256edd1930e814a8fd8f479374054e4a061002c9f1920b97e93 |
|
MD5 | f5441072f501bd671f6ba7d496574221 |
|
BLAKE2b-256 | b92571c3d0ff919546409def30dfa4af7152b8aa1d1156a16576d4cf19c63dde |