Skip to main content

A library for manipulating, loading, and saving corpus in iob2 format.

Project description

English description is under construction.

日本語での説明

import iob2
# iob2コーパス読み込み(ファイルから) [iob2]
corpus = iob2.load("./test_corpus.iob2")
# コーパスを文区切りにする
div_ls = [".", "?", "!"]	# 文区切り文字一覧
# iob2コーパス書き出し(ファイルへ) [iob2]
iob2.dump(sent_corpus, "./sent_corpus.iob2")

詳細な説明は執筆中です。

ライセンスに関する注意

当ソフトウエアはクリエイティブ・コモンズライセンス(CC0)ですが、内部でMITライセンスのpypiツールseqevalを利用しています。 下記はseqevalのライセンス表示です。

@misc{seqeval,
  title={{seqeval}: A Python framework for sequence labeling evaluation},
  url={https://github.com/chakki-works/seqeval},
  note={Software available from https://github.com/chakki-works/seqeval},
  author={Hiroki Nakayama},
  year={2018},
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

iob2-1.1.1.tar.gz (4.5 kB view hashes)

Uploaded Source

Built Distribution

iob2-1.1.1-py3-none-any.whl (5.1 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page