nlp2 · PyPI

Tool for NLP - handle file and text

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Development Status
- 4 - Beta
Intended Audience
- Developers
License
- OSI Approved :: MIT License
Programming Language
Topic
- Software Development :: Build Tools

Project description

🔨 nlp2 🔧
========

Tools for NLP using Python

This repertory used to handle file io and string cleaning/parsing

Usage
-----

Install:

::

pip install nlp2

Before using :

::

from nlp2 import *

Features
========

File Handling
~~~~~~~~~~~~~

get\_folders\_form\_dir(path)
-----------------------------

Arguments - ``path(String)`` : getting all folders under this path
(string) Returns - ``path(String)(generator)`` : path of folders under
arguments path ## get\_files\_from\_dir(path) Arguments -
``path(String)`` : getting all files under this path (string) Returns -
``path(String)(generator)`` : path of files under arguments path ##
read\_dir\_files\_into\_lines(path) Arguments - ``path(String)`` :
getting all files line by lines under this path (string) Returns -
``line(String)(generator)`` : files line under arguments path ##
read\_files\_into\_lines(path) Arguments - ``path(String)`` : getting
content in input file path (string) Returns -
``path(String)(generator)`` : file line under arguments path

String cleaning/parsing
~~~~~~~~~~~~~~~~~~~~~~~

lines\_into\_sentence(lines)
----------------------------

Arguments - ``lines(Array(String))`` : lines array Returns -
``path(String)(generator)`` : split all line base on punctuations ##
split\_sentence\_to\_ngram(text) Arguments - ``path(String)`` : sentence
to ngram

Returns - ``ngrams(Array)`` : ngrams array

Examples

::

split_sentence_to_ngram("加州旅館")
return ['加','加州',"加州旅","加州旅館","州","州旅","州旅館","旅","旅館","館"]

split\_sentence\_to\_ngram\_inpart(text)
----------------------------------------

| Arguments - ``path(String)`` : sentence to ngram Returns -
``path(String)(generator)`` : multiple ngrams array in different start
character
| Examples

::

split_sentence_to_ngram("加州旅館")
return [['加','加州',"加州旅","加州旅館"],["州","州旅","州旅館"],["旅","旅館"],["館"]]

spilt\_text\_to\_combine\_ways(text)
------------------------------------

Arguments - ``text(String)`` : input text Returns -
``path(String)(generator)`` : all of the text combines ways Examples

::

spilt_text_to_combine_ways("加州旅館")
return ['加州旅館', '加州旅館', '加州旅館', '加州旅館', '加州旅館', '加州旅館', '加州旅館']

spilt\_sentence\_to\_array(sentence)
------------------------------------

Arguments - ``sentence(String)`` : input text Returns -
``sentencearray(Array)`` : sentence array ## is\_all\_english(text)
Arguments - ``text(String)`` : input text Returns - ``result(Boolean)``
: whether the text is all English or not ## is\_contain\_number(text)
Arguments - ``text(String)`` : input text Returns - ``result(Boolean)``
: whether the text contain number or not ## is\_contain\_english(text)
Arguments - ``text(String)`` : input text Returns - ``result(Boolean)``
: whether the text contain english or not ## full2half(text) Arguments -
``string(String)`` : input string which needs turn to half Returns -
``(String)`` : a half-string ## half2full(text) Arguments -
``text(String)`` : input string which needs turn to full Returns -
``(String)`` : a full-string

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Development Status
- 4 - Beta
Intended Audience
- Developers
License
- OSI Approved :: MIT License
Programming Language
Topic
- Software Development :: Build Tools

Release history Release notifications | RSS feed

1.8.53

Apr 12, 2024

1.8.52

Jun 5, 2023

1.8.51

May 25, 2023

1.8.50

May 25, 2023

1.8.49

May 1, 2023

1.8.48

Aug 23, 2022

1.8.47

Jun 27, 2022

1.8.46

Jun 19, 2022

1.8.45

Jun 19, 2022

1.8.44

May 30, 2022

1.8.43

Mar 12, 2022

1.8.42

Mar 9, 2022

1.8.41

Feb 9, 2022

1.8.40

Jan 11, 2022

1.8.39

Dec 27, 2021

1.8.38

Sep 24, 2021

1.8.36

Jun 15, 2021

1.8.35

Jun 13, 2021

1.8.34

May 20, 2021

1.8.33

May 20, 2021

1.8.32

May 8, 2021

1.8.31

Apr 10, 2021

1.8.30

Apr 8, 2021

1.8.29

Nov 1, 2020

1.8.28

Oct 30, 2020

1.8.27

Oct 17, 2020

1.8.26

Oct 16, 2020

1.8.25

Oct 16, 2020

1.8.25.dev0 pre-release

Oct 14, 2020

1.8.24

Oct 14, 2020

1.8.23

Oct 3, 2020

1.8.22

Oct 1, 2020

1.8.21

Oct 1, 2020

1.8.20

Sep 18, 2020

1.8.19

Sep 3, 2020

1.8.18

Sep 2, 2020

1.8.17

Aug 26, 2020

1.8.16

Aug 26, 2020

1.8.15

Aug 24, 2020

1.8.14

Aug 24, 2020

1.8.13

Aug 7, 2020

1.8.13.dev6 pre-release

Aug 7, 2020

1.8.13.dev5 pre-release

Aug 7, 2020

1.8.13.dev4 pre-release

Aug 7, 2020

1.8.13.dev3 pre-release

Aug 7, 2020

1.8.13.dev2 pre-release

Aug 7, 2020

1.8.13.dev1 pre-release

Aug 7, 2020

1.8.12

Jul 20, 2020

1.8.11

Jul 20, 2020

1.8.10

Jul 19, 2020

1.8.9

Jul 18, 2020

1.8.8

Jul 15, 2020

1.8.7

Jul 15, 2020

1.8.6

Jul 15, 2020

1.8.5

Jul 12, 2020

1.8.4

Jul 8, 2020

1.8.3

Jul 8, 2020

1.8.2

Jul 6, 2020

1.8.1

Jul 6, 2020

1.8.0

Jul 6, 2020

1.7.10

Jul 6, 2020

1.7.9

Jul 6, 2020

1.7.8

Jul 6, 2020

1.7.7

Jul 6, 2020

1.7.6

Jul 6, 2020

1.7.5

Jul 6, 2020

1.7.4

Jul 6, 2020

1.7.3

Jul 6, 2020

1.7.2

Jul 6, 2020

1.7.1

Jul 5, 2020

1.7.0

Jul 5, 2020

1.6.9

Jul 5, 2020

1.6.8

Jul 4, 2020

1.6.7

Jul 4, 2020

1.6.6

Jul 3, 2020

1.6.5

Jul 3, 2020

1.6.2

Jun 23, 2020

1.6.1

Jun 18, 2020

1.6.0

Jun 18, 2020

1.5.9

Nov 17, 2019

1.5.8

Nov 9, 2019

1.5.7

Nov 9, 2019

1.5.6

Oct 11, 2019

1.5.5

Sep 30, 2019

1.5.0

Mar 5, 2019

1.4.5

Sep 30, 2019

1.4.0

Feb 26, 2019

1.3.9

Feb 26, 2019

1.3.8

Feb 2, 2019

1.3.5

Feb 2, 2019

1.3.0

Jan 22, 2019

1.2.5

Jan 17, 2019

1.2.0

Dec 30, 2018

1.1.3

Dec 15, 2018

1.1.2

Nov 29, 2018

1.1.1

Nov 18, 2018

1.1.0

Oct 3, 2018

1.0.9

Oct 3, 2018

1.0.8

Oct 3, 2018

1.0.7

Oct 3, 2018

1.0.6

Oct 3, 2018

1.0.5

Oct 3, 2018

1.0.4

Oct 3, 2018

1.0.3

Oct 3, 2018

This version

1.0.2

Jun 7, 2018

1.0.1

Jun 7, 2018

1.0.0

Mar 14, 2018

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nlp2-1.0.2.tar.gz (3.7 kB view hashes)

Uploaded Jun 7, 2018 Source

Built Distribution

nlp2-1.0.2-py3.6.egg (8.7 kB view hashes)

Uploaded Oct 3, 2018 Source

Hashes for nlp2-1.0.2.tar.gz

Hashes for nlp2-1.0.2.tar.gz
Algorithm	Hash digest
SHA256	`59d639476853ab253f885f502c1942a29e742a640a8625e9ac73befacb3a0ee1`
MD5	`519344c7a1611f3611d8bda7a1fd01f7`
BLAKE2b-256	`e0de441f5e0ae8e8993fff1fd88ff82b2a9865a165840ff8607354689bce9ede`

Hashes for nlp2-1.0.2-py3.6.egg

Hashes for nlp2-1.0.2-py3.6.egg
Algorithm	Hash digest
SHA256	`3f875ef9b2ec874e6ef1d3491c454db0ad7799775dad77c7f2d4090e1006de59`
MD5	`e47428f9dbdd09779a30dd809099d7a9`
BLAKE2b-256	`1da7d61ea0887895e79453f2c47b78b572e603575ac614daa7933523a750f476`