Skip to main content

Stopwords in NLP 🦜

Project description

Stopwords in NLP

Version License pypi downloads GitHub last commit Stars

Installation

pip install stopwds

you may want to checkout the version

stopwds version

Usage

stopwords

from stopwds import stopwords

'''
baidu: 百度停用词表
hit: 哈工大停用词表
scu: 四川大学机器智能实验室停用词表
cn: 中文停用词表
'''
for stopword in stopwords('baidu'):
    print(stopword)

and you can use like this:

from stopwds import stopwords
import jieba

text = ('医美产业崛起的同时,我国医美行业也形成了一条清晰且完整的产业链,上游医美产品生产企业占据了产业链核心环节。')
cut_sent = [word for word in jieba.cut(text) if word and word not in stopwords()]

Acknowlegements

  • stopwords

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

stopwds-0.1.5.tar.gz (5.2 kB view hashes)

Uploaded Source

Built Distribution

stopwds-0.1.5-py2.py3-none-any.whl (19.8 kB view hashes)

Uploaded Python 2 Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page