Skip to main content
PyCon US is happening May 14th-22nd in Pittsburgh, PA USA.  Learn more

package for indexing text datasets using prime number factorisation for fast word frequency analysis

Project description

primetext

python package for indexing text datasets for fast word frequency analysis

Usage

from primetext import primetext

data = ["black cat on mat",
"black hat for you",
"cat sat on you"]

# initiate primetext
pt = primetext.primetext()

# indexing data
pt.index(data)

# finding words
recordsWithCat = pt.find(['cat'])
# returns boolean vector : [True,False,True]

recordsWithCatAndSat = pt.find(['cat','sat'])
# returns boolean vector : [False,False,True]

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page