package for indexing text datasets using prime number factorisation for fast word frequency analysis
Project description
primetext
python package for indexing text datasets for fast word frequency analysis
Usage
from primetext import primetext
data = ["black cat on mat",
"black hat for you",
"cat sat on you"]
# initiate primetext
pt = primetext.primetext()
# indexing data
pt.index(data)
# finding words
recordsWithCat = pt.find(['cat'])
# returns boolean vector : [True,False,True]
recordsWithCatAndSat = pt.find(['cat','sat'])
# returns boolean vector : [False,False,True]