Skip to main content

A new way to encode words and similarity calculate.

Project description

DOI

WordFP

This application consists of a python package made to encode words and compare them through similarity calculation. The words are encoded in a matrix of 0's and 1's called "WordFP", where the first column refers to all the letters present in a word and the second column to the last refers to the position of a certain letter in a word. The search for similar words is calculated based on the metrics: geometric, arithmetic, tanimoto and tversky. A jupyter-notebook with an example of using this package is in the examples/how_to_use.ipynb directory

Another way to use this package is through of web app WordFP. It is possible to run locally too following the steps below.

Install

Via pip

$ pip install wordfp

or

Via github

$ git clone https://github.com/jeffrichardchemistry/WordFP
$ cd WordFP
$ python3 setup.py install

Install and Run WebAPP Locally

The web application is in the "app/app.py" folder. Install dependencies:

$ pip install streamlit wordfp

To run:

$ cd .../app/
$ streamlit run app.py

Considerations

This project was an idea I came up with at a random moment while studying my PhD work, I hope it can help someone someday in areas like natural language processing (NLP).

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

WordFP-1.0.2.linux-x86_64.tar.gz (1.8 MB view hashes)

Uploaded Source

Built Distribution

WordFP-1.0.2-py3-none-any.whl (1.8 MB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page