A list of similar sounding words to help disambiguate voice coding
Project description
Similar sounding words This is a list of similar sounding words that I have collected from various sources on the web and added to as I find new pairs.
Unlike most homophone, homograph, and homonym resources this list is not targeting ESL or educational use. Instead it is designed for finding common errors in speech recognition texts. Specifically I use it with Caster for voice programming.
I currently have five different sources. I've downloaded their contents into the data directory as text files, or in one case HTML and parsed appropriately. I have also linked to the original location of these files both inside the files and in the headings between Jupyter cells in the notebook.
Unfortunately I wasn't thinking about reproducibility when I started this project, so most of the text files have had a bit of light preprocessing in a text editor.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for similar-sounding-words-0.0.1.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | 6e45c73ea761748df375919aedeee7ef0c5a7f3384492c9fbcd80b6a99d92ff4 |
|
MD5 | 73a69c72f75c55ac7811e30b3c6a32d0 |
|
BLAKE2b-256 | ed125e727bf6c3500d29b12d33b41c9acf0645799228b840d8fb3adf87f8b51d |
Hashes for similar_sounding_words-0.0.1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | b2e59da40589d489f0bd3556526bd67eb4614cee5c18b37b5ae372fd6c99a645 |
|
MD5 | c02dc7f9a4a3c0ad8b87cbe0d28ba32f |
|
BLAKE2b-256 | 5bd569893708a2f66053aad166135a4f2a67458f8ffd5e012a5ba5d58b2db948 |