Skip to main content

Read and use many word embedding formats

Project description

🐍 snakefusion

Documentation Status pypi Version

Introduction

snakefusion is a Python package for reading, writing, and using finalfusion, fastText, floret, GloVe, and word2vec embeddings. This package is a thin wrapper around the Rust finalfusion crate.

snakefusion supports the same types of embeddings as finalfusion:

  • Vocabulary:
    • No subwords
    • Subwords
  • Embedding matrix:
    • Array
    • Memory-mapped
    • Quantized
  • Format:
    • fastText
    • finalfusion
    • floret
    • GloVe
    • word2vec

Building from source

Building snakefusion from source requires a Rust toolchain that is installed through rustup and setuptools-rust:

$ pip install --upgrade setuptools-rust

You can then build and install snakefusion in your environment:

$ pip install .

Documentation

Read the snakefusion documentation for a quickstart and API reference.

You use finalfrontier to train new embeddings, or download some pretrained embeddings.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

snakefusion-0.1.6.tar.gz (86.5 kB view hashes)

Uploaded Source

Built Distributions

snakefusion-0.1.6-cp36-abi3-win_amd64.whl (657.9 kB view hashes)

Uploaded CPython 3.6+ Windows x86-64

snakefusion-0.1.6-cp36-abi3-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (5.4 MB view hashes)

Uploaded CPython 3.6+ manylinux: glibc 2.17+ x86-64

snakefusion-0.1.6-cp36-abi3-macosx_10_9_x86_64.macosx_11_0_arm64.macosx_10_9_universal2.whl (1.5 MB view hashes)

Uploaded CPython 3.6+ macOS 10.9+ universal2 (ARM64, x86-64) macOS 10.9+ x86-64 macOS 11.0+ ARM64

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page