HiDi · PyPI

High-dimensional embedding generation library

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

HiDi is a library for high-dimensional embedding generation for collaborative filtering applications.

How Do I Use It?

This will get you started.

from hidi import inout, clean, matrix, pipeline


# CSV file with link_id and item_id columns
in_files = ['hidi/examples/data/user-item.csv']

# File to write output data to
outfile = 'embeddings.csv'

transforms = [
    inout.ReadTransform(in_files),      # Read data from disk
    clean.DedupeTransform(),            # Dedupe it
    matrix.SparseTransform(),           # Make a sparse user*item matrix
    matrix.SimilarityTransform(),       # To item*item similarity matrix
    matrix.SVDTransform(),              # Perform SVD dimensionality reduction
    matrix.ItemsMatrixToDFTransform(),  # Make a DataFrame with an index
    inout.WriteTransform(outfile)       # Write results to csv
]

pl = pipeline.Pipeline(transforms)
pl.run()

Setup

Requirements

HiDi is tested against CPython 2.7, 3.4, 3.5, and 3.6. It may work with different version of CPython.

Installation

To install HiDi, simply run

$ pip install hidi

Run the Tests

$ pip install tox
$ tox

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

0.0.3

Apr 27, 2017

This version

0.0.2

Apr 19, 2017

0.0.1

Apr 19, 2017

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

HiDi-0.0.2.tar.gz (7.0 kB view hashes)

Uploaded Apr 19, 2017 Source

Hashes for HiDi-0.0.2.tar.gz

Hashes for HiDi-0.0.2.tar.gz
Algorithm	Hash digest
SHA256	`5cf791aa2a4507b80201bc43c6f65d0cfe4689a92e1939932b105751a914b4a6`
MD5	`f5e24b4a4ad412583e9b9f662d8ec4bd`
BLAKE2b-256	`80a68aeef4eb53093f598335f6d6cb31045f6cafa4c9aaa62269d6fcf345919a`