A simple speaker recognition library in JAX and Flax.

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

FlaxSpeaker

Overview

A simple speaker recognition library in JAX and Flax.

For the PyTorch version, see: SpeakerRecognitionFromScratch

Installation

pip install flaxspeaker

Tutorial

Experiment config

All your experiment configurations are represented in a single YAML file.

See myconfig.yml as an example.

Hook up with data

In the configuration file, you need to correctly hook it up with your downloaded dataset.

For example, if you have downloaded LibriSpeech on your own machine, you need to set these two fields correctly:

data:
  train_librispeech_dir: "YOUR LIBRISPEECH TRAINING SET PATH"
  test_librispeech_dir: "YOUR LIBRISPEECH TESTING SET PATH"

If you are using a different dataset than LibriSpeech, you need to represent your dataset as CSV files. Then set these two fields correctly:

data:
  train_csv: "YOUR TRAINING SET CSV"
  test_csv: "YOUR TESTING SET CSV"

Generate dataset CSV

To represent your downloaded datasets by CSV files, you can use the generate_csv mode for the flaxspeaker command.

For example, you can use a command like below to generate a CSV file CN-Celeb.csv to represent your downloaded CN-Celeb dataset located at "${HOME}/Downloads/CN-Celeb_flac/data":

python -m flaxspeaker \
--mode generate_csv \
--path_to_dataset "${HOME}/Downloads/CN-Celeb_flac/data" \
--audio_format ".flac" \
--speaker_label_index -2 \
--output_csv "CN-Celeb.csv"

You can use --help to understand the meaning of each flag:

python -m flaxspeaker --help

Training

Once you have the config file ready (e.g. your_config.yml), you can launch your training with this command:

python -m flaxspeaker --mode train --config your_config.yml

Evaluation

After you finished training, you can evaluate the Equal Error Rate (EER) of the model you just trained with:

python -m flaxspeaker --mode eval --config your_config.yml

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

This version

0.0.2

Jul 24, 2023

0.0.1

Jul 23, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

flaxspeaker-0.0.2.tar.gz (13.9 kB view hashes)

Uploaded Jul 24, 2023 Source

Built Distribution

flaxspeaker-0.0.2-py3-none-any.whl (15.4 kB view hashes)

Uploaded Jul 24, 2023 Python 3

Hashes for flaxspeaker-0.0.2.tar.gz

Hashes for flaxspeaker-0.0.2.tar.gz
Algorithm	Hash digest
SHA256	`1d4e2b879c9bc4890851805a00074cf60d544354ea6f27c8a2ce549eb4dcfae8`
MD5	`f227f4a84b3464809ca1c2647c2648c6`
BLAKE2b-256	`a9cddb98afb922ee66ccb1db1097b1fea496ad9cec2cda469396acaaabf0fb5e`

Hashes for flaxspeaker-0.0.2-py3-none-any.whl

Hashes for flaxspeaker-0.0.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`7640f65e8e56a48ff42351041ae89ced852d2d3ae4afb31270e40be4cda3cb8a`
MD5	`77f97f003a4e2255aec0a981c435a9db`
BLAKE2b-256	`27bf61b279d342ae0dc176742efff7167f72c6270790a6fe97553308a6c561bc`