DNN-based singing voice synthesis library

Project description

NN-SVS

Python CI

Neural network-based singing voice synthesis library for research.

Demo

Neural network-based singing voice synthesis demo using kiritan_singing database (Japanese)

Audio samples

Kiritan samples: https://soundcloud.com/r9y9/sets/dnn-based-singing-voice

Installation

python setup.py develop

Repository structure

Core library: nnsvs/
Command line programs: nnsvs/bin/ and its configurations nnsvs/bin/conf/
Recipes: egs/

Python docstring style

https://sphinxcontrib-napoleon.readthedocs.io/en/latest/example_google.html

Recipes

A recipe is a set of scripts and configuraitons that are used to reproduce experiments. All the steps used to conduct experiments are provided in a self-contained way. Please have a look at the egs directory if you want to build your singing voice systems.

Background

As of Feb. 2020, NEUTRINO, a DNN-based singing voice synthesis tool, has started gaining its popularity in Japan. Because of the powerful DNN-based approach, users can create expressive and natural singing voices even without manual tuning which is typically required to achieve satisfactory quality using the existing tools.

While NEUTRINO is a great tool for creative purposes, it is not open-source software. In fact, there are only a few open-source toolkits to the best of our knowledge. To advance the singing voice synthesis research, we aim to provide a modern DNN-based singing voice synthesis tool for researchers and developers.

That being said, I was just curious to see if I can make a better one than NEUTRINO. We’ll see :)

History

See HISTORY.md

Related projects

English support for nnsvs: https://github.com/DynamiVox/nnsvs-english-support
NNSVSのモデルをUTAUで使えるようにするツール (UTAU plugin software powered by NNSVS): https://github.com/oatsu-gh/ENUNU
NNSVS向け教師データ作成支援ツール Vocal2lab: https://github.com/148nasuka/Vocal2lab

References

Y. Hono et al, "Sinsy: A Deep Neural Network-Based Singing Voice Synthesis System", Journal of IEEE/ACM TASLP https://arxiv.org/abs/2108.02776
Y. Hono et al, "Recent Development of the DNN-based Singing Voice Synthesis System — Sinsy," Proc. of APSIPA, 2017. (PDF)
A fork of sinsy: https://github.com/r9y9/sinsy
Python wrapper for sinsy: https://github.com/r9y9/pysinsy
NEUTRINO: https://n3utrino.work/

Project details

Release history Release notifications | RSS feed

0.1.1

Oct 9, 2023

0.1.0

Oct 9, 2023

0.0.3

Oct 15, 2022

0.0.2

Apr 29, 2022

This version

0.0.1

Mar 10, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nnsvs-0.0.1.tar.gz (38.3 kB view hashes)

Uploaded Mar 10, 2022 Source

Hashes for nnsvs-0.0.1.tar.gz

Hashes for nnsvs-0.0.1.tar.gz
Algorithm	Hash digest
SHA256	`c54fd8a48f932d7635c38c32a5f6d6eb3dfcdf81badbc64ca8fe38977bbd479b`
MD5	`9d741a346d6daa76a2d6d90e7e12d9cd`
BLAKE2b-256	`4aaeb3b7a774ec042d7ff87b728af7bc5bd89405480a84bcd74d1c85db9c4781`