TimeSide

open web audio processing framework

Project description

TimeSide is a set of python components enabling low and high level audio analysis, imaging, transcoding and streaming. Its high-level API is designed to enable complex processing on big audio or video datasets of any format. Its simple plugin architecture can be adapted to various usecases.

TimeSide also includes a smart interactive HTML5 player which provides various streaming playback functions, formats selectors, fancy audio visualizations, segmentation and semantic labelling synchonized with audio events. It is embeddable in any web application.

Build status

Branch master :
Branch dev :

Goals

Do asynchronous and fast audio processing with Python,
Decode audio frames from any audio or video media format into numpy arrays,
Analyze audio content with some state-of-the-art audio feature extraction libraries,
Organize, serialize and save analysis metadata through various formats,
Draw various fancy waveforms, spectrograms and other cool visualizers,
Transcode audio data in various media formats and stream them through web apps,
Playback and interact on demand through a smart high-level HTML5 extensible player,
Index, tag and organize semantic metadata (see Telemeta which embed TimeSide).

Architecture

The streaming architecture of TimeSide relies on 2 main parts: a processing engine including various plugin processors written in pure Python and a user interface providing some web based visualization and playback tools in pure HTML5.

https://raw.github.com/yomguy/TimeSide/master/doc/slides/img/timeside_schema.png

Processors

IEncoder

VorbisEncoder [gst_vorbis_enc]

WavEncoder [gst_wav_enc]

Mp3Encoder [gst_mp3_enc]

FlacEncoder [gst_flac_enc]

AacEncoder [gst_aac_enc]

WebMEncoder [gst_webm_enc]

AudioSink [gst_audio_sink_enc]

IDecoder

FileDecoder [gst_dec]

ArrayDecoder [array_dec]

LiveDecoder [gst_live_dec]

IGrapher

Waveform [waveform_simple]

WaveformCentroid [waveform_centroid]

WaveformTransparent [waveform_transparent]

WaveformContourBlack [waveform_contour_black]

WaveformContourWhite [waveform_contour_white]

SpectrogramLog [spectrogram_log]

SpectrogramLinear [spectrogram_lin]

Displayaubio_pitch.pitch [grapher_aubio_pitch]

Displayodf [grapher_odf]

Displaywaveform_analyzer [grapher_waveform]

Displayirit_speech_4hz.segments [grapher_irit_speech_4hz_segments]

IAnalyzer

AubioTemporal [aubio_temporal]

AubioPitch [aubio_pitch]

AubioMfcc [aubio_mfcc]

AubioMelEnergy [aubio_melenergy]

AubioSpecdesc [aubio_specdesc]

Yaafe [yaafe]

Spectrogram [spectrogram_analyzer]

Waveform [waveform_analyzer]

VampSimpleHost [vamp_simple_host]

IRITSpeechEntropy [irit_speech_entropy]

IRITSpeech4Hz [irit_speech_4hz]

OnsetDetectionFunction [odf]

LimsiSad [limsi_sad]

IValueAnalyzer

Level [level]

MeanDCShift [mean_dc_shift]

News

0.5.4

Encoder : transcoded streams where broken. Now fixed with some smart thread controls.

Analyzer : update VAMP plugin example in sandbox

Analyzer : new experimental plugin : Limsi Speech Activity Detection Systems (limsi_sad)

Decoder : process any media in streaming mode giving its URL

Install : fix some setup requirements

0.5.3

Make Analyzer rendering more generic and easy to implement

Analyzer : implement rendering capability for event and segment + add some more analyzer graphers

Analyzer : refactoring the results rendering method. + Capability to use matplotlib in environnement with no display

Decoder : Add a Live decoder to get data from the soundcard

Decoder : add support for 96kHz sampling rate

Encoder: live AudioSink encoder, encoder that plays the audio stream through the soundcard

Grapher : add a generic Class to display Analyzers through their ‘render’ method. Add the new grapher file

Grapher : add a generic Class to display Analyzers through their ‘render’ method. For now, it only support FrameValueResult analyzer

Core : add a condition to catch signal only if a LiveDecoder source is used

Various bugfixes

0.5.2

Add a general launch script “timeside-launch” (see “Shell interface”)

Add some decorators to filter the inputs of processes (see analyzer.waveform for ex)

Add a “stack” option to the FileDecoder to accumulate audio data allowing multipass processes

Add beat confidence to aubio_temporal

Add AAC encoder (gstreamer voaacenc plugin needed)

Add UUIDs to the file URI and to all processors

Add a Debian repository with all dependencies for i386 and amd64 architectures

Fix buggy WebM encoder

Fix buggy MP3 muxing

Fix various minor bugs

0.5.1

Add parent processor list to Processor

Simplify and optimize the grapher system

Add Grapher abstract generic class

Add a UUID property to Processor

Add a SpectrogramLinear grapher

Add WaveformTransparent grapher

Fix some assignment issues regarding immutable type in for Analyzer Result

Simplify analyzer results implementation by introducing a Factory and multiple classes and subclasses to handle the 8 different kinds of results

Add doctests and improve the unit tests

Add a OnsetDetectionFunction analyzer

Update documentation

Various cleanups

Various bugfixes

0.5.0

Deep refactoring of the analyzer API to handle various new usecases, specifically audio feature extraction

Add serializable global result container (NEW dependency to h5py, json, yaml)

Add new audio feature extraction analyzers thanks to the Aubio library providing beat & BPM detection, pitch dectection and other cool stuff (NEW dependency on aubio)

Add new audio feature extraction analyzers thanks to the Yaafe library (NEW dependency on yaafe)

Add new IRIT speech detection analyzers (NEW dependency on scipy)

EXPERIMENTAL : add new audio feature extraction thanks to the VAMP plugin library (NEW dependency on some vamp toold)

Add new documentation : http://files.parisson.com/timeside/doc/

New Debian repository for instant install

Various bugfixes

Comptatible with Python >=2.7

WARNING : no longer compatible with Telemeta 1.4.5

0.4.5

(re)fix Pillow support (#12)

fix some Python package rules

add a Debian package directory (thanks to piem, in git repo only)

0.4.4

Only setup bugfixes

Last compatible version with Python 2.6

Next version 0.5 will integrate serious new analyzer features (aubio, yaafe and more)

0.4.3

finally fix decoder leaks and de-synchronizations (thanks to piem)

this also fixes bad variable encoder file lengths

fix OGG and FLAC encoders (closes: #8)

fix multi-channels streaming (closes: #13)

add support for Pillow (closes: #12)

temporally desactivate AAC and WebM encoders (need to add some limits for them)

WARNING : we now need to add overwrite=True to encoder kwargs instances in order to overwrite the destination file, i.e. e=Mp3Encoder(path, overwrite=True)

0.4.2

many releases these days, but there are some patches which are really worth to be HOT released : we just need them in production..

finally fix FFT window border leaks in the streaming spectrum process for really better spectrograms and smoother spectral centroid waveforms

mv gstutils to timeside.gstutils

cleanup various processes

Ogg, Aac and Flac encoders not really working now (some frames missing) :( Will be fixed in next release.

0.4.1

move UI static files from ui/ to static/timeside/ (for better django compatibility)

upgrade js scripts from telemeta 1.4.4

upgrade SoundManager2 to v297a-20120916

0.4.0

finally fixed an old decoder bug to prevent memory leaks during hard process (thanks to piem)

add blocksize property to the processor API

add many unit tests (check tests/alltests.py)

re-add UI files (sorry, was missing in the last packages)

various bugfixes

encoders not all much tested on big files, please test!

piem is now preparing some aubio analyzers :P

0.3.3

mostly a transitional developer and mantainer version, no new cool features

but add “ts-waveforms” script for waveform batching

fix some tests

removed but download audio samples

fix setup

update README

0.3.2

move mainloop to its own thread to avoid memory hogging on large files

add condition values to prepare running gst mainloop in a thread

add experimental WebM encoder

duration analysis goes to decoder.duration property

bugfixes

Dive in

Define some processors:

>>> import timeside
>>> decoder  =  timeside.decoder.FileDecoder('sweep.wav')
>>> grapher  =  timeside.grapher.Waveform()
>>> analyzer =  timeside.analyzer.Level()
>>> encoder  =  timeside.encoder.VorbisEncoder('sweep.ogg')

then, the magic pipeline:

>>> (decoder | grapher | analyzer | encoder).run()

get the results:

>>> grapher.render(output='waveform.png')
>>> print 'Level:', analyzer.results

API / Documentation

General : http://files.parisson.com/timeside/doc/
Tutorial : http://files.parisson.com/timeside/doc/tutorial/index.html
API : http://files.parisson.com/timeside/doc/api/index.html
Player / UI : https://github.com/yomguy/TimeSide/wiki/Ui-Guide (see also “Web Interface”)
Examples:

Install

The TimeSide engine is intended to work on all Unix / Linux platforms. MacOS X and Windows versions will soon be explorated.

TimeSide needs some other python modules to run. The following methods explain how to install all dependencies on various Linux based systems.

On Debian, Ubuntu, etc:

$ echo "deb http://debian.parisson.com/debian/ stable main" | sudo tee -a /etc/apt/sources.list
$ sudo apt-get update
$ sudo apt-get install python-timeside

On Fedora and Red-Hat:

$ sudo yum install gcc python python-devel gstreamer pygobject2 \
                  gstreamer-python gstreamer gstreamer-plugins-bad-free \
                  gstreamer-plugins-bad-free-extras \
                  gstreamer-plugins-base gstreamer-plugins-good

$ sudo pip install timeside

On other Linux platforms, you can also install all dependencies and then use pip:

$ sudo pip install timeside

Dependencies

python (>=2.7), python-setuptools, python-gst0.10, gstreamer0.10-plugins-good, gstreamer0.10-gnonlin, gstreamer0.10-plugins-ugly, python-aubio, python-yaafe, python-simplejson, python-yaml, python-h5py, python-scipy, python-matplotlib, python-matplotlib

Shell Interface

Of course, TimeSide can be used in any python environment. But, a shell script is also provided to enable preset based and recursive processing through your command line interface:

$ timeside-launch -h
Usage: scripts/timeside-launch [options] -c file.conf file1.wav [file2.wav ...]
 help: scripts/timeside-launch -h

Options:
 -h, --help            show this help message and exit
 -v, --verbose         be verbose
 -q, --quiet           be quiet
 -C <config_file>, --conf=<config_file>
                       configuration file
 -s <samplerate>, --samplerate=<samplerate>
                       samplerate at which to run the pipeline
 -c <channels>, --channels=<channels>
                       number of channels to run the pipeline with
 -b <blocksize>, --blocksize=<blocksize>
                       blocksize at which to run the pipeline
 -a <analyzers>, --analyzers=<analyzers>
                       analyzers in the pipeline
 -g <graphers>, --graphers=<graphers>
                       graphers in the pipeline
 -e <encoders>, --encoders=<encoders>
                       encoders in the pipeline
 -R <formats>, --results-formats=<formats>
                       list of results output formats for the analyzers
                       results
 -I <formats>, --images-formats=<formats>
                       list of graph output formats for the analyzers results
 -o <outputdir>, --ouput-directory=<outputdir>
                       output directory

Web Interface (the player)

TimeSide comes with a smart and pure HTML5 audio player.

Features:

embed it in any audio web application
stream, playback and download various audio formats on the fly
synchronize sound with text, bitmap and vectorial events
seek through various semantic, analytic and time synced data
fully skinnable with CSS style

Screenshot:

https://raw.github.com/yomguy/TimeSide/master/doc/slides/img/timeside_player_01.png

Examples of the player embeded in the Telemeta open web audio CMS:

Development documentation:

https://github.com/yomguy/TimeSide/wiki/Ui-Guide

TODO list:

embed a light http server to get commands through something like JSON RPC
zoom
layers

The player should work on any modern HTML5 enabled browser. Flash is needed for MP3 if the browser doesn’t support it.

Development

For versions >=0.5 on Debian Stable 7.0 Wheezy:

$ echo "deb http://debian.parisson.com/debian/ stable main" | sudo tee -a /etc/apt/sources.list
$ echo "deb-src http://debian.parisson.com/debian/ stable main" | sudo tee -a /etc/apt/sources.list
$ sudo apt-get update
$ sudo apt-get install git
$ sudo apt-get build-dep python-timeside

$ git clone https://github.com/yomguy/TimeSide.git
$ cd TimeSide
$ git checkout dev
$ export PYTHONPATH=$PYTHONPATH:`pwd`
$ python tests/run_all_tests

Sponsors and Partners

Parisson

CNRS (National Center of Science Research, France)

Huma-Num (big data equipment for digital humanities, ex TGE Adonis)

CREM (french National Center of Ethomusicology Research, France)

Université Pierre et Marie Curie (UPMC Paris, France)

ANR (CONTINT 2012 project : DIADEMS)

MNHN : Museum National d’Histoire Naturelle (Paris, France)

Copyrights

License

TimeSide is free software: you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation, either version 2 of the License, or (at your option) any later version.

TimeSide is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.

See LICENSE for more details.

Project details

Release history Release notifications | RSS feed

0.9.6

Nov 27, 2020

0.9.5

Nov 27, 2020

0.8

Feb 18, 2016

0.7.1

Jan 29, 2015

0.7

Jan 25, 2015

0.6.2

Jan 18, 2015

0.6.1

Nov 6, 2014

0.6

Oct 24, 2014

0.5.6.3

Jul 4, 2014

0.5.6

Jun 11, 2014

0.5.5

May 20, 2014

This version

0.5.4-1

Mar 3, 2014

0.5.4

Feb 20, 2014

0.5.3

Feb 3, 2014

0.5.2

Dec 16, 2013

0.5.1

Nov 11, 2013

0.5.0

Oct 16, 2013

0.4.5

Sep 3, 2013

0.4.4

Jul 26, 2013

0.4.3

Feb 12, 2013

0.4.2

Oct 8, 2012

0.4.1

Oct 2, 2012

0.4.0

Sep 28, 2012

0.3.3

Sep 19, 2012

0.3.2

Jul 29, 2012

0.3.1

May 30, 2011

0.3

May 30, 2011

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

TimeSide-0.5.4-1.tar.gz (515.7 kB view hashes)

Uploaded Mar 3, 2014 Source

Hashes for TimeSide-0.5.4-1.tar.gz

Hashes for TimeSide-0.5.4-1.tar.gz
Algorithm	Hash digest
SHA256	`92e806800b42761cc4dba5ea5fc0111834e0250b067f350de0864b7004aff0d3`
MD5	`25ffb1577840a8694063ea11895cffac`
BLAKE2b-256	`8d33af43eb62954fd499c3bdbff78f428492d00b7b246cdad041f31b45cee118`

TimeSide 0.5.4-1

Navigation

Verified details

Maintainers

Unverified details

Project links

GitHub Statistics

Meta

Classifiers