inaFaceAnalyzer is a Python toolbox for large-scale face-based analysis of image and video streams. It provides fast API and command line programs allowing to perform face detection, face tracking, gender and age prediction, and export to CSV or rich ASS subtitles

Project description

inaFaceAnalyzer: a Python toolbox for large-scale face-based description of gender representation in media with limited gender, racial and age biases

inaFaceAnalyzer is a Python toolbox designed for large-scale analysis of faces in image or video streams. It provides a modular processing pipeline allowing to predict age and gender from faces. Results can be exported to tables, augmented video streams, or rich ASS subtitles. inaFaceAnalyzer is designed with speed in mind to perform large-scale media monitoring campaigns. The trained age and gender classification model provided is based on a ResNet50 architecture. Evaluation results are highly competitive with respect to the current state-of-the-art, and appear to reduce gender, age and racial biases.

Should you need further details regarding this work, please refer to the following paper:

@journal{doukhan2022joss,
  author = {David Doukhan and Thomas Petit},
  title = {inaFaceAnalyzer: a Python toolbox for large-scale face-based description of gender representation in media},
  journal = {JOSS - The journal of Open Source Software (currently being reviewed)},
  year = {submission in progress}
}

Have a look to sibling project inaSpeechSegmenter.

Installation

apt-get install cmake ffmpeg libgl1-mesa-glx
git clone https://github.com/ina-foss/inaFaceAnalyzer.git
cd inaFaceAnalyzer
pip install .
./test_inaFaceAnalyzer.py # to check that the installation is ok

Using inaFaceAnalyzer command line program

Most common processings can be done using the script ina_face_analyzer.py provided with the distribution. Some quick-starters commands are detailled bellow :

Displaying detailed manual

A detailed listing of all options available from the command line can be obtained using the following command. We guess you don't want to read the whole listing at this point, but you can have a look at it later 😉.

ina_face_analyzer.py -h

Process all frames from a list of video (without tracking)

Video processing requires a list of input video paths, together with a directory used to store results in CSV. Program initialization time requires several seconds, and we recommend using large list of files instead of calling the program for each file to process.

# directory storing result must exist
mkdir my_output_directory
# -i is followed by the list of video to analyze, and -o is followed by the name of the output_directory
ina_face_analyzer.py -i ./media/pexels-artem-podrez-5725953.mp4 -o ./my_output_directory
# displaying the first 2 lines of the resulting CSV
head -n 2 ./my_output_directory/pexels-artem-podrez-5725953.csv 
frame,bbox,detect_conf,sex_decfunc,age_decfunc,sex_label,age_label
0,"(945, -17, 1139, 177)",0.999998927116394,8.408014,3.9126961,m,34.12696123123169

Resulting CSV contain several columns:

frame: frame position in the video (here we have 5 lines corresponding to frame 0 - so 5 detected faces)
bbox: face bounding box
detect_conf: face detection confidence (dependent on the detection system used)
sex_decfunc and age_decfunc: raw classifier output. Can be used to smooth results or ignored.
sex_label: m for male and f for female
age_label: age prediction

Faster processing of a video

It computation time is an issue, we recommend using --fps 1 which will process a single frame per second, instead of the whole amount of video frames. When using GPU architectures, we also recommend setting large batch_size values.

# here we process a single frame per second, which is 25/30 faster than processing the whole video
ina_face_analyzer.py --fps 1 --batch_size 128 -i ./media/pexels-artem-podrez-5725953.mp4 -o ./my_output_directory

Using Tracking

Tracking allows to lower computation time, since it is less costly than a face detection procedure. It also allows to smooth prediction results associated to a tracked face and obtain more robust estimates.

# Process 5 frames per second, use face detection for 1/3 and face tracking for 2/3 frames
ina_face_analyzer.py --fps 5 --tracking 3 -i ./media/pexels-artem-podrez-5725953.mp4 -o ./my_output_directory

Exporting results

Result visualization allows to validate if a give processing pipeline is suited to a specific material. --mp4_export generate a new video with embeded bounding boxes and classification information. --ass_subtitle_export generate a ASS subtitle file allowing to display bounding boxes and classification results in vlc or ELAN, and which is more convenient to share..

# Process 10 frames per second, use face detection for 1/2 and face tracking for 1/2 frames
# results are exported to a newly generated MP4 video and ASS subtitle
ina_face_analyzer.py --fps 10 --tracking 2 --mp4_export --ass_subtitle_export  -i ./media/pexels-artem-podrez-5725953.mp4 -o ./my_output_directory
# display the resulting video
vlc ./my_output_directory/pexels-artem-podrez-5725953.mp4
# display the original video with the resulting subtitle files
vlc media/pexels-artem-podrez-5725953.mp4 --sub-file my_output_directory/pexels-artem-podrez-5725953.ass

Processing list of images

The processing of list of images can be speed up using --type image. A single resulting csv will be generated with entries for each detected faces, together with a reference to its original filename path.

# process all images stored in directory media, outputs a single csv file
ina_face_analyzer.py -i media/*.jpg -o ./myresults.csv --type image

Using inaFaceAnalyzer API

CREDITS

This work has been partially funded by the French National Research Agency (project GEM : Gender Equality Monitor : ANR-19-CE38-0012) and by European Union's Horizon 2020 research and innovation programme (project MeMAD : H2020 grant agreement No 780069).

We acknowledge contributions from Zohra Rezgui who trained first models and wrote the first piece of code that lead to inaFaceAnalyzer during her internship at INA.

@techreport{rezgui2019carthage,
  type = {Msc. Thesis},
  author = {Zohra Rezgui},
  title = {Détection et classification de visages pour la description de l’égalité femme-homme dans les archives télévisuelles},
  submissiondate = {2019/11/19},
  year = {2019},
  url = {https://www.researchgate.net/publication/337635267_Rapport_de_stage_Detection_et_classification_de_visages_pour_la_description_de_l'egalite_femme-homme_dans_les_archives_televisuelles},
  institution = {Higher School of Statistics and Information Analysis, University of Carthage}
}

Project details

Release history Release notifications | RSS feed

0.6.1

May 21, 2022

0.6.0

May 20, 2022

0.5.7

Feb 27, 2022

0.5.6

Feb 22, 2022

0.5.3

Feb 21, 2022

0.5.2

Feb 19, 2022

0.5.1

Feb 19, 2022

0.5.0

Feb 18, 2022

This version

0.4.0

Feb 15, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

inaFaceAnalyzer-0.4.0.tar.gz (52.0 kB view hashes)

Uploaded Feb 15, 2022 Source

Built Distribution

inaFaceAnalyzer-0.4.0-py3-none-any.whl (48.8 kB view hashes)

Uploaded Feb 15, 2022 Python 3

Hashes for inaFaceAnalyzer-0.4.0.tar.gz

Hashes for inaFaceAnalyzer-0.4.0.tar.gz
Algorithm	Hash digest
SHA256	`7d8cb06bed249fb8e4eadd700de0a7cd751b3581c19fffe1bed692cf509cbb48`
MD5	`c52142892e673d84d0a808d9a72a3114`
BLAKE2b-256	`37eece3d1166343eca39c1c1cf62ee2d39b47c55aa3e5e5dbc0850fb9077b27f`

Hashes for inaFaceAnalyzer-0.4.0-py3-none-any.whl

Hashes for inaFaceAnalyzer-0.4.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`2d5f9e89caf3b30010ab3388180c6a7a1965ade2de57e83cea6963660626f53a`
MD5	`0e2ed256499e3c1ab52607862443a8ae`
BLAKE2b-256	`b0430d101416ce56add212a4b4c34beb5fb9b85debd9f61eb78a0fe1eb312aea`