FAIM (FAir Interpolation Method), described in "Beyond Incompatibility: Interpolation between Mutually

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

FAIM

FAIM (FAir Interpolation Method), described in Beyond Incompatibility: Interpolation between Mutually Exclusive Fairness Criteria in Classification Problems, is a post-processing algorithm for achieving a combination of group-fairness criteria (equalized false positive rates, equalized false negative rates, group calibration).

This README.md is under construction!

Installation

Environment

Ensure you have a environment with Python>=3.7 and pip>=2.21, preferably by creating a virtual environment.

One way to do this is using miniconda. Install miniconda following the instructions on this page and create a python 3.10 environment:

conda create -n faim python=3.10

Activate the environment

conda activate faim

Check that versions of python are >=3.7 and >=2.21, respectively:

python --version
pip --version

Python Package

If you intend to develop the package and/or contribute, follow the install instructions in the Development Environment section below instead. Otherwise, follow these instructions.

The package and experiment CLI can be installed with pip:

pip install "faim[experiment]"

Note the [experiment] notation is required for now since, for the moment, the algorithm can only be run in experiment mode for recreating experimental results in the paper. In the future, faim will be made available for post-processing classifier scores (given ground truth and group information), going beyond reproducing paper experiments.

Removal

From the environment where you installed the package, run

pip uninstall faim

Usage

Installing faim also (currently) installs one command line interface (CLI) tool, faim-experiment which can be used to reproduce the work in the paper.

[A general API will be added soon]

Experiments

This section contains information for reproducing experiments in our paper.

Ensure the package has been installed with [experiment] extra requirements before continuing (see Installation | Python Package)!

Prepare Data

The CLI can be used to prepare any of the three datasets used in the paper:

faim-experiment --prepare-data DATASET

where DATASET is one of:

synthetic-from-paper
compas
zalando [waiting for permission to release, contact us for more information]

The dataset will be downloaded, and prepared to a folder called prepared-data.

The following sections include info about each dataset:

Synthetic data

The raw dataset in the GitHub repo corresponds to synthetic prediction and ground truth scores for two groups, for each group sampling from a corresponding binormal distribution.

COMPAS data

The raw data was obtained from ProPublica's COMPAS Analysis repository.

Zalando data

Under construction, more information to follow!

Run Experiment

Having prepared data following the instruction above, you are ready to run a FAIM experiment:

faim-experiment --run PREPARED-DATASET LOW_SCORE_VAL,HIGH_SCORE_VAL THETAS PREPARED_DATA_FILEPATH

PREPARED-DATASET is now one of the following options (depending on what has been prepared):

syntheticTwoGroups (prepared using --prepare-data synthetic)
compasGender (prepared using --prepare-data compas)
compasRace (prepared using --prepare-data compas)
compasAge (prepared using --prepare-data compas)
zalando (prepared using --prepare-data zalando) [waiting for permission to release, contact us for more information]

LOW_SCORE_VAL,HIGH_SCORE_VAL are two numbers that define the score range.

THETAS correspond to the fairness compromise you want. There are three thetas per group corresponding to the desired amount of the three fairness criteria that the system should achieve:

group calibration
equalized false negative rates
equalized false positive rates

Note, as discussed in the paper, thetas = 1,1,1 does not indicate that the system will simultaneously achieve all three (mutually incompatible) fairness criteria, but rather the result will be a compromise between all three.

See the paper for more details.

Finally, PREPARED_DATA_FILEPATH corresponds to the filepath of the prepared data.

Examples

Run all of the following from the same folder where faim-experiment --prepare-data was run.

In each example, a FAIM post-processor is trained and evaluated with results saved under the results folder:

Train FAIM model on synthetic dataset with callibration as fairness correction

faim-experiment --run syntheticTwoGroups 0.1 1,0,0,1,0,0 prepared-data/synthetic/2groups/2022-01-12/dataset.csv

Train FAIM model on synthetic dataset to achieve a combination of all three fairness criteria.

faim-experiment --run syntheticTwoGroups 0.1 1,1,1,1,1,1 prepared-data/synthetic/2groups/2022-01-12/dataset.csv

Visualize Results

Needs documentation!

Development Environment

To develop and/or contribute, clone the repository

git clone <this repo URL>

From the root directory of the git repository, install the package with pip in editable mode (-e) with extra requirements for experiments (experiment) and development (dev):

pip install -e ".[experiment,dev]"

Don't confuse the [] to mean optional. The ".[experiment, dev]" notation tells pip to install extra "experiment" and "dev" requirements including things like pytest and pre-commit.

When contributing, be sure to install (and use) our pre-commit hooks:

pre-commit install -t pre-commit -t pre-push

Project details

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

This version

0.0.3

Jan 4, 2023

0.0.2

Dec 5, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

faim-0.0.3.tar.gz (29.7 kB view hashes)

Uploaded Jan 4, 2023 Source

Built Distribution

faim-0.0.3-py3-none-any.whl (33.3 kB view hashes)

Uploaded Jan 4, 2023 Python 3

Hashes for faim-0.0.3.tar.gz

Hashes for faim-0.0.3.tar.gz
Algorithm	Hash digest
SHA256	`fda5e15aabfb152711f52cfdb4b0ccd22bdc5946725ac1a623b3998a368ee427`
MD5	`30a70abd778d31efc5d8ccdf90dc8046`
BLAKE2b-256	`6459dffb5559d170cdd70cac7a30e1fbdf6175c4ed5435d086785fb96d4d0efc`

Hashes for faim-0.0.3-py3-none-any.whl

Hashes for faim-0.0.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`1e17f2bb4087a72fbd6317639fff775c06dcd55aa0732aa8bc86ea7a87bbfb65`
MD5	`047acccad17069e26bbcc17be294cd8d`
BLAKE2b-256	`4346a5414b37891109a115372c607bd1cfd874da77c2d9ae3d0eb24cb9e37800`