Python Package for Uplift Modeling and Causal Inference with Machine Learning Algorithms

These details have been verified by PyPI

Maintainers

jeongyoonlee jing.pan paullo0106 uber

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

Disclaimer

This project is stable and being incubated for long-term support. It may contain new experimental code, for which APIs are subject to change.

Causal ML: A Python Package for Uplift Modeling and Causal Inference with ML

Causal ML is a Python package that provides a suite of uplift modeling and causal inference methods using machine learning algorithms based on recent research. It provides a standard interface that allows user to estimate the Conditional Average Treatment Effect (CATE) or Individual Treatment Effect (ITE) from experimental or observational data. Essentially, it estimates the causal impact of intervention T on outcome Y for users with observed features X, without strong assumptions on the model form. Typical use cases include

Campaign targeting optimization: An important lever to increase ROI in an advertising campaign is to target the ad to the set of customers who will have a favorable response in a given KPI such as engagement or sales. CATE identifies these customers by estimating the effect of the KPI from ad exposure at the individual level from A/B experiment or historical observational data.
Personalized engagement: A company has multiple options to interact with its customers such as different product choices in up-sell or messaging channels for communications. One can use CATE to estimate the heterogeneous treatment effect for each customer and treatment option combination for an optimal personalized recommendation system.

The package currently supports the following methods

Tree-based algorithms
- Uplift tree/random forests on KL divergence, Euclidean Distance, and Chi-Square
- Uplift tree/random forests on Contextual Treatment Selection
Meta-learner algorithms
- S-learner
- T-learner
- X-learner
- R-learner

Installation

Prerequisites

Install dependencies:

$ pip install -r requirements.txt

Install from pip:

$ pip install causalml

Install from source:

$ git clone https://github.com/uber-common/causalml.git
$ cd causalml
$ python setup.py build_ext --inplace
$ python setup.py install

Quick Start

Average Treatment Effect Estimation with S, T, and X Learners

from causalml.inference import LinearRegressionSLearner
from causalml.inference import XGBTLearner, MLPTLearner
from causalml.inference import BaseXLearner
from causalml.dataset import synthetic_data

y, X, treatment, _ = synthetic_data(mode=1, n=1000, p=5, sigma=1.0)

lr = LinearRegressionSLearner()
te, lb, ub = lr.estimate_ate(X, treatment, y)
logger.info('Average Treatment Effect (Linear Regression): {:.2f} ({:.2f}, {:.2f})'.format(te, lb, ub))

xg = XGBTLearner(random_state=42)
te, lb, ub = xg.estimate_ate(X, treatment, y)
logger.info('Average Treatment Effect (XGBoost): {:.2f} ({:.2f}, {:.2f})'.format(te, lb, ub))

nn = MLPTLearner(hidden_layer_sizes=(10, 10),
                 learning_rate_init=.1,
                 early_stopping=True,
                 random_state=42)
te, lb, ub = nn.estimate_ate(X, treatment, y)
logger.info('Average Treatment Effect (Neural Network (MLP)): {:.2f} ({:.2f}, {:.2f})'.format(te, lb, ub))

xl = BaseXLearner(learner=XGBRegressor(random_state=42))
te, lb, ub = xl.estimate_ate(X, p, treatment, y)
logger.info('Average Treatment Effect (XGBoost): {:.2f} ({:.2f}, {:.2f})'.format(te, lb, ub))

Contributing

We welcome community contributors to the project. Before you start, please read our code of conduct and check out contributing guidelines first.

Versioning

We document versions and changes in our changelog.

License

This project is licensed under the Apache 2.0 License - see the LICENSE file for details.

References

Papers

Nicholas J Radcliffe and Patrick D Surry. Real-world uplift modelling with significance based uplift trees. White Paper TR-2011-1, Stochastic Solutions, 2011.
Yan Zhao, Xiao Fang, and David Simchi-Levi. Uplift modeling with multiple treatments and general response types. Proceedings of the 2017 SIAM International Conference on Data Mining, SIAM, 2017.
Sören R. Künzel, Jasjeet S. Sekhon, Peter J. Bickel, and Bin Yu. Metalearners for estimating heterogeneous treatment effects using machine learning. Proceedings of the National Academy of Sciences, 2019.
Xinkun Nie and Stefan Wager. Quasi-Oracle Estimation of Heterogeneous Treatment Effects. Atlantic Causal Inference Conference, 2018.

Related projects

uplift: uplift models in R
grf: generalized random forests that include heterogeneous treatment effect estimation in R
rlearner: A R package that implements R-Learner
DoWhy: Causal inference in Python based on Judea Pearl's do-calculus
EconML: A Python package that implements heterogeneous treatment effect estimators from econometrics and machine learning methods

Project details

These details have been verified by PyPI

Maintainers

jeongyoonlee jing.pan paullo0106 uber

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

0.15.1

Apr 19, 2024

0.15.0

Feb 22, 2024

0.14.1

Aug 28, 2023

0.14.0

Jul 10, 2023

0.13.0

Sep 2, 2022

0.12.3

Mar 14, 2022

0.12.2

Feb 18, 2022

0.12.1

Feb 5, 2022

0.12.0

Jan 14, 2022

0.11.1

Aug 2, 2021

0.11.0

Jul 29, 2021

0.10.0

Feb 19, 2021

0.9.0

Oct 23, 2020

0.8.0

Jul 17, 2020

0.7.1

May 7, 2020

0.7.0

Feb 28, 2020

0.6.0

Jan 1, 2020

0.5.0

Nov 26, 2019

0.4.0

Oct 21, 2019

0.3.0

Sep 17, 2019

0.2.3

Aug 14, 2019

This version

0.2.2

Aug 14, 2019

0.2.1

Aug 13, 2019

0.2.0

Aug 12, 2019

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

causalml-0.2.2.tar.gz (119.8 kB view hashes)

Uploaded Aug 14, 2019 Source

Built Distributions

causalml-0.2.2-py3.6-macosx-10.7-x86_64.egg (178.1 kB view hashes)

Uploaded Aug 14, 2019 Source

causalml-0.2.2-cp36-cp36m-macosx_10_7_x86_64.whl (104.6 kB view hashes)

Uploaded Aug 14, 2019 CPython 3.6m macOS 10.7+ x86-64

Hashes for causalml-0.2.2.tar.gz

Hashes for causalml-0.2.2.tar.gz
Algorithm	Hash digest
SHA256	`f39b7f33af52493f1d586dca58a05169ca47bd4aaf30f762a1a49738bde9a717`
MD5	`ed7e8f0d137b5e9ad5f05bb51ce20c12`
BLAKE2b-256	`f855f4641b6287ef0425c495a8e846bc29a2a5ee7263d51aa0144fa5a9cf77a3`

Hashes for causalml-0.2.2-py3.6-macosx-10.7-x86_64.egg

Hashes for causalml-0.2.2-py3.6-macosx-10.7-x86_64.egg
Algorithm	Hash digest
SHA256	`0e453fba07be1068667bcc3a4d4ff2bf456cec5ca8dbeee9d6ff036feac2cc07`
MD5	`2f9a370e1f5c4472390d1de37ec8816f`
BLAKE2b-256	`2bfd5622b6abc96efbeb19d038d48d4f686c14ca245bfeedcbdf28ad9b0a85ff`

Hashes for causalml-0.2.2-cp36-cp36m-macosx_10_7_x86_64.whl

Hashes for causalml-0.2.2-cp36-cp36m-macosx_10_7_x86_64.whl
Algorithm	Hash digest
SHA256	`5c352b014cdac73f02763feaf0c2f34f0017a03c832990c8822ab3e496f1cf57`
MD5	`7bd2cfbda89a47ae7d9d6dc2df8ae0c4`
BLAKE2b-256	`3157559db2b2dbb5f4add9624d9d7159ef87414dfad4ed223ad3f911df6c48df`