concordia

Automated monitoring of machine learning models in production. Tracks and finds discrepancies in features, predictions, and labels

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

Concordia is a part of a suite of open-source machine learning packages that allow organizations to more rapidly develop and deploy machine learning models.

Installation

pip install concordia

Description

Concordia is a tracking and analytics tool for machine learning models running in production.

Using Concordia, you should be able to rapidly have confidence in your shipped ML models.

If everything’s working as expected, you should have quick proof that you’re in a position to scale up this model.

If things are not going according to plan, you should be able to see that rapidly, and have a suite of information to hone in on the root cause of those discrepancies.

Basic Setup

from concordia import Concordia
concord = Concordia()

ml_predictor = load_ml_model()
concord.add_model(model=model, model_id='model123')

Basic Usage

In your training environment

The goal here is to save the features and predictions as you calcate them in your training environment. Then, we can compare these to the features and predictions coming from your live environment. We use the row_ids to match rows across the two environments.

df = load_my_data()
ml_predictor = train_ml_model()

predictions = ml_predictor.predict(df)

concord.add_data_and_predictions(model_id='model123', features=df, predictions=predictions, row_ids=df['my_row_identifier'])

In your live environment

from concordia import load_concordia
concord = load_concordia()

data = get_live_data()

prediction = concord.predict(model_id='model123', features=data, row_id=data['my_row_identifier'])

In your analytics environment

from Concordia import load_concordia
concord = load_concordia()

concord.analyze_prediction_discrepancies(model_id='model123')

concord.analyze_feature_discrepancies(model_id='model123')

Infrastructure Assumptions

Concordia relies on MongoDB and Redis. These can be either local, or in the cloud. You can specify DB credentials and connection options when creating and loading Concordia.

Database Configuration

You can easily specify your own DB connection. You’ll need to do this both when creating the Concordia instance in the first place, as well as when you load_concordia() to get access to that same Concordia instance later.

persistent_db_config = {
    'db': '__concordia_test_env'
    , 'host': 'localhost'
    , 'port': 27017
}

in_memory_db_config = {
    'db': 8
    , 'host': 'localhost'
    , 'port': 6379
}

concord = Concordia(in_memory_db_config=in_memory_db_config, persistent_db_config=persistent_db_config)

# To load this instance of Concordia later, use that same persistent_db_config
concord = load_concordia(persistent_db_config=persistent_db_config)

What does Concordia do, under the hood?

It ensures a consistent db schema
It ensures a consistent way of saving data (predictions and features), so that data can be matched up and compared later
It builds in a suite of analytics tools for analyzing discrepancies
All of this happens automatically for each model that you ship

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

This version

0.1.1

Feb 27, 2018

0.1.0

Feb 7, 2018

0.0.7

Jan 23, 2018

0.0.6

Jan 22, 2018

0.0.4

Jan 21, 2018

0.0.3

Jan 19, 2018

0.0.2

Jan 19, 2018

0.0.1

Jan 18, 2018

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

concordia-0.1.1.tar.gz (12.2 kB view hashes)

Uploaded Feb 27, 2018 Source

Built Distribution

concordia-0.1.1-py2.py3-none-any.whl (19.2 kB view hashes)

Uploaded Feb 27, 2018 Python 2 Python 3

Hashes for concordia-0.1.1.tar.gz

Hashes for concordia-0.1.1.tar.gz
Algorithm	Hash digest
SHA256	`b7f65ee43dbe38718b0702fede8383d321401d3d489d47c8929ef2b5dbd238a8`
MD5	`af4582ac75a747003ed23f9b2a2b91cb`
BLAKE2b-256	`4a07ceea2b4efa997445d5b2fd32e515505e4d2298b1846c82a39e74ba1eee52`

Hashes for concordia-0.1.1-py2.py3-none-any.whl

Hashes for concordia-0.1.1-py2.py3-none-any.whl
Algorithm	Hash digest
SHA256	`c9309a95cb3944cfe4ece2d281fc750de6cab6e6b5b016cac8674c80f39319f3`
MD5	`8feaec42fad1e59f15387faac6d4cc69`
BLAKE2b-256	`0b0c7a8a04dde866f4a78dbd61beaee793f468b34965fdb80a95a9303684c828`