pycaret

A Python package for supervised and unsupervised machine learning.

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

ï»¿## pycaret pycaret is the free software and open source machine learning library for python programming language. It is built around several popular machine learning libraries in python. Its primary objective is to reduce the cycle time of hypothesis to insights by providing an easy to use high level unified API. pycaret's vision is to become defacto standard for teaching machine learning and data science. Our strength is in our easy to use unified interface for both supervised and unsupervised machine learning problems. It saves time and effort that citizen data scientists, students and researchers spent on coding or learning to code using different interfaces, so that now they can focus on business problem and value creation.

Key Features

Ease of Use
Focus on Business Problem
10x efficient
Collaboration
Business Ready
Cloud Ready

Current Release

The current release is beta 0.0.4 (as of 23/12/2019). A full release for public is targetted to be available by 31/12/2020.

Installation

Dependencies

Please read requirements.txt for list of requirements. They are automatically installed when pycaret is installed using pip.

User Installation

The easiest way to install pycaret is using pip.

pip install pycaret

Quick Start

As of beta 0.0.4 classification, regression and nlp modules are available. Future release will be include Anamoly Detection, Association Rules, Clustering, Recommender System and Time Series.

Classification

Getting data from pycaret repository

from pycaret.datasets import get_data
juice = get_data('juice')

Initializing the pycaret environment setup

exp1 = setup(juice, 'Purchase')

Creating a simple logistic regression (includes fitting, CV and metric evaluation)

lr = create_model('lr')

List of available estimators:

Logistic Regression (lr)
K Nearest Neighbour (knn)
Naive Bayes (nb)
Decision Tree (dt)
Support Vector Machine - Linear (svm)
SVM Radial Function (rbfsvm)
Gaussian Process Classifier (gpc)
Multi Level Perceptron (mlp)
Ridge Classifier (ridge)
Random Forest (rf)
Quadtratic Discriminant Analysis (qda)
Adaboost (ada)
Gradient Boosting Classifier (gbc)
Linear Discriminant Analysis (lda)
Extra Trees Classifier (et)
Extreme Gradient Boosting - xgboost (xgboost)
Light Gradient Boosting - Microsoft LightGBM (lightgbm)

Tuning a model using inbuilt grids.

tuned_xgb = tune_model('xgboost')

Ensembling Model

dt = create_model('dt')
dt_bagging = ensemble_model(dt, method='Bagging')
dt_boosting = ensemble_model(dt, method='Boosting')

Creating a voting classifier

voting_all = blend_models() #creates voting classifier for entire library

#create voting classifier for specific models
lr = create_model('lr')
svm = create_model('svm')
mlp = create_model('mlp')
xgboost = create_model('xgboost')

voting_clf2 = blend_models( [ lr, svm, mlp, xgboost ] )

Stacking Models in Single Layer

#create individual classifiers
lr = create_model('lr')
svm = create_model('svm')
mlp = create_model('mlp')
xgboost = create_model('xgboost')

stacker = stack_models( [lr,svm,mlp], meta_model = xgboost )

Stacking Models in Multiple Layers

#create individual classifiers
lr = create_model('lr')
svm = create_model('svm')
mlp = create_model('mlp')
gbc = create_model('gbc')
nb = create_model('nb')
lightgbm = create_model('lightgbm')
knn = create_model('knn')
xgboost = create_model('xgboost')

stacknet = create_stacknet( [ [lr,svm,mlp], [gbc, nb], [lightgbm, knn] ], meta_model = xgboost )
#meta model by default is Logistic Regression

Plot Models

lr = create_model('lr')
plot_model(lr, plot='auc')

List of available plots:

Area Under the Curve (auc)
Discrimination Threshold (threshold)
Precision Recall Curve (pr)
Confusion Matrix (confusion_matrix)
Class Prediction Error (error)
Classification Report (class_report)
Decision Boundary (boundary)
Recursive Feature Selection (rfe)
Learning Curve (learning)
Manifold Learning (manifold)
Calibration Curve (calibration)
Validation Curve (vc)
Dimension Learning (dimension)
Feature Importance (feature)
Model Hyperparameter (parameter)

Evaluate Model

lr = create_model('lr')
evaluate_model(lr) #displays user interface for interactive plotting

Interpret Tree Based Models

xgboost = create_model('xgboost')
interpret_model(xgboost)

Saving Model for Deployment

lr = create_model('lr')
save_model(lr, 'lr_23122019')

Saving Entire Experiment Pipeline

save_experiment('expname1')

Loading Model / Experiment

m = load_model('lr_23122019')
e = load_experiment('expname1')

AutoML

aml1 = automl()

Documentation

Documentation work is in progress. They will be uploaded on our website http://www.pycaret.org as soon as they are available. (Target Availability : 21/01/2020)

Contributions

Contributions are most welcome. To make contribution please reach out moez.ali@queensu.ca

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

3.3.2

Apr 28, 2024

3.3.1

Apr 16, 2024

3.3.0

Feb 20, 2024

3.2.0

Nov 12, 2023

3.1.0

Sep 11, 2023

3.0.4

Jul 2, 2023

3.0.3

Jun 26, 2023

3.0.2

May 18, 2023

3.0.1

May 14, 2023

3.0.0

Mar 18, 2023

3.0.0rc9 pre-release

Feb 9, 2023

3.0.0rc8 pre-release

Jan 9, 2023

3.0.0rc7 pre-release

Jan 9, 2023

3.0.0rc6 pre-release

Dec 24, 2022

3.0.0rc5 pre-release

Dec 18, 2022

3.0.0rc4 pre-release

Sep 23, 2022

3.0.0rc3 pre-release

Jul 12, 2022

3.0.0rc2 pre-release

Jun 6, 2022

3.0.0rc1 pre-release

Jun 3, 2022

2.3.10

Apr 10, 2022

2.3.9

Mar 27, 2022

2.3.8

Mar 21, 2022

2.3.7

Mar 20, 2022

2.3.6

Jan 12, 2022

2.3.5

Nov 19, 2021

2.3.4

Sep 23, 2021

2.3.3

Jul 25, 2021

2.3.2

Jul 7, 2021

2.3.1

Apr 28, 2021

2.3.0

Feb 21, 2021

2.2.3

Dec 22, 2020

2.2.2

Nov 26, 2020

2.2.1

Nov 9, 2020

2.2

Oct 28, 2020

2.1.2

Aug 31, 2020

2.1.1

Aug 30, 2020

2.1

Aug 28, 2020

2.0

Jul 31, 2020

1.0.0

Apr 6, 2020

0.0.60

Feb 24, 2020

0.0.59

Feb 23, 2020

0.0.58

Feb 23, 2020

0.0.57

Feb 23, 2020

0.0.56

Feb 23, 2020

0.0.55

Feb 22, 2020

0.0.54

Feb 22, 2020

0.0.53

Feb 22, 2020

0.0.52

Feb 22, 2020

0.0.51

Feb 22, 2020

0.0.50

Feb 21, 2020

0.0.49

Feb 19, 2020

0.0.48

Feb 18, 2020

0.0.47

Feb 18, 2020

0.0.46

Feb 14, 2020

0.0.45

Feb 12, 2020

0.0.44

Feb 12, 2020

0.0.43

Feb 11, 2020

0.0.41

Feb 11, 2020

0.0.40

Feb 11, 2020

0.0.39

Feb 11, 2020

0.0.38

Feb 10, 2020

0.0.37

Feb 8, 2020

0.0.36

Feb 8, 2020

0.0.35

Feb 7, 2020

0.0.34

Feb 5, 2020

0.0.33

Feb 5, 2020

0.0.32

Feb 4, 2020

0.0.31

Feb 1, 2020

0.0.30

Jan 31, 2020

0.0.29

Jan 31, 2020

0.0.28

Jan 29, 2020

0.0.27

Jan 28, 2020

0.0.26

Jan 28, 2020

0.0.25

Jan 27, 2020

0.0.24

Jan 26, 2020

0.0.23

Jan 25, 2020

0.0.22

Jan 24, 2020

0.0.21

Jan 23, 2020

0.0.20

Jan 23, 2020

0.0.19

Jan 21, 2020

0.0.18

Jan 17, 2020

0.0.17

Jan 15, 2020

0.0.16

Jan 13, 2020

0.0.15

Jan 13, 2020

0.0.14

Jan 13, 2020

0.0.13

Jan 10, 2020

0.0.12

Jan 8, 2020

0.0.11

Jan 5, 2020

0.0.10

Jan 4, 2020

0.0.9

Dec 31, 2019

0.0.8

Dec 30, 2019

0.0.7

Dec 30, 2019

0.0.6

Dec 24, 2019

0.0.5

Dec 24, 2019

This version

0.0.4

Dec 24, 2019

0.0.3

Dec 10, 2019

0.0.2

Nov 29, 2019

0.0.1

Nov 29, 2019

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pycaret-0.0.4.tar.gz (92.2 kB view hashes)

Uploaded Dec 24, 2019 Source

Built Distribution

pycaret-0.0.4-py3-none-any.whl (93.7 kB view hashes)

Uploaded Dec 24, 2019 Python 3

Hashes for pycaret-0.0.4.tar.gz

Hashes for pycaret-0.0.4.tar.gz
Algorithm	Hash digest
SHA256	`999b9649c66c326b289d250baf44ba66a01293f69759fc08f1d5eb560d71eb5e`
MD5	`789efc55caae4310799d581d7104d73e`
BLAKE2b-256	`616d243fb81bda7bf511b0b1c88e8eca32052ca8d7936e10779a580b5b0b0309`

Hashes for pycaret-0.0.4-py3-none-any.whl

Hashes for pycaret-0.0.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`c4f261677bfda1e5d8deeb6bce61ea57144e48f7c67555786646a395f14f4cfd`
MD5	`3f0a34630642a6379d09f713d2e2b220`
BLAKE2b-256	`9644dd7ee3d626e522dd3a5745cb5894a211291bff40274930da11a6208ac0ad`