Skip to main content

Coarse approximation linear function with cross validation

Project description

CircleCI ReadTheDocs

Calf, CalfCV

A binomial classifier that implements the Coarse Approximation Linear Function (CALF).

Contact

Rolf Carlson hrolfrc@gmail.com

Install

Use pip to install calfcv.

pip install calfcv

Introduction

This is a python implementation of the Coarse Approximation Linear Function (CALF). The implementation is based on the greedy forward selection algorithm described in the paper referenced below.

Two classes are provided: Calf, and CalfCV. Calf provides classification and prediction for two classes, the binomial case. Multinomial classification with more than two cases is not implemented. Calf provides a transform method that can be used for feature selection and dimensionality reduction of data sets. Calf requires that the feature matrix be scaled to have zero mean and unit variance. CalfCV provides the same functionality as Calf, but CalfCV includes built in data scaling and cross-validation. Choose Calf over CalfCV if you are optimizing hyperparameters over a grid using cross-validation.

Both Calf and CalfCV are designed for use with scikit-learn pipelines and composite estimators.

Example

from calfcv import CalfCV
from sklearn.datasets import make_classification
from sklearn.model_selection import train_test_split
Make a classification problem
seed = 42
X, y = make_classification(
    n_samples=30,
    n_features=5,
    n_informative=2,
    n_redundant=2,
    n_classes=2,
    random_state=seed
)
X_train, X_test, y_train, y_test = train_test_split(X, y, random_state=seed)
Train the classifier
cls = CalfCV().fit(X_train, y_train)
Get the score on unseen data
cls.score(X_test, y_test)
0.875

Authors

The CALF algorithm was designed by Clark D. Jeffries, John R. Ford, Jeffrey L. Tilson, Diana O. Perkins, Darius M. Bost, Dayne L. Filer and Kirk C. Wilhelmsen. This python implementation was written by Rolf Carlson.

References

Jeffries, C.D., Ford, J.R., Tilson, J.L. et al. A greedy regression algorithm with coarse weights offers novel advantages. Sci Rep 12, 5440 (2022). https://doi.org/10.1038/s41598-022-09415-2

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

calfcv-0.3.17.tar.gz (12.1 kB view hashes)

Uploaded Source

Built Distribution

calfcv-0.3.17-py3-none-any.whl (11.4 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page