Neural network model repository for highly sparse and sparse-quantized models with matching sparsification recipes

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

SparseZoo

Neural network model repository for highly sparse and sparse-quantized models with matching sparsification recipes

Overview

SparseZoo is a constantly-growing repository of sparsified (pruned and pruned-quantized) models with matching sparsification recipes for neural networks. It simplifies and accelerates your time-to-value in building performant deep learning models with a collection of inference-optimized models and recipes to prototype from. Read more about sparsification.

Available via API and hosted in the cloud, the SparseZoo contains both baseline and models sparsified to different degrees of inference performance vs. baseline loss recovery. Recipe-driven approaches built around sparsification algorithms allow you to use the models as given, transfer-learn from the models onto private datasets, or transfer the recipes to your architectures.

The GitHub repository contains the Python API code to handle the connection and authentication to the cloud.

🚨 New SparseZoo Models 🚨

🎃 October 2023 🎃

Generative AI

Sparse MPT Models - 21 variants
- ⚡ Highlighted Model ⚡ : mpt-7b-gsm8k_mpt_pretrain-pruned80_quantized
Sparse OPT Models - 12 variants
- ⚡ Highlighted Model ⚡ : opt-6.7b-opt_pretrain-pruned50_quantW8A8
Sparse Codegen (mono,multi) Models - 10 variants
- ⚡ Highlighted Model ⚡ : codegen_multi-350m-bigquery_thepile-pruned50_quantized

Highlights

Installation

This repository is tested on Python 3.8-3.11, and Linux/Debian systems. It is recommended to install in a virtual environment to keep your system in order.

Install with pip using:

pip install sparsezoo

Quick Tour

The SparseZoo Python API enables you to search and download sparsified models. Code examples are given below. We encourage users to load SparseZoo models by copying a stub directly from a model page.

Introduction to Model Class Object

The Model is a fundamental object that serves as a main interface with the SparseZoo library. It represents a SparseZoo model, together with all its directories and files.

Creating a Model Class Object From SparseZoo Stub

from sparsezoo import Model

stub = "zoo:cv/classification/resnet_v1-50/pytorch/sparseml/imagenet/pruned95_quant-none"

model = Model(stub)
print(str(model))

>> Model(stub=zoo:cv/classification/resnet_v1-50/pytorch/sparseml/imagenet/pruned95_quant-none)

Creating a Model Class Object From Local Model Directory

from sparsezoo import Model

directory = ".../.cache/sparsezoo/eb977dae-2454-471b-9870-4cf38074acf0"

model = Model(directory)
print(str(model))

>> Model(directory=.../.cache/sparsezoo/eb977dae-2454-471b-9870-4cf38074acf0)

Manually Specifying the Model Download Path

Unless specified otherwise, the model created from the SparseZoo stub is saved to the local sparsezoo cache directory. This can be overridden by passing the optional download_path argument to the constructor:

from sparsezoo import Model

stub = "zoo:cv/classification/resnet_v1-50/pytorch/sparseml/imagenet/pruned95_quant-none"
download_directory = "./model_download_directory"

model = Model(stub, download_path = download_directory)

Downloading the Model Files

Once the model is initialized from a stub, it may be downloaded either by calling the download() method or by invoking a path property. Both pathways are universal for all the files in SparseZoo. Invoking the path property will always trigger file download unless the file has already been downloaded.

# method 1
model.download() 

# method 2 
model_path = model.path

Inspecting the Contents of the SparseZoo Model

We call the available_files method to inspect which files are present in the SparseZoo model. Then, we select a file by calling the appropriate attribute:

model.available_files

>> {'training': Directory(name=training), 
>> 'deployment': Directory(name=deployment), 
>> 'sample_inputs': Directory(name=sample_inputs.tar.gz), 
>> 'sample_outputs': {'framework': Directory(name=sample_outputs.tar.gz)}, 
>> 'sample_labels': Directory(name=sample_labels.tar.gz), 
>> 'model_card': File(name=model.md), 
>> 'recipes': Directory(name=recipe), 
>> 'onnx_model': File(name=model.onnx)}

Then, we might take a closer look at the contents of the SparseZoo model:

model_card = model.model_card
print(model_card)

>> File(name=model.md)

model_card_path = model.model_card.path
print(model_card_path)

>> .../.cache/sparsezoo/eb977dae-2454-471b-9870-4cf38074acf0/model.md

Model, Directory, and File

In general, every file in the SparseZoo model shares a set of attributes: name, path, URL, and parent:

name serves as an identifier of the file/directory
path points to the location of the file/directory
URL specifies the server address of the file/directory in question
parent points to the location of the parent directory of the file/directory in question

A directory is a unique type of file that contains other files. For that reason, it has an additional files attribute.

print(model.onnx_model)

>> File(name=model.onnx)

print(f"File name: {model.onnx_model.name}\n"
      f"File path: {model.onnx_model.path}\n"
      f"File URL: {model.onnx_model.url}\n"
      f"Parent directory: {model.onnx_model.parent_directory}")
      
>> File name: model.onnx
>> File path: .../.cache/sparsezoo/eb977dae-2454-471b-9870-4cf38074acf0/model.onnx
>> File URL: https://models.neuralmagic.com/cv-classification/...
>> Parent directory: .../.cache/sparsezoo/eb977dae-2454-471b-9870-4cf38074acf0

print(model.recipes)

>> Directory(name=recipe)

print(f"File name: {model.recipes.name}\n"
      f"Contains: {[file.name for file in model.recipes.files]}\n"
      f"File path: {model.recipes.path}\n"
      f"File URL: {model.recipes.url}\n"
      f"Parent directory: {model.recipes.parent_directory}")
      
>> File name: recipe
>> Contains: ['recipe_original.md', 'recipe_transfer-classification.md']
>> File path: /home/user/.cache/sparsezoo/eb977dae-2454-471b-9870-4cf38074acf0/recipe
>> File URL: None
>> Parent directory: /home/user/.cache/sparsezoo/eb977dae-2454-471b-9870-4cf38074acf0

Selecting Checkpoint-Specific Data

A SparseZoo model may contain several checkpoints. The model may contain a checkpoint that had been saved before the model was quantized - that checkpoint would be used for transfer learning. Another checkpoint might have been saved after the quantization step - that one is usually directly used for inference.

The recipes may also vary depending on the use case. We may want to access a recipe that was used to sparsify the dense model (recipe_original) or the one that enables us to sparse transfer learn from the already sparsified model (recipe_transfer).

There are two ways to access those specific files.

Accessing Recipes (Through Python API)

available_recipes = model.recipes.available
print(available_recipes)

>> ['original', 'transfer-classification']

transfer_recipe = model.recipes["transfer-classification"]
print(transfer_recipe)

>> File(name=recipe_transfer-classification.md)

original_recipe = model.recipes.default # recipe defaults to `original`
original_recipe_path = original_recipe.path # downloads the recipe and returns its path
print(original_recipe_path)

>> .../.cache/sparsezoo/eb977dae-2454-471b-9870-4cf38074acf0/recipe/recipe_original.md

Accessing Checkpoints (Through Python API)

In general, we are expecting the following checkpoints to be included in the model:

checkpoint_prepruning
checkpoint_postpruning
checkpoint_preqat
checkpoint_postqat

The checkpoint that the model defaults to is the preqat state (just before the quantization step).

from sparsezoo import Model

stub = "zoo:nlp/question_answering/bert-base/pytorch/huggingface/squad/pruned_quant_3layers-aggressive_84"

model = Model(stub)
available_checkpoints = model.training.available
print(available_checkpoints)

>> ['preqat']

preqat_checkpoint = model.training.default # recipe defaults to `preqat`
preqat_checkpoint_path = preqat_checkpoint.path # downloads the checkpoint and returns its path
print(preqat_checkpoint_path)

>> .../.cache/sparsezoo/0857c6f2-13c1-43c9-8db8-8f89a548dccd/training

[print(file.name) for file in preqat_checkpoint.files]

>> vocab.txt
>> special_tokens_map.json
>> pytorch_model.bin
>> config.json
>> training_args.bin
>> tokenizer_config.json
>> trainer_state.json
>> tokenizer.json

Accessing Recipes (Through Stub String Arguments)

You can also directly request a specific recipe/checkpoint type by appending the appropriate URL query arguments to the stub:

from sparsezoo import Model

stub = "zoo:cv/classification/resnet_v1-50/pytorch/sparseml/imagenet/pruned95_quant-none?recipe=transfer"

model = Model(stub)

# Inspect which files are present.
# Note that the available recipes are restricted
# according to the specified URL query arguments
print(model.recipes.available)

>> ['transfer-classification']

transfer_recipe = model.recipes.default # Now the recipes default to the one selected by the stub string arguments
print(transfer_recipe)

>> File(name=recipe_transfer-classification.md)

Accessing Sample Data

The user may easily request a sample batch of data that represents the inputs and outputs of the model.

sample_data = model.sample_batch(batch_size = 10)

print(sample_data['sample_inputs'][0].shape)
>> (10, 3, 224, 224) # (batch_size, num_channels, image_dim, image_dim)

print(sample_data['sample_outputs'][0].shape)
>> (10, 1000) # (batch_size, num_classes)

Model Search

The function search_models enables the user to quickly filter the contents of SparseZoo repository to find the stubs of interest:

from sparsezoo import search_models

args = {
    "domain": "cv",
    "sub_domain": "segmentation",
    "architecture": "yolact",
}

models = search_models(**args)
[print(model) for model in models]

>> Model(stub=zoo:cv/segmentation/yolact-darknet53/pytorch/dbolya/coco/pruned82_quant-none)
>> Model(stub=zoo:cv/segmentation/yolact-darknet53/pytorch/dbolya/coco/pruned90-none)
>> Model(stub=zoo:cv/segmentation/yolact-darknet53/pytorch/dbolya/coco/base-none)

Environmental Variables

Users can specify the directory where models (temporarily during download) and its required credentials will be saved in your working machine. SPARSEZOO_MODELS_PATH is the path where the downloaded models will be saved temporarily. Default ~/.cache/sparsezoo/ SPARSEZOO_CREDENTIALS_PATH is the path where credentials.yaml will be saved. Default ~/.cache/sparsezoo/

Console Scripts

In addition to the Python APIs, a console script entry point is installed with the package sparsezoo. This enables easy interaction straight from your console/terminal.

Downloading

Download command help

sparsezoo.download -h

Download ResNet-50 Model

sparsezoo.download zoo:cv/classification/resnet_v1-50/pytorch/sparseml/imagenet/base-none

Download pruned and quantized ResNet-50 Model

sparsezoo.download zoo:cv/classification/resnet_v1-50/pytorch/sparseml/imagenet/pruned_quant-moderate

Searching

Search command help

sparsezoo search -h

Searching for all classification MobileNetV1 models in the computer vision domain

sparsezoo search --domain cv --sub-domain classification --architecture mobilenet_v1

Searching for all ResNet-50 models

sparsezoo search --domain cv --sub-domain classification \
    --architecture resnet_v1 --sub-architecture 50

For a more in-depth read, check out SparseZoo documentation.

Resources

Learning More

Documentation: SparseML, SparseZoo, Sparsify, DeepSparse
Neural Magic: Blog, Resources

Release History

Official builds are hosted on PyPI

stable: sparsezoo
nightly (dev): sparsezoo-nightly

Additionally, more information can be found via GitHub Releases.

License

The project is licensed under the Apache License Version 2.0.

Community

Contribute

We appreciate contributions to the code, examples, integrations, and documentation as well as bug reports and feature requests! Learn how here.

Join

For user help or questions about SparseZoo, sign up or log in to our Neural Magic Community Slack. We are growing the community member by member and happy to see you there. Bugs, feature requests, or additional questions can also be posted to our GitHub Issue Queue.

You can get the latest news, webinar and event invites, research papers, and other ML Performance tidbits by subscribing to the Neural Magic community.

For more general questions about Neural Magic, please fill out this form.

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

This version

1.8.0.20240401

Apr 1, 2024

1.8.0.20240325

Mar 25, 2024

1.7.0.20240304

Mar 5, 2024

1.7.0.20240131

Jan 31, 2024

1.7.0.20240124

Jan 24, 2024

1.7.0.20240103

Jan 3, 2024

1.7.0.20231210

Dec 11, 2023

1.7.0.20231206

Dec 6, 2023

1.6.0.20231201

Dec 1, 2023

1.6.0.20231120

Nov 20, 2023

1.6.0.20231110

Nov 10, 2023

1.6.0.20231031

Oct 31, 2023

1.6.0.20231020

Oct 20, 2023

1.6.0.20231019

Oct 19, 2023

1.6.0.20231012

Oct 12, 2023

1.6.0.20231011

Oct 11, 2023

1.6.0.20231007

Oct 9, 2023

1.6.0.20230928

Sep 28, 2023

1.6.0.20230923

Sep 23, 2023

1.6.0.20230906

Sep 6, 2023

1.6.0.20230829

Aug 29, 2023

1.6.0.20230825

Aug 25, 2023

1.6.0.20230823

Aug 23, 2023

1.6.0.20230815

Aug 16, 2023

1.6.0.20230811

Aug 14, 2023

1.6.0.20230809

Aug 9, 2023

1.6.0.20230801

Aug 1, 2023

1.6.0.20230723

Jul 24, 2023

1.6.0.20230720

Jul 20, 2023

1.6.0.20230717

Jul 17, 2023

1.6.0.20230707

Jul 10, 2023

1.6.0.20230629

Jun 30, 2023

1.6.0.20230623

Jun 23, 2023

1.6.0.20230616

Jun 16, 2023

1.6.0.20230608

Jun 8, 2023

1.6.0.20230604

Jun 5, 2023

1.5.0.20230531

May 31, 2023

1.5.0.20230521

May 24, 2023

1.5.0.20230520

May 24, 2023

1.5.0.20230516

May 16, 2023

1.5.0.20230509

May 9, 2023

1.5.0.20230502

May 2, 2023

1.5.0.20230420

Apr 20, 2023

1.5.0.20230418

Apr 18, 2023

1.5.0.20230412

Apr 12, 2023

1.5.0.20230404

Apr 4, 2023

1.5.0.20230330

Mar 30, 2023

1.5.0.20230329

Mar 29, 2023

1.5.0.20230320

Mar 20, 2023

1.5.0.20230301

Mar 2, 2023

1.5.0.20230228

Feb 28, 2023

1.5.0.20230224

Feb 27, 2023

1.5.0.20230216

Feb 16, 2023

1.4.0.20230210

Feb 13, 2023

1.4.0.20230208

Feb 8, 2023

1.4.0.20230202

Feb 2, 2023

1.4.0.20230124

Jan 24, 2023

1.4.0.20230120

Jan 20, 2023

1.4.0.20230117

Jan 17, 2023

1.4.0.20230114

Jan 16, 2023

1.4.0.20230105

Jan 5, 2023

1.4.0.20221230

Dec 31, 2022

1.3.0.20221216

Dec 17, 2022

1.3.0.20221129

Nov 29, 2022

1.3.0.20221121

Nov 21, 2022

1.3.0.20221118

Nov 18, 2022

1.3.0.20221115

Nov 15, 2022

1.3.0.20221108

Nov 8, 2022

1.3.0.20221104

Nov 4, 2022

1.2.0.20221024

Oct 24, 2022

1.2.0.20221012

Oct 12, 2022

1.2.0.20221003

Oct 3, 2022

1.2.0.20220927

Sep 28, 2022

1.2.0.20220916

Sep 16, 2022

1.2.0.20220903

Sep 6, 2022

1.2.0.20220826

Aug 26, 2022

1.2.0.20220825

Aug 25, 2022

1.1.0.20220819

Aug 19, 2022

1.1.0.20220810

Aug 10, 2022

1.1.0.20220804

Aug 4, 2022

1.1.0.20220803

Aug 3, 2022

1.1.0.20220722

Jul 22, 2022

1.1.0.20220715

Jul 15, 2022

1.1.0.20220713

Jul 13, 2022

1.1.0.20220707

Jul 7, 2022

1.1.0.20220701

Jul 1, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

sparsezoo_nightly-1.8.0.20240401-py3-none-any.whl (178.2 kB view hashes)

Uploaded Apr 1, 2024 Python 3

Hashes for sparsezoo_nightly-1.8.0.20240401-py3-none-any.whl

Hashes for sparsezoo_nightly-1.8.0.20240401-py3-none-any.whl
Algorithm	Hash digest
SHA256	`d2c65d06ae7b960f2311adb6f8b37be28713f53de4f9e19994cf2b111d12344f`
MD5	`4da0d48ec96bb2357afa030d07f194ed`
BLAKE2b-256	`a3fa65d3cc1595eb13a9edd4378445ecebc3f2f8c57f116a93b0625bec29638e`

sparsezoo-nightly 1.8.0.20240401

Navigation

Verified details

Maintainers

Unverified details

Project links

GitHub Statistics

Meta

Classifiers

Project description

SparseZoo

Neural network model repository for highly sparse and sparse-quantized models with matching sparsification recipes

Overview

🚨 New SparseZoo Models 🚨

🎃 October 2023 🎃

Highlights

Installation

Quick Tour

Introduction to Model Class Object

Creating a Model Class Object From SparseZoo Stub

Creating a Model Class Object From Local Model Directory

Manually Specifying the Model Download Path

Downloading the Model Files

Inspecting the Contents of the SparseZoo Model

Model, Directory, and File

Selecting Checkpoint-Specific Data

Accessing Recipes (Through Python API)

Accessing Checkpoints (Through Python API)

Accessing Recipes (Through Stub String Arguments)

Accessing Sample Data

Model Search

Environmental Variables

Console Scripts

Downloading

Searching

Resources

Learning More

Release History

License

Community

Contribute

Join

Project details

Verified details

Maintainers

Unverified details

Project links

GitHub Statistics

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distributions

Built Distribution