Repository of low precision inference toolkit

These details have not been verified by PyPI

Project links

Homepage

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

Intel Low Precision Inference Tool (iLiT)

Intel Low Precision Inference Tool (iLiT) is an open-source python library which is intended to deliver a unified low-precision inference interface cross multiple Intel optimized DL frameworks on both CPU and GPU. It supports automatic accuracy-driven tuning strategies, along with additional objectives like performance, model size, or memory footprint. It also provides the easy extension capability for new backends, tuning strategies, metrics and objectives.

WARNING

GPU support is under development.

Currently supported Intel optimized DL frameworks are:

Currently supported tuning strategies are:

Documentation

Introduction explains iLiT infrastructure, design philosophy, supported functionality, details of tuning strategy implementations and tuning result on popular models.
Tutorial provides comprehensive step-by-step instructions of how to enable iLiT on sample models.

Install from source

git clone https://github.com/intel/lp-inference-kit.git
cd lp-inference-kit
python setup.py install

Install from binary

# install from pip
pip install ilit

# install from conda
conda config --add channels intel
conda install ilit

System Requirements

Hardware

iLiT supports systems based on Intel 64 architecture or compatible processors.

Software

iLiT requires to install Intel optimized framework version for TensorFlow, PyTorch, and MXNet.

Tuning Zoo

The followings are the examples integrated with iLiT for auto tuning.

Model	Framework	Model	Framework	Model	Framework
ResNet50 V1	MXNet	BERT-Large RTE	PyTorch	ResNet18	PyTorch
MobileNet V1	MXNet	BERT-Large QNLI	PyTorch	ResNet50 V1	TensorFlow
MobileNet V2	MXNet	BERT-Large CoLA	PyTorch	ResNet50 V1.5	TensorFlow
SSD-ResNet50	MXNet	BERT-Base SST-2	PyTorch	ResNet101	TensorFlow
SqueezeNet V1	MXNet	BERT-Base RTE	PyTorch	Inception V1	TensorFlow
ResNet18	MXNet	BERT-Base STS-B	PyTorch	Inception V2	TensorFlow
Inception V3	MXNet	BERT-Base CoLA	PyTorch	Inception V3	TensorFlow
DLRM	PyTorch	BERT-Base MRPC	PyTorch	Inception V4	TensorFlow
BERT-Large MRPC	PyTorch	ResNet101	PyTorch	Inception ResNet V2	TensorFlow
BERT-Large SQUAD	PyTorch	ResNet50 V1.5	PyTorch	SSD ResNet50 V1	TensorFlow

Known Issues

KL Divergence Algorithm is very slow at TensorFlow

Due to TensorFlow not supporting tensor dump naturally, current solution of dumping the tensor content is adding print op and dumpping the value to stdout. So if the model to tune is a TensorFlow model, please restrict calibration.algorithm.activation and calibration.algorithm.weight in user yaml config file to minmax.
MSE tuning strategy doesn't work with PyTorch adaptor layer

MSE tuning strategy requires to compare FP32 tensor and INT8 tensor to decide which op has impact on final quantization accuracy. PyTorch adaptor layer doesn't implement this inspect tensor interface. So if the model to tune is a PyTorch model, please not choose MSE tuning strategy.

Support

Please submit your questions, feature requests, and bug reports on the GitHub issues page. You may also reach out to ilit.maintainers@intel.com.

Contributing

We welcome community contributions to iLiT. If you have an idea on how to improve the library:

For changes impacting the public API, submit an RFC pull request.
Ensure that the changes are consistent with the code contribution guidelines and coding style.
Ensure that you can run all the examples with your patch.
Submit a pull request.

For additional details, see contribution guidelines.

This project is intended to be a safe, welcoming space for collaboration, and contributors are expected to adhere to the Contributor Covenant code of conduct.

License

iLiT is licensed under Apache License Version 2.0. This software includes components with separate copyright notices and license terms. Your use of the source code for these components is subject to the terms and conditions of the following licenses.

Apache License Version 2.0:

Intel TensorFlow Quantization Tool

MIT License:

bayesian-optimization

See accompanying LICENSE file for full license text and copyright notices.

Legal Information

Citing

If you use iLiT in your research or wish to refer to the tuning results published in the Tuning Zoo, please use the following BibTeX entry.

@misc{iLiT,
  author =       {Feng Tian, Chuanqi Wang, Guoming Zhang, Penghui Cheng, Pengxin Yuan, Haihao Shen, and Jiong Gong},
  title =        {Intel Low Precision Inference Tool},
  howpublished = {\url{https://github.com/intel/lp-inference-kit}},
  year =         {2020}
}

Project details

These details have not been verified by PyPI

Project links

Homepage

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

1.0

Oct 30, 2020

1.0b0 pre-release

Aug 31, 2020

This version

1.0a0 pre-release

Jul 7, 2020

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

ilit-1.0a0-py3-none-any.whl (105.0 kB view hashes)

Uploaded Jul 7, 2020 Python 3

Hashes for ilit-1.0a0-py3-none-any.whl

Hashes for ilit-1.0a0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`d454c429024bc1a8f17f3fb44cea11d3c8210949c5ad1c9d1f5e880223ef11ac`
MD5	`d18ed615293eaca8b3c4c845d0042e83`
BLAKE2b-256	`477bdb6cc21d1bf2524df7df719fbf9292f5849a1f270df8f0c16b6bc632713b`