Project description

nnieqat-pytorch

This is a quantize aware training package for Neural Network Inference Engine(NNIE) on pytorch, it uses hisilicon quantization library to quantize module's weight and input data as fake fp32 format. To train model which is more friendly to NNIE, just import nnieqat and replace torch.nn default modules with corresponding one.

Installation
Usage
Code Examples
Results
Todo
Reference

Installation

Supported Platforms: Linux
Accelerators and GPUs: NVIDIA GPUs via CUDA driver 10.
Dependencies:
- python >= 3.5, < 4
- llvmlite >= 0.31.0
- pytorch >= 1.0
- numba >= 0.42.0
- numpy >= 1.18.1
Install nnieqat via pypi: $ pip install nnieqat

Usage

Replace default module with NNIE quantization optimized one. include:
- torch.nn.modules.conv -> nnieqat.modules.conv
- torch.nn.modules.linear -> nnieqat.modules.linear
- torch.nn.modules.pooling -> nnieqat.modules.pooling
```
from nnieqat.modules import convert_layers
...
...
  model = convert_layers(model)
  print(model)  # Quantized layers have "Quantized" prefix.
...
```

Freeze bn after a few epochs of training

from nnieqat.gpu.quantize import freeze_bn
...
...
    if epoch > 2:
      net.apply(freeze_bn)
...

Unquantize weight before update it

from nnieqat.gpu.quantize import unquant_weight
...
...
    net.apply(unquant_weight)
    optimizer.step()
...

Dump weight quantized model

from nnieqat.gpu.quantize import quant_weight, unquant_weight
...
...
    net.apply(quant_weight)
    save_checkpoint(...)
    net.apply(unquant_weight)
...

Code Examples

Cifar10 quantization aware training example (add nnieqat into pytorch_cifar10_tutorial)

python test/test_cifar10.py
ImageNet quantization finetuning example (add nnieqat into pytorh_imagenet_main.py)

python test/test_imagenet.py --pretrained path_to_imagenet_dataset

Results

ImageNet

python test/test_imagenet.py /data/imgnet/ --arch squeezenet1_1  --lr 0.001 --pretrained --epoch 10   # nnie_lr_e-3_ft
python pytorh_imagenet_main.py /data/imgnet/ --arch squeezenet1_1  --lr 0.0001 --pretrained --epoch 10  # lr_e-4_ft
python test/test_imagenet.py /data/imgnet/ --arch squeezenet1_1  --lr 0.0001 --pretrained --epoch 10  # nnie_lr_e-4_ft

finetune result：

	trt_fp32	trt_int8	nnie
torchvision	0.56992	0.56424	0.56026
nnie_lr_e-3_ft	0.56600	0.56328	0.56612
lr_e-4_ft	0.57884	0.57502	0.57542
nnie_lr_e-4_ft	0.57834	0.57524	0.57730

Todo

Multiple GPU training support.
Other platforms and accelerators support.
Generate quantized model directly.

Reference

HiSVP 量化库使用指南

Quantizing deep convolutional networks for efficient inference: A whitepaper

8-bit Inference with TensorRT

Distilling the Knowledge in a Neural Network

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

This version

0.1.0b0 pre-release

Aug 16, 2020

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nnieqat-0.1.0b0.tar.gz (812.0 kB view hashes)

Uploaded Aug 16, 2020 Source

Built Distribution

nnieqat-0.1.0b0-py3-none-any.whl (818.4 kB view hashes)

Uploaded Aug 16, 2020 Python 3

Hashes for nnieqat-0.1.0b0.tar.gz

Hashes for nnieqat-0.1.0b0.tar.gz
Algorithm	Hash digest
SHA256	`64eed27eef6fae519e01bea098d308b98864ac63c6c169bb14e271eff24bc0c3`
MD5	`89d1035b2debd377f12d17952b3ad741`
BLAKE2b-256	`70f8183569e2141a800f176d287faaf1bc4da31bb64f89874af0e0551d9b6508`

Hashes for nnieqat-0.1.0b0-py3-none-any.whl

Hashes for nnieqat-0.1.0b0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`024229a8c8de835da2e2d73c6689a16970a732dd21529ed0a7489f4b7a337913`
MD5	`bbaa8b7c5491a846cb918e00726118bc`
BLAKE2b-256	`494567489fefc1d9efa5ef5d6de0c7c9eafde86066e72681ed8ea3c5b2c496e0`