Tensorflow 2.0 Implementation of GCViT: Global Context Vision Transformer. https://github.com/awsaf49/gcvit-tf

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

GCViT: Global Context Vision Transformer

python tensorflow

Tensorflow 2.0 Implementation of GCViT

This library implements GCViT using Tensorflow 2.0 specifally in tf.keras.Model manner to get PyTorch flavor.

Model

Architecture:

Local Vs Global Attention:

Result

Official codebase had some issue which has been fixed recently (27 July 2022). Here's the result of ported weights on ImageNetV2-Test data,

Model	Acc@1	Acc@5	#Params
GCViT-XXTiny	63	85	12M
GCViT-XTiny	66	87	20M
GCViT-Tiny	69	89	28M
GCViT-Small	69	89	51M
GCViT-Base	71	90	90M

Installation

pip install -U gcvit
# or
# pip install -U git+https://github.com/awsaf49/gcvit-tf

Usage

Load model using following codes,

from gcvit import GCViTTiny
model = GCViTTiny(pretrain=True)

Simple code to check model's prediction,

from skimage.data import chelsea
img = tf.keras.applications.imagenet_utils.preprocess_input(chelsea(), mode='torch') # Chelsea the cat
img = tf.image.resize(img, (224, 224))[None,] # resize & create batch
pred = model(img).numpy()
print(tf.keras.applications.imagenet_utils.decode_predictions(pred)[0])

Prediction:

[('n02124075', 'Egyptian_cat', 0.9194835),
('n02123045', 'tabby', 0.009686623), 
('n02123159', 'tiger_cat', 0.0061576385),
('n02127052', 'lynx', 0.0011503297), 
('n02883205', 'bow_tie', 0.00042479983)]

For feature extraction:

model = GCViTTiny(pretrain=True)  # when pretrain=True, num_classes must be 1000
model.reset_classifier(num_classes=0, head_act=None)
feature = model(img)
print(feature.shape)

Feature:

(None, 512)

For feature map:

model = GCViTTiny(pretrain=True)  # when pretrain=True, num_classes must be 1000
feature = model.forward_features(img)
print(feature.shape)

Feature map:

(None, 7, 7, 512)

Live-Demo

For live demo on Image Classification & Grad-CAM, with ImageNet weights, click powered by 🤗 Space and Gradio. here's an example,

Example

For working training example checkout these notebooks on Google Colab & Kaggle .

Here is grad-cam result after training on Flower Classification Dataset,

To Do

New updated weights have been added.
Working training example in Colab & Kaggle.
GradCAM showcase.
Gradio Demo.
Build model with tf.keras.Model.
Port weights from official repo.
Support for TPU.

Acknowledgement

Citation

@article{hatamizadeh2022global,
  title={Global Context Vision Transformers},
  author={Hatamizadeh, Ali and Yin, Hongxu and Kautz, Jan and Molchanov, Pavlo},
  journal={arXiv preprint arXiv:2206.09959},
  year={2022}
}

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

1.1.6

Dec 24, 2023

1.1.5

Oct 16, 2023

1.1.4

May 18, 2023

1.1.3

May 7, 2023

1.1.2

May 7, 2023

1.1.1

Jan 14, 2023

1.1.0

Oct 4, 2022

This version

1.0.9

Aug 11, 2022

1.0.8

Jul 31, 2022

1.0.7

Jul 27, 2022

1.0.6

Jul 21, 2022

1.0.5

Jul 20, 2022

1.0.4

Jul 20, 2022

1.0.3

Jul 19, 2022

1.0.2

Jul 19, 2022

1.0.1

Jul 19, 2022

1.0.0

Jul 19, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gcvit-1.0.9.tar.gz (12.2 kB view hashes)

Uploaded Aug 11, 2022 Source

Built Distribution

gcvit-1.0.9-py3-none-any.whl (16.4 kB view hashes)

Uploaded Aug 11, 2022 Python 3

Hashes for gcvit-1.0.9.tar.gz

Hashes for gcvit-1.0.9.tar.gz
Algorithm	Hash digest
SHA256	`dea4064527df07e69b9a2f1ef66cb00d5c283d6c3b01ebce9db07ff98b75e1bf`
MD5	`7def172e80a6dec7add0912105cf19f8`
BLAKE2b-256	`9d2638cabd288f958bd71521cc46703b77c06dbef5f8c8656c499b2cc415656e`

Hashes for gcvit-1.0.9-py3-none-any.whl

Hashes for gcvit-1.0.9-py3-none-any.whl
Algorithm	Hash digest
SHA256	`1c9b91d2b275c5fca111072c4eb555c4dabe847763c35d5db0dad68d9290526f`
MD5	`ff8187299fc3a48d0eb223a497e68709`
BLAKE2b-256	`b07981ac7d008b9a58c685b2c6b7ecbc6ff60f915828a2e6d2b43e410603cecb`