pytorch-optimizer

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

pytorch-optimizer

Bunch of optimizer implementations in PyTorch with clean-code, strict types. Inspired by pytorch-optimizer.

Usage

Supported Optimizers

Optimizer	Description	Official Code	Paper
AdamP	Slowing Down the Slowdown for Momentum Optimizers on Scale-invariant Weights	github	https://arxiv.org/abs/2006.08217
Adaptive Gradient Clipping (AGC)	High-Performance Large-Scale Image Recognition Without Normalization	github	https://arxiv.org/abs/2102.06171
Chebyshev LR Schedules	Acceleration via Fractal Learning Rate Schedules	~~github~~	https://arxiv.org/abs/2103.01338v1
Gradient Centralization (GC)	A New Optimization Technique for Deep Neural Networks	github	https://arxiv.org/abs/2004.01461
Lookahead	k steps forward, 1 step back	github	https://arxiv.org/abs/1907.08610v2
RAdam	On the Variance of the Adaptive Learning Rate and Beyond	github	https://arxiv.org/abs/1908.03265
Ranger	a synergistic optimizer combining RAdam and LookAhead, and now GC in one optimizer	github
Ranger21	integrating the latest deep learning components into a single optimizer	github

Citations

AdamP

@inproceedings{heo2021adamp,
    title={AdamP: Slowing Down the Slowdown for Momentum Optimizers on Scale-invariant Weights},
    author={Heo, Byeongho and Chun, Sanghyuk and Oh, Seong Joon and Han, Dongyoon and Yun, Sangdoo and Kim, Gyuwan and Uh, Youngjung and Ha, Jung-Woo},
    year={2021},
    booktitle={International Conference on Learning Representations (ICLR)},
}

Adaptive Gradient Clipping (AGC)

@article{brock2021high,
  author={Andrew Brock and Soham De and Samuel L. Smith and Karen Simonyan},
  title={High-Performance Large-Scale Image Recognition Without Normalization},
  journal={arXiv preprint arXiv:2102.06171},
  year={2021}
}

Chebyshev LR Schedules

@article{agarwal2021acceleration,
  title={Acceleration via Fractal Learning Rate Schedules},
  author={Agarwal, Naman and Goel, Surbhi and Zhang, Cyril},
  journal={arXiv preprint arXiv:2103.01338},
  year={2021}
}

Gradient Centralization (GC)

@inproceedings{yong2020gradient,
  title={Gradient centralization: A new optimization technique for deep neural networks},
  author={Yong, Hongwei and Huang, Jianqiang and Hua, Xiansheng and Zhang, Lei},
  booktitle={European Conference on Computer Vision},
  pages={635--652},
  year={2020},
  organization={Springer}
}

Lookahead

@article{zhang2019lookahead,
  title={Lookahead optimizer: k steps forward, 1 step back},
  author={Zhang, Michael R and Lucas, James and Hinton, Geoffrey and Ba, Jimmy},
  journal={arXiv preprint arXiv:1907.08610},
  year={2019}
}

RAdam

@inproceedings{liu2019radam,
 author = {Liu, Liyuan and Jiang, Haoming and He, Pengcheng and Chen, Weizhu and Liu, Xiaodong and Gao, Jianfeng and Han, Jiawei},
 booktitle = {Proceedings of the Eighth International Conference on Learning Representations (ICLR 2020)},
 month = {April},
 title = {On the Variance of the Adaptive Learning Rate and Beyond},
 year = {2020}
}

Author

Hyeongchan Kim / @kozistr

Project details

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

2.12.0

Oct 7, 2023

2.11.2

Sep 2, 2023

2.11.1

Jul 19, 2023

2.11.0

Jun 27, 2023

2.10.1

Jun 13, 2023

2.10.0

Jun 7, 2023

2.9.1

May 19, 2023

2.9.0

May 6, 2023

2.8.0

Apr 29, 2023

2.7.0

Apr 26, 2023

2.6.1

Apr 22, 2023

2.6.0

Apr 22, 2023

2.5.2

Apr 11, 2023

2.5.1

Mar 12, 2023

2.5.0

Feb 15, 2023

2.4.2

Feb 10, 2023

2.4.1

Feb 6, 2023

2.4.0

Feb 2, 2023

2.3.1

Jan 31, 2023

2.3.0

Jan 30, 2023

2.2.1

Jan 28, 2023

2.2.0

Jan 24, 2023

2.1.1

Jan 2, 2023

2.1.0

Jan 1, 2023

2.0.1

Nov 1, 2022

2.0.0

Oct 21, 2022

1.3.2

Sep 2, 2022

1.3.1

Sep 1, 2022

1.2.0

Aug 26, 2022

1.1.4

Aug 25, 2022

1.1.3

Aug 23, 2022

1.1.2

Jun 1, 2022

1.1.1

May 9, 2022

1.1.0

May 8, 2022

1.0.0

May 7, 2022

0.6.1

May 7, 2022

0.6.0

Apr 2, 2022

0.5.0

Mar 5, 2022

0.4.2

Mar 5, 2022

0.4.1

Feb 20, 2022

0.4.0

Feb 19, 2022

0.3.7

Feb 1, 2022

0.3.6

Jan 31, 2022

0.3.5

Jan 30, 2022

0.3.4

Jan 29, 2022

0.3.3

Jan 29, 2022

0.3.2

Jan 28, 2022

0.3.1

Jan 28, 2022

0.3.0

Jan 28, 2022

0.2.2

Nov 29, 2021

0.2.1

Nov 22, 2021

0.2.0

Nov 15, 2021

0.1.1

Oct 9, 2021

0.1.0

Oct 6, 2021

0.0.11

Oct 6, 2021

0.0.10

Sep 25, 2021

0.0.9

Sep 23, 2021

0.0.8

Sep 23, 2021

0.0.7

Sep 22, 2021

0.0.6

Sep 22, 2021

0.0.5

Sep 22, 2021

0.0.4

Sep 22, 2021

0.0.3

Sep 22, 2021

0.0.2

Sep 21, 2021

This version

0.0.1

Sep 21, 2021

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pytorch-optimizer-0.0.1.tar.gz (19.2 kB view hashes)

Uploaded Sep 21, 2021 Source

Built Distribution

pytorch_optimizer-0.0.1-py3-none-any.whl (21.7 kB view hashes)

Uploaded Sep 21, 2021 Python 3

Hashes for pytorch-optimizer-0.0.1.tar.gz

Hashes for pytorch-optimizer-0.0.1.tar.gz
Algorithm	Hash digest
SHA256	`fa8c7b111cc8de044fbd5187532589cd7e966f177743b358af40eb618d6fcf02`
MD5	`ab763c8ac5845219de58bb4a3b7c8daa`
BLAKE2b-256	`37cd1e2e260c2682bef84ec9c161b7f3aeded486ffdc95a0d6b7454e5ac6e793`

Hashes for pytorch_optimizer-0.0.1-py3-none-any.whl

Hashes for pytorch_optimizer-0.0.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`f4708f1bc64c6bed02282badfe81170f6c8dd26663aa8efb6f6002f58589caac`
MD5	`e20beea609dc94d5993e0ceb5dbcc142`
BLAKE2b-256	`633b417ebc52ab2c98f6724f1ae56b4378944b4a10dfeceaf4a6b22273a2cfc9`