Skip to content

Navigation Menu

Appearance settings

Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

Appearance settings

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

kozistr / pytorch_optimizer Public

Notifications You must be signed in to change notification settings
Fork 24
Star 304

Code
Issues 9
Pull requests
Discussions
Actions
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Security
Insights

Releases: kozistr/pytorch_optimizer

Releases · kozistr/pytorch_optimizer

pytorch-optimizer v2.11.1

19 Jul 12:00

kozistr

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

pytorch-optimizer v2.11.1

Change Log

Feature

Implement Tiger optimizer (#192)
- A Tight-fisted Optimizer
Implement CAME optimizer (#196)
- Confidence-guided Adaptive Memory Efficient Optimization
Implement loss functions (#198)
- Tversky Loss : Tversky loss function for image segmentation using 3D fully convolutional deep networks
- Focal Tversky Loss
- Lovasz Hinge Loss : The Lovász-Softmax loss: A tractable surrogate for the optimization of the intersection-over-union measure in neural networks

Diff

2.11.0...2.11.1

Assets 2

Loading

Uh oh!

There was an error while loading. Please reload this page.

All reactions

pytorch-optimizer v2.11.0

27 Jun 13:48

kozistr

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

pytorch-optimizer v2.11.0

Change Log

Feature

Implement PAdam optimizer (#186)
- Closing the Generalization Gap of Adaptive Gradient Methods in Training Deep Neural Networks
Implement LOMO optimizer (#188)
- Full Parameter Fine-tuning for Large Language Models with Limited Resources
Implement loss functions (#189)
- BCELoss
- BCEFocalLoss
- FocalLoss : Focal Loss for Dense Object Detection
- FocalCosineLoss : Data-Efficient Deep Learning Method for Image Classification Using Data Augmentation, Focal Cosine Loss, and Ensemble
- DiceLoss : Generalised Dice overlap as a deep learning loss function for highly unbalanced segmentations
- LDAMLoss : Learning Imbalanced Datasets with Label-Distribution-Aware Margin Loss
- JaccardLoss
- BiTemperedLogisticLoss : Robust Bi-Tempered Logistic Loss Based on Bregman Divergences

Diff

2.10.1...2.11.0

Assets 2

Loading

Uh oh!

There was an error while loading. Please reload this page.

Bing-su reacted with rocket emoji

All reactions

🚀 1 reaction

1 person reacted

pytorch-optimizer v2.10.1

13 Jun 06:37

kozistr

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

pytorch-optimizer v2.10.1

Change Log

Feature

Implement Prodigy optimizer (#183)
- An Expeditiously Adaptive Parameter-Free Learner

Fix

perturb isn't multiplied by -step_size in SWATS optimizer. (#179)
chebyshev step has size of T while the permutation is 2^T. (#168, #181)

Diff

2.10.0...2.10.1

Assets 2

Loading

Uh oh!

There was an error while loading. Please reload this page.

Bing-su and pyapyapya reacted with rocket emoji

All reactions

🚀 2 reactions

2 people reacted

pytorch-optimizer v2.10.0

07 Jun 14:18

kozistr

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

pytorch-optimizer v2.10.0

Change Log

Feature

Implement Amos optimizer (#174)
- An Adam-style Optimizer with Adaptive Weight Decay towards Model-Oriented Scale
Implement SignSGD optimizer (#176)
- Compressed Optimisation for Non-Convex Problems
Implement AdaHessian optimizer (#176)
- An Adaptive Second Order Optimizer for Machine Learning
Implement SophiaH optimizer (#173, #176)
- A Scalable Stochastic Second-order Optimizer for Language Model Pre-training
Implement re-usable functions to compute hessian in BaseOptimizer (#176, #177)
- two types of distribution are supported (Gaussian, Rademacher).
Support AdamD feature for AdaHessian optimizer (#177)

Diff

Contributions

thanks to @i404788

Contributors

i404788

Assets 2

Loading

Uh oh!

There was an error while loading. Please reload this page.

All reactions

pytorch-optimizer v2.9.1

19 May 12:09

kozistr

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

pytorch-optimizer v2.9.1

Change Log

Fix

fix weight decay in Ranger21 (#170)

Diff

Assets 2

Loading

Uh oh!

There was an error while loading. Please reload this page.

All reactions

pytorch-optimizer v2.9.0

06 May 08:07

kozistr

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

pytorch-optimizer v2.9.0

Change Log

Feature

Implement AdaMax optimizer, #148
- A variant of Adam based on the infinity norm
Implement Gravity optimizer, #151
- a Kinematic Approach on Optimization in Deep Learning
Implement AdaSmooth optimizer, #153
- An Adaptive Learning Rate Method based on Effective Ratio
Implement SRMM optimizer, #154
- Stochastic regularized majorization-minimization with weakly convex and multi-convex surrogates
Implement AvaGrad optimizer, #155
- Domain-independent Dominance of Adaptive Methods
Implement AdaShift optimizer, #157
- Decorrelation and Convergence of Adaptive Learning Rate Methods
Upgrade to D-Adaptation v3, #158, #159
Implement AdaDelta optimizer, #160
- An Adaptive Learning Rate Method

Docs

Fix readthedocs build issue, #156
Move citations into table, #156

Refactor

Refactor validation logic, #149, #150
Rename amsbound, amsgrad terms into ams_bound, #149
Return gradient instead of the parameter, AGC. #149
Refactor duplicates (e.g. rectified step size, AMSBound, AdamD, AdaNorm, weight decay) into re-usable functions, #150
Move pytorch_optimizer.experimental under pytorch_optimizer.*.experimental

Diff

Assets 2

Loading

Uh oh!

There was an error while loading. Please reload this page.

Bing-su, i404788, pyapyapya, and gseonglee reacted with rocket emoji

All reactions

🚀 4 reactions

4 people reacted

pytorch-optimizer v2.8.0

29 Apr 08:51

kozistr

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

pytorch-optimizer v2.8.0

Change Log

Feature

Implement A2Grad optimizer, #136
- Optimal Adaptive and Accelerated Stochastic Gradient Descent
Implement Accelerated SGD optimizer, #137
- Accelerating Stochastic Gradient Descent For Least Squares Regression
Implement Adaptive SGD optimizer, #139
- Adaptive Gradient Descent without Descent
Implement SGDW optimizer, #139
- Decoupled Weight Decay Regularization
Implement Yogi optimizer, #140
- Adaptive Methods for Nonconvex Optimization
Implement SWATS optimizer, #141
- Improving Generalization Performance by Switching from Adam to SGD
Implement Fromage optimizer, #142
- On the distance between two neural networks and the stability of learning
Implement MSVAG optimizer, #143
- Dissecting Adam: The Sign, Magnitude and Variance of Stochastic Gradients
Implement AdaMod optimizer, #144
- An Adaptive and Momental Bound Method for Stochastic Learning
Implement AggMo optimizer, #145
- Aggregated Momentum: Stability Through Passive Damping
Implement QHAdam, QHM optimizers, #146
- Quasi-hyperbolic momentum and Adam for deep learning
Implement PID optimizer, #147
- A PID Controller Approach for Stochastic Optimization of Deep Networks

Bug

Fix update in Lion optimizer, #135
Fix momentum_buffer in SGDP optimizer, #139

Assets 2

Loading

Uh oh!

There was an error while loading. Please reload this page.

gseonglee and Bing-su reacted with hooray emoji

All reactions

🎉 2 reactions

2 people reacted

pytorch-optimizer v2.7.0

26 Apr 06:31

kozistr

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

pytorch-optimizer v2.7.0

Change Log

Features

Implement AdaNorm optimizer, #133
- AdaNorm: Adaptive Gradient Norm Correction based Optimizer for CNNs
Implement RotoGrad optimizer, #124, #134
- RotoGrad: Gradient Homogenization in Multitask Learning
Implement D-Adapt Adan optimizer, #134
Support AdaNorm variant, #133, #134
- AdaBelief
- AdamP
- AdamS
- AdaPNM
- diffGrad
- Lamb
- RAdam
- Ranger
- Adan
Support AMSGrad variant, #133, #134
- diffGrad
- AdaFactor
Support degenerated_to_sgd, #133
- Ranger
- Lamb

Refactor

Rename adamd_debias_term to adam_debias, #133
Merge the rectified version with the original, #133
- diffRGrad + diffGrad -> diffGrad
- RaLamb + Lamb -> Lamb
- now you can simply use with rectify=True

Fix

Fix previous_grad deepcopy issue in Adan optimizer. #134

Assets 2

Loading

Uh oh!

There was an error while loading. Please reload this page.

gseonglee and Bing-su reacted with rocket emoji

All reactions

🚀 2 reactions

2 people reacted

pytorch-optimizer v2.6.1

22 Apr 12:14

kozistr

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

pytorch-optimizer v2.6.1

Change Log

Fix

variables are not located on the same device with the gradients, #132 (related to #131) (thanks to @Bing-su)
fix approximate_sq_grad() in Adafactor optimizer, #132

Contributors

Bing-su

Assets 2

Loading

Uh oh!

There was an error while loading. Please reload this page.

pyapyapya, Bing-su, and gseonglee reacted with rocket emoji

All reactions

🚀 3 reactions

3 people reacted

pytorch-optimizer v2.6.0

22 Apr 07:56

kozistr

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

pytorch-optimizer v2.6.0

Change Log

Feature

Implement SM3 optimizer, #130
- Memory-Efficient Adaptive Optimization
Tweak Scalable Shampoo optimizer, #128, #129
- implement a new preconditioner type, OUTPUT.
- optimize speed/memory usage of coupled Newton iteration and power iteration methods.
- use in-place operation (SQRT-N Grafting).
- clean-up shampoo_utils more readable.
- support skip_preconditioning_rank_lt parameter to skip preconditioning in case of the low-rank gradient.
- set default value for preconditioning_compute_steps to 1000.
- set default value for start_preconditioning_step to 25.

Assets 2

Loading

Uh oh!

There was an error while loading. Please reload this page.

Bing-su reacted with rocket emoji

All reactions

🚀 1 reaction

1 person reacted

Previous 1 2 3 4 5 … 8 9 Next

Footer

© 2025 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.