Contains keras callbacks and optimizers for training keras models
Finds optimal learning rate for model - paper (section 3.3)
Contain following schedulers:
Contains optimizers from official keras repo added with some optimization techniques.
- Weight decay
- Discriminative learning rates
- Weight decay normalization with wd_multi(below algo 2) and adam with restarts - paper