ADOPT, MARS, Prodigy #1

Andron00e · 2025-02-19T07:50:59Z

this pr brings three more methods.
i think there might be an issue with MARS (its mars-shampoo version), ill check today.

comment: for MARS, i changed train_step behaviour a bit: ref

since were using only approximate scheme of MARS, we should only track the past gradient in order to compute g_curr - g_prev, therefore i use this in the train_step:

if optimizer.__class__.__name__ == "MARS":
     optimizer.zero_grad(set_to_none=True) 
     optimizer.update_last_grad()
else:
    optimizer.zero_grad()

Wandb logging

Andron00e and others added 10 commits February 19, 2025 03:03

prodigy is ready, add mars and adopt in the morning

4bb541e

mars is here, adopt todo

116f37e

adopt is here

1f071c4

--fix step adopt

32a258b

--minor

c2e704b

wandb-entity

a3c80db

wandb-entity

9199106

Merge pull request #2 from Andron00e/wandb-logging

d299a20

Wandb logging

fixed adopt

bdf5057

fixed adopt

36ed862

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ADOPT, MARS, Prodigy #1

ADOPT, MARS, Prodigy #1

Uh oh!

Andron00e commented Feb 19, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

ADOPT, MARS, Prodigy #1

Are you sure you want to change the base?

ADOPT, MARS, Prodigy #1

Uh oh!

Conversation

Andron00e commented Feb 19, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant