Offline learning tweak; adding gradients for `tool_choice` #113

sidnarayanan · 2024-10-22T23:25:20Z

Separated out logic for full-batch offline learning to make it easier on the user
LLMCallOp was missing a gradient for tool_choice; added

ldp/alg/runners.py

.pre-commit-config.yaml

…vements

sidnarayanan · 2024-10-23T17:44:33Z

ldp/graph/common_ops.py

-        # Trainable is metadata that an optimizer can use this. It enables things
-        # like (remote) fine-tuning with OpenAI
-        self.trainable: bool = False


Forgot to flag this earlier - removing unused flag.

ldp/alg/runners.py

jamesbraza

Nice little detail, wow

sidnarayanan added 2 commits October 21, 2024 16:12

changing full batch offline logic

31fbcd0

add tool_choice to LLMCallOp.backward

5c01487

sidnarayanan requested review from whitead, jamesbraza, Ryan-Rhys and albertbou92 October 22, 2024 23:25

jamesbraza reviewed Oct 23, 2024

View reviewed changes

ldp/alg/runners.py Outdated Show resolved Hide resolved

.pre-commit-config.yaml Outdated Show resolved Hide resolved

sidnarayanan added 2 commits October 23, 2024 17:40

Merge branch 'main' of github.com:Future-House/ldp into offline-impro…

bb0d464

…vements

pr comment

49830f9

albertbou92 approved these changes Oct 23, 2024

View reviewed changes

sidnarayanan requested a review from jamesbraza October 23, 2024 17:44

sidnarayanan commented Oct 23, 2024

View reviewed changes

jamesbraza reviewed Oct 23, 2024

View reviewed changes

ldp/alg/runners.py Show resolved Hide resolved

add comment

0138dbf

jamesbraza approved these changes Oct 23, 2024

View reviewed changes

sidnarayanan merged commit ea8733d into main Oct 23, 2024
5 of 6 checks passed

sidnarayanan deleted the offline-improvements branch October 23, 2024 18:42

jamesbraza mentioned this pull request Oct 23, 2024

Simplify ReAct agent #111

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Offline learning tweak; adding gradients for `tool_choice` #113

Offline learning tweak; adding gradients for `tool_choice` #113

sidnarayanan commented Oct 22, 2024

sidnarayanan Oct 23, 2024

jamesbraza left a comment

Offline learning tweak; adding gradients for tool_choice #113

Offline learning tweak; adding gradients for tool_choice #113

Conversation

sidnarayanan commented Oct 22, 2024

sidnarayanan Oct 23, 2024

Choose a reason for hiding this comment

jamesbraza left a comment

Choose a reason for hiding this comment

Offline learning tweak; adding gradients for `tool_choice` #113

Offline learning tweak; adding gradients for `tool_choice` #113