Separation of planning concerns in ReAct-style LLM agents. Planner fine-tuning on synthetic trajectories.
-
Updated
Jul 28, 2024 - Python
Separation of planning concerns in ReAct-style LLM agents. Planner fine-tuning on synthetic trajectories.
Python tools to sample randomly with dont pick closest n elements constraints. Also contains a batch generator for the same to sample with replacement and with repeats if necessary.
Add a description, image, and links to the constrained-sampling topic page so that developers can more easily learn about it.
To associate your repository with the constrained-sampling topic, visit your repo's landing page and select "manage topics."