Merge pull request #1 from JetBrains-Research/saridormi-dev

Bring current experiments to main
JetBrains-Research · Mar 29, 2024 · e76de6f · e76de6f
2 parents 6a6fa80 + 1d1d9fd
commit e76de6f
Show file tree

Hide file tree

Showing 93 changed files with 15,886 additions and 0 deletions.
diff --git a/.github/workflows/workflow.yaml b/.github/workflows/workflow.yaml
@@ -0,0 +1,34 @@
+name: Style and typing checks
+
+on: push
+
+jobs:
+  build:
+
+    runs-on: ubuntu-latest
+
+    steps:
+      - uses: actions/checkout@v3
+      - name: Set up Python 3.11
+        uses: actions/setup-python@v5
+        with:
+          python-version: "3.11"
+
+      - name: Install and configure Poetry
+        uses: snok/install-poetry@v1
+
+      - name: Install dependencies
+        run: |
+          poetry install --no-interaction
+
+      - name: Lint with ruff
+        run: |
+          poetry run ruff check
+
+      - name: Check formatting with ruff
+        run: |
+          poetry run ruff format --check
+
+      - name: Check types with mypy
+        run: |
+          poetry run mypy .
diff --git a/LICENSE b/LICENSE
@@ -0,0 +1,21 @@
+MIT License
+
+Copyright (c) 2024 JetBrains Research
+
+Permission is hereby granted, free of charge, to any person obtaining a copy
+of this software and associated documentation files (the "Software"), to deal
+in the Software without restriction, including without limitation the rights
+to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
+copies of the Software, and to permit persons to whom the Software is
+furnished to do so, subject to the following conditions:
+
+The above copyright notice and this permission notice shall be included in all
+copies or substantial portions of the Software.
+
+THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
+IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
+FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
+AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
+LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
+OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
+SOFTWARE.
diff --git a/README.md b/README.md
@@ -0,0 +1,149 @@
+# 🤖📝 Planning Library
+
+## Installation
+
+Currently, only installation from source is supported:
+
+```shell
+pip install git+https://github.com/JetBrains-Research/planning-library.git@saridormi-dev
+```
+
+## Quick Tour
+
+> :construction: Subject to change
+
+Currently, we have two types of strategies: **custom strategies** and
+**[LangGraph](https://github.com/langchain-ai/langgraph/tree/main) strategies**.
+
+### Custom strategies
+
+Custom strategies follow the interface provided
+by [`BaseCustomStrategy`](planning_library/strategies/base_strategy.py).
+
+**Example:** [Tree of Thoughts + DFS](planning_library/strategies/tot_dfs/tot_strategy.py)
+
+#### Initializing strategy
+
+Each custom strategy can be created by envoking a static method `create` with at least agent and tools.
+
+```python
+from planning_library.strategies import TreeOfThoughtsDFSStrategy
+
+agent = ...  # any runnable that follows either RunnableAgent or RunnableMultiActionAgent
+tools = [...]  # any sequence of tools
+strategy_executor = TreeOfThoughtsDFSStrategy.create(
+    agent=agent,
+    tools=tools,
+)
+```
+
+Some strategies contain other meaningful components (e.g., an evaluator, which is responsible for evaluating
+intermediate steps). :construction: We will provide some default implementations for such components, but they can also
+be redefined
+them with custom runnables tailored for specific tasks.
+
+#### Using strategy
+
+Each custom strategy is an instance of [`Chain`](https://python.langchain.com/docs/modules/chains/) and mostly can be
+used the same
+way as the default [`AgentExecutor`](https://python.langchain.com/docs/modules/agents/quick_start#create-the-agent) from
+LangChain.
+
+```python
+strategy_executor.invoke({"inputs": "Hello World"})
+```
+
+### LangGraph strategies
+
+Strategies powered by [LangGraph library](https://github.com/langchain-ai/langgraph) follow the interface provided
+by [`BaseLangGraphStrategy`](planning_library/strategies/base_strategy.py).
+
+**Example:** [Reflexion](planning_library/strategies/reflexion/reflexion_strategy.py)
+
+#### Initializing strategy
+
+Each LangGraph strategy can be created by envoking a static method `create` with (at least) agent and tools.
+
+```python
+from planning_library.strategies import ReflexionStrategy
+
+agent = ...  # any runnable that follows either RunnableAgent or RunnableMultiActionAgent
+tools = [...]  # any sequence of tools
+strategy_graph = ReflexionStrategy.create(agent=agent, tools=tools)
+```
+
+Some strategies contain other meaningful components (e.g., an evaluator, which is responsible for evaluating
+intermediate steps). :construction: We will provide some default implementations for such components, but they can also
+be redefined
+them with custom runnables tailored for specific tasks.
+
+#### Using strategy
+
+[`BaseLangGraphStrategy.create`](planning_library/strategies/base_strategy.py) returns a
+compiled [`StateGraph`](https://github.com/langchain-ai/langgraph?tab=readme-ov-file#stategraph) that exposes the same
+interface as any LangChain runnable.
+
+```python
+strategy_graph.invoke({"inputs": "Hello World"})
+```
+
+## Available Strategies
+
+|              Name              |                                   Implementation                                   |   Type    |                                                Paper                                                 |
+|:------------------------------:|:----------------------------------------------------------------------------------:|:---------:|:----------------------------------------------------------------------------------------------------:|
+| Tree of Thoughts + DFS / DFSDT | [`TreeOfThoughtsDFSStrategy`](planning_library/strategies/tot_dfs/tot_strategy.py) |  Custom   | [:scroll: ToT](https://arxiv.org/abs/2305.10601), [:scroll: DFSDT](https://arxiv.org/abs/2307.16789) |
+|           Reflexion            | [`ReflexionStrategy`](planning_library/strategies/reflexion/reflexion_strategy.py) | LangGraph |                             [:scroll:](https://arxiv.org/abs/2303.11366)                             |
+|             ADaPT              |       [`ADaPTStrategy`](planning_library/strategies/adapt/adapt_strategy.py)       |  Custom   |                             [:scroll:](https://arxiv.org/abs/2311.05772)                             |
+|          Simple/ReAct          |     [`SimpleStrategy`](planning_library/strategies/simple/simple_strategy.py)      |  Custom   |                             [:scroll:](https://arxiv.org/abs/2210.03629)                             |
+
+## Available Environments
+
+### :two::four: Game of 24
+
+> Game of 24 is a mathematical reasoning task. The goal is to reach the number 24 by applying arithmetical operations
+> to four given numbers. See :scroll: [Tree of Thoughts](https://arxiv.org/abs/2305.10601) paper for more details.
+
+Our implementation of Game of 24 is available under [`environments/game_of_24`](environments/game_of_24) folder. It
+includes a set of prompts, a set of tools and examples of running available strategies on Game of 24.
+
+* Common:
+    * [Gymnasium](https://gymnasium.farama.org/) env for Game of
+      24: [`environments/game_of_24/common/environment.py`](environments/game_of_24/common/environment.py)
+    * Tools for Game of 24: [`environments/game_of_24/common/tools.py`](environments/game_of_24/common/tools.py)
+* Strategies:
+    * Tree of Thoughts + DFS
+      example: [`environments/game_of_24/tot_dfs.ipynb`](environments/game_of_24/tot_dfs.ipynb)
+    * Reflexion example: [`environments/game_of_24/reflexion.ipynb`](environments/game_of_24/reflexion.ipynb)
+
+### :snowflake: FrozenLake
+
+> FrozenLake is a simple environment that requires crossing a frozen lake from start to goal without falling into any
+> holes.
+> See [Gymnasium docs](https://gymnasium.farama.org/environments/toy_text/frozen_lake/) for more details.
+
+Our implementation of FrozenLake is available under [`environments/frozen_lake`](environments/frozen_lake) folder.
+
+* Common:
+    * Env wrapper for
+      FrozenLake: [`environments/frozen_lake/common/environment.py`](environments/frozen_lake/common/environment.py)
+    * Tools for FrozenLake: [`environments/frozen_lake/common/tools.py`](environments/frozen_lake/common/tools.py)
+* Strategies:
+    * Tree of Thoughts + DFS
+      example: [`environments/frozen_lake/tot_dfs.ipynb`](environments/frozen_lake/tot_dfs.ipynb)
+    * Reflexion example: [`environments/frozen_lake/reflexion.ipynb`](environments/frozen_lake/reflexion.ipynb)
+
+### :house: ALFWorld
+
+> ALFWorld contains interactive TextWorld environments for household navigation. See :scroll: 
+> [ALFWorld](https://arxiv.org/abs/2010.03768) paper or [project website](https://alfworld.github.io/) for more
+> information.
+
+Our implementation of ALFWorld is available under [`environments/alfword`](environments/alfword) folder.
+
+* Common:
+    * Env wrapper for
+      ALFWorld: [`environments/alfworld/common/environment.py`](environments/alfworld/common/environment.py)
+    * Tools for ALFWorld: [`environments/alfworld/common/tools.py`](environments/alfworld/common/tools.py)
+* Strategies:
+    * Reflexion example: [`environments/alfworld/reflexion.ipynb`](environments/alfworld/reflexion.ipynb)
+    * ADaPT example: [`environments/alfworld/adapt.ipynb`](environments/alfworld/adapt.ipynb)
diff --git a/environments/alfworld/__init__.py b/environments/alfworld/__init__.py