🧠 Agentic Reasoning System

This project implements an Agentic AI Reasoning System designed for structured, multi-step reasoning tasks such as logic-based question answering.
It autonomously decomposes problems, selects appropriate tools, executes subtasks, and generates transparent reasoning traces.

🚀 Project Overview

Large Language Models (LLMs) often hallucinate intermediate steps or skip verification during logical reasoning.
To address this, this system implements an agentic reasoning framework that:

Decomposes logic problems into smaller subtasks.
Selects tools (symbolic solver, calculator, or code execution).
Executes and verifies sub-results to ensure reliability.
Generates step-by-step reasoning traces along with the final answer.

The system is designed to run on smaller LLMs or base models and avoid heavy proprietary reasoning models such as GPT-4, GPT-5, Claude 3, or Gemini Ultra.

📁 Repository Structure

Agentic-Reasoner/
│
├── data/
│   ├── train.csv
│   ├── test.csv
│
├── src/
│   ├── __init__.py
│   ├── main.py
│   ├── data_loader.py
│   ├── reasoning_agent.py
│   ├── tool_selector.py
│   ├── solver.py
│   ├── verifier.py
│   ├── utils.py
│
├── outputs/
│   ├── output.csv
│
├── eval_runner.py
├── README.md
└── requirements.txt

🧩 Core Components

1. `reasoning_agent.py`

Implements the agentic controller that:

Decomposes the main problem into subtasks.
Chooses appropriate tools for each subtask.
Integrates all results into a coherent reasoning chain.

2. `tool_selector.py`

Chooses tools like:

Symbolic Solver (for algebra, logic)
Arithmetic Calculator
Code Execution Module (for programmable subtasks)

3. `solver.py`

Handles execution of mathematical or logical subtasks.

4. `verifier.py`

Checks subtask outputs for consistency and correctness.

5. `utils.py`

Helper functions for logging, formatting reasoning traces, and CSV export.

⚙️ Installation

# Clone the repository
git clone https://github.com/<your-username>/Agentic-Reasoner.git
cd Agentic-Reasoner

# Install dependencies
pip install -r requirements.txt

🧠 Running the System

Step 1: Train / Fine-tune (Optional)

You can use train.csv to fine-tune a small model or validate the reasoning pipeline.

Step 2: Inference

Run inference on the test dataset:

python src/main.py

The system will:

Read test.csv
Decompose each problem
Solve step-by-step
Output reasoning traces and predictions to outputs/output.csv

Output format:

topic	problem_statement	solution	correct_option

🧪 Evaluation

To generate predictions and evaluate them, run:

python eval_runner.py

This script compares predicted answers with ground truth (if available) and computes metrics like Macro F1 Score.

🧰 Requirements

Create a requirements.txt file with:

pandas
numpy
scikit-learn
sympy

📊 Output Example

Example row in outputs/output.csv:

topic	problem_statement	solution	correct_option
Arithmetic	What is 2 + 2 × 3?	Step 1: Multiply 2×3=6. Step 2: Add 2+6=8. Final Answer: 8.	2

🧾 Evaluation Metrics

Macro F1 Score (50%)
Approach Creativity & Originality (35%)
Report Quality (10%)
Code Quality (5%)

🧠 Key Design Principles

✅ Transparent reasoning with trace logs
✅ Modular, reusable pipeline
✅ Verification for correctness
✅ Interpretable output for human validation

👨‍💻 Authors

Developed by J. Adarsh and contributors for the Agentic Reasoning Challenge.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
data		data
docs		docs
experiments		experiments
outputs		outputs
src		src
tools		tools
venv		venv
readme.md		readme.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Repository files navigation

🧠 Agentic Reasoning System

🚀 Project Overview

📁 Repository Structure

🧩 Core Components

1. `reasoning_agent.py`

2. `tool_selector.py`

3. `solver.py`

4. `verifier.py`

5. `utils.py`

⚙️ Installation

🧠 Running the System

Step 1: Train / Fine-tune (Optional)

Step 2: Inference

🧪 Evaluation

🧰 Requirements

📊 Output Example

🧾 Evaluation Metrics

🧠 Key Design Principles

👨‍💻 Authors

About

Uh oh!

Releases

Packages

Languages

Uh oh!

Uh oh!

Adars2005/AgenticReasoner

Folders and files

Latest commit

History

Repository files navigation

🧠 Agentic Reasoning System

🚀 Project Overview

📁 Repository Structure

🧩 Core Components

1. reasoning_agent.py

2. tool_selector.py

3. solver.py

4. verifier.py

5. utils.py

⚙️ Installation

🧠 Running the System

Step 1: Train / Fine-tune (Optional)

Step 2: Inference

🧪 Evaluation

🧰 Requirements

📊 Output Example

🧾 Evaluation Metrics

🧠 Key Design Principles

👨‍💻 Authors

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

1. `reasoning_agent.py`

2. `tool_selector.py`

3. `solver.py`

4. `verifier.py`

5. `utils.py`

Packages