Project: Reasoning Agents - Condor Console

### Track

Reasoning Agents (Azure AI Foundry)

### Project Name

Condor Console

### GitHub Username

@alanmaizon

### Repository URL

https://github.com/alanmaizon/reasoning-agents

### Project Description

  <h2>Condor Console</h2>
  <p>
    Condor Console is a modular multi-agent reasoning system built to make complex AI workflows
    more reliable, auditable, and production-ready.
  </p>
  <p>
    Instead of a single-pass prompt, Condor uses explicit role-based orchestration to improve
    transparency and control across multi-step tasks.
  </p>

  <h3>Architecture</h3>
  <ul>
    <li><strong>Planner Agent:</strong> Decomposes high-level goals into structured steps.</li>
    <li><strong>Executor Agents:</strong> Perform scoped tasks independently.</li>
    <li><strong>Critic / Verifier Agent:</strong> Checks logical consistency and constraint adherence.</li>
    <li><strong>State &amp; Memory Layer:</strong> Preserves intermediate reasoning for traceability.</li>
  </ul>

  <h3>Why It Matters</h3>
  <p>
    By separating planning, execution, and verification, Condor improves determinism and reduces
    hallucinated reasoning chains in complex workflows.
  </p>

  <h3>Key Features</h3>
  <ul>
    <li>Explicit planner-executor-verifier orchestration</li>
    <li>Structured reasoning trace output</li>
    <li>Modular agent abstractions</li>
    <li>API-key based local execution</li>
    <li>Extensible architecture for new workflows</li>
  </ul>

  <h3>Technical Highlights</h3>
  <ul>
    <li>Implemented a reusable planner-executor-critic reasoning loop</li>
    <li>Designed modular orchestration for extensibility</li>
    <li>Built traceable logs for debugging and evaluation</li>
    <li>Separated reasoning logic from execution logic</li>
    <li>Prioritized architectural reliability over prompt complexity</li>
  </ul>

### Demo Video or Screenshots

<h3>Live Site</h3>

<p>
  <a href="https://mdtsc0222101105stan.swedencentral.cloudapp.azure.com/">
    🌐 Open Condor Console Live
  </a>
</p>

<p>
  The live site requires authentication. Please sign in with Google to test the full experience.
</p>

<h3>Demo Video</h3>

<p>
  <a href="https://vimeo.com/1167717294?share=copy&fl=sv&fe=ci">
    ▶ Watch Demo Video (Vimeo)
  </a>
</p>

<p>
  <a href="https://vimeo.com/1167717294?share=copy&fl=sv&fe=ci">
    <img src="https://raw.githubusercontent.com/alanmaizon/reasoning-agents/main/screenshots/session-setup.png" alt="Condor Console demo thumbnail" width="900" />
  </a>
</p>


### Primary Programming Language

Python

### Key Technologies Used

<section>
  <ul>
    <li><strong>Backend:</strong> Python, FastAPI, Uvicorn</li>
    <li><strong>Frontend:</strong> HTML5, CSS3, JavaScript (ES Modules)</li>
    <li><strong>AI Orchestration:</strong> Multi-agent planner-executor-critic architecture</li>
    <li><strong>LLM Integration:</strong> Azure AI Foundry / Azure OpenAI-compatible endpoints</li>
    <li><strong>Identity &amp; Access:</strong> Microsoft Entra External ID (CIAM), MSAL.js</li>
    <li><strong>Cloud Platform:</strong> Microsoft Azure (VM/App hosting, networking, identity)</li>
    <li><strong>DevOps:</strong> GitHub, GitHub Actions (CI/CD), SSH-based deployment workflows</li>
    <li><strong>Configuration &amp; Secrets:</strong> Environment variables (.env), GitHub Secrets</li>
    <li><strong>Observability:</strong> Structured logs and reasoning trace outputs</li>
  </ul>
</section>

### Submission Type

Individual

### Team Members

_No response_

### Submission Requirements

- [x] My project meets the track-specific challenge requirements
- [x] My repository includes a comprehensive README.md with setup instructions
- [x] My code does not contain hardcoded API keys or secrets
- [x] I have included demo materials (video or screenshots)
- [x] My project is my own work with proper attribution for any third-party code
- [x] I agree to the [Code of Conduct](https://github.com/microsoft/agentsleague/blob/main/CODE_OF_CONDUCT.md)
- [x] I have read and agree to the [Disclaimer](https://github.com/microsoft/agentsleague/blob/main/DISCLAIMER.md)
- [x] My submission does NOT contain any confidential, proprietary, or sensitive information
- [x] I confirm I have the rights to submit this content and grant the necessary licenses

### Quick Setup Summary

<section>
  <p>
    Quick setup is 3 steps: create a Python virtual environment and install dependencies,
    run either offline mode or local API mode, and open the built-in frontend.
  </p>
  <ol>
    <li>
      Set up environment:
      <code>python -m venv .venv &amp;&amp; source .venv/bin/activate &amp;&amp; pip install -r requirements.txt</code>
    </li>
    <li>
      Run the app:
      <code>python -m src.main --offline</code>
      or
      <code>uvicorn src.api:app --reload --port 8000</code>
    </li>
    <li>
      Open frontend:
      <code>http://127.0.0.1:8000/</code>
    </li>
  </ol>
  <p>
    For online mode with Azure AI Foundry, copy <code>.env.example</code> to <code>.env</code>,
    set <code>AZURE_AI_PROJECT_ENDPOINT</code> and <code>AZURE_AI_MODEL_DEPLOYMENT_NAME</code>,
    then run again.
  </p>
  <p>
    Note: if your shell maps commands differently, use <code>python3</code> and <code>pip3</code>.
  </p>
</section>


### Technical Highlights

<section>
  <p>
    The strongest part of this implementation is the explicit multi-agent orchestration model:
    a Planner, Examiner, Misconception Diagnoser, Grounding Verifier, and Coach working as
    separate components with clear responsibilities.
  </p>
  <ul>
    <li>
      <strong>Role-separated reasoning architecture:</strong> We replaced single-pass prompting with
      planner-executor-verifier flow to improve consistency on multi-step tasks.
    </li>
    <li>
      <strong>Schema-first contracts:</strong> Agent inputs/outputs are validated with strict models,
      reducing brittle prompt coupling and making failures easier to detect and recover from.
    </li>
    <li>
      <strong>Grounding-before-explaining design:</strong> Coaching content is tied to Microsoft Learn
      evidence through MCP tooling, with explicit fallback behavior when evidence is insufficient.
    </li>
    <li>
      <strong>Dual-mode execution path:</strong> Adaptive mode provides diagnosis and coaching depth,
      while mock-test mode prioritizes exam-like speed and scoring realism.
    </li>
    <li>
      <strong>Production-oriented delivery:</strong> The same core runs locally and in cloud deployment
      with CI/CD, auth controls, and runtime configuration via secrets rather than hardcoded values.
    </li>
    <li>
      <strong>Traceability and observability:</strong> The system keeps structured state and intermediate
      reasoning artifacts to support debugging, evaluation, and iterative improvement.
    </li>
  </ul>
  <p>
    Most importantly, I prioritized architectural reliability and debuggability over prompt complexity.
    That decision made the system easier to extend, test, and operate.
  </p>
</section>


### Challenges & Learnings

<section>
  <p>
    The biggest challenge was moving from a “single prompt” mindset to a reliable multi-agent system
    that behaves well in real deployment conditions.
  </p>
  <ul>
    <li>
      <strong>Challenge:</strong> Inconsistent output quality across multi-step reasoning.<br />
      <strong>Learning:</strong> Explicit planner-executor-verifier separation plus schema validation
      gives much more stable behavior than prompt tuning alone.
    </li>
    <li>
      <strong>Challenge:</strong> Grounding quality varied when evidence retrieval was weak or unavailable.<br />
      <strong>Learning:</strong> A strict grounding policy with clear fallback (“insufficient evidence”)
      is better than forcing low-confidence explanations.
    </li>
    <li>
      <strong>Challenge:</strong> Authentication complexity with Entra External ID and federated sign-in flows.<br />
      <strong>Learning:</strong> Keep auth configuration minimal, issuer/audience rules explicit, and
      environment-specific values isolated in secrets.
    </li>
    <li>
      <strong>Challenge:</strong> Balancing exam realism with user experience and response time.<br />
      <strong>Learning:</strong> Splitting into adaptive and mock-test modes provided a clean tradeoff:
      depth when needed, speed when needed.
    </li>
    <li>
      <strong>Challenge:</strong> Frontend state complexity (session lifecycle, submit locking, question navigation).<br />
      <strong>Learning:</strong> Deterministic UI state transitions and explicit loading/disabled states
      prevent accidental resets and reduce user confusion.
    </li>
    <li>
      <strong>Challenge:</strong> Operating within cloud quota and cost constraints while iterating quickly.<br />
      <strong>Learning:</strong> Build local/offline paths first, then add cloud hosting and CI/CD with
      controlled runtime scaling.
    </li>
  </ul>
  <p>
    Overall, the main takeaway was that reliability comes from system design decisions
    (contracts, orchestration, observability, and guardrails), not just stronger prompts.
  </p>
</section>


### Contact Information

[linkedin.com/in/maizonalan](https://linkedin.com/in/maizonalan/)

### Country/Region

Ireland

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Project: Reasoning Agents - Condor Console #38

Track

Project Name

GitHub Username

Repository URL

Project Description

Condor Console

Architecture

Why It Matters

Key Features

Technical Highlights

Demo Video or Screenshots

Live Site

Demo Video

Primary Programming Language

Key Technologies Used

Submission Type

Team Members

Submission Requirements

Quick Setup Summary

Technical Highlights

Challenges & Learnings

Contact Information

Country/Region

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Project: Reasoning Agents - Condor Console #38

Description

Track

Project Name

GitHub Username

Repository URL

Project Description

Condor Console

Architecture

Why It Matters

Key Features

Technical Highlights

Demo Video or Screenshots

Live Site

Demo Video

Primary Programming Language

Key Technologies Used

Submission Type

Team Members

Submission Requirements

Quick Setup Summary

Technical Highlights

Challenges & Learnings

Contact Information

Country/Region

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions