Documentation β’ Website β’ Agenta Cloud
Agenta is a platform for building production-grade LLM applications. It helps engineering and product teams create reliable LLM apps faster through integrated prompt management, evaluation, and observability.
Collaborate with Subject Matter Experts (SMEs) on prompt engineering and make sure nothing breaks in production.
- Interactive Playground: Compare prompts side by side against your test cases
- Multi-Model Support: Experiment with 50+ LLM models or bring-your-own models
- Version Control: Version prompts and configurations with branching and environments
- Complex Configurations: Enable SMEs to collaborate on complex configuration schemas beyond simple prompts
Evaluate your LLM applications systematically with both human and automated feedback.
- Flexible Test Sets: Create test cases from production data, playground experiments, or upload CSVs
- Pre-built and Custom Evaluators: Use LLM-as-judge, one of our 20+ pre-built evaluators, or you custom evaluators
- UI and API Access: Run evaluations via UI (for SMEs) or programmatically (for engineers)
- Human Feedback Integration: Collect and incorporate expert annotations
Explore evaluation frameworks β
Get visibility into your LLM applications in production.
- Cost & Performance Tracking: Monitor spending, latency, and usage patterns
- Tracing: Debug complex workflows with detailed traces
- Open Standards: OpenTelemetry native tracing compatible with OpenLLMetry, and OpenInference
- Integrations: Comes with pre-built integrations for most models and frameworks
The easiest way to get started is through Agenta Cloud. Free tier available with no credit card required.
- Clone Agenta:
git clone https://github.com/Agenta-AI/agenta && cd agenta
- Start Agenta services:
docker compose -f hosting/docker-compose/oss/docker-compose.gh.yml --env-file hosting/docker-compose/oss/.env.oss.gh --profile with-web up -d
- Access Agenta at
http://localhost
.
For deploying on a remote host, or using different ports refers to our self-hosting and remote deployment documentation.
Find help, explore resources, or get involved:
- π Documentation β Full guides and API reference
- π Changelog β Track recent updates
- π¬ Slack Community β Ask questions and get support
We welcome contributions of all kinds β from filing issues and sharing ideas to improving the codebase.
- π Report bugs β Help us by reporting problems you encounter
- π‘ Share ideas and feedback β Suggest features or vote on ideas
- π§ Contribute to the codebase β Read the guide and open a pull request
Consider giving us a star! It helps us grow our community and gets Agenta in front of more developers.
Thanks goes to these wonderful people (emoji key):
This project follows the all-contributors specification. Contributions of any kind are welcome!
By default, Agenta automatically reports anonymized basic usage statistics. This helps us understand how Agenta is used and track its overall usage and growth. This data does not include any sensitive information. To disable anonymized telemetry set TELEMETRY_ENABLED
to false
in your .env
file.