Skip to content
#

claudeopus45

Here are 4 public repositories matching this topic...

Prompt evaluation framework using AWS Bedrock to assess LLM outputs with code-based, model-based, and human grading approaches. Includes dataset generation, syntax validation, and AI-powered scoring for Python, JSON, and Regex tasks.

  • Updated Mar 31, 2026
  • Jupyter Notebook

Improve this page

Add a description, image, and links to the claudeopus45 topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the claudeopus45 topic, visit your repo's landing page and select "manage topics."

Learn more