AI Red Teaming playground labs to run AI Red Teaming trainings including infrastructure.
-
Updated
Feb 13, 2026 - TypeScript
AI Red Teaming playground labs to run AI Red Teaming trainings including infrastructure.
A security scanner for your LLM agentic workflows
A collection of servers which are deliberately vulnerable to learn Pentesting MCP Servers.
Whistleblower is a offensive security tool for testing against system prompt leakage and capability discovery of an AI application exposed through API. Built for AI engineers, security researchers and folks who want to know what's going on inside the LLM-based app they use daily
AspGoat is an intentionally vulnerable ASP.NET Core application for learning and practicing web application security.
Code scanner to check for issues in prompts and LLM calls
A comprehensive guide to adversarial testing and security evaluation of AI systems, helping organizations identify vulnerabilities before attackers exploit them.
Open-source LLM Prompt-Injection and Jailbreaking Playground
AI security and prompt injection payload toolkit
A diagnostic methodology for bypassing LLM defense layers — from input filters to persistent memory exploitation.
AI red teaming, jailbreaking, and all forms of adversarial attacks for security purposes
The ultimate OWASP MCP Top 10 security checklist and pentesting framework for Model Context Protocol (MCP), AI agents, and LLM-powered systems.
Basilisk — Open-source AI red teaming framework with genetic prompt evolution. Automated LLM security testing for GPT-4, Claude, Grok, Gemini. OWASP LLM Top 10 coverage. 32 attack modules.
Comprehensive taxonomy of AI security vulnerabilities, LLM adversarial attacks, prompt injection techniques, and machine learning security research. Covers 71+ attack vectors including model poisoning, agentic AI exploits, and privacy breaches.
A repository for your Garak runs, as well as a modern visualizer.
🛡️ Safe AI Agents through Action Classifier
Awesome LLM security tools, research, and documents
Projet issu du codelab Devfest Nantes 2025 “La guerre des prompts” : atelier de 2h pour apprendre à pirater des IA et comment les protéger via des frameworks open source
LLM Attack Testing Toolkit is a structured methodology and mindset framework for testing Large Language Model (LLM) applications against logic abuse, prompt injection, jailbreaks, and workflow manipulation.
Agentic AI Security Bootcamp is a hands-on, research-driven training environment for analysing, attacking, and securing autonomous AI systems. The repository provides structured labs, adversarial evaluation frameworks, and red-teaming exercises covering multi-agent observability, prompt injection..
Add a description, image, and links to the ai-red-teaming topic page so that developers can more easily learn about it.
To associate your repository with the ai-red-teaming topic, visit your repo's landing page and select "manage topics."