AgentCTF x AgentXploit — Berkeley RDI

Competition Overview

What is AgentCTF?

Organized by the Berkeley RDI Center, AgentCTF x AgentXploit challenges participants to build AI agents capable of identifying and exploiting real-world vulnerabilities. Targeted frameworks include LangChain, AutoGPT, and many more, with tasks sourced from publicly disclosed CVEs.

The evaluation pipeline follows the AAA (Agentified Agent Assessment) paradigm. A portion of tasks are released as a development set so participants can iterate locally before the official evaluation.

🤖

Build an Exploit Agent

Implement an AI agent with an A2A interface that autonomously reasons about and exploits CVEs.

🛡

Real-World CVEs

Tasks are drawn from publicly disclosed vulnerabilities across 20+ popular AI and web application frameworks.

⚖️

Dual Evaluation

Agents are scored on both the released dev set and a hidden test set, with full runs replayed for verification.

AAA Overview AAA Evaluation Walkthrough AgentXploit Paper (arXiv) GitHub Repository

Final Standings

Competition Results

Below are the top 5 teams ranked by their Full Score (average across 10 public and 10 hidden test cases). The Public Score reflects performance on the 10 publicly released development tasks only.

Rank	Team	Full Score	Public Score
1	🥇 CyberForge	4.35	4.8
2	🥈 Lyraix AI	4.2	4.2
3	🥉 DeoGaze	4.0	4.9
4	SafeAI	3.65	3.8
5	Hestia	3.6	4.5

Results Report

How to Participate

Submission Guidelines

Read the materials & fork the repository

Review the AAA evaluation paradigm documentation and fork the GitHub repository to your own account.

Implement your agent

Build your exploit agent with an A2A interface inside ./src/white_agent/. Only modify files in that directory and pyproject.toml. Do not alter the Green Agent or task configurations — violations may result in disqualification.

Test locally & bundle

Run the full dev-set evaluation, then bundle results with the provided CLI. Total submission size must be under 1 MB — do not include model weights or large files. The bundle captures the latest run-all results; do not modify them after bundling.

Submit via Google Form by 23:59 AoE, March 20, 2026

Upload your submission.zip. Official evaluation will rerun results to verify authenticity.

Evaluation

Scoring Policy

Submissions are evaluated against both the released dev set and a hidden test set. Specify which LLM you used so organizers can provision appropriate model access. Supported models include openai/*, gemini/*, and vertex_ai/claude-*.

Budget

$10 LiteLLM API credit per task

Time Limit

15 minutes per task to generate an exploit

Test Sets

Dev set (public) + hidden test set

Verification

Official evaluation reruns all results — do not modify bundles post-submission

Model Access

openai/* gemini/* vertex_ai/claude-*

.env configuration

# Provided via .env for each task evaluation
LITELLM_PROXY_API_KEY=sk-xxxxx
LITELLM_PROXY_API_BASE=...

# Specify your model (prefix with litellm_proxy/ in most cases)
LITELLM_MODEL=litellm_proxy/openai/gpt-4o

Quickstart

Getting Started

Full setup instructions, dependency requirements, and usage examples are available in the repository README.

Read the README on GitHub

License & Ethics

Responsible Disclosure

⚠️

This framework is intended for educational and research purposes only. All included CVEs are publicly disclosed vulnerabilities. Participants must adhere to responsible disclosure policies and may not use techniques or artifacts from this competition outside of the authorized evaluation environment.