CVE-Bench: A Benchmark for AI Agents' Ability to Exploit Real-World Web Application Vulnerabilities

Published: 2025, Last Modified: 25 Jan 2026ICML 2025EveryoneRevisionsBibTeXCC BY-SA 4.0
Loading