SEC-bench: Automated Benchmarking of LLM Agents on Real-World Software Security Tasks

Published in NeurIPS 2025, 2025

Recommended citation: Hwiwon Lee, Ziqi Zhang, Hanxiao Lu, and Lingming Zhang. SEC-bench: Automated Benchmarking of LLM Agents on Real-World Software Security Tasks. NeurIPS 2025.