SEC-bench: Automated Benchmarking of LLM Agents on Real-World Software Security Tasks
Published in NeurIPS 2025, 2025
Recommended citation: Hwiwon Lee, Ziqi Zhang, Hanxiao Lu, and Lingming Zhang. SEC-bench: Automated Benchmarking of LLM Agents on Real-World Software Security Tasks. NeurIPS 2025.
