Show HN: I benchmarked LLM agents on fixing real-world security vulnerabilities

Vulnerability Detection, AI Model Comparison, Security Benchmarking(giovannigatti.github.io)view on HackerNews
AIvulnerability detectionsecurity benchmarkingLLMmodel comparisoncost analysis

Author: ggattip

Date: 6/5/2026

Article Summary:
The author built a benchmark to compare the performance of 5 LLM agents in fixing security vulnerabilities in Python projects, finding that cost and model training data are significant differentiators.