Show HN: I benchmarked LLM agents on fixing real-world security vulnerabilities

Vulnerability Detection, AI Model Comparison, Security Benchmarking(giovannigatti.github.io)view on HackerNews

AIvulnerability detectionsecurity benchmarkingLLMmodel comparisoncost analysis

Author: ggattip

Date: 6/5/2026

Article Summary:

The author built a benchmark to compare the performance of 5 LLM agents in fixing security vulnerabilities in Python projects, finding that cost and model training data are significant differentiators.