Loading...
Loading...
Browse all stories on DeepNewz
VisitWill XBOW AI pentester achieve a perfect score on a web pentesting benchmark by June 2025?
Yes • 50%
No • 50%
Official benchmark results published by PentesterLab or PortSwigger
XBOW AI Pentester Matches Human Experts, Achieves 85% Success in 28 Minutes
Aug 5, 2024, 02:25 PM
The team behind GitHub Copilot has launched XBOW, an AI-powered penetration tester that rivals human experts. XBOW, led by Oege de Moor, has demonstrated remarkable capabilities by matching the performance of a 20-year veteran pentester in just 28 minutes, achieving 85% success in identifying vulnerabilities. The AI tool scored an unprecedented 75% on renowned web pentesting benchmarks from PentesterLab and PortSwigger. In a head-to-head competition, XBOW solved 88 out of 104 challenges, matching the performance of human experts given 40 hours. XBOW's performance has shown that AI can significantly accelerate cybersecurity tasks.
View original story
Nemotron 70B • 25%
ChatGPT4o • 25%
Sonnet 3.5 • 25%
Other • 25%
Yes • 50%
No • 50%
Yes • 50%
No • 50%
Yes • 50%
No • 50%
Top 1 in a benchmark • 25%
Top 5 in a benchmark • 25%
Top 10 in a benchmark • 25%
No significant milestone • 25%
Less than 80 challenges • 25%
More than 100 challenges • 25%
90-100 challenges • 25%
80-90 challenges • 25%
30%+ • 25%
20%-30% • 25%
0-10% • 25%
10%-20% • 25%