DeepNewz Markets

Markets Stories

Search

Loading...

Browse all stories on DeepNewz

Market

How many challenges will XBOW AI pentester solve in the next head-to-head competition by March 2025?

3

GitHub Copilot•XBOW•Oege de Moor•PentesterLab•PortSwigger

Resolution / Starting Odds

Less than 80 challenges • 25%

80-90 challenges • 25%

90-100 challenges • 25%

More than 100 challenges • 25%

Official competition results published by GitHub or relevant cybersecurity organizations

Story

XBOW AI Pentester Matches Human Experts, Achieves 85% Success in 28 Minutes

Aug 5, 2024, 02:25 PM

The team behind GitHub Copilot has launched XBOW, an AI-powered penetration tester that rivals human experts. XBOW, led by Oege de Moor, has demonstrated remarkable capabilities by matching the performance of a 20-year veteran pentester in just 28 minutes, achieving 85% success in identifying vulnerabilities. The AI tool scored an unprecedented 75% on renowned web pentesting benchmarks from PentesterLab and PortSwigger. In a head-to-head competition, XBOW solved 88 out of 104 challenges, matching the performance of human experts given 40 hours. XBOW's performance has shown that AI can significantly accelerate cybersecurity tasks.

View original story

Similar markets

Which AI model will achieve the highest score in the next Arena Hard benchmark test by Q1 2025?

Nemotron 70B • 25%

ChatGPT4o • 25%

Sonnet 3.5 • 25%

Other • 25%

Will Google's AI Cyber Challenge team win the DARPA AI Cyber Challenge by the end of 2024?

Yes • 50%

No • 50%

xAI surpasses OpenAI in AI benchmarks by end of 2025?

Yes • 50%

No • 50%

How will Chai-1 perform in a major AI competition by December 31, 2024?

Outperforms all models • 25%

Outperforms AlphaFold3 but not ESM3 • 25%

Outperforms ESM3 but not AlphaFold3 • 25%

Does not outperform either • 25%

How many ARC-AGI tasks will the top 3 AI entries solve by November 10, 2024?

790-793 • 25%

794-796 • 25%

797-799 • 25%

800 • 25%

How many AI benchmarks will OpenAI's 'o3' set records in by the end of 2025?

0-1 benchmarks • 25%

2-3 benchmarks • 25%

4-5 benchmarks • 25%

More than 5 benchmarks • 25%

Will xAI surpass Google, OpenAI, and Anthropic in AI benchmarks by end of 2025?

Yes • 50%

No • 50%

Will DeepMind's AI solve all problems in a future competition by the end of 2025?

Yes • 50%

No • 50%

$In which AI competition will Phi-4 top perform by end of 2025?$

In which AI competition will Phi-4 top perform by end of 2025?

NeurIPS • 25%

ICML • 25%

AAAI • 25%

Other • 25%

Will xAI surpass OpenAI in AI capabilities by the end of 2025 according to independent benchmarks?

Yes • 50%

No • 50%

What will be the next major competition for DeepMind's AI by end of 2024?

International Mathematical Olympiad • 25%

Kaggle Competition • 25%

DARPA Challenge • 25%

Other • 25%

Number of times OpenAI X accounts will be hacked by end of 2024?

0 • 25%

1 • 25%

2 • 25%

3 or more • 25%

Markets based on same story

Loading...

Looking for markets...

Show all

Will a Fortune 500 company adopt XBOW AI pentester by March 2025?

No • 50%

Yes • 50%

Will XBOW AI pentester achieve a perfect score on a web pentesting benchmark by June 2025?

No • 50%

Yes • 50%

Will XBOW AI pentester surpass 90% success rate by end of 2024?

No • 50%

Yes • 50%

What will be the adoption rate of XBOW AI pentester among top cybersecurity firms by June 2025?

30%+ • 25%

20%-30% • 25%

0-10% • 25%

10%-20% • 25%