DeepNewz Markets

Market

Will any AI solve all ARC-AGI tasks by November 10, 2024?

New York University•NYU•Mechanical Turk

Resolution / Starting Odds

Yes • 50%

No • 50%

Official ARC-AGI competition results

Story

Independent NYU Study Finds 98.7% of ARC-AGI Tasks Solvable by Humans Ahead of November 10 Competition Deadline

Sep 4, 2024, 05:53 PM

Researchers at New York University (NYU) conducted an independent study on the ARC-AGI tasks, revealing that 98.7% of the public tasks are solvable by humans. The study found that 790 out of 800 tasks could be completed by at least one Mechanical Turk worker. This finding underscores the gap between human and AI performance on these tasks. The ARC-AGI competition, which challenges participants to develop AI capable of solving these tasks, will end on November 10, 2024. Researchers aim for future iterations to achieve 100% solvability and to establish human baselines on the private test set. Many high-scoring entries in the competition currently rely on basic brute-force program search.

View original story

Similar markets

Will OpenAI's AGI system surpass human-level problem-solving in a public demonstration by end of 2024?

Yes • 33%

No • 33%

Unclear/Disputed • 34%

Will the AI factory for xAI's Grok be completed by Dec 31, 2024?

Yes • 50%

No • 50%

Which AI model will be the best at reasoning tasks on December 31, 2024?

OpenAI o1-preview • 25%

Anthropic Claude 3.5 Sonnet • 25%

OpenAI o1 mini • 25%

Other • 25%

o3 ties with another model • 25%

No new models tested • 25%

Will the Army's Cyber AI tool pilot be successfully completed by Nov 18, 2025?

Yes • 50%

No • 50%

Which AI lab will be the first to announce the achievement of AGI by the end of 2025?

OpenAI • 25%

DeepMind • 25%

Anthropic • 25%

Other • 25%

Market

Story

Similar markets

Will OpenAI's AGI system surpass human-level problem-solving in a public demonstration by end of 2024?

Will the AI factory for xAI's Grok be completed by Dec 31, 2024?

Which AI model will be the best at reasoning tasks on December 31, 2024?

Will DeepMind's AI solve all problems in a future competition by the end of 2025?

Will OpenAI or AgentLayer announce the development of AGI by September 22, 2025?

Will AI-enhanced quantum computing solve a complex real-world problem by December 31, 2025?

Will LATS-integrated AI agents outperform non-LATS agents in a major AI competition by June 30, 2024?

Will LiquidAI's 3B model achieve SOTA performance in ARC benchmark by June 30, 2024?

Will OpenAI's 'o3' model exceed 90% on ARC-AGI benchmark by end of 2025?

How will OpenAI's 'o3' perform on ARC-AGI benchmark compared to others by 2025?

Will the Army's Cyber AI tool pilot be successfully completed by Nov 18, 2025?

Which AI lab will be the first to announce the achievement of AGI by the end of 2025?

Will future ARC-AGI iterations achieve 100% solvability by end of 2025?

Will the top-scoring ARC-AGI entry use brute-force program search by November 10, 2024?

How many ARC-AGI tasks will the top 3 AI entries solve by November 10, 2024?

What method will the top-scoring ARC-AGI entry use by November 10, 2024?

Will future ARC-AGI iterations achieve 100% solvability by end of 2025?

Will the top-scoring ARC-AGI entry use brute-force program search by November 10, 2024?

How many ARC-AGI tasks will the top 3 AI entries solve by November 10, 2024?

What method will the top-scoring ARC-AGI entry use by November 10, 2024?