DeepNewz Markets

Market

What percentage of ARC-AGI tasks will the top AI entry solve by November 10, 2024?

New York University•NYU•Mechanical Turk

Resolution / Starting Odds

90-94.9% • 25%

95-97.9% • 25%

98-99.9% • 25%

100% • 25%

Official ARC-AGI competition results

Story

Independent NYU Study Finds 98.7% of ARC-AGI Tasks Solvable by Humans Ahead of November 10 Competition Deadline

Sep 4, 2024, 05:53 PM

Researchers at New York University (NYU) conducted an independent study on the ARC-AGI tasks, revealing that 98.7% of the public tasks are solvable by humans. The study found that 790 out of 800 tasks could be completed by at least one Mechanical Turk worker. This finding underscores the gap between human and AI performance on these tasks. The ARC-AGI competition, which challenges participants to develop AI capable of solving these tasks, will end on November 10, 2024. Researchers aim for future iterations to achieve 100% solvability and to establish human baselines on the private test set. Many high-scoring entries in the competition currently rely on basic brute-force program search.

View original story

Similar markets

o3 ties with another model • 25%

No new models tested • 25%

Will LiquidAI's 3B model achieve SOTA performance in ARC benchmark by June 30, 2024?

Yes • 50%

No • 50%

What will be OpenAI's AGI progress level by end of 2024?

Level 1: Chatbots • 25%

Level 2: Reasoners • 25%

Level 3: Agents • 25%

Level 4 or higher • 25%

Will AWM-enhanced AI agents achieve a 55% improvement in success rates on major benchmarks by the end of 2024?

Yes • 50%

No • 50%

How will Chai-1 perform in a major AI competition by December 31, 2024?

Outperforms all models • 25%

Outperforms AlphaFold3 but not ESM3 • 25%

Outperforms ESM3 but not AlphaFold3 • 25%

Does not outperform either • 25%

Will OpenAI's AGI system surpass human-level problem-solving in a public demonstration by end of 2024?

Yes • 33%

No • 33%

Unclear/Disputed • 34%

Economic growth • 25%

Other • 25%

Market

Story

Similar markets

Will OpenAI's 'o3' model exceed 90% on ARC-AGI benchmark by end of 2025?

Will OpenAI's o3 score 90%+ on ARC-AGI by end of 2025?

How will OpenAI's 'o3' perform on ARC-AGI benchmark compared to others by 2025?

Will LiquidAI's 3B model achieve SOTA performance in ARC benchmark by June 30, 2024?

What will be OpenAI's AGI progress level by end of 2024?

Will AWM-enhanced AI agents achieve a 55% improvement in success rates on major benchmarks by the end of 2024?

How will Chai-1 perform in a major AI competition by December 31, 2024?

Will OpenAI's AGI system surpass human-level problem-solving in a public demonstration by end of 2024?

Will OpenAI's o3 model score 80%+ on ARC-AGI before release?

Will the AI model achieve 98% accuracy in an independent study by August 31, 2025?

Amazon AGI team achieves significant AI breakthrough by end of 2025?

What will be the primary goal of the AGI initiative if established by end of 2025?

Will any AI solve all ARC-AGI tasks by November 10, 2024?

Will future ARC-AGI iterations achieve 100% solvability by end of 2025?

Will the top-scoring ARC-AGI entry use brute-force program search by November 10, 2024?

How many ARC-AGI tasks will the top 3 AI entries solve by November 10, 2024?

Will any AI solve all ARC-AGI tasks by November 10, 2024?

Will future ARC-AGI iterations achieve 100% solvability by end of 2025?

Will the top-scoring ARC-AGI entry use brute-force program search by November 10, 2024?

How many ARC-AGI tasks will the top 3 AI entries solve by November 10, 2024?