DeepNewz Markets

Markets Stories

Search

Loading...

Browse all stories on DeepNewz

Market

Will future ARC-AGI iterations achieve 100% solvability by end of 2025?

3

New York University•NYU•Mechanical Turk

Resolution / Starting Odds

Yes • 50%

No • 50%

Official announcements from ARC-AGI organizers

Story

Independent NYU Study Finds 98.7% of ARC-AGI Tasks Solvable by Humans Ahead of November 10 Competition Deadline

Sep 4, 2024, 05:53 PM

Researchers at New York University (NYU) conducted an independent study on the ARC-AGI tasks, revealing that 98.7% of the public tasks are solvable by humans. The study found that 790 out of 800 tasks could be completed by at least one Mechanical Turk worker. This finding underscores the gap between human and AI performance on these tasks. The ARC-AGI competition, which challenges participants to develop AI capable of solving these tasks, will end on November 10, 2024. Researchers aim for future iterations to achieve 100% solvability and to establish human baselines on the private test set. Many high-scoring entries in the competition currently rely on basic brute-force program search.

View original story

Similar markets

Will OpenAI's 'o3' model exceed 90% on ARC-AGI benchmark by end of 2025?

Yes • 50%

No • 50%

Will OpenAI's o3 score 90%+ on ARC-AGI by end of 2025?

Yes • 50%

No • 50%

Who will release a model surpassing 'o3' on ARC-AGI by end of 2025?

Google DeepMind • 25%

Anthropic • 25%

Meta AI • 25%

Other • 25%

How will OpenAI's 'o3' perform on ARC-AGI benchmark compared to others by 2025?

o3 remains the top performer • 25%

Another model surpasses o3 • 25%

o3 ties with another model • 25%

No new models tested • 25%

Will OpenAI's AGI system surpass human-level problem-solving in a public demonstration by end of 2024?

Yes • 33%

No • 33%

Unclear/Disputed • 34%

Will OpenAI's board declare AGI achievement by end of 2025?

Yes • 50%

No • 50%

Will OpenAI board declare AGI achieved by end of 2025?

Yes • 50%

No • 50%

Will OpenAI reach level 3 in their AGI progress system by end of 2025?

Yes • 50%

No • 50%

What will be the primary goal of the AGI initiative if established by end of 2025?

Technological leadership • 25%

National security • 25%

Economic growth • 25%

Other • 25%

What will be the ARC-AGI high-compute performance of 'o3' by end of 2025?

Below 85% • 25%

85% to 90% • 25%

90% to 95% • 25%

Above 95% • 25%

What level will OpenAI reach in their AGI progress system by end of 2025?

Level 1 • 25%

Level 2 • 25%

Level 3 • 25%

Level 4 or higher • 25%

Which company will announce the first AGI milestone by the end of 2025?

OpenAI • 25%

Google • 25%

DeepMind • 25%

Other • 25%

Markets based on same story

Loading...

Looking for markets...

Show all

Will any AI solve all ARC-AGI tasks by November 10, 2024?

Yes • 50%

No • 50%

Will the top-scoring ARC-AGI entry use brute-force program search by November 10, 2024?

No • 50%

Yes • 50%

How many ARC-AGI tasks will the top 3 AI entries solve by November 10, 2024?

797-799 • 25%

790-793 • 25%

800 • 25%

794-796 • 25%

What method will the top-scoring ARC-AGI entry use by November 10, 2024?

Other • 25%

Hybrid approach • 25%

Brute-force program search • 25%

Machine learning • 25%