DeepNewz Markets

Market

Will OpenAI's o3 model score 80%+ on ARC-AGI before release?

OpenAI

Resolution / Starting Odds

Yes • 50%

No • 50%

Official ARC-AGI benchmark results published by OpenAI or ARC

Story

OpenAI's o3 Model Scores 75.7% on ARC-AGI, Set for Early 2025 Release

Dec 21, 2024, 12:48 PM

OpenAI has unveiled its latest AI model, o3, marking a significant advancement in AI reasoning capabilities. The o3 model, along with its smaller counterpart o3-mini, is set to be released in early 2025 following safety testing and red teaming. o3 achieved a breakthrough score of 75.7% on the ARC-AGI benchmark's semi-private evaluation set, with a high-compute configuration reaching 87.5%. Despite these impressive results, experts caution that o3 does not yet constitute artificial general intelligence (AGI), as it still fails on some tasks that are straightforward for humans. OpenAI's o3 model represents a step forward in AI's ability to adapt to novel tasks, but it is not considered AGI due to its limitations in handling certain easy tasks and the high cost of operation, which can reach thousands of dollars per task.

View original story

Similar markets

o3 ties with another model • 25%

No new models tested • 25%

Market

Story

Similar markets

Will OpenAI's o3 score 90%+ on ARC-AGI by end of 2025?

Will OpenAI's 'o3' model exceed 90% on ARC-AGI benchmark by end of 2025?

How will OpenAI's 'o3' perform on ARC-AGI benchmark compared to others by 2025?

Will OpenAI's 'o3' model surpass 30% on Frontier Math by the end of 2025?

Will OpenAI release the o3 model by the end of 2024?

Will OpenAI's 'o3' model achieve a Codeforces rating of 2800+ by end of 2025?

Will OpenAI's o1 model achieve a new benchmark performance in AI research by January 31, 2025?

Will OpenAI's 'o3' model reach a Codeforces rating of 2800 by March 2025?

Will OpenAI's o1-preview model achieve a user satisfaction rating of 90% or higher by June 30, 2025?

Will OpenAI's O1 model achieve a significant milestone towards AGI by June 30, 2025?

Will OpenAI release a new model that surpasses o1-preview in overall performance by end of 2024?

Will OpenAI's O1 models achieve a significant breakthrough in AGI research by the end of 2024?

Will a Fortune 500 company adopt OpenAI's o3 model within 3 months of release?

Will OpenAI release the o3 model by March 31, 2025?

What will be the primary application domain for OpenAI's o3 model within 6 months of release?

What will be the primary concern about OpenAI's o3 model within 3 months of release?

Will a Fortune 500 company adopt OpenAI's o3 model within 3 months of release?

Will OpenAI release the o3 model by March 31, 2025?

What will be the primary application domain for OpenAI's o3 model within 6 months of release?

What will be the primary concern about OpenAI's o3 model within 3 months of release?