DeepNewz Markets

Market

What will be OpenAI o1 model's performance on benchmark tasks by end of 2024?

OpenAI•OpenAI o1•Strawberry•ChatGPT•ChatGPT Plus•Team•AI

Resolution / Starting Odds

Top 10% • 25%

Top 1% • 25%

Top 5% • 25%

Below Top 10% • 25%

Official benchmark results published by OpenAI or independent evaluators

Story

OpenAI Releases o1 and o1-mini AI Models with Advanced Reasoning and Fact-Checking Capabilities

Sep 12, 2024, 05:15 PM

OpenAI has officially released its new AI model, OpenAI o1, internally known as Strawberry, after several months of development. This model is designed to enhance reasoning capabilities and solve complex tasks in fields such as mathematics, science, and coding. The o1 model series includes a smaller, cost-efficient version called o1-mini and a free tier version of ChatGPT. Both models are available to ChatGPT Plus and Team users, with o1-preview and o1-mini being selectable in the model picker. The new models are reported to perform at PhD-level accuracy on benchmark tasks in physics, chemistry, and biology, and can reason through problems similarly to human thinking. Additionally, the o1 model can fact-check itself, marking a significant milestone in AI development.

View original story

Similar markets

Will OpenAI's o1 model achieve a new benchmark performance in AI research by January 31, 2025?

Yes • 50%

No • 50%

$Will OpenAI's 'o1' model surpass GPT-4o in a public benchmark by end of 2024?$

Will OpenAI's 'o1' model surpass GPT-4o in a public benchmark by end of 2024?

Yes • 50%

No • 50%

Which feature of OpenAI's O1 model will set a new benchmark in AI performance by June 30, 2025?

Reinforcement learning • 25%

Search-based reasoning • 25%

Thinking before answering • 25%

Other • 25%

Improvement in scientific reasoning • 25%

Other • 25%

How many AI models will OpenAI's o1 model train from scratch by December 31, 2024?

None • 25%

1 to 2 • 25%

3 to 4 • 25%

5 or more • 25%

Will OpenAI's o1-preview model maintain the top spot on LiveBench AI by end of 2024?

Yes • 50%

No • 50%

What will be OpenAI's o1 model's performance in solving International Mathematics Olympiad problems by December 31, 2024?

Less than 80% • 25%

80% to 85% • 25%

85% to 90% • 25%

Over 90% • 25%

Will OpenAI's O1 model achieve a significant milestone towards AGI by June 30, 2025?

Yes • 50%

No • 50%

Will OpenAI's 'o3' model exceed 90% on ARC-AGI benchmark by end of 2025?

Yes • 50%

No • 50%

Market

Story

Similar markets

Will OpenAI's o1 model achieve a new benchmark performance in AI research by January 31, 2025?

Will OpenAI's 'o1' model surpass GPT-4o in a public benchmark by end of 2024?

Which feature of OpenAI's O1 model will set a new benchmark in AI performance by June 30, 2025?

Will OpenAI release a new model that surpasses o1-preview in overall performance by end of 2024?

Will OpenAI's O1 models achieve a significant breakthrough in AGI research by the end of 2024?

Will OpenAI's O1 model outperform GPT-4 in a standardized benchmark test by December 31, 2024?

In which area will OpenAI's o1 model show the next major performance improvement by March 31, 2025?

How many AI models will OpenAI's o1 model train from scratch by December 31, 2024?

Will OpenAI's o1-preview model maintain the top spot on LiveBench AI by end of 2024?

What will be OpenAI's o1 model's performance in solving International Mathematics Olympiad problems by December 31, 2024?

Will OpenAI's O1 model achieve a significant milestone towards AGI by June 30, 2025?

Will OpenAI's 'o3' model exceed 90% on ARC-AGI benchmark by end of 2025?

Will a major tech company adopt OpenAI o1 model for internal use by end of 2024?

Will OpenAI o1 model achieve a major scientific breakthrough by June 30, 2025?

Will OpenAI o1 model be integrated into a popular consumer application by March 31, 2025?

How many users will adopt OpenAI o1 model by June 30, 2025?

Will a major tech company adopt OpenAI o1 model for internal use by end of 2024?

Will OpenAI o1 model achieve a major scientific breakthrough by June 30, 2025?

Will OpenAI o1 model be integrated into a popular consumer application by March 31, 2025?

How many users will adopt OpenAI o1 model by June 30, 2025?