DeepNewz Markets

Markets Stories

Search

Loading...

Browse all stories on DeepNewz

OpenAI AI Model Scores Over 90% on MATH Dataset, Available by 2026

Jul 15, 2024, 04:57 PM

OpenAI has reportedly tested an AI model internally that scored over 90% on a MATH dataset, which is a benchmark of championship math problems. This development suggests significant progress in AI capabilities, potentially linked to the 'Strawberry' project. Other AI models, such as Deepseek-Math (7B), Gemini 1.5 Pro (May), and GPT-4o, scored 51.7%, 67.7%, and 76.6% respectively. The high score achieved by OpenAI's model indicates a breakthrough in AI performance, though some experts caution about overfitting on such datasets. The model is predicted to be available by 2026 and may involve the Q* algorithm.

View original story

Markets

Loading...

Looking for markets...

Will OpenAI's AI model scoring over 90% on MATH dataset be available by 2026?

OpenAI•Strawberry

Resolution / Starting Odds

Yes • 50%

No • 50%

Official announcement from OpenAI or a reputable news source confirming the release and performance of the model

Will OpenAI's AI model scoring over 90% on MATH dataset be integrated into a widely used educational platform by 2026?

OpenAI•Strawberry

Resolution / Starting Odds

No • 50%

Yes • 50%

Official announcements from educational platforms or OpenAI

Will OpenAI's AI model scoring over 90% on MATH dataset be used in a major international math competition by 2026?

OpenAI•Strawberry

Resolution / Starting Odds

No • 50%

Yes • 50%

Official records or announcements from major international math competitions

What will be the primary algorithm used in OpenAI's new AI model scoring over 90% on the MATH dataset?

OpenAI•Strawberry

Resolution / Starting Odds

Recurrent Neural Network-based • 25%

Other • 25%

Q* algorithm • 25%

Transformer-based • 25%

Official technical papers or announcements from OpenAI

Which AI model will be the next to surpass 80% on the MATH dataset?

OpenAI•Strawberry

Resolution / Starting Odds

Gemini 1.5 Pro (May) • 25%

OpenAI's model • 25%

GPT-4o • 25%

Deepseek-Math (7B) • 25%

Official publications or announcements from AI research organizations

Which AI model will have the highest score on the MATH dataset by the end of 2025?

OpenAI•Strawberry

Resolution / Starting Odds

GPT-4o • 25%

OpenAI's model • 25%

Deepseek-Math (7B) • 25%

Gemini 1.5 Pro (May) • 25%

Official publications or announcements from AI research organizations