DeepNewz Markets

Home Markets Stories

Market

What ranking will DeepSeek-R1 achieve in AI performance benchmarks by end of 2025?

2

DeepSeek•MIT License•OpenAI•AIME•Hugging Face•API•Qwen•Claude•Sonnet

Resolution / Starting Odds

Top 1 • 25%

Top 2-5 • 25%

Top 6-10 • 25%

Below 10 • 25%

Performance benchmark reports from AI research organizations

Story

DeepSeek Releases MIT-Licensed 685B DeepSeek-R1 Model Rivaling OpenAI's o1 at 30x Lower Cost

Jan 20, 2025, 12:34 PM

DeepSeek has officially released its new open-source reasoning models, DeepSeek-R1 and DeepSeek-R1-Zero, licensed under the MIT License. The models, boasting 685 billion parameters, perform on par with OpenAI's o1 model across math, code, and reasoning tasks. DeepSeek-R1 achieved 71.0% pass@1 on AIME 2024, comparable to OpenAI's o1, and reached 86.7% with majority voting, surpassing OpenAI's o1. The release includes a technical report detailing a novel training pipeline that uses reinforcement learning without supervised fine-tuning, incorporating a 'language consistency reward' to improve reasoning outputs. The models are available on Hugging Face, the DeepSeek website, and API, as well as in their chat and Android/iOS apps. DeepSeek also released distilled smaller models, including one based on Qwen-1.5B that outperforms GPT-4o and Claude-3.5-Sonnet on math benchmarks with 28.9% on AIME and 83.9% on MATH. The models offer significant cost savings, with DeepSeek-R1 being up to 30 times cheaper than OpenAI's o1.

View original story

Similar markets

What ranking will DeepSeek-V3 achieve in a global AI competition by end of 2025?

Top 3 • 25%

1st place • 25%

Outside Top 10 • 25%

Top 10 • 25%

Will DeepSeek-R1 surpass OpenAI's o1 in a major AI benchmark by end of 2025?

Yes • 50%

No • 50%

What position will DeepSeek-V3 achieve in a major AI competition by June 30, 2025?

Top 1 • 25%

Top 3 • 25%

Top 5 • 25%

Outside Top 5 • 25%

Will DeepSeek-R1 surpass OpenAI o1 in math task benchmarks by end of 2025?

No • 50%

Yes • 50%

Will DeepSeek-V3 surpass GPT-4o in global AI benchmarks by end of 2025?

Yes • 50%

No • 50%

Which benchmark will DeepSeek-V3 lead in by December 31, 2025?

Other • 25%

MATH-500 • 25%

MMLU-Pro • 25%

MMLU • 25%

Where will DeepSeek-AI models rank on HuggingFace by end of 2025?

Top 50 • 25%

Top 10 • 25%

Top 20 • 25%

Outside Top 50 • 25%

Will DeepSeek-R1 API usage surpass OpenAI o1 by mid-2025?

No • 50%

Yes • 50%

Will DeepSeek-R1 surpass OpenAI's o1-Medium on LiveCodeBench by July 2025?

No • 50%

Yes • 50%

Which AI model will next outperform DeepSeek-V3 on LiveBench by end of 2025?

Claude 4.0 • 25%

Anthropic's New Model • 25%

GPT-5 • 25%

Other • 25%

Will DeepSeek V3 rank first in Aider polyglot benchmark by end of Q1 2025?

No • 50%

Yes • 50%

What will be DeepSeek-V3's rank on Aider leaderboard by end of 2025?

2nd Place • 25%

3rd Place • 25%

4th Place or lower • 25%

1st Place • 25%

Markets based on same story

Loading...

Looking for markets...

Show all

Will DeepSeek-R1 be the most downloaded AI model on Hugging Face by mid-2025?

Yes • 50%

No • 50%

Will DeepSeek-R1 surpass OpenAI's o1 in market share by end of 2025?

No • 50%

Yes • 50%

Will DeepSeek secure a major partnership with a Fortune 500 company by end of 2025?

Yes • 50%

No • 50%

What AIME score will DeepSeek's Qwen-1.5B based model achieve by end of 2025?

Above 90% • 25%

Below 80% • 25%

80%-85% • 25%

85%-90% • 25%