DeepNewz Markets

Markets Stories

Search

Loading...

Browse all stories on DeepNewz

Market

Which AI model will next outperform DeepSeek-V3 on LiveBench by end of 2025?

2

DeepSeek•LiveBench•Western•Aider•HuggingFace

Resolution / Starting Odds

Claude 4.0 • 25%

GPT-5 • 25%

Anthropic's New Model • 25%

Other • 25%

LiveBench benchmark results published on official website or press releases

Story

DeepSeek Launches Open-Source DeepSeek-V3 Model with 671 Billion Parameters, 60 Tokens/Second Speed, and $5.6 Million Training Cost

Dec 26, 2024, 02:37 PM

DeepSeek has officially released its latest AI model, DeepSeek-V3, which features 671 billion parameters in a mixture-of-experts (MoE) architecture. This model utilizes 37 billion activated parameters and is capable of processing 60 tokens per second, making it three times faster than its predecessor, DeepSeek-V2. Notably, DeepSeek-V3 has achieved a score of 60.4 on the LiveBench benchmark, outperforming competitors such as Claude 3.5 Sonnet and GPT-4o across various tasks. The model was trained using 14.8 trillion tokens at a cost of approximately $5.6 million, significantly lower than the training costs of similar models from Western labs. DeepSeek-V3 has also demonstrated a remarkable improvement in coding capabilities, jumping from a 17% success rate in its previous version to 48% in the current version on the Aider benchmark. The model is fully open-source and available on HuggingFace, marking a notable advancement in the open-source AI landscape.

View original story

Similar markets

Which benchmark will DeepSeek-V3 lead in by December 31, 2025?

MATH-500 • 25%

MMLU-Pro • 25%

Other • 25%

MMLU • 25%

Will DeepSeek V3 achieve a new benchmark score on LiveBench by mid-2025?

Yes • 50%

No • 50%

Will DeepSeek-V3 surpass GPT-4o in global AI benchmarks by end of 2025?

No • 50%

Yes • 50%

Which AI model will lead BigCodeBench-Hard rankings by end of 2025?

GPT-4o • 25%

Llama 3.1 405b • 25%

DeepSeek-V3 • 25%

Claude Sonnet 3.5 • 25%

Will DeepSeek-V3 achieve a higher LiveBench score than Claude Sonnet 3.5 by March 2025?

Yes • 50%

No • 50%

What ranking will DeepSeek-V3 achieve in a global AI competition by end of 2025?

Outside Top 10 • 25%

Top 10 • 25%

Top 3 • 25%

1st place • 25%

What position will DeepSeek-V3 achieve in a major AI competition by June 30, 2025?

Top 5 • 25%

Top 1 • 25%

Top 3 • 25%

Outside Top 5 • 25%

Which AI model will top technical benchmarks by end of 2025?

DeepSeek-V3 • 25%

Meta's Llama 3.1 • 25%

OpenAI's GPT-4o • 25%

Alibaba's Qwen 2.5 • 25%

Will DeepSeek-V3 surpass GPT-4o in global AI model usage by end of 2025?

No • 50%

Yes • 50%

How many AI models will DeepSeek release by the end of 2025?

One • 25%

Three • 25%

Two • 25%

More than three • 25%

Will DeepSeek-V3 be the most popular open-source AI model in China by end of 2025?

Yes • 50%

No • 50%

Will DeepSeek-V3 become the most widely adopted AI model in China by June 30, 2025?

Yes • 50%

No • 50%

Markets based on same story

Loading...

Looking for markets...

Show all

Will DeepSeek-V3 achieve 60% success on Aider benchmark by end of 2025?

No • 50%

Yes • 50%

Will DeepSeek-V3 be the most downloaded model on HuggingFace by mid-2025?

Yes • 50%

No • 50%

Will DeepSeek-V3 surpass 70 on LiveBench by end of 2025?

No • 50%

Yes • 50%

What will be the primary use case for DeepSeek-V3 by end of 2025?

Natural Language Processing • 25%

Other • 25%

Data Analysis • 25%

Coding Assistance • 25%