DeepNewz Markets

Loading...

Browse all stories on DeepNewz

Market

Which AI model will top Aider Polyglot Benchmark by end of 2025?

2

DeepSeek•Llama•Aider Polyglot Benchmark

Resolution / Starting Odds

DeepSeek-V3 • 25%

GPT-4o • 25%

Claude 3.5 Sonnet • 25%

Llama 3.1 405b • 25%

Aider Polyglot Benchmark results published by reputable AI research organizations

Story

DeepSeek Unveils 671B DeepSeek-V3 AI Model, Outperforms GPT-4o with 60 Tokens/Sec Speed

Dec 26, 2024, 05:37 PM

DeepSeek has officially released DeepSeek-V3, a new open-source AI language model with 671 billion Mixture-of-Experts (MoE) parameters and 37 billion activated parameters per token. The model reportedly outperforms leading proprietary models such as GPT-4o, Claude 3.5 Sonnet, and Llama 3.1 405b on various benchmarks, including the Aider Polyglot Benchmark, which tests language models on coding exercises across multiple programming languages. DeepSeek-V3 achieves a score of 48% on this benchmark, significantly improving from the 17% score of its predecessor, DeepSeek-V2.5. The model was trained on 14.8 trillion high-quality tokens using 2.788 million H800 GPU hours over less than two months, with a reported training cost of $5.6 million. DeepSeek-V3 also boasts a speed of 60 tokens per second, three times faster than the previous version, and supports a context length of 128,000 tokens. The model utilizes auxiliary-loss-free load balancing and FP8 mixed-precision, and it operates with high sparsity by leveraging 256 experts with only eight activated per token. The release includes fully open-source models and papers. Pricing is set at $0.27 per million input tokens and $1.10 per million output tokens.

View original story

Similar markets

Which AI model will top Aider multilingual programming benchmark by end of 2025?

DeepSeek V3 • 25%

A new entrant • 25%

Other existing model • 25%

Claude 3.5 Sonnet • 25%

Which AI model will top technical benchmarks by end of 2025?

Meta's Llama 3.1 • 25%

Alibaba's Qwen 2.5 • 25%

DeepSeek-V3 • 25%

OpenAI's GPT-4o • 25%

Which AI model will lead in coding benchmarks by mid-2025?

DeepSeek-V3 • 25%

Other • 25%

Claude 3.5 Sonnet • 25%

GPT-4o • 25%

Which AI model will lead BigCodeBench-Hard rankings by end of 2025?

Claude Sonnet 3.5 • 25%

DeepSeek-V3 • 25%

GPT-4o • 25%

Llama 3.1 405b • 25%

Which AI model will have highest API performance speedup by mid-2025?

A new entrant • 25%

DeepSeek V3 • 25%

Claude 3.5 Sonnet • 25%

Other existing model • 25%

Which AI model will have the fastest token generation speed by mid-2025?

Claude 3.5 • 25%

Other • 25%

DeepSeek V3 • 25%

Sonnet 3.5 • 25%

Which AI model will lead in coding capabilities by the end of 2024?

Claude 3.5 • 25%

Sonnet 3.5 • 25%

Other • 25%

DeepSeek V3 • 25%

Which AI model will be most popular among developers by mid-2025?

Other • 25%

Llama 3.3 • 25%

Gemini Pro • 25%

Phi-4 • 25%

Top-ranked AI model on Chatbot Arena by March 31, 2025?

Gemini 2.0 Flash Thinking • 25%

Other • 25%

Microsoft's AI Model • 25%

OpenAI's o1 • 25%

Which AI model will have the lowest API cost per million tokens by end of 2025?

Llama 3.1 405b • 25%

DeepSeek-V3 • 25%

Sonnet 3.5 • 25%

GPT-4o • 25%

Which AI model will achieve highest accuracy in mathematics tasks by end of 2025?

Other • 25%

Gemini Pro • 25%

Phi-4 • 25%

Llama 3.3 • 25%

Which AI model will lead the market share by end of 2025?

Meta leads • 25%

OpenAI leads • 25%

DeepSeek V3 leads • 25%

Other AI models lead • 25%