DeepNewz Markets

Market

Will DeepSeek V3 achieve a new benchmark score on LiveBench by mid-2025?

DeepSeek•Hugging Face•Claude•Sonnet•LiveBench•V3•V2•Aider Polyglot•DeepSeek V3

Resolution / Starting Odds

Yes • 50%

No • 50%

LiveBench benchmark results published on official or tech analysis websites

Story

DeepSeek Launches V3 Model with 685B Parameters, 60 Tokens/Second, Outperforming Claude 3.5 and Sonnet 3.5 at $0.28/1M Output

Dec 26, 2024, 02:52 AM

DeepSeek has officially launched its V3 model, now available on Hugging Face, featuring 685 billion parameters and a mixture of experts (MoE) architecture with 256 experts and 8 active per token. The model reportedly outperforms competitors such as Claude 3.5 and Sonnet 3.5 on various benchmarks, including a notable 60.4 score on LiveBench. The V3 model is designed to be highly efficient, achieving speeds of 60 tokens per second, which is three times faster than its predecessor, V2. It has also shown substantial improvements in coding capabilities, increasing its performance from 17.8% to 48.4% on the Aider Polyglot leaderboard. Additionally, the pricing structure for DeepSeek V3 is competitive, offering rates of $0.28 per million outputs, significantly lower than those of its main competitors. The release is seen as a major advancement in the open-source AI landscape, with implications for the future of AI development and deployment.

View original story

Similar markets

MMLU-Pro • 25%

Other • 25%

Which AI model will next outperform DeepSeek-V3 on LiveBench by end of 2025?

Anthropic's New Model • 25%

Claude 4.0 • 25%

GPT-5 • 25%

Other • 25%

Market

Story

Similar markets

Will DeepSeek-V3 surpass 70 on LiveBench by end of 2025?

Will DeepSeek-V3 achieve a higher LiveBench score than Claude Sonnet 3.5 by March 2025?

Which benchmark will DeepSeek-V3 lead in by December 31, 2025?

Which AI model will next outperform DeepSeek-V3 on LiveBench by end of 2025?

Will DeepSeek-V3 achieve 60% success on Aider benchmark by end of 2025?

Will DeepSeek-V3 outperform GPT-4o in a major benchmark by June 30, 2025?

Will DeepSeek V3 rank first in Aider polyglot benchmark by end of Q1 2025?

Will DeepSeek V3 be integrated into a major commercial application by end of 2025?

Will DeepSeek-V3 surpass GPT-4o in global AI benchmarks by end of 2025?

Will DeepSeek-V3 surpass GPT-4 in MMLU benchmark in independent tests by June 30, 2025?

Will DeepSeek-V3 receive a major update or new version release by December 31, 2025?

Will DeepSeek V3 surpass OpenAI's ChatGPT in a major AI benchmark by June 2025?

Will DeepSeek V3 maintain its pricing advantage over competitors through 2024?

Will DeepSeek V3 surpass Claude 3.5 and Sonnet 3.5 in market share by end of 2024?

Which AI model will have the fastest token generation speed by mid-2025?

Which AI model will lead in coding capabilities by the end of 2024?

Will DeepSeek V3 maintain its pricing advantage over competitors through 2024?

Will DeepSeek V3 surpass Claude 3.5 and Sonnet 3.5 in market share by end of 2024?

Which AI model will have the fastest token generation speed by mid-2025?

Which AI model will lead in coding capabilities by the end of 2024?