Loading...
Loading...
Browse all stories on DeepNewz
VisitWhich AI model will have the fastest token generation speed by mid-2025?
DeepSeek V3 • 25%
Claude 3.5 • 25%
Sonnet 3.5 • 25%
Other • 25%
Speed tests published by AI research groups or tech publications
DeepSeek Launches V3 Model with 685B Parameters, 60 Tokens/Second, Outperforming Claude 3.5 and Sonnet 3.5 at $0.28/1M Output
Dec 26, 2024, 02:52 AM
DeepSeek has officially launched its V3 model, now available on Hugging Face, featuring 685 billion parameters and a mixture of experts (MoE) architecture with 256 experts and 8 active per token. The model reportedly outperforms competitors such as Claude 3.5 and Sonnet 3.5 on various benchmarks, including a notable 60.4 score on LiveBench. The V3 model is designed to be highly efficient, achieving speeds of 60 tokens per second, which is three times faster than its predecessor, V2. It has also shown substantial improvements in coding capabilities, increasing its performance from 17.8% to 48.4% on the Aider Polyglot leaderboard. Additionally, the pricing structure for DeepSeek V3 is competitive, offering rates of $0.28 per million outputs, significantly lower than those of its main competitors. The release is seen as a major advancement in the open-source AI landscape, with implications for the future of AI development and deployment.
View original story
Other existing model • 25%
DeepSeek V3 • 25%
Claude 3.5 Sonnet • 25%
A new entrant • 25%
DeepSeek-V3 • 25%
GPT-4o • 25%
Llama 3.1 405b • 25%
Sonnet 3.5 • 25%
Llama 3.3 • 25%
Other • 25%
Phi-4 • 25%
Gemini Pro • 25%
OpenAI's GPT-4o • 25%
Meta's Llama 3.1 • 25%
DeepSeek-V3 • 25%
Alibaba's Qwen 2.5 • 25%
Claude 3.5 Sonnet • 25%
DeepSeek-V3 • 25%
GPT-4o • 25%
Other • 25%
Claude 3.5 Sonnet • 25%
Llama 3.1 405b • 25%
DeepSeek-V3 • 25%
GPT-4o • 25%
Meta leads • 25%
DeepSeek V3 leads • 25%
OpenAI leads • 25%
Other AI models lead • 25%
GPT-4o • 25%
DeepSeek-V3 • 25%
Llama 3.1 405b • 25%
Claude Sonnet 3.5 • 25%
Other existing model • 25%
A new entrant • 25%
Claude 3.5 Sonnet • 25%
DeepSeek V3 • 25%
Llama 3.3 • 25%
Other • 25%
Phi-4 • 25%
Gemini Pro • 25%
Other • 25%
OpenAI • 25%
Google • 25%
Microsoft • 25%
Microsoft • 25%
OpenAI • 25%
Meta • 25%
Other • 25%
Claude 3.5 • 25%
Other • 25%
DeepSeek V3 • 25%
Sonnet 3.5 • 25%