Loading...
Loading...
Browse all stories on DeepNewz
VisitWhich AI model will top technical benchmarks by end of 2025?
DeepSeek-V3 • 25%
Meta's Llama 3.1 • 25%
OpenAI's GPT-4o • 25%
Alibaba's Qwen 2.5 • 25%
Results from widely recognized AI benchmarks and competitions
DeepSeek-V3: Chinese AI Startup Releases 671B Parameter Model, Outperforms Leading Competitors
Dec 30, 2024, 01:50 PM
DeepSeek, a Chinese AI startup, has released DeepSeek-V3, a new open-source AI model with 671 billion parameters, utilizing a Mixture-of-Experts (MoE) architecture that activates only 37 billion parameters for specific tasks. The model, trained on 14.8 trillion tokens, achieves a throughput of 60 tokens per second, which is three times faster than its predecessor, DeepSeek-V2. DeepSeek-V3 has demonstrated superior performance in technical tasks, including programming and mathematical problem-solving, outperforming leading models such as Meta's Llama 3.1, OpenAI's GPT-4o, and Alibaba's Qwen 2.5 in various benchmarks. The model is available through Hugging Face and the company's official website, with an API offered to enterprises at promotional pricing until February 8, 2025. DeepSeek's optimizations have allowed it to train the model with 11 times less compute power than similar efforts, highlighting potential limits of US sanctions on AI hardware availability in China.
View original story
Claude 3.5 Sonnet • 25%
DeepSeek-V3 • 25%
GPT-4o • 25%
Other • 25%
Claude Sonnet 3.5 • 25%
GPT-4o • 25%
Llama 3.1 405b • 25%
DeepSeek-V3 • 25%
DeepSeek V3 • 25%
Other existing model • 25%
Claude 3.5 Sonnet • 25%
A new entrant • 25%
Claude 3.5 Sonnet • 25%
Other existing model • 25%
DeepSeek V3 • 25%
A new entrant • 25%
DeepSeek V3 leads • 25%
OpenAI leads • 25%
Meta leads • 25%
Other AI models lead • 25%
DeepSeek-V3 • 25%
GPT-4o • 25%
Llama 3.1 405b • 25%
Claude 3.5 Sonnet • 25%
Google • 25%
Other • 25%
Microsoft • 25%
OpenAI • 25%
Llama 4 • 25%
GPT-5 • 25%
Claude 4 • 25%
Other • 25%
GPT-5 • 25%
Other • 25%
Anthropic's New Model • 25%
Claude 4.0 • 25%
OpenAI's latest model • 25%
Other • 25%
Microsoft's latest model • 25%
Gemini 2.0 Flash Thinking • 25%
Yes • 50%
No • 50%
Yes • 50%
No • 50%
Other • 25%
Meta • 25%
Alibaba • 25%
Microsoft • 25%