Loading...
Loading...
Browse all stories on DeepNewz
VisitWhich platform will host the most downloads of DeepSeek-V3 by June 2025?
HuggingFace • 25%
GitHub • 25%
DeepSeek official site • 25%
Other platforms • 25%
Download statistics from platforms like HuggingFace, GitHub, etc.
DeepSeek Releases Open-Source DeepSeek-V3 Model; Surpasses Llama 3.1 405b, Offers Cheaper API Access
Dec 26, 2024, 11:51 AM
DeepSeek has unveiled DeepSeek-V3, a new open-source language model with 671 billion parameters utilizing a Mixture of Experts (MoE) architecture with 256 experts and 8 activated per token. The model surpasses Llama 3.1 405b, Claude Sonnet 3.5, and GPT-4o on various benchmarks, including ranking first on BigCodeBench-Hard with an average score of 34.5% and achieving a 60.4 LiveBench score. DeepSeek-V3 was trained on 14.8 trillion tokens at a cost of approximately $5.6 million, significantly less than comparable models. The model delivers enhanced capabilities at 60 tokens per second, three times faster than its predecessor. Additionally, DeepSeek-V3 API is 250% cheaper than Sonnet 3.5, priced at $0.27 per million input tokens and $1.10 per million output tokens. DeepSeek-V3 is fully open-source and available on platforms such as HuggingFace, with API compatibility intact. The release marks a significant advancement in open-source AI, demonstrating that high-performing models can be developed with limited compute resources.
View original story
More than 1,000,000 • 25%
Less than 100,000 • 25%
100,000 to 500,000 • 25%
500,001 to 1,000,000 • 25%
Amazon • 25%
Google • 25%
Microsoft • 25%
Other • 25%
Asia • 25%
North America • 25%
Other • 25%
Europe • 25%
North America • 25%
Asia-Pacific • 25%
Other • 25%
Europe • 25%
Education • 25%
Healthcare • 25%
Finance • 25%
Technology • 25%
1 million • 25%
5 million • 25%
10 million • 25%
Other • 25%
MATH-500 • 25%
Other • 25%
MMLU • 25%
MMLU-Pro • 25%
Yes • 50%
No • 50%
Data Analysis • 25%
Other • 25%
Coding Assistance • 25%
Natural Language Processing • 25%
Less than 10% • 25%
Greater than 50% • 25%
Between 10% and 30% • 25%
Between 30% and 50% • 25%
Content Creation • 25%
Other • 25%
Research and Development • 25%
Customer Service • 25%
Yes • 50%
No • 50%
DeepSeek-V3 • 25%
GPT-4o • 25%
Llama 3.1 405b • 25%
Sonnet 3.5 • 25%