Loading...
Loading...
Browse all stories on DeepNewz
VisitGroq's Llama3 AI Model Hits 1,000+ T/s, Outpaces GPT4
May 10, 2024, 10:48 PM
Groq Inc.'s latest AI model, Llama3, has been making significant strides in the AI industry with its advanced capabilities. The model, which operates on Groq's platform, has been noted for its exceptional speed, blazing through more than 1,000+ T/s and processing requests four times faster than the latest GPT4 model. This performance enhancement is attributed to the Llama3-70b-8192 model, which completes requests in approximately 25% of the time it takes GPT4. Additionally, Llama3 has introduced grouped query attention (GQA) across its models, improving inference efficiency and overall performance. The open-source nature of Llama3 also facilitates widespread use and customization, further democratizing AI technology.
View original story
70,000 tokens/s on Llama3 model • 25%
70% reduction in memory usage • 25%
Collaboration on AI with top 3 universities • 25%
New AI hardware release • 25%
50,000 tokens/s on Llama3 70B model • 33%
60% reduction in memory usage • 33%
Partnership with major tech company • 33%
Llama3 • 25%
NVIDIA's Latest Model • 25%
BERT • 25%
GPT-4 • 25%