Loading...
Loading...
Browse all stories on DeepNewz
VisitGroq's Llama3 AI Model Hits 1,000+ T/s, Outpaces GPT4
May 10, 2024, 10:48 PM
Groq Inc.'s latest AI model, Llama3, has been making significant strides in the AI industry with its advanced capabilities. The model, which operates on Groq's platform, has been noted for its exceptional speed, blazing through more than 1,000+ T/s and processing requests four times faster than the latest GPT4 model. This performance enhancement is attributed to the Llama3-70b-8192 model, which completes requests in approximately 25% of the time it takes GPT4. Additionally, Llama3 has introduced grouped query attention (GQA) across its models, improving inference efficiency and overall performance. The open-source nature of Llama3 also facilitates widespread use and customization, further democratizing AI technology.
View original story
Falcon 2 • 33%
Meta's Llama 3 • 33%
OpenAI's latest model • 34%
Falcon 2 • 33%
Meta's Llama 3 • 33%
OpenAI's models • 34%
Phi-3 14B • 25%
Mixtral 8x7B • 25%
GPT-3.5 • 25%
Llama-3 8B • 25%
Falcon 2 • 33%
Meta's Llama 3 • 33%
OpenAI's models • 34%
OpenAI • 25%
Google DeepMind • 25%
Anthropic • 25%
Microsoft • 25%
GPT-4o • 25%
Claude 3 • 25%
Google Bard • 25%
Other • 25%
Llama3 • 25%
NVIDIA's Latest Model • 25%
BERT • 25%
GPT-4 • 25%