DeepNewz Markets

Markets Stories

Search

Loading...

Browse all stories on DeepNewz

Groq's Llama3 AI Model Hits 1,000+ T/s, Outpaces GPT4

May 10, 2024, 10:48 PM

Groq Inc.'s latest AI model, Llama3, has been making significant strides in the AI industry with its advanced capabilities. The model, which operates on Groq's platform, has been noted for its exceptional speed, blazing through more than 1,000+ T/s and processing requests four times faster than the latest GPT4 model. This performance enhancement is attributed to the Llama3-70b-8192 model, which completes requests in approximately 25% of the time it takes GPT4. Additionally, Llama3 has introduced grouped query attention (GQA) across its models, improving inference efficiency and overall performance. The open-source nature of Llama3 also facilitates widespread use and customization, further democratizing AI technology.

View original story

Markets

Loading...

Looking for markets...

Will Groq release an upgraded Llama3 model by May 2025?

Resolution / Starting Odds

Yes • 50%

No • 50%

Official announcements from Groq

Will Llama3 become the most used AI model in academia by mid-2025?

Resolution / Starting Odds

Yes • 50%

No • 50%

Surveys or usage data reports from major academic institutions

Will Llama3 outperform OpenAI's next model in speed by end of 2024?

Resolution / Starting Odds

Yes • 50%

No • 50%

Standardized AI speed testing results published by an independent AI research organization

AI Model with Highest Commercial Adoption by 2024

Resolution / Starting Odds

Llama3 • 25%

NVIDIA's Latest Model • 25%

BERT • 25%

GPT-4 • 25%

Market adoption reports from credible tech analysis firms

Fastest AI model by end of 2024

Resolution / Starting Odds

Next-gen OpenAI model • 34%

Llama3 • 33%

GPT-4 • 33%

Standardized AI speed testing results published by an independent AI research organization

Leading AI Innovation Company in 2024

Resolution / Starting Odds

OpenAI • 25%

Groq • 25%

Google DeepMind • 25%

NVIDIA • 25%

Analysis from tech industry reports and AI model release announcements