DeepNewz Markets

Market

Which significant milestone will Groq achieve by end of 2024?

Llama3•Groq•MatMul

Resolution / Starting Odds

50,000 tokens/s on Llama3 70B model • 33%

60% reduction in memory usage • 33%

Partnership with major tech company • 33%

Official announcements from Groq Inc. or credible industry publications

Story

Groq Inc. Achieves 40,792 Tokens/s on Llama3 70B Model in AI Breakthrough

Jun 6, 2024, 03:57 PM

Groq Inc. has made significant advancements in AI language modeling, particularly with the Llama3 models. The company has achieved an input rate of 40,792 tokens per second on the Llama3 70B model, utilizing FP16 multiply and FP32 accumulate operations. This follows their previous milestone of 30,000 tokens per second on the Llama3 8B model. These improvements are attributed to Groq's innovative approach, which includes the elimination of MatMul operations in favor of addition and negation operations. This method has not only maintained strong performance at billion-parameter scales but also reduced memory usage by up to 61%. Groq's technology demonstrates impressive inference speed and precision, processing approximately 8,000 tokens in 0.2 seconds with lossless precision. Additionally, Groq has achieved over 1200 tps on L3 8B, operating at 13W, moving LLMs closer to brain-like efficiency.

View original story

Similar markets

Not in 2024 • 33%

Adoption of Meta's Llama 3 by Other Major Tech Companies by 2024

Yes • 50%

No • 50%

First platform to announce further Llama 3 integration by end of 2024

AWS • 20%

Google Cloud • 20%

Microsoft Azure • 20%

Snowflake • 20%

IBM WatsonX • 20%

$Will top tech companies adopt Llama8B by the end of 2024?$

Will top tech companies adopt Llama8B by the end of 2024?

Yes • 50%

No • 50%

Additional major platform integrations for Llama 3 by end of 2024?

Yes • 50%

No • 50%

Market

Story

Similar markets

Llama 3 AI to reach 1000 tokens/sec by 2024?

Will Groq release an upgraded Llama3 model by May 2025?

Major tech adoption of Groq's technology by mid-2025

When will xAI's Grok AI achieve a significant milestone?

Adoption of Meta's Llama 3 by Other Major Tech Companies by 2024

First platform to announce further Llama 3 integration by end of 2024

Llama 3 Model Upgrade for Quantization by End of 2024

Completion of Meta's 400B+ Llama 3 AI Model by 2024

Llama 3 surpasses GPT-4 in benchmarks by end of 2024?

Llama 3 exceeds GPT in 2024 benchmarks?

Will top tech companies adopt Llama8B by the end of 2024?

Additional major platform integrations for Llama 3 by end of 2024?

Groq achieves 50,000 tokens/s on Llama3 70B by end of 2024?

Groq announces partnership with major tech company by end of 2024?

Groq's Llama3 70B model adopted by top 5 AI labs by mid-2025?

What will be Groq's market share in AI hardware by end of 2024?

Groq achieves 50,000 tokens/s on Llama3 70B by end of 2024?

Groq announces partnership with major tech company by end of 2024?

Groq's Llama3 70B model adopted by top 5 AI labs by mid-2025?

What will be Groq's market share in AI hardware by end of 2024?