DeepNewz Markets

Markets Stories

Search

Loading...

Browse all stories on DeepNewz

Meta's Llama 3.1 Models Offer 4X Cheaper AI Deployments, Achieve 99% Performance

Aug 14, 2024, 09:16 AM

Meta's AI division has released the Llama 3.1 models, which have been quantized to 4 bits by Neural Magic's research team. This quantization allows for 4X cheaper deployments, reducing the need from two 8x80GB nodes to one 4x80GB node. The models, including the 405B, 70B, and 8B versions, maintain approximately 100% recovery of the original performance. The HQQ Llama-3.1-70B model, in particular, achieves 99% of the base model's performance across various benchmarks, marking a significant advancement in AI model efficiency and cost-effectiveness.

View original story

Markets

Loading...

Looking for markets...

Will any major tech company announce adoption of Meta's Llama 3.1 models by December 31, 2024?

Meta•Llama•Neural Magic•HQQ

Resolution / Starting Odds

No • 50%

Yes • 50%

Public announcements or press releases from major tech companies

Will companies report a 4X cost reduction in AI deployments using Meta's Llama 3.1 models by December 31, 2024?

Meta•Llama•Neural Magic•HQQ

Resolution / Starting Odds

No • 50%

Yes • 50%

Public financial reports or cost analysis from companies using the models

Will Meta's Llama 3.1 models achieve 99% or higher performance on at least one major benchmark by December 31, 2024?

Meta•Llama•Neural Magic•HQQ

Resolution / Starting Odds

Yes • 50%

No • 50%

Benchmark results published by independent AI research organizations

Which AI model will achieve the highest performance benchmark by December 31, 2024?

Meta•Llama•Neural Magic•HQQ

Resolution / Starting Odds

Meta's Llama 3.1-70B • 25%

Other • 25%

Google's Bard • 25%

OpenAI's GPT-4 • 25%

Performance benchmarks published by independent AI research organizations

Which AI model will have the largest market share by December 31, 2024?

Meta•Llama•Neural Magic•HQQ

Resolution / Starting Odds

OpenAI's GPT-4 models • 25%

Other • 25%

Meta's Llama 3.1 models • 25%

Google's Bard models • 25%

Market analysis reports from AI industry analysts

Which quantized AI model will see the highest adoption by December 31, 2024?

Meta•Llama•Neural Magic•HQQ

Resolution / Starting Odds

Other • 25%

Meta's Llama 3.1-70B • 25%

OpenAI's GPT-4 quantized • 25%

Google's Bard quantized • 25%

Public announcements or press releases from companies adopting the models