DeepNewz Markets

Market

Which feature of NVIDIA's AI Foundry will be most popular by end of 2024?

Llama•Hugging Face•Together Inference•NVIDIA•AI Foundry•AI•NVIDIA Nemotron•NVIDIA NeMo Retriever

Resolution / Starting Odds

Custom generative AI models • 25%

Synthetic data generation • 25%

Fine-tuning • 25%

NeMo Retriever microservices • 25%

Usage statistics or surveys from NVIDIA or third-party market research firms

Story

Groq Inc. and NVIDIA Turbocharge Llama 3.1 405B Model for Record-Breaking Speeds and Cost Efficiency

Jul 23, 2024, 03:18 PM

Groq Inc. has turbocharged the Llama 3.1 model, achieving record-breaking speeds and cost efficiency. The Llama 3.1 405B model, hosted by Groq Inc., runs at speeds up to 330 tokens per second, making it 100 times faster than previous models. This advancement is expected to significantly reduce costs, with some estimates suggesting it could be 10 times cheaper. The model is also available for download on Hugging Face. Additionally, Groq Inc. has partnered with Together Inference and Fine-tuning to bring these models to a broader audience, with speeds of up to 400 tokens per second for the Llama 3.1 8B model. NVIDIA has also announced its AI Foundry service, which will allow enterprises and nations to build custom generative AI models using Llama 3.1 405B and NVIDIA Nemotron models, with comprehensive features including synthetic data generation and fine-tuning. The Llama 3.1 70B model with 128k context is also part of this offering, and NVIDIA NeMo Retriever microservices are included for accurate responses.

View original story

Similar markets

What will be the most popular feature of NVIDIA's Fugatto AI model among users by June 30, 2025?

Voice Modification • 25%

Music Generation • 25%

Sound Transformation • 25%

Other • 25%

Which AI model will have the highest usage on SambaNova Cloud by mid-2024?

Llama 3.1 8B • 25%

Llama 3.1 70B • 25%

Llama 3.1 405B • 25%

Other • 25%

What will be the most popular use case for NVIDIA's NIM Agent Blueprints by the end of 2024?

Digital humans • 25%

PDF data extraction • 25%

Virtual screenings • 25%

Other • 25%

What will be the most significant feature enhancement for Nvidia's Fugatto AI model by December 31, 2025?

Increased Parameters • 25%

New Audio Inputs • 25%

Enhanced Output Quality • 25%

Other • 25%

Which industry will widely adopt NVIDIA's Fugatto AI model first by December 31, 2025?

Music Industry • 25%

Film Industry • 25%

Gaming Industry • 25%

Other • 25%

Which industry will first widely adopt Nvidia's Fugatto AI by 2025?

Music • 25%

Gaming • 25%

Film • 25%

Advertising • 25%

What will be the primary application sector for Nvidia's Fugatto AI model by December 31, 2025?

Music Production • 25%

Advertising • 25%

Gaming • 25%

Other • 25%

What will be the primary focus of Nvidia and Accenture's AI training by mid-2025?

Machine Learning • 25%

Natural Language Processing • 25%

Computer Vision • 25%

Other • 25%

Sector most impacted by NVIDIA AI automation by end of 2025?

Manufacturing • 25%

Healthcare • 25%

Transportation • 25%

Retail • 25%

Which feature of Together AI's Platform will be most praised by end of 2024?

Faster Inference Speed • 25%

Cost Reduction • 25%

Security in Private Clouds • 25%

Ease of Integration • 25%

Which AI image model will be the most popular on RenderNet by end of Q1 2025?

Flux AI • 25%

Midjourney • 25%

ChatGPT • 25%

Other • 25%

What will be the next AI feature integrated into NVIDIA's Holoscan for Media by end of 2024?

Image Recognition • 25%

Natural Language Processing • 25%

Predictive Analytics • 25%

Automated Editing • 25%

Market

Story

Similar markets

What will be the most popular feature of NVIDIA's Fugatto AI model among users by June 30, 2025?

Which AI model will have the highest usage on SambaNova Cloud by mid-2024?

What will be the most popular use case for NVIDIA's NIM Agent Blueprints by the end of 2024?

What will be the most significant feature enhancement for Nvidia's Fugatto AI model by December 31, 2025?

Which industry will widely adopt NVIDIA's Fugatto AI model first by December 31, 2025?

Which industry will first widely adopt Nvidia's Fugatto AI by 2025?

What will be the primary application sector for Nvidia's Fugatto AI model by December 31, 2025?

What will be the primary focus of Nvidia and Accenture's AI training by mid-2025?

Sector most impacted by NVIDIA AI automation by end of 2025?

Which feature of Together AI's Platform will be most praised by end of 2024?

Which AI image model will be the most popular on RenderNet by end of Q1 2025?

What will be the next AI feature integrated into NVIDIA's Holoscan for Media by end of 2024?

Will Groq Inc.'s Llama 3.1 405B model achieve a 10x cost reduction by end of 2024?

Will NVIDIA's AI Foundry service be adopted by at least 5 Fortune 500 companies by mid-2025?

Will the Llama 3.1 405B model be downloaded more than 10,000 times on Hugging Face by end of 2024?

Which entity will achieve the highest token generation speed for Llama 3.1 models by end of 2024?

Will Groq Inc.'s Llama 3.1 405B model achieve a 10x cost reduction by end of 2024?

Will NVIDIA's AI Foundry service be adopted by at least 5 Fortune 500 companies by mid-2025?

Will the Llama 3.1 405B model be downloaded more than 10,000 times on Hugging Face by end of 2024?

Which entity will achieve the highest token generation speed for Llama 3.1 models by end of 2024?