DeepNewz Markets

Markets Stories

Search

Loading...

Browse all stories on DeepNewz

Market

Which AI model will have the highest usage on SambaNova Cloud by mid-2024?

2

SambaNova•SambaNova Cloud•Llama•TogetherCompute API

Resolution / Starting Odds

Llama 3.1 8B • 25%

Llama 3.1 70B • 25%

Llama 3.1 405B • 25%

Other • 25%

SambaNova usage statistics

Story

SambaNova Launches Fastest AI Platform with Record 132 Tokens/Sec for Llama 3.1 405B

Sep 10, 2024, 02:48 PM

SambaNova has launched its new cloud inference platform, SambaNova Cloud, which is now available for developers to access Llama 3.1 models including 8B, 70B, and 405B on their custom AI chips. The platform sets a new record for inference speed, achieving 132 tokens per second for Llama 3.1 405B at full precision and 570 tokens per second for Llama 3.1 70B. This performance is 10 times faster than traditional GPUs. The API is available for free with no waitlist, enabling developers to unlock advanced AI applications. Additionally, Llama 3.1 405B can also achieve 100 tokens per second on TogetherCompute API, with a 128k long-context version coming soon.

View original story

Similar markets

Which industry will be the primary user of SambaNova Cloud's AI inference platform by June 30, 2025?

Healthcare • 25%

Finance • 25%

Technology • 25%

Other • 25%

Which AI model provider will have the highest usage in GitHub Models by end of 2024?

Azure OpenAI • 25%

OpenAI • 25%

Meta • 25%

Other • 25%

Which AI model will have the most commercial applications by mid-2024?

Claude 3.5 Sonnet • 25%

GPT-4o • 25%

Gemini Pro • 25%

Llama-3 • 25%

Which AI model will have the highest market adoption by June 30, 2025?

OpenAI's O1 model • 25%

GPT-4 • 25%

Gemini • 25%

Anthropic's Claude • 25%

Which AI model will be most used in commercial applications by March 31, 2025?

Meta (Llama 3) • 25%

OpenAI (GPT-4o) • 25%

Anthropic (Claude 3.5 Sonnet) • 25%

Other • 25%

Leading AI model in market adoption by end of 2024?

Claude 3.5 • 25%

GPT-4 • 25%

Gemini Pro • 25%

Llama • 25%

Which AI model will be the most used in enterprise applications by end of 2024?

Claude 3.5 Sonnet • 33%

GPT-4o • 33%

Gemini • 34%

Which AI model will generate the highest revenue by end of 2024?

Claude 3.5 Sonnet • 33%

GPT-4o • 33%

Google's AI Model • 33%

Which feature of NVIDIA's AI Foundry will be most popular by end of 2024?

Custom generative AI models • 25%

Synthetic data generation • 25%

Fine-tuning • 25%

NeMo Retriever microservices • 25%

Which quantized AI model will see the highest adoption by December 31, 2024?

Meta's Llama 3.1-70B • 25%

OpenAI's GPT-4 quantized • 25%

Google's Bard quantized • 25%

Other • 25%

Which AI model will be most adopted by major tech companies by end of 2024?

Claude 3.5 Sonnet • 33%

GPT-4o • 33%

Google's AI Model • 33%

Which AI model will be the most cost-efficient by the end of 2024?

Llama 3.1 • 25%

GPT-4o • 25%

Bard • 25%

Other • 25%

Markets based on same story

Loading...

Looking for markets...

Show all

Will SambaNova Cloud achieve higher than 132 tokens/sec for Llama 3.1 405B by end of 2024?

Yes • 50%

No • 50%

Will SambaNova Cloud announce a paid tier for API access by end of Q1 2024?

No • 50%

Yes • 50%

Will SambaNova Cloud be integrated into a major cloud provider's platform by mid-2024?

No • 50%

Yes • 50%

How many active developers will be using SambaNova Cloud by end of 2024?

Less than 10,000 • 25%

More than 100,000 • 25%

50,001 to 100,000 • 25%

10,000 to 50,000 • 25%