DeepNewz Markets

Markets Stories

Search

Loading...

Browse all stories on DeepNewz

Market

Which major AI research institution will adopt Cerebras Inference service by mid-2025?

2

Cerebras Systems•Nvidia•Cerebras Inference•Llama•Cerebras

Resolution / Starting Odds

Google DeepMind • 25%

OpenAI • 25%

Microsoft Research • 25%

Other • 25%

Official announcements from AI research institutions or Cerebras Systems

Story

Cerebras Launches AI Inference Service, 20x Faster with 1,850 Tokens/sec for 8B Model

Aug 27, 2024, 05:22 PM

Cerebras Systems has launched what it claims to be the world's fastest AI inference service, directly challenging Nvidia's dominance in the AI computing sector. The new service, known as Cerebras Inference, utilizes the company's custom wafer-scale AI accelerator chips to achieve significant performance gains. It is capable of processing Llama 3.1 models at impressive speeds: 1,850 tokens per second for the 8B model and 450 tokens per second for the 70B model. Cerebras asserts that its inference tool offers a 20x speed increase over traditional GPU-based systems, and it is priced at 60 cents per million tokens, which is a fifth of the cost compared to hyperscalers. The launch aims to provide a cost-effective and efficient alternative to Nvidia's GPUs, with claims of delivering 100x better price-performance.

View original story

Similar markets

Who will be Cerebras Systems' primary customer in 2025?

G42 • 25%

Another tech company • 25%

Multiple customers with no primary • 25%

Other • 25%

Which company's AI inference solution will be fastest by mid-2025?

Nvidia • 25%

Cerebras • 25%

Google • 25%

Other • 25%

Will Nvidia launch an AI inference service outperforming Cerebras by Q1 2025?

Yes • 50%

No • 50%

Will Cerebras secure a major cloud provider partnership by mid-2025?

Yes • 50%

No • 50%

Will Cerebras' AI inference tool adoption rate surpass Nvidia's by the end of Q1 2025?

Yes • 50%

No • 50%

Will Cerebras announce a partnership with a major cloud provider by the end of 2024?

Yes • 50%

No • 50%

Which industry will be the primary user of SambaNova Cloud's AI inference platform by June 30, 2025?

Healthcare • 25%

Finance • 25%

Technology • 25%

Other • 25%

Which major AI company will adopt AWS Trainium chips by end of 2024?

Google • 25%

Meta • 25%

Microsoft • 25%

Other • 25%

Which company will have the largest AI compute capacity by the end of 2025?

OpenAI • 25%

Google • 25%

Amazon • 25%

Microsoft • 25%

Which company will have the most powerful AI training system by the end of 2024?

xAI • 25%

Google DeepMind • 25%

OpenAI • 25%

Nvidia • 25%

Which major tech company will adopt MIT's MAIA Neural Network first by end of 2024?

Google • 25%

Microsoft • 25%

Amazon • 25%

Apple • 25%

Which company will announce the largest AI infrastructure investment by the end of 2024?

Nvidia • 25%

Google • 25%

Amazon • 25%

Microsoft • 25%

Markets based on same story

Loading...

Looking for markets...

Show all

Will a major tech company adopt Cerebras Inference service by the end of 2024?

Yes • 50%

No • 50%

Will Cerebras Inference service achieve a significant market share by mid-2025?

No • 50%

Yes • 50%

Will Nvidia announce a new AI inference service to compete with Cerebras by Q1 2025?

Yes • 50%

No • 50%

How will Cerebras Inference performance compare to Nvidia's offerings by end of 2024?

Cerebras significantly outperforms Nvidia • 25%

Nvidia significantly outperforms Cerebras • 25%

Nvidia slightly outperforms Cerebras • 25%

Cerebras slightly outperforms Nvidia • 25%