DeepNewz Markets

Markets Stories

Search

Loading...

Browse all stories on DeepNewz

Market

What will be the main advantage cited for adopting MatMul-free language model by end of 2024?

4

Princeton University•Stevens Institute of Technology•University of Pennsylvania

Resolution / Starting Odds

Memory reduction • 33%

GPU efficiency • 33%

Performance at scale • 33%

Public statements or press releases by companies adopting the model

Story

Researchers Develop Scalable MatMul-Free Language Model with 61% Memory Reduction

Jun 6, 2024, 12:12 PM

Researchers have developed a scalable, MatMul-free language model that eliminates the need for matrix multiplication operations while maintaining strong performance at billion-parameter scales. This new approach, which replaces MatMul operations with addition and negation, has shown to reduce memory usage by up to 61% and improve GPU efficiency. The model processes billion-parameter scale models at 13W beyond human-readable throughput, moving large language models (LLMs) closer to brain-like efficiency. The implementation has been a collaborative effort involving researchers W Guo, J Long, Y Zeng, and Z Liu from Princeton University, Stevens Institute of Technology, and the University of Pennsylvania.

View original story

Similar markets

Apple Reduces Size of Large Language Models in 2024?

Yes • 50%

No • 50%

AI model with the most language support by end of 2024

Falcon 2 • 33%

Meta's Llama 3 • 33%

OpenAI's models • 34%

What will be the primary competitive advantage of Codestral-22B by September 2024?

Performance • 25%

Context length • 25%

Language support • 25%

Licensing model • 25%

Improved Code Compilation Speeds by Mid-2025 Due to Meta's Multi-Token Prediction Model?

Yes • 50%

No • 50%

Competing GPU support initiatives launched by end of 2024?

Yes • 50%

No • 50%

AI Model with Highest Commercial Adoption by 2024

Llama3 • 25%

GPT-4 • 25%

BERT • 25%

NVIDIA's Latest Model • 25%

ZeroGPU $10M free GPU allocation exhausted by end of 2024?

Yes • 50%

No • 50%

What will be the average improvement in AI model deployment speed using NVIDIA NIM by end of 2024?

Less than 20% improvement • 25%

20-40% improvement • 25%

40-60% improvement • 25%

More than 60% improvement • 25%

First major microprocessor breakthrough announcement by 2024

Intel • 25%

TSMC • 25%

Samsung • 25%

NVIDIA • 25%

$Which open-source LLM will be the most adopted by top tech companies by the end of 2024?$

Which open-source LLM will be the most adopted by top tech companies by the end of 2024?

Llama8B • 25%

Qwen 2 • 25%

Nemotron • 25%

Other • 25%

Fastest AI model by end of 2024

Llama3 • 33%

GPT-4 • 33%

Next-gen OpenAI model • 34%

Performance boost in NLP benchmarks due to Infini-attention by 2024?

Yes • 50%

No • 50%

Markets based on same story

Loading...

Looking for markets...

Show all

Will a major tech product use MatMul-free language model by end of 2024?

Yes • 50%

No • 50%

Will a peer-reviewed paper validate MatMul-free language model performance by end of 2024?

No • 50%

Yes • 50%

Will MatMul-free language model surpass 75% market share by end of 2024?

No • 50%

Yes • 50%

Which institution will lead in research papers on MatMul-free language models by end of 2024?

University of Pennsylvania • 33%

Stevens Institute of Technology • 33%

Princeton University • 33%