DeepNewz Markets

Market

Will AdEMAMix be featured in major AI conferences by end of 2024?

Apple•EPFL•AdEMAMix•AdamW•Adam•FedOpt•DiLoCo

Resolution / Starting Odds

Yes, at NeurIPS 2024 • 25%

Yes, at ICML 2024 • 25%

Yes, at both NeurIPS and ICML • 25%

No, it will not be featured in any major AI conferences • 25%

Conference agendas and presentations from major AI conferences like NeurIPS, ICML, etc.

Story

Apple and EPFL Introduce AdEMAMix, a Novel AI Optimizer with 1.95x Improvement

Sep 10, 2024, 01:11 AM

Researchers from Apple and EPFL have introduced AdEMAMix, a novel optimization approach leveraging dual exponential moving averages to enhance gradient efficiency and improve large-scale model training performance. The new optimizer, which operates with just 120 lines of code, claims a 1.95x improvement over the widely-used AdamW optimizer. AdEMAMix requires 95% fewer training tokens than AdamW to reach the same level of performance. The approach utilizes two exponential moving averages for the numerator of Adam, a fast one with a low beta and a slow one with a high beta, which could explain its superior performance in various optimization scenarios, including FedOpt variants like DiLoCo.

View original story

Similar markets

Will Codeium's AI system be featured in a major industry conference by end of 2024?

Yes • 50%

No • 50%

Which AI conference will first feature a presentation on LiquidAI's LFMs by June 2025?

NeurIPS • 25%

ICML • 25%

CVPR • 25%

Other • 25%

AAAI • 25%

CVPR • 25%

GPT-5 • 25%

Other • 25%

Market

Story

Similar markets

Will Codeium's AI system be featured in a major industry conference by end of 2024?

Which AI conference will first feature a presentation on LiquidAI's LFMs by June 2025?

Will Zyphra's Tree Attention algorithm be recognized at a major AI conference by end of 2024?

Will CRAB framework be featured in a major AI conference keynote by end of 2024?

At which major AI conference will LiveBench AI be featured first by end of 2024?

Will 'The AI Scientist' be featured in a TED Talk by end of 2024?

Will CircuitNet be featured in a major AI conference keynote by end of 2024?

Will SAM 2 be cited in at least one major academic paper or conference by end of 2024?

Will major AI research institutions adopt Gemma Scope by end of 2024?

MLE-bench adopted as standard benchmark by major AI conference by mid-2025?

Will a peer-reviewed paper on DynamoLLM be published in a top-tier AI journal by end of 2024?

Which AI model will win the 'Best AI Innovation' award at a major AI conference in 2024?

Will AdEMAMix be adopted by a major tech company (excluding Apple) by end of 2024?

Will AdEMAMix be cited in at least 50 academic papers by June 2025?

Will AdEMAMix be integrated into a popular open-source AI framework by March 2025?

Will AdEMAMix be implemented in commercial AI products by end of 2025?

Will AdEMAMix be adopted by a major tech company (excluding Apple) by end of 2024?

Will AdEMAMix be cited in at least 50 academic papers by June 2025?

Will AdEMAMix be integrated into a popular open-source AI framework by March 2025?

Will AdEMAMix be implemented in commercial AI products by end of 2025?