Loading...
Loading...
Browse all stories on DeepNewz
VisitMeta's 2024 Multi-token Prediction Makes LLMs 3x Faster
May 2, 2024, 03:23 PM
Meta has introduced a new method for training large language models (LLMs) such as GPT and Llama, titled 'Better & Faster Large Language Models via Multi-token Prediction'. This approach, detailed in a 2024 paper by F Gloeckle, B Y Idrissi, B Rozière, D Lopez-Paz, and G Synnaeve from FAIR at Meta, involves training models to predict four future tokens simultaneously. This method has shown to improve sample efficiency and increase inference speeds by up to three times.
View original story
Markets
No • 50%
Yes • 50%
Published papers or press releases from other major AI research labs
No • 50%
Yes • 50%
Data on new LLM projects from Gartner, IDC, or similar market research firms
Yes • 50%
No • 50%
Tech industry reports, updates from major LLM developers
Microsoft • 25%
Amazon • 25%
Google • 25%
IBM • 25%
Press releases, official announcements from major tech companies
Technology • 25%
Financial Services • 25%
Healthcare • 25%
Automotive • 25%
Market analysis reports, sector-specific AI adoption reports
Chatbots • 25%
Virtual Assistants • 25%
Data Analytics • 25%
Automated Content Generation • 25%
Industry usage reports, performance improvement metrics published by companies