Meta releases new language models, open-sourced for multi-token prediction. #NLP

Meta Platforms Inc. has released four open-source language models that implement a new machine learning approach called multi-token prediction. These models generate four tokens at a time, aiming to make large language models faster and more accurate. The models are designed for code generation tasks and have 7 billion parameters each, trained on massive amounts of code samples. Meta also developed a fifth model with 13 billion parameters. The models consist of a shared trunk for initial computations and four output heads that generate one token each, allowing for four tokens to be produced simultaneously.

The company’s researchers believe that the multi-token prediction approach may improve code quality by mitigating the limitations of traditional teacher-forcing training methods. Meta tested the models using benchmark tests and found that they outperformed traditional models in accuracy and speed. The models performed 17% and 12% better on coding tasks compared to one-token-at-a-time models and were three times faster in generating output.

Overall, Meta’s new language models represent a significant advancement in machine learning for code generation tasks. The company’s research suggests that the multi-token prediction approach may offer improvements over traditional training methods, leading to higher-quality code generation.

Source link

Source link: https://siliconangle.com/2024/07/04/meta-open-sources-new-multi-token-prediction-language-models/

Meta releases new language models, open-sourced for multi-token prediction. #NLP

Comparison of leading AI tools in frontier exploration #AIExploration

Samsung Galaxy AI to provide real-time translations in WhatsApp #innovation

#Achieve $10k/MO online with the best strategy possible. #Success

AI innovations for sustainable development in nature and environment #sustainability

Tether enhances security and privacy with AI technology. #BlockchainSecurity

Top AI laptops for optimal performance in 2024 #AIReady

Introduction to Machine Learning for the Everyday Person #AI

6 ways Google AI enhances Pixel’s functionality with #AIinnovation

AI model outperforms current methods in detecting cancer clues. #InnovativeDetection

Guide on using Runway Gen-3 Alpha with tutorial. #AIFashionGuide

Top AI laptops for optimal performance in 2024 #AIReady

AI model outperforms current methods in detecting cancer clues. #InnovativeDetection

Open Sourcing Large Language Models for Unmatched Reasoning #LM2.5-7B-Chat

#HurkleDurkling trend in bed boosted by modern generative AI.

East Asian Languages Chapter by Henry Heng LUO, Jun 2024 #Languages

AI-powered iOS app reads PDFs and webpages aloud. #Accessibility

Share this: