NVIDIA Reveals DoRA: Advanced AI Model Fine-Tuning Method #DoRA

NVIDIA has introduced a new fine-tuning method called DoRA, which improves upon the existing LoRA method without adding extra inference overhead. DoRA has shown significant performance enhancements in various language and vision models, outperforming LoRA in tasks like common-sense reasoning and multi-turn benchmarks. The method has been accepted at ICML 2024, indicating its potential impact in machine learning.

DoRA decomposes pretrained weights into magnitude and directional components, fine-tuning both efficiently. Visualizations show that DoRA makes substantial directional adjustments while maintaining magnitude, resembling full fine-tuning patterns. Across different models, DoRA consistently outperforms LoRA in tasks like commonsense reasoning, image-text understanding, and visual instruction tuning.

DoRA can be integrated into the QLoRA framework for low-bit pretrained models, showing superior accuracy compared to FT and QLoRA. In text-to-image generation applications like DreamBooth, DoRA produces better results than LoRA in challenging datasets.

The method is expected to become a standard choice for fine-tuning AI models, compatible with existing methods and suitable for applications like NVIDIA Metropolis, NeMo, NIM, and TensorRT. For more information, refer to the NVIDIA Technical Blog.

Source link

Source link: https://blockchain.news/news/nvidia-unveils-dora-superior-fine-tuning-method-ai-models

NVIDIA Reveals DoRA: Advanced AI Model Fine-Tuning Method #DoRA

OpenAI committed to learning from India for impactful AI. #AIImpact

AI creates entire video – mind-blowing technology! #innovation

AI creates entire video – mind-blowing technology! #innovation

Market insights and trends in immune health supplements industry. #HealthSupplements

Google’s top Gemini demo was fabricated with #deception.

Playful children enjoy splashing in puddles during midjourney adventures. #ChildhoodJoy

Meta introduces cutting-edge AI tool for converting text to 3D. #AIgenerators

Evaluating large language models: The power of collaboration #NLP

Honor Magic V3: Price, Release Date, Spec Rumors #smartphone

Apple gains observer seat on OpenAI board. #AIobserver

#Chameleon: Meta’s AI outshines GPT-4 and Gemini in mixed-modal capabilities

Evaluating large language models: The power of collaboration #NLP

#Chameleon: Meta’s AI outshines GPT-4 and Gemini in mixed-modal capabilities

The crucial link between LLMs, language, and cognition #connection

#UniversityofToronto researchers introduce superior deep-learning model for peptide structure prediction. #AI

East Asian Languages Chapter by Henry Heng LUO, Jun 2024 #Languages

Enhancing Communication with AI Voice Tools for Efficiency #AIVoiceTools

Share this: