in

Together.ai launches Dragonfly: Enhanced Vision-Language Model with Multi-Resolution Zoom #AIinnovation

NVIDIA Partners with Hugging Face to Simplify AI Model Deployments

Together.ai has launched Dragonfly, a vision-language model that enhances fine-grained visual understanding and reasoning about image regions. The model architecture utilizes multi-resolution zoom-and-select capabilities to optimize multi-modal reasoning while maintaining context efficiency. Dragonfly employs two primary strategies: multi-resolution visual encoding and zoom-in patch selection, enabling the model to focus on fine-grained details of image regions. The model has shown promising performance on vision-language benchmarks, achieving competitive results on various tasks.

In collaboration with Stanford Medicine, Together.ai has introduced Dragonfly-Med, a version fine-tuned on 1.4 million biomedical image-instruction data. Dragonfly-Med excels in high-resolution medical data tasks, outperforming previous models on multiple medical imaging benchmarks. The model was evaluated on visual question-answering and clinical report generation tasks, achieving state-of-the-art results on several medical benchmarks.

Dragonfly’s architecture offers a new research direction by focusing on zooming in on image regions to capture more fine-grained visual information. Together.ai plans to continue improving the model’s capabilities and exploring new architectures and visual encoding strategies to benefit broader scientific fields. The collaboration with Stanford Medicine and the utilization of resources like Meta LLaMA3 and CLIP from OpenAI have been crucial in developing Dragonfly. The model’s codebase also builds upon the foundations of Otter and LLaVA-UHD.

Source link

Source link: https://blockchain.news/news/dragonfly-vision-language-model-launch

What do you think?

Leave a Reply

GIPHY App Key not set. Please check settings

The Future of Artificial Intelligence and Its Impact on Jobs: A Comprehensive Analysis | by Orlando Living | Jun, 2024

Analyzing AI’s Impact on Jobs: A Future Perspective #AIJobs

Apple needs to make the iPhone cool again. Today is its chance

Apple must revamp iPhone to regain its cool factor. #RevampiPhone