in

Generative Fusion Decoding: Merging ASR System with LLM #FusionDecoding

Fuse any ASR System with any LLM - Generative Fusion Decoding

The video demonstrates the installation of Generative Fusion Decoding (GFD) locally and showcases a demo of the framework. GFD is designed to enhance multimodal text recognition systems with Large Language Models (LLMs) by enabling joint decoding of Automatic Speech Recognition (ASR) or Optical Character Recognition (OCR) combined with any LLM without the need for training. The video also includes links for supporting the channel, getting discounts on GPU rentals, and becoming a Patron. It encourages viewers to follow the creator on LinkedIn, YouTube, and their blog. Additionally, related resources and videos are mentioned for further exploration. The content is copyrighted to Fahd Mirza in 2021.

Source link

Source link: https://www.youtube.com/watch?v=d8PKjfjKqeU

What do you think?

Leave a Reply

GIPHY App Key not set. Please check settings

The Ethical Implications of AI on Creative Professionals | by Dirk Steynberg | Jul, 2024

AI’s ethical impact on creatives: A deep dive #ethics

The Independent

Laura Harrier challenges stereotype that Black people avoid therapy. #NormalizeTherapy