Google Summer of Code 2024 Midterm Evaluations by Tarun Jain #GSoC2024

This article discusses the author’s progress in contributing to Google Summer of Code 2024 at Red Hen Lab. The author shares their journey with the TV News Chat LLM project, focusing on community bonding, data extraction, cleaning, and filtering processes. They also describe the dataset creation using the Self-Instruct framework and fine-tuning the Large Language Model (LLM) on English context. The author details the training process, which includes Supervised Fine-Tuning (SFT) and Parameter-Efficient Fine-Tuning (PEFT) using the LoRA configuration. They also mention using vLLM for inference and merging LORA adapters with base model weights. The article concludes with plans for the next phase of making the model adapt to multilingual questions. Special mentions are given to the mentor and others who supported the author during the project. The author’s code and PR can be found on GitHub. The author expresses gratitude to their mentor and colleagues for their support and insightful discussions throughout the project.

Source link

Source link: https://medium.com/@jaintarun7/google-summer-of-code-2024-mid-term-evaluations-5df8b9291b19?source=rss——llm-5

Google Summer of Code 2024 Midterm Evaluations by Tarun Jain #GSoC2024

BluWhale: Innovating Technology and Building Stronger Communities #revolutionizinginnovation

Comparing and reviewing the top 7 AI tools for students #AIforStudents

#GraphRAG Ollama: Local Setup, Data Privacy Guaranteed #PrivacyFirst

Moshi Kyutai Performances: Redefining Standards with Vocal AI #innovation

AI Model selects cancer treatments to improve therapy responses. #PrecisionMedicine

Understanding and Implementing Medprompt by Anand Subramanian | Jul, 2024

MedpromptUnderstanding

Understanding and Implementing Medprompt by Anand Subramanian | Jul, 2024

MedpromptUnderstanding

Guide on utilizing HuggingFace model for text vectorization. #NLP

Websim: The AI playground for instant creation possibilities. #innovation

#DynamicRendering in One Minute: Gradio’s Dynamic Rendering Tutorial #GradioTutorial

Moshi Kyutai Performances: Redefining standards with vocal AI. #AIvocals

BluWhale: Innovating Technology and Building Stronger Communities #revolutionizinginnovation

Moshi Kyutai Performances: Redefining Standards with Vocal AI #innovation

Guide on utilizing HuggingFace model for text vectorization. #NLP

Moshi Kyutai Performances: Redefining standards with vocal AI. #AIvocals

East Asian Languages Chapter by Henry Heng LUO, Jun 2024 #Languages

Enhancing Communication with AI Voice Tools for Efficiency #AIVoiceTools

Share this: