Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.
-
Updated
Aug 3, 2024
Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.
The code for AAAI 2025 “Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation”
FastLongSpeech is a novel framework designed to extend the capabilities of Large Speech-Language Models for efficient long-speech processing without necessitating dedicated long-speech training data.
Enhance long-speech processing with FastLongSpeech, a framework for Large Speech-Language Models. Explore our model and dataset on GitHub! 🚀📦
Add a description, image, and links to the speech-llms topic page so that developers can more easily learn about it.
To associate your repository with the speech-llms topic, visit your repo's landing page and select "manage topics."