A low-bitrate single-codebook 16 / 24 kHz speech codec based on focal modulation
deep-learning pytorch speech-synthesis codec vector-quantization wavlm vocos focal-modulation neural-speech-coding
-
Updated
Sep 22, 2025 - Jupyter Notebook