pyf98

Follow

Yifan Peng pyf98

Follow

Research on Multimodal LLMs and Speech AI

110 followers · 1 following

Santa Clara, CA
https://pyf98.github.io
in/yifan-peng
@pengyf21
https://scholar.google.com/citations?user=wH2FALMAAAAJ&hl=en

Achievements

Achievements

Pinned Loading

NVIDIA-NeMo/NeMo NVIDIA-NeMo/NeMo Public

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 15.8k 3.1k
espnet/espnet espnet/espnet Public

End-to-End Speech Processing Toolkit

Python 9.5k 2.3k
NeMo_VoiceTextBlender NeMo_VoiceTextBlender Public

NAACL 2025 main conference: "VoiceTextBlender: Augmenting Large Language Models with Speech Capabilities via Single-Stage Joint Speech-Text Supervised Fine-Tuning"

Python 12 2
DPHuBERT DPHuBERT Public

INTERSPEECH 2023: "DPHuBERT: Joint Distillation and Pruning of Self-Supervised Speech Models"

Python 114 12
speech-model-compression speech-model-compression Public

A collection of papers related to speech model compression

26 2