GitHub - Vyvo-Labs/SpeechPlus: SpeechPlus: Small LLM-Based Text-to-Speech Library 🚀

SpeechPlus: Small LLM-Based Text-to-Speech Library 🚀

🛠️ Installation

uv venv --python 3.12
source .venv/bin/activate
uv pip install -r requirements.txt

🎙️ Usage

from speechplus.inference import generate_speech_from_text

generate_speech_from_text(
    text="Hello, this is a demonstration of SpeechPlus.",
    model_path="./output/checkpoint-1000",
    tokenizer_path="./output/checkpoint-1000",
    output_path="generated_speech.wav",
    sample_rate=24000,
    max_length=2048,
)

🎵 Training

python3 speechplus/train.py

😍 Contributing

uv pip install pre-commit
pre-commit install
pre-commit run --all-files

📜 License

This project is licensed under the terms of the Apache License 2.0.

🤗 Citation

@article{ji2024wavtokenizer,
  title={Wavtokenizer: an efficient acoustic discrete codec tokenizer for audio language modeling},
  author={Ji, Shengpeng and Jiang, Ziyue and Wang, Wen and Chen, Yifu and Fang, Minghui and Zuo, Jialong and Yang, Qian and Cheng, Xize and Wang, Zehan and Li, Ruiqi and others},
  journal={arXiv preprint arXiv:2408.16532},
  year={2024}
}

@article{ji2024language,
  title={Language-codec: Reducing the gaps between discrete codec representation and speech language models},
  author={Ji, Shengpeng and Fang, Minghui and Jiang, Ziyue and Huang, Rongjie and Zuo, Jialung and Wang, Shulei and Zhao, Zhou},
  journal={arXiv preprint arXiv:2402.12208},
  year={2024}
}
@misc{allal2025smollm2smolgoesbig,
      title={SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model},
      author={Loubna Ben Allal and Anton Lozhkov and Elie Bakouch and Gabriel Martín Blázquez and Guilherme Penedo and Lewis Tunstall and Andrés Marafioti and Hynek Kydlíček and Agustín Piqueres Lajarín and Vaibhav Srivastav and Joshua Lochner and Caleb Fahlgren and Xuan-Son Nguyen and Clémentine Fourrier and Ben Burtenshaw and Hugo Larcher and Haojun Zhao and Cyril Zakka and Mathieu Morlon and Colin Raffel and Leandro von Werra and Thomas Wolf},
      year={2025},
      eprint={2502.02737},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2502.02737},
}

📝 Acknowledgments

SmolVoice: https://github.com/Deep-unlearning/SmolVoice

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
.github		.github
assets		assets
notebook		notebook
speechplus		speechplus
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
requirements.txt		requirements.txt
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Repository files navigation

SpeechPlus: Small LLM-Based Text-to-Speech Library 🚀

🛠️ Installation

🎙️ Usage

🎵 Training

😍 Contributing

📜 License

🤗 Citation

📝 Acknowledgments

About

Uh oh!

Releases

Sponsor this project

Uh oh!

Packages

Uh oh!

Languages

Uh oh!

License

Vyvo-Labs/SpeechPlus

Folders and files

Latest commit

History

Repository files navigation

SpeechPlus: Small LLM-Based Text-to-Speech Library 🚀

🛠️ Installation

🎙️ Usage

🎵 Training

😍 Contributing

📜 License

🤗 Citation

📝 Acknowledgments

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Sponsor this project

Uh oh!

Packages 0

Uh oh!

Languages

Packages