-
Notifications
You must be signed in to change notification settings - Fork 2
Home
Komiljon Mukhammadiev edited this page Apr 30, 2025
·
1 revision
This wiki complements the awesome-generative-ai repository by providing structured documentation, categorized lists, and helpful guidance on generative models, datasets, and tools.
Explore state-of-the-art resources across the following domains:
- Stable Diffusion, DALLΒ·E, and emerging diffusion models
- Prompt engineering resources
- Fine-tuning and personalization techniques (e.g., DreamBooth, LoRA)
- Real-time and zero-shot voice cloning
- Multilingual speech synthesis tools
- Projects like GPT-SoVITS, Bark, and OpenVoice
- Avatar animation from voice or text
- Facial reenactment and lip-syncing models
- Language-to-image/audio/video generation
- LLM orchestration with generative tasks (e.g., prompt-to-speech-to-video)
- Suno, Riffusion, MusicLM and other AI-driven music tools
- Prompt-to-melody or harmony generation
Each section in this wiki includes:
- Curated links to GitHub repositories
- Associated academic papers
- Live demos (if available)
- Tags and descriptions for easy browsing
Use the sidebar to navigate to subpages like:
voice-cloning.md
text-to-image.md
datasets.md
multimodal.md
- And more coming soon!
We welcome community contributions!
Feel free to suggest new tools, correct outdated info, or improve descriptions.
Maintained by @Mrkomiljon
Last updated: 2025.04.30