Home

🎨 Welcome to the Awesome Generative AI Wiki!

This wiki complements the awesome-generative-ai repository by providing structured documentation, categorized lists, and helpful guidance on generative models, datasets, and tools.

📂 Categories Covered

Explore state-of-the-art resources across the following domains:

🖼️ Text-to-Image

Stable Diffusion, DALL·E, and emerging diffusion models
Prompt engineering resources
Fine-tuning and personalization techniques (e.g., DreamBooth, LoRA)

🗣️ Voice Cloning & TTS

Real-time and zero-shot voice cloning
Multilingual speech synthesis tools
Projects like GPT-SoVITS, Bark, and OpenVoice

🧑‍🎤 Talking Head Generation

Avatar animation from voice or text
Facial reenactment and lip-syncing models

🧠 LLMs & Multimodal AI

Language-to-image/audio/video generation
LLM orchestration with generative tasks (e.g., prompt-to-speech-to-video)

🎵 AI Music Generation

Suno, Riffusion, MusicLM and other AI-driven music tools
Prompt-to-melody or harmony generation

🚀 How to Use This Wiki

Each section in this wiki includes:

Curated links to GitHub repositories
Associated academic papers
Live demos (if available)
Tags and descriptions for easy browsing

Use the sidebar to navigate to subpages like:

voice-cloning.md
text-to-image.md
datasets.md
multimodal.md
And more coming soon!

🤝 Contributing

We welcome community contributions!
Feel free to suggest new tools, correct outdated info, or improve descriptions.

Maintained by @Mrkomiljon
Last updated: 2025.04.30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!