Skip to content
Komiljon Mukhammadiev edited this page Apr 30, 2025 · 1 revision

🎨 Welcome to the Awesome Generative AI Wiki!

This wiki complements the awesome-generative-ai repository by providing structured documentation, categorized lists, and helpful guidance on generative models, datasets, and tools.


πŸ“‚ Categories Covered

Explore state-of-the-art resources across the following domains:

πŸ–ΌοΈ Text-to-Image

  • Stable Diffusion, DALLΒ·E, and emerging diffusion models
  • Prompt engineering resources
  • Fine-tuning and personalization techniques (e.g., DreamBooth, LoRA)

πŸ—£οΈ Voice Cloning & TTS

  • Real-time and zero-shot voice cloning
  • Multilingual speech synthesis tools
  • Projects like GPT-SoVITS, Bark, and OpenVoice

πŸ§‘β€πŸŽ€ Talking Head Generation

  • Avatar animation from voice or text
  • Facial reenactment and lip-syncing models

🧠 LLMs & Multimodal AI

  • Language-to-image/audio/video generation
  • LLM orchestration with generative tasks (e.g., prompt-to-speech-to-video)

🎡 AI Music Generation

  • Suno, Riffusion, MusicLM and other AI-driven music tools
  • Prompt-to-melody or harmony generation

πŸš€ How to Use This Wiki

Each section in this wiki includes:

  • Curated links to GitHub repositories
  • Associated academic papers
  • Live demos (if available)
  • Tags and descriptions for easy browsing

Use the sidebar to navigate to subpages like:

  • voice-cloning.md
  • text-to-image.md
  • datasets.md
  • multimodal.md
  • And more coming soon!

🀝 Contributing

We welcome community contributions!
Feel free to suggest new tools, correct outdated info, or improve descriptions.


Maintained by @Mrkomiljon
Last updated: 2025.04.30