Whisper.cpp with diarization
- Diarize with onnxruntime
- Segment using pyannote-audio model
- Transcribe using OpenAI whisper
- Support macOS, Windows, Linux
- Comes with FFMPEG
- Support any audio or video format
- Download models automatically with
--setup
- Colored output for easy identification of speakers
Download and extract it from releases
./loud test.wav --json transcript.json --setup
See building.md