๐ค Whisper Speaker Diarization Demo
AI-powered speaker identification and transcription
๐ฌ Processing Information
- ๐ป CPU processing only (slower than GPU)
- ๐ฆ No file size limits - process any audio length
- ๐ For multi-language audio, use larger models (medium, large-v2, large-v3)
- โก Larger models provide better accuracy but take longer to process
- โ ๏ธ Very large files may take significant time and memory
Supported: MP3, WAV, M4A, FLAC, etc.
๐ฏ Whisper Model
๐ Language
Standard: Traditional diarization | Separation: Pre-separate speakers
1 4
๐ How to Use
- Upload audio (any size)
- Choose processing mode
- Configure settings (optional)
- Click process and wait
- Download results
๐ฏ Processing Modes
- Standard: Traditional speaker diarization
- Speaker Separation: Pre-separate speakers first
๐ Model Selection
- tiny.en/base.en/small.en: Fast, English only
- medium.en: Better accuracy, English only
- medium/large-v2/large-v3: Best for multi-language audio
โ ๏ธ Large File Warning
- Large files will take longer to process
- Monitor system resources during processing