๐ŸŽค Whisper Speaker Diarization Demo

AI-powered speaker identification and transcription

Telecom Paris

๐Ÿ”ฌ Processing Information

  • ๐Ÿ’ป CPU processing only (slower than GPU)
  • ๐Ÿ“ฆ No file size limits - process any audio length
  • ๐ŸŒ For multi-language audio, use larger models (medium, large-v2, large-v3)
  • โšก Larger models provide better accuracy but take longer to process
  • โš ๏ธ Very large files may take significant time and memory

Supported: MP3, WAV, M4A, FLAC, etc.

๐ŸŽฏ Whisper Model
๐ŸŒ Language
๐ŸŽฏ Processing Mode

Standard: Traditional diarization | Separation: Pre-separate speakers

1 4

๐Ÿ“š How to Use

  1. Upload audio (any size)
  2. Choose processing mode
  3. Configure settings (optional)
  4. Click process and wait
  5. Download results

๐ŸŽฏ Processing Modes

  • Standard: Traditional speaker diarization
  • Speaker Separation: Pre-separate speakers first

๐ŸŒ Model Selection

  • tiny.en/base.en/small.en: Fast, English only
  • medium.en: Better accuracy, English only
  • medium/large-v2/large-v3: Best for multi-language audio

โš ๏ธ Large File Warning

  • Large files will take longer to process
  • Monitor system resources during processing