Mixed speakers
Separated outputs
Supported Formats
MP3, WAV, FLAC, OGG, M4A, OPUS, MP4, MOV
Max File Size
100 MB
Processing Time
~Real-time to 2x audio length
Outputs
JSON timestamps + per-speaker WAV
1from audiopod import AudioPod23# Initialize client4client = AudioPod(api_key="YOUR_API_KEY")56# Diarize7result = client.speaker_separation.process(8 file="meeting.wav",9)1011# Outputs: result.segments (JSON), result.speakers (list), result.files (dict of wavs)
Processing Speed
AI Model
Quality Score
Success Rate
Meetings, panels, interviews and podcasts.
Structured outputs for editing and analytics.
Consistent labels and clear speaker boundaries.