segmentation-3.0
Pyannote segmentation-3.0 is a speaker segmentation model for detecting speaker changes, overlapping speech, and voice activity in audio. It produces frame-level predictions used as input to the full speaker diarization pipeline. The model can also run standalone for voice activity detection or overlapped speech detection without the full diarization stack.
10,202,982 ↓ · 955 ♡