Instructions to use Revai/reverb-diarization-v2 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- pyannote.audio
How to use Revai/reverb-diarization-v2 with pyannote.audio:
from pyannote.audio import Model, Inference model = Model.from_pretrained("Revai/reverb-diarization-v2") inference = Inference(model) # inference on the whole file inference("file.wav") # inference on an excerpt from pyannote.core import Segment excerpt = Segment(start=2.0, end=5.0) inference.crop("file.wav", excerpt) - Reverb
How to use Revai/reverb-diarization-v2 with Reverb:
# No code snippets available yet for this library. # To use this model, check the repository files and the library's documentation. # Want to help? PRs adding snippets are welcome at: # https://github.com/huggingface/huggingface.js
- Notebooks
- Google Colab
- Kaggle
Details on the model, it's performance, and more available on Arxiv. For more information on how to run this diarization model see https://github.com/revdotcom/reverb/tree/main/diarization
Reverb diarization V2 provides a 22.25% relative improvement in WDER (Word Diarization Error Rate) compared to the baseline pyannote3.0 model, evaluated on over 1,250,000 tokens across five different test suites.
| Test suite | WDER |
|---|---|
| earnings21 | 0.046 |
| rev16 | 0.078 |
Usage
# taken from https://huggingface.co/pyannote/speaker-diarization-3.1 - see for more details
# instantiate the pipeline
from pyannote.audio import Pipeline
pipeline = Pipeline.from_pretrained(
"Revai/reverb-diarization-v2",
use_auth_token="HUGGINGFACE_ACCESS_TOKEN_GOES_HERE")
# run the pipeline on an audio file
diarization = pipeline("audio.wav")
# dump the diarization output to disk using RTTM format
with open("audio.rttm", "w") as rttm:
diarization.write_rttm(rttm)
Cite this Model
If you use this model please use the following citation:
@misc{bhandari2024reverbopensourceasrdiarization,
title={Reverb: Open-Source ASR and Diarization from Rev},
author={Nishchal Bhandari and Danny Chen and Miguel Ángel del Río Fernández and Natalie Delworth and Jennifer Drexler Fox and Migüel Jetté and Quinten McNamara and Corey Miller and Ondřej Novotný and Ján Profant and Nan Qin and Martin Ratajczak and Jean-Philippe Robichaud},
year={2024},
eprint={2410.03930},
archivePrefix={arXiv},
primaryClass={cs.CL},
url={https://arxiv.org/abs/2410.03930},
}
License
See LICENSE for details.
- Downloads last month
- 2,780
Space using Revai/reverb-diarization-v2 1
Paper for Revai/reverb-diarization-v2
Paper • 2410.03930 • Published