Skip to content

nttcslab-sp/diar-forced-alignment

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 

Repository files navigation

Forced-Aligned Diarization Labels for AMI and AliMeeting

This repository provides the diarization labels of the AMI and AliMeeting datasets obtained via forced alignment, used in our ASRU 2025 paper "Can We Really Repurpose Multi-Speaker ASR Corpus for Speaker Diarization?" The labels are created using Montreal Forced Aligner with pretrained models.

Note that the provided labels are not perfect due to alignment errors, transcription ambiguity, out-of-vocabulary words, etc. But at least, it is common to regard labels obtained via forced alignment as ground truth in VAD research (e.g., [Kraljevski+, Interspeech 2025] and [Tan+, Computer Speech & Language 2020]).

Annotations

AMI

AliMeeting

Citation

If you use our annotations, please cite our paper below.

@inproceedings{horiguchi_asru2025,
    author = {Horiguchi, Shota and Tawara, Naohiro and Ashihara, Takanori and Ando, Atsushi and Delcroix, Marc},
    title = {Can We Really Repurpose Multi-Speaker ASR Corpus for Speaker Diarization?},
    booktitle={IEEE Automatic Speech Recognition and Understanding Workshop (ASRU)},
    year = {2025},
    month = {Dec},
}

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published