voxceleb (17 files)
wav/vox2_test_wav.zip |
7.70GB |
wav/vox2_dev_wav.zip |
231.00GB |
wav/vox1_test_wav.zip |
1.07GB |
wav/vox1_dev_wav.zip |
32.60GB |
txt/vox2_test_txt.zip |
53.34MB |
txt/vox2_dev_txt.zip |
1.58GB |
txt/vox1_test_txt.zip |
4.57MB |
txt/vox1_dev_txt.zip |
139.08MB |
meta/vox2_meta.csv |
165.24kB |
meta/vox1_meta.csv |
40.78kB |
meta/veri_test2.txt |
2.33MB |
meta/veri_test.txt |
2.34MB |
meta/list_test_hard2.txt |
34.16MB |
meta/list_test_hard.txt |
34.26MB |
meta/list_test_all2.txt |
35.95MB |
meta/list_test_all.txt |
36.05MB |
meta/iden_split.txt |
4.91MB |
Type: Dataset
Bibtex:
Tags:
Bibtex:
@article{,
title= {voxceleb},
journal= {Proc. Interspeech 2018},
author= {Joon Son Chung and Arsha Nagrani and Andrew Zisserman},
year= {2018},
url= {https://www.robots.ox.ac.uk/~vgg/data/voxceleb},
abstract= {This torrent shares the VoxCeleb1 and VoxCeleb2 datasets. The original dataset creators do not provide access to the dataset anymore. To ensure papers in the field of speaker recognition can be reproduced (many have used VoxCeleb in recent years) the data should be available for academic purposes.
The audio data is stored as mono-channel, 16000hz, signed 16-bit (little-endian) PCM wav files. This torrent does not include video data.
},
keywords= {speaker recognition, speaker identification, speaker verification},
terms= {The VoxCeleb metadata is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License. The audio data is scraped from publicly available YouTube videos. Data is only made available for academic purposes.},
license= {},
superseded= {}
}
Citation:
Chung, J. S., Nagrani, A., & Zisserman, A.. (2018). voxceleb [Data set]. Academic Torrents. https://academictorrents.com/details/bdd9f57a6f47aa197f502b68bc0195f5ac786ec4
wav/vox2_test_wav.zip