Name: voxceleb
Creator: Joon Son Chung and Arsha Nagrani and Andrew Zisserman
Published: 2024-04-03 22:27:46
License: https://academictorrents.com/nolicensespecified

voxceleb (17 files)

wav/vox2_test_wav.zip	7.70GB
wav/vox2_dev_wav.zip	231.00GB
wav/vox1_test_wav.zip	1.07GB
wav/vox1_dev_wav.zip	32.60GB
txt/vox2_test_txt.zip	53.34MB
txt/vox2_dev_txt.zip	1.58GB
txt/vox1_test_txt.zip	4.57MB
txt/vox1_dev_txt.zip	139.08MB
meta/vox2_meta.csv	165.24kB
meta/vox1_meta.csv	40.78kB
meta/veri_test2.txt	2.33MB
meta/veri_test.txt	2.34MB
meta/list_test_hard2.txt	34.16MB
meta/list_test_hard.txt	34.26MB
meta/list_test_all2.txt	35.95MB
meta/list_test_all.txt	36.05MB
meta/iden_split.txt	4.91MB

Type: Dataset

Tags: speaker recognitionspeaker identificationspeaker verification

Bibtex:

@article{,
title= {voxceleb},
journal= {Proc. Interspeech 2018},
author= {Joon Son Chung and Arsha Nagrani and Andrew Zisserman},
year= {2018},
url= {https://www.robots.ox.ac.uk/~vgg/data/voxceleb},
abstract= {This torrent shares the VoxCeleb1 and VoxCeleb2 datasets. The original dataset creators do not provide access to the dataset anymore. To ensure papers in the field of speaker recognition can be reproduced (many have used VoxCeleb in recent years) the data should be available for academic purposes. 

The audio data is stored as mono-channel, 16000hz, signed 16-bit (little-endian) PCM wav files. This torrent does not include video data.
},
keywords= {speaker recognition, speaker identification, speaker verification},
terms= {The VoxCeleb metadata is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License. The audio data is scraped from publicly available YouTube videos. Data is only made available for academic purposes.},
license= {},
superseded= {}
}

voxceleb Joon Son Chung and Arsha Nagrani and Andrew Zisserman

voxceleb
Joon Son Chung and Arsha Nagrani and Andrew Zisserman