This repository contains the CSV files for the processed dataset used to train VoiceLDM. These files include the transcriptions generated using the Whisper model.

Speech Segments

as_speech_en.csv
cv1.csv (cv.csv has been split into two due to file size limitations on GitHub.)
cv2.csv
voxceleb.csv

Non-Speech Segments

as_noise.csv
noise_demand.csv

Evaluation Segments

Additionally, I've included the CSV file corresponding to the ac_filtered test set, which was specifically used to evaluate VoiceLDM.

ac_filtered.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Speech Segments

Non-Speech Segments

Evaluation Segments

Files

README.md

Latest commit

History

README.md

File metadata and controls

Speech Segments

Non-Speech Segments

Evaluation Segments