GitHub - ducspe/ResNet18LSTMBaselineReimplementation: This is the baseline code to compare my new approach to.

This is a reimplementation of the baseline code for our paper "See the silence: improving visual-only voice activity detection by optical flow and RGB fusion" available at: https://github.com/ducspe/VVADpaper

This baseline is compared to our new modified approach available at: https://github.com/ducspe/VisualOnlyVoiceActivityDetection.

The ResNet and LSTM architecture from the original authors was faithfully kept. The supporting infrastructure code around it was changed however to compress the original code significantly, since we do not need the audio branch and the audio-visual version present in the original implementation.

Unlike in the original implementation, we do not label manually, but rather implement a separate module that infers ground truth labels from the audio stream. Audio related code is available in the processing folder

The data_dda folder contains a subset of the preprocessed TCD-TIMIT dataset. The structure of this folder must be kept when working with the full dataset.

The preprocessed inputs can be obtained by using the code from: https://github.com/ducspe/TCD-TIMIT-Preprocessing

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
data_dda		data_dda
networks		networks
processing		processing
.gitignore		.gitignore
README.md		README.md
learn_train_video_statistics.py		learn_train_video_statistics.py
losses.py		losses.py
requirements.txt		requirements.txt
utils.py		utils.py
video_train_dataset.py		video_train_dataset.py
video_train_statistics.npy		video_train_statistics.npy
video_validation_dataset.py		video_validation_dataset.py
vvad_test.py		vvad_test.py
vvad_train.py		vvad_train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Contributors 2

Languages

ducspe/ResNet18LSTMBaselineReimplementation

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages