GitHub - talg2324/audio_id: Deep Learning approach for audio-based user authentication.

audio_id

Course project on continuous audio signal monitoring including collection, processing, and authentication. Deep Learning approach for audio-based user authentication.

Project Flow

The goal of this project is to utilize mel coefficient features and deep learning for user authentication by voice analysis. We implement various temporal-spectral features with no prior knowledge and learn the best features for this task:

Collect sound data with the data_acq_gui script.

Perform feature analysis to analyze quality of various sound features

Build a CNN model optimizing all possible combinations of features to detect the best feature combination
Train a CNN and save weights
Evaluate the CNN with ROC and PRC curves, measure AUC, and evaluate various working points (max sensitivity, max specificity, max accuracy)

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
acq_gui		acq_gui
examples		examples
Dataset.py		Dataset.py
README.md		README.md
feat_analysis.py		feat_analysis.py
main_script.py		main_script.py
pre_processing.py		pre_processing.py
sequential_forward_feature_selection.ipynb		sequential_forward_feature_selection.ipynb
train_model.ipynb		train_model.ipynb
training.py		training.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

audio_id

Project Flow

About

Releases

Packages

Contributors 3

Languages

talg2324/audio_id

Folders and files

Latest commit

History

Repository files navigation

audio_id

Project Flow

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages