Skip to content
forked from lololouuuu/P4

PAV - P4: reconocimiento y verificación del locutor

Notifications You must be signed in to change notification settings

pablomarinreyes/P4

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

PAV Speaker Identifier with Deep Neural Networks

Speaker recognition baseline for PAV subject in ETSETB UPC (Telecom BCN)

This program creates a Multilayer perceptron to classify speaker.

Se puede ejecutar directamente si la carpeta pav_spkid_pytorch está en el directorio PAV, y dentro de este directorio también se encuentra la carpeta P4 con todos los archivos

EJEMPLOS DE COMANDOS USADOS

python3 train.py --save_path model_h30_25fram --hsize 30 --in_frames 25 --db_path ../P4/work/lpcc --tr_list_file cfg/all.train --va_list_file cfg/all.test --ext lpcc

python3 test.py --weights_ckpt model_h30_25fram/bestval_e19_weights.ckpt --train_cfg model_h30_25fram/train.opts --log_file logs/model_h30_25fram.log --db_path ../P4/work/lpcc --te_list_file cfg/all.test --ext lpcc

python3 verify --db_path ../P4/work/lpcc --te_list_file ../P4/lists/verif/all.test --candidates_list ../P4/lists/verif/all.test.candidates --weights_ckpt model_h30_25fram/bestval_e19_weights.ckpt --log_file logs/verif_test_h30_25f.log --train_cfg model_h30_25fram/train.opts --ext lpcc

PARA EVALUAR CON EL SET DE TEST

python3 class_score.py --r logs/model_h30_25fram.log python3 verify_score.py --r logs/verif_test_h30_25f.log

COMANDOS USADOS PARA LA EVALUACION CIEGA FINAL

python3 test.py --weights_ckpt model_h30_25fram/bestval_e19_weights.ckpt --train_cfg model_h30_25fram/train.opts --log_file class_DNN.log --db_path ../P4/work/lpcc/final/spk_cls --te_list_file ..P4/lists/final/class.test --ext lpcc

python3 verify --db_path ../P4/work/lpcc/final --te_list_file ../P4/lists/final/verif.test --candidates_list ../P4/lists/final/verif.test.candidates --weights_ckpt model_h30_25fram/bestval_e19_weights.ckpt --log_file verif_DNN.log --train_cfg model_h30_25fram/train.opts --ext lpcc --blind

About

PAV - P4: reconocimiento y verificación del locutor

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • C++ 72.3%
  • CSS 12.2%
  • Shell 6.0%
  • Python 3.5%
  • CMake 2.8%
  • Perl 1.5%
  • Other 1.7%