Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[egs] Adding new non-splicing layers in some TDNN setups. #1937

Merged
merged 1 commit into from
Oct 13, 2017

Conversation

danpovey
Copy link
Contributor

I found that we can get the same improvement that @freewym was getting from adding non-splicing layers between each TDNN layer, by just adding two non-splicing layers towards the beginning of the network. There is even a small improvement on Switchboard.

I will probably merge this right away, as its an easy win. However there are a couple of outstanding issues.

  • we should run this on some other setups as well and see if it improves them too.
  • someone should test whether using just one of the two additional layers is sufficient.
  • eventually we should try the same modification to other architectures such as TDNN+LSTM or TDNN+LSTM+attention.

Does anyone have time for this? @hhadian, maybe you could instruct one of the masters students how to run these experiments, they may learn something from the exercise.

@danpovey danpovey merged commit 510428f into kaldi-asr:master Oct 13, 2017
@freewym
Copy link
Contributor

freewym commented Oct 13, 2017

For AMI SDM1, I found further increasing the num-epochs from 6 (default) to 8 is helpful:

System tdnn1g tdnn1g_nonsplicing tdnn1g_nonsplicing_epochs8
WER on dev 37.5 37.2 36.7
WER on eval 41.4 41.0 40.6
Final train prob -0.210036 -0.209284 -0.200288
Final valid prob -0.27294 -0.268485 -0.262626
Final train prob (xent) -2.46404 -2.41837 -2.35348
Final valid prob (xent) -2.69502 -2.66269 -2.61947

@danpovey
Copy link
Contributor Author

danpovey commented Oct 13, 2017 via email

kronos-cm added a commit to kronos-cm/kaldi that referenced this pull request Oct 27, 2017
* 'master' of https://github.com/kaldi-asr/kaldi: (45 commits)
  [scripts] Fix a bug in attention xconfig regarding key-scale (thanks: Pegah) (kaldi-asr#1975)
  [src,scripts] nnet1: various minor changes, (kaldi-asr#1962)
  [build] Make nvcc respect CXX (kaldi-asr#1965)
  [egs] Further small HKUST and SRE fixes (kaldi-asr#1971)
  [egs] Small egs fixes to HKUST and sre10  (kaldi-asr#1958)
  [egs,scripts] Add support for l2-regularize to LSTM layers; give example.
  [scripts] Small fix to nnet3 training scripts: pass egs_command to get_egs.sh (kaldi-asr#1970)
  [egs] Updates to Chime4 recipe (kaldi-asr#1956)
  [src,scripts,egs] Make it possible to specify component-level l2 regularization (kaldi-asr#1959)
  [egs] fix to scoring script error in HKUST recipe (kaldi-asr#1957)
  [egs,scripts] Disambiguate cmd.sh and path.sh pathnames: add './' (kaldi-asr#1953)
  [scripts] Simplify and remove non-useful diagnostics in nnet3 training scripts
  [egs] Fixed a missing xconfig change of the tdnn recipe in the previous commit (kaldi-asr#1948)
  [scripts] Add new tuning option for speech activity detection (SAD) scripts (kaldi-asr#1949)
  [egs] Adding new non-splicing layers in the AMI TDNN setup (kaldi-asr#1947)
  [egs] Librispeech: make sure phone list is sorted, for repeatability (kaldi-asr#1946)
  [doc] Fix path in kaldi for dummies (thanks: Ermand Hoxha)
  [scripts,egs] Use KALDI_ROOT instead of relative path (kaldi-asr#1941)
  [src,egs] Minor tweaks in nnet3 code (diagnostics,testing); add stage option to swbd setup.
  [egs] Adding new non-splicing layers in some TDNN setups. (kaldi-asr#1937)
  ...
Skaiste pushed a commit to Skaiste/idlak that referenced this pull request Sep 26, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants