Add beam search #95

ryanleary · 2017-06-19T19:49:02Z

This is a WIP integration of beam search based on my port of the ctc beam decoder in tensorflow. Will address #86.

Remaining:

If you want to test it out, you will need to manually install my pytorch_ctc bindings: https://github.com/ryanleary/pytorch-ctc

ryanleary · 2017-06-20T06:25:58Z

@SeanNaren - do you notice anything in here that would break training? I consistently get OOM errors during clip_grad in this branch for some reason. Cannot explain at the moment.

Traceback (most recent call last):
  File "train.py", line 382, in <module>
    main()
  File "train.py", line 248, in main
    torch.nn.utils.clip_grad_norm(model.parameters(), args.max_norm)
File "/home/ryan/anaconda3/envs/pytorch/lib/python3.6/site-packages/torch/nn/utils/clip_grad.py", line 25, in clip_grad_norm
    param_norm = p.grad.data.norm(norm_type)
RuntimeError: cuda runtime error (2) : out of memory at /py/conda-bld/pytorch_1493680494901/work/torch/lib/THC/THCGeneral.c:833

Perhaps something with the new BatchSoftmax layer? I've tried commenting out large swaths but it still seems to OOM.

SeanNaren · 2017-06-20T19:39:01Z

Hey man its definitely involving the new batch softmax layer i think, but the behaviour is really strange, will be digging a little further into it.

ryanleary · 2017-06-20T19:45:37Z

I was referencing pytorch/pytorch#1020 in the implementation of that module.

ryanleary · 2017-06-21T03:15:20Z

Fixed it. Accidentally omitted the final transform when in training mode.

ryanleary · 2017-06-21T03:41:21Z

Working through the kenlm integration now.

willfrey · 2017-06-21T03:44:02Z

FYI this is a thing.

https://github.com/timediv/tensorflow-with-kenlm

SeanNaren · 2017-06-21T06:12:59Z

@ryanleary nice catch :)
@willfrey that's whats being merged

ryanleary · 2017-06-26T17:15:56Z

@SeanNaren it may be best to consider this experimental still, but I think it's probably safe to merge.

SeanNaren · 2017-06-29T09:56:29Z

Thanks a lot for this!

ryanleary force-pushed the beam-final branch from 511668a to da9fecb Compare June 21, 2017 03:18

ryanleary mentioned this pull request Jun 21, 2017

KenLM integration (Beam search) #76

Closed

ryanleary changed the title ~~WIP: Add beam search~~ Add beam search Jun 26, 2017

ryanleary added 10 commits June 28, 2017 13:38

Add softmax during evaluation

7837feb

Rename ArgMaxDecoder and add initial BeamCTCDecoder

b82ff15

Integrate beam search into test/predict scripts

e75bf9e

Fix minor bugs

3eac8cb

Add alternative softmax implementation

5c19b98

Add decoder changes

acf8670

Integrate latest pytorch_ctc library

b474841

Change merge parameter

4848334

Add utility script for creating LM trie

7f09141

Update README

617edf3

ryanleary force-pushed the beam-final branch from 96cd22d to 617edf3 Compare June 28, 2017 17:38

SeanNaren approved these changes Jun 29, 2017

View reviewed changes

SeanNaren merged commit a52830f into SeanNaren:master Jun 29, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add beam search #95

Add beam search #95

ryanleary commented Jun 19, 2017 •

edited

Loading

ryanleary commented Jun 20, 2017

SeanNaren commented Jun 20, 2017

ryanleary commented Jun 20, 2017 •

edited

Loading

ryanleary commented Jun 21, 2017

ryanleary commented Jun 21, 2017

willfrey commented Jun 21, 2017

SeanNaren commented Jun 21, 2017 •

edited

Loading

ryanleary commented Jun 26, 2017

SeanNaren commented Jun 29, 2017

Add beam search #95

Add beam search #95

Conversation

ryanleary commented Jun 19, 2017 • edited Loading

ryanleary commented Jun 20, 2017

SeanNaren commented Jun 20, 2017

ryanleary commented Jun 20, 2017 • edited Loading

ryanleary commented Jun 21, 2017

ryanleary commented Jun 21, 2017

willfrey commented Jun 21, 2017

SeanNaren commented Jun 21, 2017 • edited Loading

ryanleary commented Jun 26, 2017

SeanNaren commented Jun 29, 2017

ryanleary commented Jun 19, 2017 •

edited

Loading

ryanleary commented Jun 20, 2017 •

edited

Loading

SeanNaren commented Jun 21, 2017 •

edited

Loading