Use PyTorch as logits transpose for ONNX support #141

mgoin · 2022-09-26T17:00:17Z

Because Numpy was used for the final transpose for the logits output, torch.onnx.export would fail

/usr/local/lib/python3.7/dist-packages/torch/onnx/utils.py in _run_symbolic_function(g, block, n, inputs, env, operator_export_type)
   1420         else:
   1421             raise symbolic_registry.UnsupportedOperatorError(
-> 1422                 domain, op_name, opset_version
   1423             )
   1424     except RuntimeError:
UnsupportedOperatorError: Exporting the operator ::numpy_T to ONNX opset version 13 is not supported. Please feel free to request support or submit a pull request on PyTorch GitHub.

If this line at whisper/model.py:L192 is changed from
logits = (x @ self.token_embedding.weight.to(x.dtype).T).float()
to
logits = (x @ torch.transpose(self.token_embedding.weight.to(x.dtype), 0, 1)).float()
then the export succeeds!

Code to test export:

import whisper
import torch

tiny_model = whisper.load_model("tiny")
torch.onnx.export(tiny_model.encoder, torch.randn(1,80,3000).to("cuda"), "tiny/whisper-encoder.onnx")
torch.onnx.export(tiny_model.decoder, (torch.tensor([[50258]]).to("cuda"), torch.randn(1,384,384).to("cuda")), "tiny/whisper-decoder_main.onnx")
torch.onnx.export(tiny_model.decoder, (torch.tensor([[50258, 50259, 50359]]).to("cuda"), torch.randn(1, 384, 384).to("cuda")), "tiny/whisper-decoder_language.onnx")

Y-T-G · 2022-09-26T17:50:44Z

[[50258, 50259, 50359]]
@mgoin May I know where did you obtain these shapes from?

jongwook · 2022-09-26T17:54:22Z

Thanks for checking! I haven't tried ONNX but the change seems benign.

mgoin · 2022-09-26T21:10:05Z

[[50258, 50259, 50359]] @mgoin May I know where did you obtain these shapes from?

@Y-T-G those shapes were taken just from some sample audio I ran through, printed the tensor shapes, and use them to make the dummy inputs.

Thanks for the accept!

ArtyomZemlyak · 2022-09-27T01:29:40Z

Hi!

It's not shapes, but it's sot_tokens:

50258 - sot_token
50259 - language token
50359 - task token (50359 - just for trunscribe)

This 3 tokens formed here:

whisper/whisper/tokenizer.py

Lines 324 to 331 in 8cf36f3

 langs = tuple(LANGUAGES.keys()) 

 sot_sequence = [sot] 

 if language is not None: 

 sot_sequence.append(sot + 1 + langs.index(language)) 

 if task is not None: 

 sot_sequence.append(transcribe if task == "transcribe" else translate) 

 return Tokenizer(tokenizer=tokenizer, language=language, sot_sequence=tuple(sot_sequence))

ArtyomZemlyak · 2022-09-27T01:34:15Z

But i have problems with speed of overall model, as mentioned here #134

@mgoin If you can run ONNX version of model without botlenecks, it's would very go to see your inference code (or just know, that you has not problems with it, and all running good).

nyadla-sys · 2022-09-30T04:15:46Z

@ArtyomZemlyak ,can you please share the code to run inference on onnx files

Y-T-G · 2022-10-02T05:50:23Z

@mgoin I see. So I am assuming it would be a different values for different size of models.

David19970306 · 2022-11-03T14:24:05Z

But i have problems with speed of overall model, as mentioned here #134

@mgoin If you can run ONNX version of model without botlenecks, it's would very go to see your inference code (or just know, that you has not problems with it, and all running good).

do u have the codes for running the onnx model file? coz i came across a problem how to convert logits to tokens we need.

AntyRia · 2024-04-02T02:49:29Z

Because Numpy was used for the final transpose for the logits output, torch.onnx.export would fail

/usr/local/lib/python3.7/dist-packages/torch/onnx/utils.py in _run_symbolic_function(g, block, n, inputs, env, operator_export_type)
   1420         else:
   1421             raise symbolic_registry.UnsupportedOperatorError(
-> 1422                 domain, op_name, opset_version
   1423             )
   1424     except RuntimeError:
UnsupportedOperatorError: Exporting the operator ::numpy_T to ONNX opset version 13 is not supported. Please feel free to request support or submit a pull request on PyTorch GitHub.

If this line at whisper/model.py:L192 is changed from logits = (x @ self.token_embedding.weight.to(x.dtype).T).float() to logits = (x @ torch.transpose(self.token_embedding.weight.to(x.dtype), 0, 1)).float() then the export succeeds!

Code to test export:

import whisper
import torch

tiny_model = whisper.load_model("tiny")
torch.onnx.export(tiny_model.encoder, torch.randn(1,80,3000).to("cuda"), "tiny/whisper-encoder.onnx")
torch.onnx.export(tiny_model.decoder, (torch.tensor([[50258]]).to("cuda"), torch.randn(1,384,384).to("cuda")), "tiny/whisper-decoder_main.onnx")
torch.onnx.export(tiny_model.decoder, (torch.tensor([[50258, 50259, 50359]]).to("cuda"), torch.randn(1, 384, 384).to("cuda")), "tiny/whisper-decoder_language.onnx")

Hi, thank you for your contribution. I am a novice who has just come into contact with whisper model. I would like to ask whether I can save the entire whisper input pt as pth and then convert it to an onnx model. I made simple attempts, but they didn't seem to succeed

Use PyTorch as logits transpose for ONNX support

312ca71

jongwook merged commit 9c8183a into openai:main Sep 26, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use PyTorch as logits transpose for ONNX support #141

Use PyTorch as logits transpose for ONNX support #141

mgoin commented Sep 26, 2022

Y-T-G commented Sep 26, 2022

jongwook commented Sep 26, 2022

mgoin commented Sep 26, 2022 •

edited

Loading

ArtyomZemlyak commented Sep 27, 2022

ArtyomZemlyak commented Sep 27, 2022

nyadla-sys commented Sep 30, 2022

Y-T-G commented Oct 2, 2022

David19970306 commented Nov 3, 2022

AntyRia commented Apr 2, 2024

Use PyTorch as logits transpose for ONNX support #141

Use PyTorch as logits transpose for ONNX support #141

Conversation

mgoin commented Sep 26, 2022

Y-T-G commented Sep 26, 2022

jongwook commented Sep 26, 2022

mgoin commented Sep 26, 2022 • edited Loading

ArtyomZemlyak commented Sep 27, 2022

ArtyomZemlyak commented Sep 27, 2022

nyadla-sys commented Sep 30, 2022

Y-T-G commented Oct 2, 2022

David19970306 commented Nov 3, 2022

AntyRia commented Apr 2, 2024

mgoin commented Sep 26, 2022 •

edited

Loading