Skip to content

Encoding speed comparison of ONNX vs Normal model in infloat/multilingual-e5

Notifications You must be signed in to change notification settings

tkys/multilingual-e5_onnx

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 

Repository files navigation

multilingual-e5_onnx encodeing speed comparison

ノーマルmultilingual-e5-smallと量子最適化された.onnxモデルでの推論の速度比較

https://huggingface.co/intfloat/multilingual-e5-small

https://huggingface.co/intfloat/multilingual-e5-small/blob/main/onnx/model.onnx

.onnxモデルはHFリポジトリ内で提供されているモデルをそのまま使う

Result

multilingual-e5.onnx is x2 faster than normal multilingual-e5.bin @googlecolab/cpu

plot


Benchmark

## Benchmark ##

Trail:0 Done
Trail:1 Done
Trail:2 Done
Trail:3 Done
Trail:4 Done
Trail:5 Done
Trail:6 Done
Trail:7 Done
Trail:8 Done
Trail:9 Done


## Result ##

Trail_Count:10

onnx_time_total	14.439099999999998
onnx_token_total	13279
onnx_speed	920 token/sec


norm_time_total	28.244899999999998
norm_token_total	13279
norm_speed	470 token/sec

About

Encoding speed comparison of ONNX vs Normal model in infloat/multilingual-e5

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published