Skip to content

Version history and Available Models

Dimitrii Voronin edited this page Nov 13, 2024 · 11 revisions

Version History

Version Date Comment
v1 2020-12-15 Initial release
v1.1 2020-12-24 better vad models compatible with chunks shorter than 250 ms
v1.2 2020-12-30 Number Detector added
v2 2021-01-11 Add Language Classifier heads (en, ru, de, es)
v2.1 2021-02-11 Add micro (10k params) VAD models
v2.2 2021-03-22 Add micro 8000 sample rate VAD models
v2.3 2021-04-12 Add mini (100k params) VAD models (8k and 16k sample rate) + new adaptive utils for full audio and single audio stream
v2.4 2021-07-09 Add 116 languages classifier and group classifier
v2.4 2021-07-09 Deleted 116 language classifier, added 95 language classifier instead (get rid of lowspoken languages for quality improvement)
v3.0 2021-12-06 Deleted old VAD models, added new VAD model
v3.1 2021-12-17 Added ONNX Silero VAD model (16000 Hz only)
v4.0 2022-10-26 Added v4 JIT and ONNX models. ONNX model now supports batching and both (16k and 8k) sampling rates
v4.0 2023-04-27 Deprecated the number and language detector models
v5.0 2024-06-27 Added v5 JIT and ONNX models

Available models

Currently we provide the following endpoints:

model= Params Model type Streaming Languages PyTorch ONNX opset Colab
'silero_vad' ~260K VAD Yes 6000+ ✔️ 15 ✔️, 16 ✔️ Open In Colab

Deprecated models

model= Params Model type Streaming Languages PyTorch ONNX Colab
'silero_number_detector' 1.1M Number Detector No ru, en, de, es ✔️ ✔️ Open In Colab
'silero_lang_detector' 1.1M Language Classifier No ru, en, de, es ✔️ ✔️ Open In Colab
'silero_lang_detector_95' 4.7M Language Classifier No 95 languages ✔️ ✔️ Open In Colab
Clone this wiki locally