ReazonSpeech

This repository provides access to the main user tooling of ReazonSpeech project.

https://research.reazon.jp/projects/ReazonSpeech/

Install

$ git clone https://github.com/reazon-research/ReazonSpeech
$ pip install ReazonSpeech/pkg/nemo-asr  # or k2-asr, espnet-asr or espnet-oneseg

Packages

reazonspeech.nemo.asr

Implements a fast, accurate speech recognition based on FastConformer-RNNT.
The total number of parameters is 619M. Requires Nvidia Nemo.

reazonspeech.k2.asr

Next-gen Kaldi model that is very fast and accurate.
The total number of parameters is 159M. Requires sherpa-onnx.

reazonspeech.espnet.asr

Speech recognition with a Conformer-Transducer model.
The total number of parameters is 120M. Requires ESPnet.

reazonspeech.espnet.oneseg

Provides a set of tools to analyze Japanese "one-segment" TV stream.
Use this package to create Japanese audio corpus.

LICENSE

Copyright 2022-2024 Reazon Holdings, inc.

Licensed under the Apache License, Version 2.0 (the "License");
you may not use this file except in compliance with the License.
You may obtain a copy of the License at

   https://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.

Name		Name	Last commit message	Last commit date
Latest commit History 70 Commits
colab		colab
pkg		pkg
.gitignore		.gitignore
LICENSE		LICENSE
README.rst		README.rst
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ReazonSpeech

Install

Packages

LICENSE

About

Releases 3

Contributors 4

Languages

License

reazon-research/ReazonSpeech

Folders and files

Latest commit

History

Repository files navigation

ReazonSpeech

Install

Packages

LICENSE

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 3

Contributors 4

Languages