Automatic Speech Recognition is a model deep learning to convert speech into text. This model is trained using the OpenSLR dataset. The dataset used is the Javanese language dataset. The dataset is divided into 2 parts, namely train, and validation. The dataset is trained using the wav2vec2
based model from Facebook.
git clone https://github.com/JohanesSetiawan/asr-javanese-api.git
pip install Flask flask-cors huggingsound pyngrok
if you using server, and not the local machine.
or
pip install Flask flask-cors huggingsound
if you using local machine.
python api.py
Paste the API link that appears in the terminal to the transcribe.php
file, in the:
url: "<FLASK_API>/transcribe", // Flask server API endpoint variable.
and the endpoint will be:
/transcribe
You can use API in locally or using Google Colab to run API.