A machine learning project that translates voice from one language to another in real-time while preserving the tone and emotion of the speaker, and outputs the result in MP3 format.
Python3, SpeechRecognition, pyaudio, google-trans-new, gTTS, playsound, deep-translator, cx-Freeze
-
Clone this project and create virtualenv (recommended) and activate virtualenv.
# Create virtualenv virtualenv -p python3 env # Linux/MacOS source env/bin/activate # Windows env\Scripts\activate
-
Install require dependencies.
pip install -r requirements.txt
-
Run code and speech (have fun).
python main.py
I am using cx_Freeze to build executable file of this app. The build settings can be changed by modifying the setup.py file.
- Windows:
python setup.py bdist_msi
- Linux:
python setup.py bdist_rpm
- Mac:
python setup.py bdist_mac