Voice Assistant

This is an english version of linyiLYi's implementaion of voice assistant using Apple's mlx. I tried to keep it as close as possible to the original version

Voice Assistant

A simple Python script that allows for voice interaction with a local large language model. In this project, the whisper implementation comes from mlx official example library. The large language model is Lingyi Wanwu's Yi model, among which Yi-34B-Chat has stronger capabilities and is recommended for use if memory space allows.

macOS Installation Guide

Below is the installation process for macOS. Windows and Linux can use speech_recognition and pyttsx3 to replace the macOS-specific hear/whisper and say commands in the text below.

Setting Up the Environment

conda create -n VoiceAI python=3.11
conda activate VoiceAI
pip install -r requirements.txt
CMAKE_ARGS="-DLLAMA_METAL=on" pip install llama-cpp-python

# Install audio processing tools
brew install portaudio
pip install pyaudio

Installing the hear Voice Recognition Module

Download the installation package from the open source project hear at this link. After unzipping the folder, run sudo bash install.sh (administrator rights required). Once installed, the macOS voice recognition function can be called directly through console commands. Note that the keyboard dictation option in the computer settings must be enabled: Settings -> Keyboard -> Dictation (turn on the switch). The first time you use it on macOS, you also need to allow the hear module to run in "Settings -> Privacy & Security".

Model Files

The model files are stored in the models/ folder and specified in the script via the variable MODEL_PATH. It is recommended to download TheBloke and XeIaso's gguf format models, among which the 6B model occupies less memory:

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
tools		tools
whisper		whisper
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Voice Assistant

macOS Installation Guide

Setting Up the Environment

Installing the hear Voice Recognition Module

Model Files

About

Releases

Packages

Languages

License

Peter-obi/voice-assistant

Folders and files

Latest commit

History

Repository files navigation

Voice Assistant

macOS Installation Guide

Setting Up the Environment

Installing the hear Voice Recognition Module

Model Files

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages