Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
.github		.github
alien_scientist		alien_scientist
cyberpunk		cyberpunk
detective		detective
newscaster1920s		newscaster1920s
outputs		outputs
pirate		pirate
samantha		samantha
valleygirl		valleygirl
wizard		wizard
.env.sample		.env.sample
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
app.py		app.py
cpu_requirements.txt		cpu_requirements.txt
requirements.txt		requirements.txt

Repository files navigation

Voice Chat AI

Voice Chat AI is a project that allows you to interact with different AI characters using speech. You can choose between various characters, each with unique personalities and voices. You can run all locally, you can use openai for chat and voice, you can mix between the two.

Features

Supports both OpenAI and Ollama language models.
Provides text-to-speech synthesis using XTTS or OpenAI TTS.
Analyzes user mood and adjusts AI responses accordingly.
Easy configuration through environment variables.

Installation

Requirements

Python 3.10
CUDA-enabled GPU
Microphone
A sence of humor

Steps

Download the models below

download directly https://nordnet.blob.core.windows.net/bilde/checkpoints.zip

download directly https://huggingface.co/coqui/XTTS-v2

download the model and place both in project folder

voice-chat-ai/
├── .gitignore
├── .env
├── README.md
├── app.py
├── requirements.txt
├── cpu_requirements.txt
├── checkpoints/
│   ├── base_speakers
│   ├── convertor
│   
├── XTTS-v2/
│   ├── config.json
│   ├── model.pth
│   ├── ... (other XTTS model files)
├── outputs/
│   └── ... (generated audio files)
├── samantha/
│   ├── samantha.txt
│   ├── prompts.json
│   └── samantha.wav
├── wizard/
│   ├── wizard.txt
│   ├── prompts.json
│   └── wizard.wav

Clone the repository:

git clone https://github.com/bigsk1/voice-chat-ai.git
cd voice-chat-ai

Create a virtual environment:

python -m venv venv
source venv/bin/activate   # On Windows use `venv\Scripts\Activate`

or use conda just make it python 3.10

conda create --name voice-chat-ai python=3.10
conda activate voice-chat-ai

# Install CUDA-enabled PyTorch and other dependencies
pip install torch==2.3.1+cu121 torchaudio==2.3.1+cu121 torchvision==0.18.1+cu121 -f https://download.pytorch.org/whl/torch_stable.html
pip install -r requirements.txt

# For CPU-only installations, use:
pip install -r cpu_requirements.txt

Install dependencies:

For GPU (CUDA) version:

pip install -r requirements.txt

For CPU-only version:

pip install -r cpu_requirements.txt

Configuration

Rename the .env.sample to .env in the root directory of the project and configure it with the necessary environment variables: - The app is controlled based on the variables you add.

# Conditional API Usage: Depending on the value of MODEL_PROVIDER, that's what will be used when ran 
# use either ollama or openai, can mix and match, use local olllama with openai speech or use openai model with local xtts, ect..

# openai or ollama
MODEL_PROVIDER=ollama

# Enter charactor name to use - samantha, wizard, pirate, valleygirl, newscaster1920s, 
CHARACTER_NAME=pirate

# Text-to-Speech Provider - (xtts local uses the custom charactor .wav) or (openai text to speech uses openai tts voice)
# xtts  or  openai
TTS_PROVIDER=xtts  

# The voice speed for xtts only ( 1.0 - 1.5 , default 1.1)
XTTS_SPEED=1.1

# OpenAI TTS Voice - When TTS Provider is set to openai above it will use the chosen voice
# Examples here  https://platform.openai.com/docs/guides/text-to-speech
# Choose the desired voice options are - alloy, echo, fable, onyx, nova, and shimmer
OPENAI_TTS_VOICE=onyx  


# SET THESE BELOW AND NO NEED TO CHANGE OFTEN #

# Endpoints
OPENAI_BASE_URL=https://api.openai.com/v1/chat/completions
OPENAI_TTS_URL=https://api.openai.com/v1/audio/speech
OLLAMA_BASE_URL=http:https://localhost:11434

# OpenAI API Key for models and speech
OPENAI_API_KEY=sk-11111111

# Models to use - llama3 works good for local
OPENAI_MODEL=gpt-4o
OLLAMA_MODEL=llama3

Add character-specific configuration files:
- Create a folder named after your character (e.g., samantha).
- Add a text file with the character's prompt (e.g., samantha/samantha.txt).
- Add a JSON file with mood prompts (e.g., samantha/prompts.json).
- Add the voice sample in the character folder (e.g., samantha/samantha.wav).

Usage

Run the application:

python app.py

Commands

To stop the conversation, say "Quit", "Exit", or "Leave".

Adding New Characters

Create a new folder for the character in the root directory.
Add a text file with the character's prompt (e.g., wizard/wizard.txt).
Add a JSON file with mood prompts (e.g., wizard/prompts.json).

Example Character Configuration

`wizard/wizard.txt`

You are a wise and ancient wizard who speaks with a mystical and enchanting tone. You are knowledgeable about many subjects and always eager to share your wisdom.

`wizard/prompts.json`

{
    "joyful": "RESPOND WITH ENTHUSIASM AND WISDOM, LIKE A WISE OLD SAGE WHO IS HAPPY TO SHARE HIS KNOWLEDGE.",
    "sad": "RESPOND WITH EMPATHY AND COMFORT, LIKE A WISE OLD SAGE WHO UNDERSTANDS THE PAIN OF OTHERS.",
    "flirty": "RESPOND WITH A TOUCH OF MYSTERY AND CHARM, LIKE A WISE OLD SAGE WHO IS ALSO A BIT OF A ROGUE.",
    "angry": "RESPOND CALMLY AND WISELY, LIKE A WISE OLD SAGE WHO KNOWS THAT ANGER IS A PART OF LIFE.",
    "neutral": "KEEP RESPONSES SHORT AND NATURAL, LIKE A WISE OLD SAGE WHO IS ALWAYS READY TO HELP.",
    "fearful": "RESPOND WITH REASSURANCE, LIKE A WISE OLD SAGE WHO KNOWS THAT FEAR IS ONLY TEMPORARY.",
    "surprised": "RESPOND WITH AMAZEMENT AND CURIOSITY, LIKE A WISE OLD SAGE WHO IS ALWAYS EAGER TO LEARN.",
    "disgusted": "RESPOND WITH UNDERSTANDING AND COMFORT, LIKE A WISE OLD SAGE WHO KNOWS THAT DISGUST IS A PART OF LIFE."
}

License

This project is licensed under the MIT License.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Voice Chat AI

Features

Installation

Requirements

Steps

Configuration

Usage

Commands

Adding New Characters

Example Character Configuration

`wizard/wizard.txt`

`wizard/prompts.json`

License

About

Releases 1

Sponsor this project

Packages

Languages

License

bigsk1/voice-chat-ai

Folders and files

Latest commit

History

Repository files navigation

Voice Chat AI

Features

Installation

Requirements

Steps

Configuration

Usage

Commands

Adding New Characters

Example Character Configuration

wizard/wizard.txt

wizard/prompts.json

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 1

Sponsor this project

Packages 0

Languages

`wizard/wizard.txt`

`wizard/prompts.json`

Packages