Skip to content

A real-time voice-driven animation system in Python, utilizing advanced audio processing and speech recognition to animate and visualize speech dynamically on a visual canvas.

License

Notifications You must be signed in to change notification settings

IsaacTomeho/VoiceAnimation

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 

Repository files navigation

Voice-Driven Animation System

Overview

The Voice-Driven Animation System is a Python application that uses real-time audio processing and speech recognition to create dynamic visualizations of spoken words. The system captures audio, processes and recognizes speech, and then visually animates the words on a screen.

Features

  • Real-Time Audio Processing: Uses PyAudio to capture live audio data.
  • Speech Recognition: Incorporates Google's speech recognition to convert spoken words into text.
  • Dynamic Visualization: Utilizes Vispy for real-time visualization of the audio signals and text.
  • Interactive GUI: Features a PyQt5-based GUI for real-time control over audio processing settings.

Technologies Used

  • Python: Primary programming language.
  • PyAudio: For capturing audio data.
  • Vispy: For rendering visual animations.
  • SciPy: Provides signal processing capabilities.
  • speech_recognition: Handles the conversion of speech to text.
  • PyQt5: For the graphical user interface.
  • PyDub: Used for manipulating audio data.
  • Threading: Supports concurrent execution for seamless audio processing and UI interaction.

Instal the required packages

pip install numpy pyaudio vispy PyQt5 scipy SpeechRecognition pydub

About

A real-time voice-driven animation system in Python, utilizing advanced audio processing and speech recognition to animate and visualize speech dynamically on a visual canvas.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages