Skip to content

gunashresht/Lip-sync-insight

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

LipSyncInsight

Hey guys, Let me introdce LipSyncInsight, an lip reading software which recognizes the lips of the user and predicts the output.

Contents:

Demo:

image

Dataset:

The dataset used for training the model is a subset of the Grid Corpus Dataset . Used gdown to download a subset (1 speaker) of the full dataset (34 speakers) from google drive.

Dependences:

  • Python-Tensorflow-Keras -> data preparation, pipeline, model training & testing.
  • Streamlit -> web application.
  • LipNet -> lip reading model architecture idea.
  • ffmpeg -> video file format conversion
  • opencv -> video capture and frames processing.
  • gdown -> for downloading the dataset.
  • imageio -> for making gifs

Working:

image

here instead of Bi-GRU we are using Bi-Lstm. image image corelation matrix

Referances:

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published