LipSyncInsight

Hey guys, Let me introdce LipSyncInsight, an lip reading software which recognizes the lips of the user and predicts the output.

Demo:

Dataset:

The dataset used for training the model is a subset of the Grid Corpus Dataset . Used gdown to download a subset (1 speaker) of the full dataset (34 speakers) from google drive.

Dependences:

Python-Tensorflow-Keras -> data preparation, pipeline, model training & testing.
Streamlit -> web application.
LipNet -> lip reading model architecture idea.
ffmpeg -> video file format conversion
opencv -> video capture and frames processing.
gdown -> for downloading the dataset.
imageio -> for making gifs

Working:

here instead of Bi-GRU we are using Bi-Lstm. corelation matrix

Referances:

https://keras.io/examples/audio/ctc_asr/
https://github.com/rizkiarm/LipNet
Lip reading.pdf
Lip reading1.pdf

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
Lip reading.pdf		Lip reading.pdf
Lip reading1.pdf		Lip reading1.pdf
LipNet.ipynb		LipNet.ipynb
README.md		README.md
animation.gif		animation.gif

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LipSyncInsight

Contents:

Demo:

Dataset:

Dependences:

Working:

Referances:

About

Releases

Packages

Languages

gunashresht/Lip-sync-insight

Folders and files

Latest commit

History

Repository files navigation

LipSyncInsight

Contents:

Demo:

Dataset:

Dependences:

Working:

Referances:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages