EmoGen/evaluation at master · sahilg06/EmoGen

History

Name		Name	Last commit message	Last commit date
parent directory ..
scores_LSE		scores_LSE
README.md		README.md
gen_videos_from_filelist.py		gen_videos_from_filelist.py
img_eval.py		img_eval.py
real_videos_inference.py		real_videos_inference.py

README.md

Evaluation of sync quality

You can calculate the LSE-D and LSE-C scores using the instructions below.

Steps to set-up the evaluation repository for LSE-D and LSE-C metric:

We use the pre-trained syncnet model available in this repository.

Clone the SyncNet repository.

git clone https://github.com/joonson/syncnet_python.git

Follow the procedure given in the above linked repository to download the pretrained models and set up the dependencies.
- Note: Please install a separate virtual environment for the evaluation scripts. The versions used by EmoGen and the publicly released code of SyncNet is different and can cause version mis-match issues. To avoid this, we suggest the users to install a separate virtual environment for the evaluation scripts

cd syncnet_python
pip install -r requirements.txt
sh download_model.sh

The above step should ensure that all the dependencies required by the repository is installed and the pre-trained models are downloaded.

Running the evaluation scripts:

Copy our evaluation scripts given in this folder to the cloned repository.

    cd Wav2Lip/evaluation/scores_LSE/
    cp *.py syncnet_python/
    cp *.sh syncnet_python/

Our evaluation technique does not require ground-truth of any sorts. Given lip-synced videos we can directly calculate the scores from only the generated videos. Please store the generated videos (from our test sets or your own generated videos) in the following folder structure.

video data root (Folder containing all videos)
├── All .mp4 files

Change the folder back to the cloned repository.

cd syncnet_python

To run evaluation on the ReSynced dataset or your own generated videos, please run the following command:

sh calculate_scores_real_videos.sh /path/to/video/data/root

The generated scores will be present in the all_scores.txt generated in the syncnet_python/ folder

Evaluation of image quality using FID metric.

We use the pytorch-fid repository for calculating the FID metrics. We dump all the frames in both ground-truth and generated videos and calculate the FID score. Please see this thread on how to calculate the FID scores.

Evaluation of emotion incorporation.

We use our emotion discriminator to evaluate the generated emotional talking face videos. Train it as an emotion classifier. The higher the emotion classification accuracy (EmoAcc) of the video-based emotion classifier on the generated videos, the better the emotion incorporation ability of the model. As we are using arbitrary emotions to generate our videos, those arbitrary emotions can be exploited as ground truth labels for the classifier to evaluate our model.

Opening issues related to evaluation scripts

Please open the issues with the "Evaluation" label if you face any issues in the evaluation scripts.

Acknowledgements

Our evaluation pipeline in based on two existing repositories. LSE metrics are based on the syncnet_python repository and the FID score is based on pytorch-fid repository. We thank the authors of both the repositories for releasing their wonderful code.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

evaluation

evaluation

README.md

Evaluation of sync quality

Steps to set-up the evaluation repository for LSE-D and LSE-C metric:

Running the evaluation scripts:

Evaluation of image quality using FID metric.

Evaluation of emotion incorporation.

Opening issues related to evaluation scripts

Acknowledgements

Files

evaluation

Directory actions

More options

Directory actions

More options

Latest commit

History

evaluation

Folders and files

parent directory

README.md

Evaluation of sync quality

Steps to set-up the evaluation repository for LSE-D and LSE-C metric:

Running the evaluation scripts:

Evaluation of image quality using FID metric.

Evaluation of emotion incorporation.

Opening issues related to evaluation scripts

Acknowledgements