Skip to content

melaniezhang/whisper-diarization

 
 

Repository files navigation

CS 229 Final Project

Fall 2022

This repository is a fork of openai/whisper with some added tools and scripts related to speaker diarization.

Files authored by us: clustering/clustering_diarizer.py, data_processing/process_ami_annotations.py

The code we wrote to perform all dataset pre-processing and run our clustering experiments is in this Colab notebook.

The code we used to fine-tune the Whisper model (adapted from this fine-tuning notebook and edited heavily) is in this Colab notebook.

###Setup Instructions

  1. sudo apt update && sudo apt install ffmpeg (dependency for the Whisper package)
  2. Clone this repo: git clone https://github.com/melaniezhang/whisper-diarization.git
  3. pip install -r requirements.txt
  4. pip install librosa
  5. python clustering/clustering_diarizer.py

About

CS229 final project

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 52.7%
  • Jupyter Notebook 47.3%