This is a fork of the Scription editor by smlum adapted for Whisper and WhisperX transcripts. It supports WhisperX's json files containing speaker diarization.
Scription is an editor for automated transcription services like Amazon Transcribe and Mozilla Deepspeech. It links transcript text to audio playback to bring love and joy to the transcription process ❤️ It's currently being developed bit by bit - if you have any feedback please feel free to send me a message.
- Highlight and scroll text as the audio plays
- Control audio playback by clicking words in the text
- Skip around in the audio with keyboard shortcuts
And some other useful stuff:
- Highlight quotes and export them to csv
- Seperate speech by speakers (AWS)
- Highlight low confidence words (AWS)
- Add punctuation (AWS)
- Run a transcription job using Amazon Transcribe or Mozilla Deepspeech
- Download the json output file
- Load the json file into Scription
- Load in your corresponding audio (see below for large audio files)
- You're good to go!
'Save project' creates a text file which you can load into Scription at a later time. It preserves any text edits and annotations.
If you have 'Autosave' turned on it saves your edits every 5 seconds using cookies. This is less secure, but if you refresh the page, they should still be there.
'Export text' creates a plain text file which includes the speaker tags - essentially the same thing as copy and pasting.
'Export annotations' creates a csv file with highlighted quotes by each category.
Audio playback can be controlled using keyboard shortcuts:
- Go back 5s Ctrl + ,
- Skip 5s Ctrl + .
- Slow down Ctrl + Shift + ,
- Speed up Ctrl + Shift + .
Large audio files (above ~50mb) can cause playback issues. So can files with variable bitrates. Ideally you want the files to be less than 50mb.
To get around this you can compress audio down to a small file size. I recommend using a lossy file format (like mp3). It also helps to format it to mono, use a constant bitrate and reduce the bitrate.
You can manually adjust these using something like Audacity's "export to mp3", for example:
This can be a pain for multiple files. I used the following ffmpeg script to iterate through a folder of mp3 files, change the bitrates and sample rates to 8k, change to mono and save new audio files with the '.min.mp3' suffix:
find ./ -name “*.mp3” -exec ffmpeg -i "{}" -codec:a libmp3lame -b:a 8k -ac 1 -ar 8000 '$(basename {} min)’.mp3 \;
git clone https://github.com/cvl01/scription
cd scription
Set APP_DOMAIN
in .env
3. Install packages (requires node)
npm run install
In the root of the project, run gulp.
gulp
npm run start
Or use a local Apache or Nginx server pointed towards the index.html in the root.
The Scription web app uses your browser's local storage. Nothing is uploaded onto another server using the app.
Pull requests are welcome! For major changes, please open an issue first to discuss what you'd like to change.
Scription is built using Bulma and hyperaudio
Thanks to likeleto for adding Google and Yandex support.
If you need some help to setup scription, want to ask a question or simply get involved in the community, feel free to give me a shout.
scription was created by Sam Lumley and is licensed under the open source AGPLv3 license. If you're interested in using it in a proprietary application feel free to get in touch!