Skip to content
forked from pinlab/Schnitt

audio cutter based on VAD, with a swing GUI

License

Notifications You must be signed in to change notification settings

mnaoizy/schnitt

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

36 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Project schnitt

  • voice activity detector (VAD) algorithm for splitting sound files
  • Java Swing GUI for visualization, error analysis and editing
  • group project for my MA programming course (Fall, 2015, Kobe University)

Wave panel

  • editable hypo tier (green)
  • non-editable target tier (dark blue)
  • automatically updating eval tier (red) with VAD errors

wav panel

Type Description
TN true negative; silence detected as silence
TN true positive; speech detected as speech
WC word clipping
NDS(1) noise detected as speech, during silence
NDS(2) noise detected as speech, arching 2 speech activities
FEC front end clipping
REC rear end clipping
HEAD overhead: hypo starts before voice activity
TAIL tail: hypo ends after voice activity ends

Evaluation

The following FDA (more precisely Finite State Transducer) is implemented. wav panel

About

audio cutter based on VAD, with a swing GUI

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Java 99.7%
  • Shell 0.3%