- voice activity detector (VAD) algorithm for splitting sound files
- Java Swing GUI for visualization, error analysis and editing
- group project for my MA programming course (Fall, 2015, Kobe University)
- editable hypo tier (green)
- non-editable target tier (dark blue)
- automatically updating eval tier (red) with VAD errors
Type | Description |
---|---|
TN | true negative; silence detected as silence |
TN | true positive; speech detected as speech |
WC | word clipping |
NDS(1) | noise detected as speech, during silence |
NDS(2) | noise detected as speech, arching 2 speech activities |
FEC | front end clipping |
REC | rear end clipping |
HEAD | overhead: hypo starts before voice activity |
TAIL | tail: hypo ends after voice activity ends |
The following FDA (more precisely Finite State Transducer) is implemented.