Skip to content

hbwu-ntu/AudioDecBenchmark

 
 

Repository files navigation

Audio Codec Benchmark

Codec Collection:

Criteria

Waveform (Lower is better)

L1Loss in waveform

Mel Distance (Lower is better)

The Mel Distance is the distance between the log mel spectrograms of the reconstructed and ground truth waveforms.

STFT Distance (Lower is better)

This metric calculates the distance between the log magnitude spectrograms of the reconstructed and ground truth waveforms, using window lengths of [2048, 512], and is better at capturing fidelity in higher frequencies compared to the Mel Distance.

PESQ (Higher is better)

PESQ is an intrusive perceptual quality metric that assesses audio quality based on the mean opinion score of a human listener.

STOI (Higher is better)

STOI is an intrusive perceptual quality metric that assesses audio quality based on the intelligibility of the reconstructed speech.

About

Audio Codec Benchmark

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 91.6%
  • JavaScript 6.1%
  • HTML 1.6%
  • CSS 0.7%