Dataset and baseline for the first Audiocaption task
-
Updated
Jul 25, 2024 - Python
Dataset and baseline for the first Audiocaption task
Fluency ENhanced Sentence-bert Evaluation (FENSE), metric for audio caption evaluation. And Benchmark dataset AudioCaps-Eval, Clotho-Eval.
Add a description, image, and links to the audiocaption topic page so that developers can more easily learn about it.
To associate your repository with the audiocaption topic, visit your repo's landing page and select "manage topics."