Audio Understanding Test Set

Audio Understanding Test Set A structured dataset for evaluating audio understanding capabilities of multimodal AI models. Contains 137 test prompts across 22 categories, paired with a 20-minute voice sample and 49 completed model outputs from Gemini 3.1 Flash Lite. Overview Property Value Total prompts 137 Implemented (with prompt text) 49 Suggested (description only) 88 Completed outputs 49 Categories 22 Model under test… See the full description on the dataset page: https://huggingface.co/datasets/danielrosehill/Audio-Understanding-Test-Set.

View on Hugging Face

Project Details

Tags

task_categories:audio-classificationtask_categories:automatic-speech-recognitionlanguage:enlicense:cc-by-4.0size_categories:n<1kmodality:audiodoi:10.57967/hf/8154region:usaudio-understandingvoice-analysismultimodal-evaluationspeaker-analysisemotion-detectionaudio-engineeringvoice-cloningforensic-audio

Explore More Projects