Hugging Face Dataset

ASR-WPM-And-Background-Noise-Eval

ASR WPM and Background Noise Evaluation Dataset A dataset of annotated audio recordings for evaluating how different factors affect Whisper (and other ASR/STT systems) transcription accuracy. Purpose This dataset provides controlled audio samples with annotations to evaluate ASR performance across: Speaking pace (fast, normal, slow, mumbled, whispered, weird voices) Background noise (cafe, music, conversations in various languages, traffic, sirens, etc.) Microphone… See the full description on the dataset page: https://huggingface.co/datasets/danielrosehill/ASR-WPM-And-Background-Noise-Eval.

Project Information

Topics

task_categories:automatic-speech-recognitionlanguage:enlicense:mitsize_categories:n<1Kmodality:audiodoi:10.57967/hf/7192region:usaudiospeechwhisperasrsttwpmbackground-noisespeech-recognitionevaluation
View on Hugging Face Dataset