SloPalSpeech: A 2,8000-Hour Slovak Speech Corpus from Parliamentary Data
Paper
β’
2509.19270
β’
Published
This model is a fine-tuned version of openai/whisper-large-v3-turbo.
It is adapted for Slovak ASR using SloPalSpeech: 2,806 hours of aligned, β€30 s speechβtext pairs from official plenary sessions of the Slovak National Council.
| Dataset | Base WER | Fine-tuned WER | Ξ (abs) |
|---|---|---|---|
| Common Voice 21 (sk) | 31.7 | 13.2 | -18.5 |
| FLEURS (sk) | 10.7 | 6.4 | -4.3 |
Numbers from the paperβs final benchmark runs.
For more details, please see our paper on arXiv. If you use this model in your work, please cite it as:
@misc{boΕΎΓk2025slopalspeech2800hourslovakspeech,
title={SloPalSpeech: A 2,800-Hour Slovak Speech Corpus from Parliamentary Data},
author={Erik BoΕΎΓk and Marek Ε uppa},
year={2025},
eprint={2509.19270},
archivePrefix={arXiv},
primaryClass={cs.CL},
url={https://arxiv.org/abs/2509.19270},
}
This work was supported by VΓB Banka who provided the GPU resources and backing necessary to accomplish it, enabling progress in Slovak ASR research.
Base model
openai/whisper-large-v3