TY - JOUR AU - Fedoseev, Vladimir Igorevich AU - Konev , Anton Aleksandrovich AU - Repyuk, Natalia Sergeevna PY - 2026 TI - Speech Corpora for Different Languages: A Systematic Review JF - Journal of Computer Science VL - 22 IS - 1 DO - 10.3844/jcssp.2026.9.24 UR - https://thescipub.com/abstract/jcssp.2026.9.24 AB - The study of speech signals relies on carefully curated audio recordings, which are compiled and stored within specialized speech corpora. This article provides a comprehensive overview of such corpora across multiple languages, with particular focus on Russian, English, and Arabic. It notes that Russian and Arabic are represented by fewer corpora compared to the more extensive resources available for English. The discussion includes an examination of typical speech corpus structures, a description of standard parameters for characterizing corpora, and an outline of common metrics used to describe the speech signal itself.