Performance of Czech Speech Recognition with Language Models Created from Public Resources
In this paper, we investigate the usability of publicly available n-gram corpora for the creation of language models (LM) applicable for Czech speech recognition systems.N-gram LMs with various parameters and settings were created from two publicly available sets, Czech Web 1T 5-gram corpus provided by Nail Cream Google and 5-gram corpus obtained f