Speech Audio Characteristics
Audio can be encoded at different sampling rates (i.e. samples per second – the most common being: 8, 16, 32, 44.1, 48, and 96 kHz), and different bits per sample (the most common being: 8-bits, 16-bits or 32-bits). Speech recognition engines work best if the acoustic model they use was trained with speech audio which was recorded at the same sampling rate/bits per sample as the speech being recognized.
Read more about this topic: Acoustic Model
Famous quotes containing the word speech:
“There are certain things in which mediocrity is intolerable: poetry, music, painting, public eloquence. What torture it is to hear a frigid speech being pompously declaimed, or second-rate verse spoken with all a bad poets bombast!”
—Jean De La Bruyère (16451696)