Acoustic Model - Speech Audio Characteristics

Speech Audio Characteristics

Audio can be encoded at different sampling rates (i.e. samples per second – the most common being: 8, 16, 32, 44.1, 48, and 96 kHz), and different bits per sample (the most common being: 8-bits, 16-bits or 32-bits). Speech recognition engines work best if the acoustic model they use was trained with speech audio which was recorded at the same sampling rate/bits per sample as the speech being recognized.

Read more about this topic: Acoustic Model

Famous quotes containing the word speech:

“What of the heart without her? Nay, poor heart,
Of thee what word remains ere speech be still?
A wayfarer by barren ways and chill,
Steep ways and weary, without her thou art,
Where the long cloud, the long wood’s counterpart,
Sheds doubled darkness up the labouring hill.”
—Dante Gabriel Rossetti (1828–1882)