Speech Audio Characteristics
Audio can be encoded at different sampling rates (i.e. samples per second – the most common being: 8, 16, 32, 44.1, 48, and 96 kHz), and different bits per sample (the most common being: 8-bits, 16-bits or 32-bits). Speech recognition engines work best if the acoustic model they use was trained with speech audio which was recorded at the same sampling rate/bits per sample as the speech being recognized.
Read more about this topic: Acoustic Model
Famous quotes containing the word speech:
“our concern was speech, and speech impelled us
To purify the dialect of the tribe
And urge the mind to aftersight and foresight,”
—T.S. (Thomas Stearns)