Acoustic Model - Speech Audio Characteristics

Speech Audio Characteristics

Audio can be encoded at different sampling rates (i.e. samples per second – the most common being: 8, 16, 32, 44.1, 48, and 96 kHz), and different bits per sample (the most common being: 8-bits, 16-bits or 32-bits). Speech recognition engines work best if the acoustic model they use was trained with speech audio which was recorded at the same sampling rate/bits per sample as the speech being recognized.

Read more about this topic:  Acoustic Model

Famous quotes containing the word speech:

    our concern was speech, and speech impelled us
    To purify the dialect of the tribe
    And urge the mind to aftersight and foresight,
    —T.S. (Thomas Stearns)