Acoustic Model - Speech Audio Characteristics

Speech Audio Characteristics

Audio can be encoded at different sampling rates (i.e. samples per second – the most common being: 8, 16, 32, 44.1, 48, and 96 kHz), and different bits per sample (the most common being: 8-bits, 16-bits or 32-bits). Speech recognition engines work best if the acoustic model they use was trained with speech audio which was recorded at the same sampling rate/bits per sample as the speech being recognized.

Read more about this topic:  Acoustic Model

Famous quotes containing the word speech:

    There are certain things in which mediocrity is intolerable: poetry, music, painting, public eloquence. What torture it is to hear a frigid speech being pompously declaimed, or second-rate verse spoken with all a bad poet’s bombast!
    —Jean De La Bruyère (1645–1696)