Acoustic Model - Speech Audio Characteristics

Speech Audio Characteristics

Audio can be encoded at different sampling rates (i.e. samples per second – the most common being: 8, 16, 32, 44.1, 48, and 96 kHz), and different bits per sample (the most common being: 8-bits, 16-bits or 32-bits). Speech recognition engines work best if the acoustic model they use was trained with speech audio which was recorded at the same sampling rate/bits per sample as the speech being recognized.

Read more about this topic:  Acoustic Model

Famous quotes containing the word speech:

    Our speech has its weaknesses and its defects, like all the rest. Most of the occasions for the troubles of the world are grammatical.
    Michel de Montaigne (1533–1592)