Desktop-based Speech Recognition
For speech recognition on a standard desktop PC, the limiting factor is the sound card. Most sound cards today can record at sampling rates of between 16 kHz-48 kHz of audio, with bit rates of 8 to 16-bits per sample, and playback at up to 96 kHz.
As a general rule, a speech recognition engine works better with acoustic models trained with speech audio data recorded at higher sampling rates/bits per sample. But using audio with too high a sampling rate/bits per sample can slow the recognition engine down. A compromise is needed. Thus for desktop speech recognition, the current standard is acoustic models trained with speech audio data recorded at sampling rates of 16 kHz/16bits per sample.
Read more about this topic: Acoustic Model
Famous quotes containing the words speech and/or recognition:
“We look forward to a world founded upon four essential human freedoms. The first is freedom of speech and expressioneverywhere in the world. The second is freedom of every person to worship God in his own wayeverywhere in the world. The third is freedom from want ... everywhere in the world. The fourth is freedom from fear ... anywhere in the world.”
—Franklin D. Roosevelt (18821945)
“Work expands so as to fill the time available for its completion. General recognition of this fact is shown in the proverbial phrase It is the busiest man who has time to spare.”
—C. Northcote Parkinson (19091993)