Desktop-based Speech Recognition
For speech recognition on a standard desktop PC, the limiting factor is the sound card. Most sound cards today can record at sampling rates of between 16 kHz-48 kHz of audio, with bit rates of 8 to 16-bits per sample, and playback at up to 96 kHz.
As a general rule, a speech recognition engine works better with acoustic models trained with speech audio data recorded at higher sampling rates/bits per sample. But using audio with too high a sampling rate/bits per sample can slow the recognition engine down. A compromise is needed. Thus for desktop speech recognition, the current standard is acoustic models trained with speech audio data recorded at sampling rates of 16 kHz/16bits per sample.
Read more about this topic: Acoustic Model
Famous quotes containing the words speech and/or recognition:
“I thought my razor was dull until I heard his speech and that reminds me of a story thats so dirty Im ashamed to think of it myself.”
—S.J. Perelman, U.S. screenwriter, Bert Kalmar, Harry Ruby, and Norman Z. McLeod. Groucho Marx, Horsefeathers, as a newly-appointed college president commenting on the remarks of Huxley Colleges outgoing president (1932)
“That the world can be improved and yet must be celebrated as it is are contradictions. The beginning of maturity may be the recognition that both are true.”
—William Stott (b. 1940)