Voice Activity Detection

Voice activity detection (VAD), also known as speech activity detection or speech detection, is a technique used in speech processing in which the presence or absence of human speech is detected. The main uses of VAD are in speech coding and speech recognition. It can facilitate speech processing, and can also be used to deactivate some processes during non-speech section of an audio session: it can avoid unnecessary coding/transmission of silence packets in Voice over Internet Protocol applications, saving on computation and on network bandwidth.

VAD is an important enabling technology for a variety of speech-based applications. Therefore various VAD algorithms have been developed that provide varying features and compromises between latency, sensitivity, accuracy and computational cost. Some VAD algorithms also provide further analysis, for example whether the speech is voiced, unvoiced or sustained. Voice activity detection is usually language independent.

It was first investigated for use on time-assignment speech interpolation (TASI) systems.

Read more about Voice Activity Detection:  Algorithm Overview, Applications, Performance Evaluation, Implementations

Famous quotes containing the words voice and/or activity:

    Reason is a faculty far larger than mere objective force. When either the political or the scientific discourse announces itself as the voice of reason, it is playing God, and should be spanked and stood in the corner.
    Ursula K. Le Guin (b. 1929)

    Who shall set a limit to the influence of a human being? There are men, who, by their sympathetic attractions, carry nations with them, and lead the activity of the human race. And if there be such a tie, that, wherever the mind of man goes, nature will accompany him, perhaps there are men whose magnetisms are of that force to draw material and elemental powers, and, where they appear, immense instrumentalities organize around them.
    Ralph Waldo Emerson (1803–1882)