Computer Science
In the fields of computational linguistics and natural language processing (NLP), esp. corpus linguistics and machine-learned NLP, it is common to disregard hapax legomena (and sometimes other infrequent words), as they are likely to have little value for computational techniques. This disregard has the added benefit of significantly reducing the memory use of an application, since, by Zipf's law, many words are hapaxes.
Read more about this topic: Hapax Legomenon
Famous quotes containing the words computer and/or science:
“The Buddha, the Godhead, resides quite as comfortably in the circuits of a digital computer or the gears of a cycle transmission as he does at the top of a mountain or in the petals of a flower.”
—Robert M. Pirsig (b. 1928)
“He has been described as an innkeeper who hated his guests, a philosopher, and poet who left no written record of his thought, a despiser of women who gave all he had to one, an aristocrat, a proletarian, a pagan, an arcadian, an atheist, a lover of beauty, and, inadvertently, the stepfather of domestic science in America.”
—Administration in the State of Colo, U.S. public relief program (1935-1943)