SMART Information Retrieval System

The SMART (System for the Mechanical Analysis and Retrieval of Text) Information Retrieval System is an information retrieval system developed at Cornell University in the 1960s. Many important concepts in information retrieval were developed as part of research on the SMART system, including the vector space model, relevance feedback, and Rocchio Classification.

Gerard Salton led the group that developed SMART. Other contributors included Mike Lesk.

The SMART system also provides a set a corpora, queries and reference rankings, taken from different subjects, notably

  • ADI: publications from information science reviews
  • CACM: computer science
  • Cranfield collection: publications from aeronautic reviews
  • CISI: library science
  • Medlars collection: publications from medical reviews
  • Time magazine collection: archives of the generalist review Time in 1963

To the legacy of the SMART system belongs the so-called SMART notation, a mnemonic scheme for denoting tf-idf weighting variants in the vector space model. The mnemonic for representing a combination of weights takes the form ddd.qqq, where the first three letters represents the term weighting of the document vector and the second three letters represents the term weighting for the query vector. The letter representation for a term, and document, is as follows :

Term frequency Document frequency Normalization
n (natural): n (no): 1 n (none): 1
l (logarithm): 1+log t (idf): log c (cosine):
a (augmented): 0.5 + p (prob idf): b (byte size):
b (boolean): \begin{cases} 1, & \text{if tf}_{t,d} > 0 \\ 0, & \text{otherwise}
\end{cases}
L (log average):

where tf is the term frequency of term in document .

Famous quotes containing the words smart, information and/or system:

    For he is the cleanest in the use of his forepaws of any quadruped.
    For the dexterity of his defense is an instance of the love of God
    to him exceedingly.
    —Christopher Smart (1722–1771)

    When action grows unprofitable, gather information; when information grows unprofitable, sleep.
    Ursula K. Le Guin (b. 1929)

    Columbus stood in his age as the pioneer of progress and enlightenment. The system of universal education is in our age the most prominent and salutary feature of the spirit of enlightenment, and it is peculiarly appropriate that the schools be made by the people the center of the day’s demonstration. Let the national flag float over every schoolhouse in the country and the exercises be such as shall impress upon our youth the patriotic duties of American citizenship.
    Benjamin Harrison (1833–1901)