Knowledge Discovery

Knowledge Discovery

Knowledge Extraction is the creation of knowledge from structured (relational databases, XML) and unstructured (text, documents, images) sources. The resulting knowledge needs to be in a machine-readable and machine-interpretable format and must represent knowledge in a manner that facilitates inferencing. Although it is methodically similar to Information Extraction (NLP) and ETL (Data Warehouse), the main criteria is that the extraction result goes beyond the creation of structured information or the transformation into a relational schema. It requires either the reuse of existing formal knowledge (reusing identifiers or ontologies) or the generation of a schema based on the source data.

The RDB2RDF W3C group is currently standardizing a language for extraction of RDF from relational databases. Another popular example for Knowledge Extraction is the transformation of Wikipedia into structured data and also the mapping to existing knowledge (see DBpedia, Freebase and ).

Read more about Knowledge Discovery:  Overview, Extraction From Natural Language Sources, Knowledge Discovery, Ontology Learning

Famous quotes containing the words knowledge and/or discovery:

    If we consider what happens in conversation, in reveries, in remorse, in times of passion, in surprises, in the instructions of dreams, wherein often we see ourselves in masquerade,—the droll disguises only magnifying and enhancing a real element, and forcing it on our distinct notice,—we shall catch many hints that will broaden and lighten into knowledge of the secret of nature.
    Ralph Waldo Emerson (1803–1882)

    The new supplants the old. Yet men’s minds are stuffed with outworn bunk. Educating the young in the latest findings of authorities and scholars in the social sciences is important. It is equally important to devise ways and means for aiding the middle-aged and old to reexamine hang-over unscientific doctrines and ideas in the light of recent discovery and research.
    Mary Barnett Gilson (1877–?)