Current Challenges and Research
Despite the high F1 numbers reported on the MUC-7 dataset, the problem of Named Entity Recognition is far from being solved. The main efforts are directed to reducing the annotation labor, robust performance across domains and scaling up to fine-grained entity types.
A recently emerging task of identifying "important expressions" in text and cross-linking them to Wikipedia can be seen as an instance of extremely fine-grained named entity recognition, where the types are the actual Wikipedia pages describing the (potentially ambiguous) concepts. Below is an example output of a Wikification system:
http://en.wikipedia.org/wiki/Michael_I._Jordan Michael Jordan is a professor athttp://en.wikipedia.org/wiki/University_of_California,_Berkeley Berkeley
Read more about this topic: Named-entity Recognition
Famous quotes containing the words current, challenges and/or research:
“Absolutely speaking, Do unto others as you would that they should do unto you is by no means a golden rule, but the best of current silver. An honest man would have but little occasion for it. It is golden not to have any rule at all in such a case.”
—Henry David Thoreau (18171862)
“The approval of the public is to be avoided like the plague. It is absolutely essential to keep the public from entering if one wishes to avoid confusion. I must add that the public must be kept panting in expectation at the gate by a system of challenges and provocations.”
—André Breton (18961966)
“It is a good morning exercise for a research scientist to discard a pet hypothesis every day before breakfast. It keeps him young.”
—Konrad Lorenz (19031989)