Fast and accurate annotation of short texts with Wikipedia pages
|Name||Fast and accurate annotation of short texts with Wikipedia pages|
We address the problem of cross-referencing text fragments with Wikipedia pages, in a way that synonymy and poly-semy issues are resolved accurately and efficiently. We take inspiration from a recent flow of work and extend their scenario from the annotation of long documents to the annotation of short texts, such as snippets of search-engine results, tweets, news, blogs, etc.. These short and poorly composed texts pose new challenges in terms of efficiency and effectiveness of the annotation process, that we address by designing and engineering Tagme, the first system that performs an accurate and on-the- fly annotation of these short textual fragments. A large set of experiments shows that Tagme outperforms state-of-the-art algorithms when they are adapted to work on short texts and it results fast and competitive on long texts.
|ieee paper year||2012|