Preview

Proceedings of the Institute for System Programming of the RAS (Proceedings of ISP RAS)

Advanced search

WikifyMe: Creating Testbed for Wikifiers

Abstract

Finding relationships between words in text and articles from Wikipedia is an extremely popular task known as wikification. However there is still no gold standard corpus for wikifiers comparison. We present WikifyMe, the online tool for collaborative work on universal test collection which allows users to easily prepare tests for two most difficult problems in wikification: word-sense disambiguation and keyphrase extraction.

About the Authors

Sergey Bartunov
ISP RAS, Moscow
Russian Federation


Alexander Boldakov
ISP RAS, Moscow
Russian Federation


Denis Turdakov
ISP RAS, Moscow
Russian Federation


References

1. Jeff Barr and Luis Felipe Cabrera. Ai gets a brain. Queue, 4:24–29, May 2006.

2. Timothy Chklovski and Rada Mihalcea. Building a sense tagged corpus with open mind word expert. In Proceedings of the ACL-02 workshop on Word sense disambiguation: recent successes and future directions - Volume 8, WSD ’02, pages 116–122, Stroudsburg, PA, USA, 2002. Association for Computational Linguistics.

3. Silviu Cucerzan. Large-scale named entity disambiguation based on wikipedia data. In Proceedings of EMNLP-CoNLL 2007, page 708716, 2007.

4. Paolo Ferragina and Ugo Scaiella. Tagme: on-the-fly annotation of short text fragments (by wikipedia entities). In Proceedings of the 19th ACM international conference on Information and knowledge management, CIKM ’10, pages 1625–1628, New York, NY, USA, 2010. ACM.

5. M. Grineva, D. Lizorkin, M. Grinev, A. Boldakov, D. Turdakov, A. Sysoev, and A. Kiyko. Blognoon: Exploring a topic in the blogosphere. In Proceedings of the 18th international conference on World wide web, 2011.

6. Maria Grineva, Maxim Grinev, and Dmitry Lizorkin. Extracting key terms from noisy and multitheme documents. In Proceedings of the 18th international conference on World wide web, WWW ’09, pages 661–670, New York, NY, USA, 2009. ACM.

7. Sayali Kulkarni, Amit Singh, Ganesh Ramakrishnan, and Soumen Chakrabarti. Collective annotation of Wikipedia entities in web text. In Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining, KDD ’09, pages 457–466, New York, NY, USA, 2009. ACM.

8. Olena Medelyan, Ian H. Witten, and David Milne. Topic indexing with wikipedia, 2008.

9. Rada Mihalcea. Using wikipedia for automatic word sense disambiguation. In North American Chapter of the Association for Computational Linguistics (NAACL 2007), 2007.

10. David Milne and Ian H. Witten. Learning to link with wikipedia. In Proceeding of the 17th ACM conference on Information and knowledge management, CIKM ’08, pages 509–518, New York, NY, USA, 2008. ACM.

11. Denis Turdakov and Pavel Velikhov. Semantic relatedness metric for wikipedia concepts based on link analysis and its application to word sense disambiguation. In Proceedings of the SYRCODIS 2008 Colloquium on Databases and Information Systems, 2008.


Review

For citations:


Bartunov S., Boldakov A., Turdakov D. WikifyMe: Creating Testbed for Wikifiers. Proceedings of the Institute for System Programming of the RAS (Proceedings of ISP RAS). 2011;21. (In Russ.)



Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 License.


ISSN 2079-8156 (Print)
ISSN 2220-6426 (Online)