Strategies in automatic traversal of Wikipedia articles for mining multilingual resources

Andrés Dominguez Burgos, Koen Kerremans, Rita Temmerman

Research output: Chapter in Book/Report/Conference proceedingConference paper

Abstract

In this article we present Termontospider, a wiki crawler that optimally traverses Wikipedia in search of domain-specific texts for extracting terminological and ontological information. The crawler is part of a tool suite for automatically developing multilingual termontological databases, i.e. ontologically-underpinned multilingual terminological databases. The focus is on analyzing the best value for internal links, categories and other metadata to assign weights and search mechanisms in network traversal.
Original languageEnglish
Title of host publicationProceedings of the workshop on Challenges to knowledge representation in multilingual contexts - TKE2012
EditorsRute Costa, Manuel Silva, António Lucas Soares
Place of PublicationMadrid
PublisherUniversidad Politécnica de Madrid
Pages1-8
Number of pages8
Publication statusPublished - 2012
EventKnowledge Representation in Multilingual Contexts (TKE 2012 conference) - Politécnica de Madrid, Spain
Duration: 19 Jun 2012 → …

Publication series

NameProceedings of the workshop on Challenges to knowledge representation in multilingual contexts - TKE2012

Conference

ConferenceKnowledge Representation in Multilingual Contexts (TKE 2012 conference)
Country/TerritorySpain
CityPolitécnica de Madrid
Period19/06/12 → …

Bibliographical note

Rute Costa, Manuel Silva and António Lucas Soares

Keywords

  • Termontospider
  • Wikipedia
  • terminology
  • cultural events

Fingerprint

Dive into the research topics of 'Strategies in automatic traversal of Wikipedia articles for mining multilingual resources'. Together they form a unique fingerprint.

Cite this