Building a wide coverage multilingual lexical knowledge base: Multilingual Central Repository

  • Aitor Gonzalez-Agirre Departamento de Informática, Universidade do Minho
  • German Rigau

Abstract

The use of wide coverage and general domain semantic resources has become a common practice and often necesary by existing systems Natural Language Processing (NLP). WordNet is by far the most widely used semantic resource in NLP. Following the success of WordNet, the EuroWordNet project has designed a multilingual semantic infrastructure to develop wordnets for a set of European languages. In EuroWordNet, these wordnets are interconnected with links stored in the Inter-Lingual Index (ILI). Following the EuroWordNet architecture, the MEANING project has developed the first versions of Multilingual Central Repository (MCR) using WordNet 1.6 as ILI. Thus, maintaining the compatibility between wordnets of different languages ​​and versions. This version of the MCR integrates six different versions of the English WordNet (1.6 to 3.0) and wordnets in Spanish, Catalan, Basque and Italian, along with more than a million semantic relationships between concepts and semantic properties different ontologies. We recently developed a new version of MCR using WordNet 3.0 as ILI. This new version of the MCR integrates wordnets of five different languages: English, Spanish, Catalan, Basque and Galician. The current version of MCR, like the previous one, systematically integrates thousands of semantic relations between concepts. In addition, the MCR is enriched with about 460,000 semantic and ontological properties including Base Level Concepts, Top Ontology, WordNet Domains and AdimenSUMO, providing all ontological consistency the integrated semantic wordnets and resources on it.
Published
2013-07-20
How to Cite
Gonzalez-Agirre, A., & Rigau, G. (2013). Building a wide coverage multilingual lexical knowledge base: Multilingual Central Repository. Linguamática, 5(1), 13-28. Retrieved from https://linguamatica.com/index.php/linguamatica/article/view/159
Section
Dossier