LinguaKit: a multilingual tool for linguistic analysis and information extraction

  • Pablo Gamallo Universidade de Santiago de Compostela
  • Marcos Garcia Universidade de A Corunha

Abstract

This paper presents LinguaKit, a multilingual suite of tools for analysis, extraction, annotation and linguistic correction. LinguaKit allows the user to perform different tasks such as lemmatization, PoS-tagging or syntactic parsing (among others), including applications for sentiment analysis (or opinion mining), extraction of multiword expressions or conceptual annotation and entity linking to DBpedia. Most part of the developed modules work in four linguistic varieties: Portuguese, Spanish, English, and Galician. The system is programmed in Perl, and it is freely available under a GPLv3 license.

Published
2017-07-01
How to Cite
Gamallo, P., & Garcia, M. (2017). LinguaKit: a multilingual tool for linguistic analysis and information extraction. Linguamática, 9(1), 19-28. https://doi.org/10.21814/lm.9.1.243
Section
Research Articles