Discovering Fuzzy Synsets from the Redundancy across several Dictionaries

Authors

  • Fábio Santos CISUC, Departamento de Engenharia Informática, Universidade de Coimbra
  • Hugo Gonçalo Oliveira CISUC, Departamento de Engenharia Informática, Universidade de Coimbra

Keywords:

wordnets, synsets, fuzzy clustering, rede léxico-semântica, sinónimos, confiança, dicionários

Abstract

In a wordnet, concepts are typically represented as groups of words, commonly known as synsets, and each membership of a word to a synset denotes a different sense of that word.
However, since word senses are complex entities, without well-defined boundaries, we suggest to handle them less artificially, by representing them as fuzzy objects, where each word has its membership degree, which can be related to the confidence on using the word to denote the concept conveyed by the synset.
We thus propose an approach to discover synsets from a synonymy network, ideally redundant and extracted from several broad-coverage sou rces. The more synonymy relations there are between two words, the higher the confidence on the semantic equivalence of at least one of their senses.The proposed approach was applied to a network extracted from three Portuguese dictionaries and resulted in a large set of fuzzy synsets.
Besides describing this approach and illustrating its results, we rely on three evaluations -- comparison against a handcrafted Portuguese thesaurus; comparison against the results of a previous approach with a similar goal; and manual evaluation -- to believe that our outcomes are positive and that, in the future, they might my expanded by exploring additional synonymy sources.

References

Published

2015-12-30

Issue

Section

Research Articles

How to Cite

Discovering Fuzzy Synsets from the Redundancy across several Dictionaries. (2015). Linguamática, 7(2), 3-17. https://linguamatica.com/index.php/linguamatica/article/view/V7N2.1