Discovering Fuzzy Synsets from the Redundancy across several Dictionaries
Abstract
In a wordnet, concepts are typically represented as groups of words, commonly known as synsets, and each membership of a word to a synset denotes a different sense of that word.
However, since word senses are complex entities, without well-defined boundaries, we suggest to handle them less artificially, by representing them as fuzzy objects, where each word has its membership degree, which can be related to the confidence on using the word to denote the concept conveyed by the synset.
We thus propose an approach to discover synsets from a synonymy network, ideally redundant and extracted from several broad-coverage sou rces. The more synonymy relations there are between two words, the higher the confidence on the semantic equivalence of at least one of their senses.The proposed approach was applied to a network extracted from three Portuguese dictionaries and resulted in a large set of fuzzy synsets.
Besides describing this approach and illustrating its results, we rely on three evaluations -- comparison against a handcrafted Portuguese thesaurus; comparison against the results of a previous approach with a similar goal; and manual evaluation -- to believe that our outcomes are positive and that, in the future, they might my expanded by exploring additional synonymy sources.
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).