Universidad Complutense de Madrid
E-Prints Complutense

Combinatoria léxica, polisemia y polisemia regular en una base de conocimiento léxico conceptual : el caso de "Redes diccionario combinatorio del español contemporáneo" y functional grammar knowledge base

Impacto

Descargas

Último año



Sherwood Droz, Maia (2014) Combinatoria léxica, polisemia y polisemia regular en una base de conocimiento léxico conceptual : el caso de "Redes diccionario combinatorio del español contemporáneo" y functional grammar knowledge base. [Tesis]

[img]
Vista previa
PDF
4MB


Resumen

This dissertation attempts to link the linguistic information contained in REDES Diccionario combinatorio del español contemporáneo (Bosque, 2004), or REDES, to the
ontological framework of Functional Grammar Knowledge Base, or FunGramKB. REDES is a dictionary that gathers systematic restrictions imposed by some 4,000 Spanish predicates to their selection of lexical arguments, while FunGramKB is a multilingual and multipurpose lexical conceptual knowledge base (KB) designed for Natural Language Processing (NLP). This work is thus part of the field known as electronic lexicography for the XXIst century or the third millennium
(Fuertes and Tarp, 2011).This new lexicography comprises electronic lexicographical resources which are much
more complex than the commonly used digitalized versions of traditional dictionaries. It is composed of lexical databases or lexical knowledge bases, of variable complexity and depth, built on electronic platforms. Even though electronic lexicography can also serve the typical dictionary queries posed by humans, the best-known NLP applications include machine translation (MT),
question and answer systems, information extraction, and voice recognition programs. However, the main problem facing NLP continues to be Word Sense Disambiguation (WSD). Most words in a language are polysemic and a successful NLP application will depend on its ability to assign the
correct sense to each word in a given context. A great deal of the work that is taking place in NLP is therefore focused on finding strategies to effectively disambiguate words in their context.In working with REDES and FunGramKB, it could seem at first glance that we are dealing with two distant fields –print lexicography and knowledge engineering, respectively–, but this thesis assumes that linking these resources would yield significant benefits for both. Furthermore, it would combine two sources of valuable linguistic and theoretical data: on the one hand, REDES
contributes thousands of patterns of systematic predicate-argument word combinations, taken from real use in Spanish-language corpora. These combinations are not presented as “collocations”, that is, combinations between a single predicate and a single argument, or vice versa, but rather,
between predicates and “lexical classes”, groups of arguments that share a common semantic basis. On the other hand, FunGramKB contributes a multilevel electronic platform designed for NLP, which includes a conceptual, a lexical and a grammatical model, and has been built around a hierarchical and taxonomical ontology of universal cognitive concepts.


Tipo de documento:Tesis
Información Adicional:

Tesis inédita de la Universidad Complutense de Madrid, Facultad de Filología, leída el 04-02-2014

Directores (o tutores):
NombreEmail del director (o tutor)
Mairal Usón, Ricardo
Palabras clave:Lengua, lexicografía
Materias:Humanidades > Filología > Lengua española
Código ID:25081
Depositado:11 Abr 2014 09:17
Última Modificación:11 Abr 2014 13:05

Descargas en el último año

Sólo personal del repositorio: página de control del artículo