Complutense University Library

Annotating Expressions of Engagement in online book reviews: A contrastive (English-Spanish) corpus study for computational processing

Mora, Natalia (2011) Annotating Expressions of Engagement in online book reviews: A contrastive (English-Spanish) corpus study for computational processing. [Trabajo Fin de Máster]

[img]
Preview
PDF
520kB

Official URL: https://portal.ucm.es/web/filologia_inglesa_i/inicio

View download statistics for this eprint

==>>> Export to other formats

Abstract

This dissertation studies the expression of Engagement and alternative points of view in English and Spanish online book reviews, following the Appraisal model designed by
Martin and White (2005). The study has three main aims: 1) to test two main aspects of the linguistic category of Engagement empirically, namely the identification of span
realising Engagement and the classification of Engagement into different subtypes; 2) to extract relevant contrastive features of the use of Engagement in English and Spanish in
online book reviews; 3) to create a bilingual (comparable) machine-readable annotated corpus with Engagement features in English and Spanish which can serve as the training
corpus for machine learning algorithms and be offered to the scientific community for further research. Following standard methodologies in the field of Natural Language
Processing, two agreement studies are carried out, designed to measure inter-annotator agreement based on an initial set of 10 reviews. A larger set of 28 reviews (14 English,
14 Spanish) is further annotated by one single human coder in order to extract relevant results on contrastive aspects and provide publicly-available machine-readable annotated
texts with Engagement categories. The findings reveal disagreement mainly on span length and the annotation of some specific categories, namely Pronounce and Counter.
In addition, differences regarding frequency in the use of Engagement types were found in both languages, although the expressions employed were formally similar. Finally,
the results of the annotation of the larger data set showed that more expressions than what was initially expected can be annotated context-independently, although regarding
some other expressions, register and collocations were seen to have a decisive influence on their interpretation of some expressions, in the same way that genre has on their
frequency of use, for resources aimed at emphasising reviewer’s personal opinion were more frequent than those who acknowledged and evaluated external sources.


Item Type:Trabajo Fin de Máster
Directors:
DirectorsDirector email
Lavid López, Julia
Uncontrolled Keywords:Online book reviews; Expressions of Engagement; Computational linguistic; Natural Language Processing; Engagement in English and Spanish
Subjects:Humanities > Philology > English philology
Humanities > Philology > Linguistics
ID Code:13754
Deposited On:07 Nov 2011 09:50
Last Modified:06 Feb 2014 09:52

Repository Staff Only: item control page