Utilize este identificador para referenciar este registo:
http://hdl.handle.net/10071/22743
Autoria: | Filipe, S. Batista, F. Ribeiro, R. |
Editor: | Alberto Simões, Pedro Rangel Henriques, Ricardo Queirós |
Data: | 2020 |
Título próprio: | Different lexicon-based approaches to emotion identification in Portuguese tweets |
Volume: | 83 |
Paginação: | 12:1 - 12:8 |
Título do evento: | 9th Symposium on Languages, Applications and Technologies (SLATE 2020) |
ISSN: | 2190-6807 |
ISBN: | 978-3-95977-165-8 |
DOI (Digital Object Identifier): | 10.4230/OASIcs.SLATE.2020.12 |
Palavras-chave: | Emotion detection Emotion lexicon Portuguese language Tweets |
Resumo: | This paper presents the existing literature on the identification of emotions and describes various lexica-based approaches and translation strategies to identify emotions in Portuguese tweets. A dataset of tweets was manually annotated to evaluate our classifier and also to assess the difficulty of the task. A lexicon-based approach was used in order to classify the presence or absence of eight different emotions in a tweet. Different strategies have been applied to refine and improve an existing and widely used lexicon, by means of automatic machine translation and aligned word embeddings. We tested six different classification approaches, exploring different ways of directly applying resources available for English by means of different translation strategies. The achieved results suggest that a better performance can be obtained both by improving a lexicon and by directly translating tweets into English and then applying an existing English lexicon. |
Arbitragem científica: | yes |
Acesso: | Acesso Aberto |
Aparece nas coleções: | CTI-CRI - Comunicações a conferências internacionais |
Ficheiros deste registo:
Ficheiro | Descrição | Tamanho | Formato | |
---|---|---|---|---|
conferenceobject_74084.pdf | Versão Editora | 395,57 kB | Adobe PDF | Ver/Abrir |
Todos os registos no repositório estão protegidos por leis de copyright, com todos os direitos reservados.