Please use this identifier to cite or link to this item:
http://hdl.handle.net/10071/22744
Author(s): | Pinto, A. Moniz, H. Batista, F. |
Editor: | Alberto Simões, Pedro Rangel Henriques, Ricardo Queirós |
Date: | 2020 |
Title: | Detection of emerging words in Portuguese tweets |
Volume: | 83 |
Pages: | 3:1 - 3:10 |
Event title: | 9th Symposium on Languages, Applications and Technologies (SLATE 2020) |
ISSN: | 2190-6807 |
ISBN: | 978-3-95977-165-8 |
DOI (Digital Object Identifier): | 10.4230/OASIcs.SLATE.2020.3 |
Keywords: | Emerging words Portuguese language |
Abstract: | This paper tackles the problem of detecting emerging words on a language, based on social networks content. It proposes an approach for detecting new words on Twitter, and reports the achieved results for a collection of 8 million Portuguese tweets. This study uses geolocated tweets, collected between January 2018 and June 2019, and written in the Portuguese territory. The first six months of the data were used to define an initial vocabulary on known words, and the following 12 months were used for identifying new words, thus testing our approach. The set of resulting words were manually analyzed, revealing a number of distinct events, and suggesting that Twitter may be a valuable resource for researching neology, and the dynamics of a language. |
Peerreviewed: | yes |
Access type: | Open Access |
Appears in Collections: | CTI-CRI - Comunicações a conferências internacionais |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
conferenceobject_74086.pdf | Versão Editora | 1,36 MB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.