Utilize este identificador para referenciar este registo:
http://hdl.handle.net/10071/22744
Autoria: | Pinto, A. Moniz, H. Batista, F. |
Editor: | Alberto Simões, Pedro Rangel Henriques, Ricardo Queirós |
Data: | 2020 |
Título próprio: | Detection of emerging words in Portuguese tweets |
Volume: | 83 |
Paginação: | 3:1 - 3:10 |
Título do evento: | 9th Symposium on Languages, Applications and Technologies (SLATE 2020) |
ISSN: | 2190-6807 |
ISBN: | 978-3-95977-165-8 |
DOI (Digital Object Identifier): | 10.4230/OASIcs.SLATE.2020.3 |
Palavras-chave: | Emerging words Portuguese language |
Resumo: | This paper tackles the problem of detecting emerging words on a language, based on social networks content. It proposes an approach for detecting new words on Twitter, and reports the achieved results for a collection of 8 million Portuguese tweets. This study uses geolocated tweets, collected between January 2018 and June 2019, and written in the Portuguese territory. The first six months of the data were used to define an initial vocabulary on known words, and the following 12 months were used for identifying new words, thus testing our approach. The set of resulting words were manually analyzed, revealing a number of distinct events, and suggesting that Twitter may be a valuable resource for researching neology, and the dynamics of a language. |
Arbitragem científica: | yes |
Acesso: | Acesso Aberto |
Aparece nas coleções: | CTI-CRI - Comunicações a conferências internacionais |
Ficheiros deste registo:
Ficheiro | Descrição | Tamanho | Formato | |
---|---|---|---|---|
conferenceobject_74086.pdf | Versão Editora | 1,36 MB | Adobe PDF | Ver/Abrir |
Todos os registos no repositório estão protegidos por leis de copyright, com todos os direitos reservados.