Please use this identifier to cite or link to this item:
http://hdl.handle.net/10071/28590
Author(s): | Vicente, M. Carvalho, J. P. Batista, F. |
Editor: | José-Luis Sierra-Rodríguez José-Paulo Leal Alberto Simões |
Date: | 2015 |
Title: | Using unstructured profile information for gender classification of Portuguese and English |
Volume: | 563 |
Book title/volume: | SLATE 2015: 4th International Symposium on Languages, Applications and Technologies: Languages, Applications and Technologies |
Pages: | 57 - 64 |
Reference: | Vicente, M., Carvalho, J. P., & Batista, F. (2015). Using unstructured profile information for gender classification of Portuguese and English. EM J. L. Sierra-Rodríguez, J. P. Leal, & A. Simões (Eds.). SLATE 2015: 4th International Symposium on Languages, Applications and Technologies: Languages, Applications and Technologies (pp. 57-64). Springer. https://doi.org/10.1007/978-3-319-27653-3_6 |
ISBN: | 978-3-319-27653-3 |
DOI (Digital Object Identifier): | 10.1007/978-3-319-27653-3_6 |
Keywords: | Twitter users Gender detection Fuzzy c-Means Supervised methods Unsupervised methods |
Abstract: | This paper reports experiments on automatically detecting the gender of Twitter users, based on unstructured information found on their Twitter profile. A set of features previously proposed is evaluated on two datasets of English and Portuguese users, and their performance is assessed using several supervised and unsupervised approaches, including Naive Bayes variants, Logistic Regression, Support Vector Machines, Fuzzy c-Means clustering, and k-means. Results show that features perform well in both languages separately, but even best results were achieved when combining both languages. Supervised approaches reached 97.9 % accuracy, but Fuzzy c-Means also proved suitable for this task achieving 96.4 % accuracy. |
Peerreviewed: | yes |
Access type: | Open Access |
Appears in Collections: | IT-CRI - Comunicações a conferências internacionais |
Files in This Item:
File | Size | Format | |
---|---|---|---|
conferenceObject_26082.pdf | 291,58 kB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.