Please use this identifier to cite or link to this item: http://hdl.handle.net/10071/25096
Author(s): Vicente, M.
Batista, F.
Carvalho, J.
Editor: Adnan Yazici, Nikhil R. Pal, Uzat Kaymak
Date: 2015
Title: Twitter gender classification using user unstructured information
ISSN: 1544-5615
ISBN: 978-1-4673-7428-6
DOI (Digital Object Identifier): 10.1109/FUZZ-IEEE.2015.7338102
Keywords: Twitter
Gender detection
Fuzzy c-means
Supervised and unsupervised methods
Abstract: This paper describes an approach to automatically detect the gender of Twitter users, based only on clues provided by their profile information in an unstructured form. A number of features that capture phenomena specific of Twitter users is proposed and evaluated on a dataset of about 242K English language users. Different supervised and unsupervised approaches are used to assess the performance of the proposed features, including Naive Bayes variants, Logistic Regression, Support Vector Machines, Fuzzy c-Means clustering, and K-means. An unsupervised approach based on Fuzzy c-Means proved to be very suitable for this task, returning the correct gender for about 96% of the users.
Peerreviewed: yes
Access type: Open Access
Appears in Collections:IT-CRI - Comunicações a conferências internacionais

Files in This Item:
File Description SizeFormat 
conferenceobject_24678.pdfVersão Aceite139,68 kBAdobe PDFView/Open


FacebookTwitterDeliciousLinkedInDiggGoogle BookmarksMySpaceOrkut
Formato BibTex mendeley Endnote Logotipo do DeGóis Logotipo do Orcid 

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.