Utilize este identificador para referenciar este registo: http://hdl.handle.net/10071/25098
Registo completo
Campo DCValorIdioma
dc.contributor.authorBatista, F.-
dc.contributor.authorJoão P. Carvalho-
dc.contributor.editorAdnan Yazici, Nikhil R. Pal, Uzat Kaymak-
dc.date.accessioned2022-04-08T09:25:46Z-
dc.date.available2022-04-08T09:25:46Z-
dc.date.issued2015-
dc.identifier.isbn978-1-4673-7428-6-
dc.identifier.issn1544-5615-
dc.identifier.urihttp://hdl.handle.net/10071/25098-
dc.description.abstractThis paper introduces two fuzzy fingerprint based text classification techniques that were successfully applied to automatically label companies from CrunchBase, based purely on their unstructured textual description. This is a real and very challenging problem due to the large set of possible labels (more than 40) and also to the fact that the textual descriptions do not have to abide by any criteria and are, therefore, extremely heterogeneous. Fuzzy fingerprints are a recently introduced technique that can be used for performing fast classification. They perform well in the presence of unbalanced datasets and can cope with a very large number of classes. In the paper, a comparison is performed against some of the best text classification techniques commonly used to address similar problems. When applied to the CrunchBase dataset, the fuzzy fingerprint based approach outperformed the other techniques.eng
dc.language.isoeng-
dc.publisherIEEE-
dc.relationinfo:eu-repo/grantAgreement/FCT/6817 - DCRRNI ID/UID%2FCEC%2F50021%2F2013/PT-
dc.rightsopenAccess-
dc.subjectText classificationeng
dc.subjectFuzzy fingerprintseng
dc.subjectText miningeng
dc.subjectCrunchbaseeng
dc.subjectDocument classificationeng
dc.titleText based classification of companies in CrunchBaseeng
dc.typeconferenceObject-
dc.event.titleIEEE International Conference on Fuzzy Systems-
dc.event.typeConferênciapt
dc.event.locationIstambuleng
dc.event.date2015-
dc.peerreviewedyes-
dc.journalIEEE International Fuzzy Systems conference proceedings-
degois.publication.locationIstambuleng
degois.publication.titleText based classification of companies in CrunchBaseeng
dc.date.updated2022-04-08T10:22:26Z-
dc.description.versioninfo:eu-repo/semantics/submittedVersion-
dc.identifier.doi10.1109/FUZZ-IEEE.2015.7337892-
dc.subject.fosDomínio/Área Científica::Ciências Naturais::Ciências Físicaspor
iscte.identifier.cienciahttps://ciencia.iscte-iul.pt/id/ci-pub-24671-
iscte.alternateIdentifiers.scopus2-s2.0-84975687563-
Aparece nas coleções:IT-CRI - Comunicações a conferências internacionais

Ficheiros deste registo:
Ficheiro Descrição TamanhoFormato 
conferenceobject_24671.pdfVersão Submetida332,42 kBAdobe PDFVer/Abrir


FacebookTwitterDeliciousLinkedInDiggGoogle BookmarksMySpaceOrkut
Formato BibTex mendeley Endnote Logotipo do DeGóis Logotipo do Orcid 

Todos os registos no repositório estão protegidos por leis de copyright, com todos os direitos reservados.