Utilize este identificador para referenciar este registo: http://hdl.handle.net/10071/27854
Registo completo
Campo DCValorIdioma
dc.contributor.authorMedeiros, H.-
dc.contributor.authorBatista, F.-
dc.contributor.authorMoniz, H.-
dc.contributor.authorTrancoso, I.-
dc.contributor.authorNunes, L.-
dc.contributor.editorLeal, J. P., Rocha, R., and Simões, A.-
dc.date.accessioned2023-02-13T11:07:07Z-
dc.date.available2023-02-13T11:07:07Z-
dc.date.issued2013-01-01-
dc.identifier.citationMedeiros, H., Batista, F., Moniz, H., Trancoso, I., & Nunes, L. (2013). Comparing different machine learning approaches for disfluency structure detection in a corpus of university lectures. In J. P. Leal, R. Rocha, & A. Simões (Eds.), 2nd Symposium on Languages, Applications and Technologies, SLATE 2013 (vol. 29, pp. 259-269). OASIcs. https://doi.org/10.4230/OASIcs.SLATE.2013.259-
dc.identifier.isbn978-3-939897-52-1-
dc.identifier.issn2190-6807-
dc.identifier.urihttp://hdl.handle.net/10071/27854-
dc.description.abstractThis paper presents a number of experiments focusing on assessing the performance of different machine learning methods on the identification of disfluencies and their distinct structural regions over speech data. Several machine learning methods have been applied, namely Naive Bayes, Logistic Regression, Classification and Regression Trees (CARTs), J48 and Multilayer Perceptron. Our experiments show that CARTs outperform the other methods on the identification of the distinct structural disfluent regions. Reported experiments are based on audio segmentation and prosodic features, calculated from a corpus of university lectures in European Portuguese, containing about 32h of speech and about 7.7% of disfluencies. The set of features automatically extracted from the forced alignment corpus proved to be discriminant of the regions contained in the production of a disfluency. This work shows that using fully automatic prosodic features, disfluency structural regions can be reliably identified using CARTs, where the best results achieved correspond to 81.5% precision, 27.6% recall, and 41.2% F-measure. The best results concern the detection of the interregnum, followed by the detection of the interruption point.eng
dc.language.isoeng-
dc.publisherOASIcs-
dc.relationinfo:eu-repo/grantAgreement/FCT/PIDDAC/SFRH%2FBD%2F44671%2F2008/PT-
dc.relationinfo:eu-repo/grantAgreement/FCT/6817 - DCRRNI ID/PEst-OE%2FEEI%2FLA0021%2F2011/PT-
dc.relationinfo:eu-repo/grantAgreement/FCT/3599-PPCDT/CMU-PT%2FHuMach%2F0039%2F2008/PT-
dc.relation.ispartof2nd Symposium on Languages, Applications and Technologies, SLATE 2013-
dc.rightsopenAccess-
dc.subjectMachine learningeng
dc.subjectSpeech processingeng
dc.subjectProsodic featureseng
dc.subjectAutomatic detection of disfluencieseng
dc.titleComparing different machine learning approaches for disfluency structure detection in a corpus of university lectureseng
dc.typeconferenceObject-
dc.event.title2nd Symposium on Languages, Applications and Technologies, SLATE 2013-
dc.event.typeConferênciapt
dc.event.locationPortoeng
dc.event.date2013-
dc.pagination259 - 269-
dc.peerreviewedyes-
dc.volume29-
dc.date.updated2023-02-13T11:04:08Z-
dc.description.versioninfo:eu-repo/semantics/publishedVersion-
dc.identifier.doi10.4230/OASIcs.SLATE.2013.259-
dc.subject.fosDomínio/Área Científica::Ciências Naturais::Matemáticaspor
dc.subject.fosDomínio/Área Científica::Ciências Sociais::Geografia Económica e Socialpor
iscte.identifier.cienciahttps://ciencia.iscte-iul.pt/id/ci-pub-12189-
iscte.alternateIdentifiers.scopus2-s2.0-84893245717-
Aparece nas coleções:ISTAR-CRI - Comunicações a conferências internacionais
IT-CRI - Comunicações a conferências internacionais

Ficheiros deste registo:
Ficheiro TamanhoFormato 
conferenceobject_12189.pdf413 kBAdobe PDFVer/Abrir


FacebookTwitterDeliciousLinkedInDiggGoogle BookmarksMySpaceOrkut
Formato BibTex mendeley Endnote Logotipo do DeGóis Logotipo do Orcid 

Todos os registos no repositório estão protegidos por leis de copyright, com todos os direitos reservados.