Utilize este identificador para referenciar este registo: http://hdl.handle.net/10071/25451
Registo completo
Campo DCValorIdioma
dc.contributor.authorDias, J.-
dc.contributor.authorPellegrini, T-
dc.contributor.authorHedayati, V.-
dc.contributor.authorTrancoso, I.-
dc.contributor.authorHämäläinen, A.-
dc.contributor.editorChng E.S. , Li H., Meng H., Ma B. and Xie L-
dc.date.accessioned2022-05-19T09:49:31Z-
dc.date.available2022-05-19T09:49:31Z-
dc.date.issued2014-01-01-
dc.identifier.isbn9781634394352-
dc.identifier.issn2308-457X-
dc.identifier.urihttp://hdl.handle.net/10071/25451-
dc.description.abstractPhone-like acoustic models (AMs) used in large-vocabulary automatic speech recognition (ASR) systems are usually trained with speech collected from young adult speakers. Using such models, ASR performance may decrease by about 10% absolute when transcribing elderly speech. Ageing is known to alter speech production in ways that require ASR systems to be adapted, in particular at the level of acoustic modeling. In this study, we investigated automatic age estimation in order to select age-specific adapted AMs. A large corpus of read speech from European Portuguese speakers aged 60 or over was used. Age estimation (AE) based on i-vectors and support vector regression achieved mean error rates of about 4.2 and 4.5 years for males and females, respectively. Compared with a baseline ASR system with AMs trained using young adult speech and a WER of 13.9%, the selection of five-year-range adapted AMs, based on the estimated age of the speakers, led to a decrease in WER of about 9.3% relative (1.3% absolute). Comparable gains in ASR performance were observed when considering two larger age ranges (60-75 and 76-90) instead of six five-year ranges, suggesting that it would be sufficient to use the two large ranges only.eng
dc.language.isoeng-
dc.publisherInternational Speech and Communication Association-
dc.relationUID/MULTI/0446/2013-
dc.rightsopenAccess-
dc.subjectAutomatic speech recognitioneng
dc.subjectElderly speecheng
dc.subjectAutomatic age estimationeng
dc.subjectI-vector extractioneng
dc.titleSpeaker age estimation for elderly speech recognition in European Portugueseeng
dc.typeconferenceObject-
dc.event.titleCelebrating the Diversity of Spoken Languages-
dc.event.typeConferênciapt
dc.event.locationSingapuraeng
dc.event.date2014-
dc.peerreviewedyes-
dc.journal15th Annual Conference of the International Speech Communication Association (INTERSPEECH 2014)-
degois.publication.locationSingapuraeng
degois.publication.titleSpeaker age estimation for elderly speech recognition in European Portugueseeng
dc.date.updated2022-05-19T10:48:25Z-
dc.description.versioninfo:eu-repo/semantics/acceptedVersion-
dc.subject.fosDomínio/Área Científica::Ciências Naturais::Ciências Físicaspor
iscte.identifier.cienciahttps://ciencia.iscte-iul.pt/id/ci-pub-22986-
iscte.alternateIdentifiers.scopus2-s2.0-84910028544-
Aparece nas coleções:ISTAR-CRI - Comunicações a conferências internacionais

Ficheiros deste registo:
Ficheiro Descrição TamanhoFormato 
conferenceobject_22986f.pdfVersão Aceite84,99 kBAdobe PDFVer/Abrir


FacebookTwitterDeliciousLinkedInDiggGoogle BookmarksMySpaceOrkut
Formato BibTex mendeley Endnote Logotipo do DeGóis Logotipo do Orcid 

Todos os registos no repositório estão protegidos por leis de copyright, com todos os direitos reservados.