Utilize este identificador para referenciar este registo: http://hdl.handle.net/10071/29220
Registo completo
Campo DCValorIdioma
dc.contributor.authorFreitas, J.-
dc.contributor.authorTeixeira, A.-
dc.contributor.authorDias, M. S.-
dc.contributor.editorBilmes, J., Fosler-Lussier, E., Hasegawa-Johnson, M., and Livescu, K.-
dc.date.accessioned2023-08-30T14:09:41Z-
dc.date.available2023-08-30T14:09:41Z-
dc.date.issued2013-
dc.identifier.citationFreitas, J., Teixeira, A., & Dias, M. S. (2013). Multimodal silent speech interface based on video, depth, surface electromyography and ultrasonic Doppler: Data collection and first recognition results. In J. Bilmes, E. Fosler-Lussier, M. Hasegawa-Johnson, & K. Livescu (Eds.), Workshop on Speech Production in Automatic Speech Recognition (SPASR-2013) (pp. 44-49). International Speech and Communication Association. https://www.isca-speech.org/archive/spasr_2013/freitas13_spasr.html-
dc.identifier.issn2308-457X-
dc.identifier.urihttp://hdl.handle.net/10071/29220-
dc.description.abstractSilent Speech Interfaces use data from the speech production process, such as visual information of face movements. However, using a single modality limits the amount of available information. In this study we start to explore the use of multiple data input modalities in order to acquire a more complete representation of the speech production model. We have selected 4 non-invasive modalities – Visual data from Video and Depth, Surface Electromyography and Ultrasonic Doppler - and created a system that explores the synchronous combination of all 4, or of a subset of them, into a multimodal Silent Speech Interface (SSI). This paper describes the system design, data collection and first word recognition results. As the first acquired corpora are necessarily small for this SSI, we use for classification an example based recognition approach based on Dynamic Time Warping followed by a weighted k-Nearest Neighbor classifier. The first classification results using different vocabularies, with digits, a small set of commands related to Ambient Assisted Living and minimal nasal pairs, show that word recognition benefits can be obtained from a multimodal approach.eng
dc.language.isoeng-
dc.publisherInternational Speech and Communication Association-
dc.relationinfo:eu-repo/grantAgreement/FCT/6820 - DCRRNI ID/PEst-C%2FEEI%2FUI0127%2F2011/PT-
dc.relationinfo:eu-repo/grantAgreement/EC/FP7/251415/EU-
dc.relationinfo:eu-repo/grantAgreement/FCT/5876-PPCDTI/PTDC%2FEEA-PLP%2F098298%2F2008/PT-
dc.relationFCOMP-01-0124-FEDER-022682-
dc.relationFP7-PEOPLE-2009-IAP-
dc.relation.ispartofWorkshop on Speech Production in Automatic Speech Recognition (SPASR-2013)-
dc.rightsopenAccess-
dc.subjectSilent speech interfaceseng
dc.subjectMultimodaleng
dc.subjectVideo and depth informationeng
dc.subjectSurface electromyographyeng
dc.subjectUltrasonic doppler sensingeng
dc.titleMultimodal silent speech interface based on video, depth, surface electromyography and ultrasonic Doppler: Data collection and first recognition resultseng
dc.typeconferenceObject-
dc.event.titleWorkshop on Speech Production in Automatic Speech Recognition (SPASR-2013)-
dc.event.typeWorkshoppt
dc.event.locationLyoneng
dc.event.date2013-
dc.pagination44 - 49-
dc.peerreviewedyes-
dc.date.updated2023-08-30T15:06:56Z-
dc.description.versioninfo:eu-repo/semantics/publishedVersion-
dc.subject.fosDomínio/Área Científica::Ciências Naturais::Ciências da Computação e da Informaçãopor
dc.subject.fosDomínio/Área Científica::Engenharia e Tecnologia::Engenharia Eletrotécnica, Eletrónica e Informáticapor
dc.subject.fosDomínio/Área Científica::Humanidades::Línguas e Literaturaspor
iscte.subject.odsIndústria, inovação e infraestruturaspor
iscte.subject.odsReduzir as desigualdadespor
iscte.identifier.cienciahttps://ciencia.iscte-iul.pt/id/ci-pub-96464-
Aparece nas coleções:ISTAR-CRI - Comunicações a conferências internacionais

Ficheiros deste registo:
Ficheiro TamanhoFormato 
conferenceobject_96464.pdf1,08 MBAdobe PDFVer/Abrir


FacebookTwitterDeliciousLinkedInDiggGoogle BookmarksMySpaceOrkut
Formato BibTex mendeley Endnote Logotipo do DeGóis Logotipo do Orcid 

Todos os registos no repositório estão protegidos por leis de copyright, com todos os direitos reservados.