Multimodal silent speech interface based on video, depth, surface electromyography and ultrasonic Doppler: Data collection and first recognition results

Freitas, J.; Teixeira, A.; Dias, M. S.

Utilize este identificador para referenciar este registo: http://hdl.handle.net/10071/29220

Registo completo

Campo DC	Valor	Idioma
dc.contributor.author	Freitas, J.	-
dc.contributor.author	Teixeira, A.	-
dc.contributor.author	Dias, M. S.	-
dc.contributor.editor	Bilmes, J., Fosler-Lussier, E., Hasegawa-Johnson, M., and Livescu, K.	-
dc.date.accessioned	2023-08-30T14:09:41Z	-
dc.date.available	2023-08-30T14:09:41Z	-
dc.date.issued	2013	-
dc.identifier.citation	Freitas, J., Teixeira, A., & Dias, M. S. (2013). Multimodal silent speech interface based on video, depth, surface electromyography and ultrasonic Doppler: Data collection and first recognition results. In J. Bilmes, E. Fosler-Lussier, M. Hasegawa-Johnson, & K. Livescu (Eds.), Workshop on Speech Production in Automatic Speech Recognition (SPASR-2013) (pp. 44-49). International Speech and Communication Association. https://www.isca-speech.org/archive/spasr_2013/freitas13_spasr.html	-
dc.identifier.issn	2308-457X	-
dc.identifier.uri	http://hdl.handle.net/10071/29220	-
dc.description.abstract	Silent Speech Interfaces use data from the speech production process, such as visual information of face movements. However, using a single modality limits the amount of available information. In this study we start to explore the use of multiple data input modalities in order to acquire a more complete representation of the speech production model. We have selected 4 non-invasive modalities – Visual data from Video and Depth, Surface Electromyography and Ultrasonic Doppler - and created a system that explores the synchronous combination of all 4, or of a subset of them, into a multimodal Silent Speech Interface (SSI). This paper describes the system design, data collection and first word recognition results. As the first acquired corpora are necessarily small for this SSI, we use for classification an example based recognition approach based on Dynamic Time Warping followed by a weighted k-Nearest Neighbor classifier. The first classification results using different vocabularies, with digits, a small set of commands related to Ambient Assisted Living and minimal nasal pairs, show that word recognition benefits can be obtained from a multimodal approach.	eng
dc.language.iso	eng	-
dc.publisher	International Speech and Communication Association	-
dc.relation	info:eu-repo/grantAgreement/FCT/6820 - DCRRNI ID/PEst-C%2FEEI%2FUI0127%2F2011/PT	-
dc.relation	info:eu-repo/grantAgreement/EC/FP7/251415/EU	-
dc.relation	info:eu-repo/grantAgreement/FCT/5876-PPCDTI/PTDC%2FEEA-PLP%2F098298%2F2008/PT	-
dc.relation	FCOMP-01-0124-FEDER-022682	-
dc.relation	FP7-PEOPLE-2009-IAP	-
dc.relation.ispartof	Workshop on Speech Production in Automatic Speech Recognition (SPASR-2013)	-
dc.rights	openAccess	-
dc.subject	Silent speech interfaces	eng
dc.subject	Multimodal	eng
dc.subject	Video and depth information	eng
dc.subject	Surface electromyography	eng
dc.subject	Ultrasonic doppler sensing	eng
dc.title	Multimodal silent speech interface based on video, depth, surface electromyography and ultrasonic Doppler: Data collection and first recognition results	eng
dc.type	conferenceObject	-
dc.event.title	Workshop on Speech Production in Automatic Speech Recognition (SPASR-2013)	-
dc.event.type	Workshop	pt
dc.event.location	Lyon	eng
dc.event.date	2013	-
dc.pagination	44 - 49	-
dc.peerreviewed	yes	-
dc.date.updated	2023-08-30T15:06:56Z	-
dc.description.version	info:eu-repo/semantics/publishedVersion	-
dc.subject.fos	Domínio/Área Científica::Ciências Naturais::Ciências da Computação e da Informação	por
dc.subject.fos	Domínio/Área Científica::Engenharia e Tecnologia::Engenharia Eletrotécnica, Eletrónica e Informática	por
dc.subject.fos	Domínio/Área Científica::Humanidades::Línguas e Literaturas	por
iscte.subject.ods	Indústria, inovação e infraestruturas	por
iscte.subject.ods	Reduzir as desigualdades	por
iscte.identifier.ciencia	https://ciencia.iscte-iul.pt/id/ci-pub-96464	-
Aparece nas coleções:	ISTAR-CRI - Comunicações a conferências internacionais

Ficheiros deste registo:

Ficheiro	Tamanho	Formato
conferenceobject_96464.pdf	1,08 MB	Adobe PDF	Ver/Abrir

Mostrar registo em formato simples Visualizar estatísticas