Extending automatic transcripts in a unified data representation towards a prosodic-based metadata annotation and evaluation

Batista, F.; Moniz, H.; Trancoso, I.; Mamede, N.; Mata, A. I.

Utilize este identificador para referenciar este registo: http://hdl.handle.net/10071/14185

Registo completo

Campo DC	Valor	Idioma
dc.contributor.author	Batista, F.	-
dc.contributor.author	Moniz, H.	-
dc.contributor.author	Trancoso, I.	-
dc.contributor.author	Mamede, N.	-
dc.contributor.author	Mata, A. I.	-
dc.date.accessioned	2017-07-28T09:50:11Z	-
dc.date.available	2017-07-28T09:50:11Z	-
dc.date.issued	2012	-
dc.identifier.issn	2236-9740	por
dc.identifier.uri	https://ciencia.iscte-iul.pt/id/ci-pub-9070	-
dc.identifier.uri	http://hdl.handle.net/10071/14185	-
dc.description.abstract	This paper describes a framework that extends automatic speech transcripts in order to accommodate relevant information coming from manual transcripts, the speech signal itself, and other resources, like lexica. The proposed framework automatically collects, relates, computes, and stores all relevant information together in a self-contained data source, making it possible to easily provide a wide range of interconnected information suitable for speech analysis, training, and evaluating a number of automatic speech processing tasks. The main goal of this framework is to integrate different linguistic and paralinguistic layers of knowledge for a more complete view of their representation and interactions in several domains and languages. The processing chain is composed of two main stages, where the first consists of integrating the relevant manual annotations in the speech recognition data, and the second consists of further enriching the previous output in order to accommodate prosodic information. The described framework has been used for the identification and analysis of structural metadata in automatic speech transcripts. Initially put to use for automatic detection of punctuation marks and for capitalization recovery from speech data, it has also been recently used for studying the characterization of disfluencies in speech. It was already applied to several domains of Portuguese corpora, and also to English and Spanish Broadcast News corpora.	por
dc.language.iso	eng	por
dc.publisher	Luso-Brazilian Association of Speech Sciences	por
dc.relation	info:eu-repo/grantAgreement/FCT/SFRH/SFRH%2FBD%2F44671%2F2008/PT	por
dc.relation	info:eu-repo/grantAgreement/FCT/3599-PPCDT/83853/PT	por
dc.relation	info:eu-repo/grantAgreement/FCT/3599-PPCDT/120017/PT	por
dc.relation	PEst-OE/EEI/LA0021/2011	por
dc.rights	openAccess	por
dc.subject	Automatic speech processing	por
dc.subject	Speech alignment	por
dc.subject	Structural metadata	por
dc.subject	Speech prosody	por
dc.subject	Speech data representation	por
dc.subject	Multiple-domain speech corpora	por
dc.subject	Cross-language speech processing	por
dc.title	Extending automatic transcripts in a unified data representation towards a prosodic-based metadata annotation and evaluation	por
dc.type	article	en_US
dc.pagination	115-138	por
dc.publicationstatus	Publicado	por
dc.peerreviewed	yes	por
dc.journal	Journal of Speech Sciences	por
dc.distribution	Internacional	por
dc.volume	2	por
dc.number	2	por
degois.publication.firstPage	115	por
degois.publication.lastPage	138	por
degois.publication.issue	2	por
degois.publication.title	Journal of Speech Sciences	por
dc.date.updated	2017-07-28T09:49:05Z	-
Aparece nas coleções:	CTI-RI - Artigos em revistas científicas internacionais com arbitragem científica