Utilize este identificador para referenciar este registo: http://hdl.handle.net/10071/14185
Registo completo
Campo DCValorIdioma
dc.contributor.authorBatista, F.-
dc.contributor.authorMoniz, H.-
dc.contributor.authorTrancoso, I.-
dc.contributor.authorMamede, N.-
dc.contributor.authorMata, A. I.-
dc.date.accessioned2017-07-28T09:50:11Z-
dc.date.available2017-07-28T09:50:11Z-
dc.date.issued2012-
dc.identifier.issn2236-9740por
dc.identifier.urihttps://ciencia.iscte-iul.pt/id/ci-pub-9070-
dc.identifier.urihttp://hdl.handle.net/10071/14185-
dc.description.abstractThis paper describes a framework that extends automatic speech transcripts in order to accommodate relevant information coming from manual transcripts, the speech signal itself, and other resources, like lexica. The proposed framework automatically collects, relates, computes, and stores all relevant information together in a self-contained data source, making it possible to easily provide a wide range of interconnected information suitable for speech analysis, training, and evaluating a number of automatic speech processing tasks. The main goal of this framework is to integrate different linguistic and paralinguistic layers of knowledge for a more complete view of their representation and interactions in several domains and languages. The processing chain is composed of two main stages, where the first consists of integrating the relevant manual annotations in the speech recognition data, and the second consists of further enriching the previous output in order to accommodate prosodic information. The described framework has been used for the identification and analysis of structural metadata in automatic speech transcripts. Initially put to use for automatic detection of punctuation marks and for capitalization recovery from speech data, it has also been recently used for studying the characterization of disfluencies in speech. It was already applied to several domains of Portuguese corpora, and also to English and Spanish Broadcast News corpora.por
dc.language.isoengpor
dc.publisherLuso-Brazilian Association of Speech Sciencespor
dc.relationinfo:eu-repo/grantAgreement/FCT/SFRH/SFRH%2FBD%2F44671%2F2008/PTpor
dc.relationinfo:eu-repo/grantAgreement/FCT/3599-PPCDT/83853/PTpor
dc.relationinfo:eu-repo/grantAgreement/FCT/3599-PPCDT/120017/PTpor
dc.relationPEst-OE/EEI/LA0021/2011por
dc.rightsopenAccesspor
dc.subjectAutomatic speech processingpor
dc.subjectSpeech alignmentpor
dc.subjectStructural metadatapor
dc.subjectSpeech prosodypor
dc.subjectSpeech data representationpor
dc.subjectMultiple-domain speech corporapor
dc.subjectCross-language speech processingpor
dc.titleExtending automatic transcripts in a unified data representation towards a prosodic-based metadata annotation and evaluationpor
dc.typearticleen_US
dc.pagination115-138por
dc.publicationstatusPublicadopor
dc.peerreviewedyespor
dc.journalJournal of Speech Sciencespor
dc.distributionInternacionalpor
dc.volume2por
dc.number2por
degois.publication.firstPage115por
degois.publication.lastPage138por
degois.publication.issue2por
degois.publication.titleJournal of Speech Sciencespor
dc.date.updated2017-07-28T09:49:05Z-
Aparece nas coleções:CTI-RI - Artigos em revistas científicas internacionais com arbitragem científica

Ficheiros deste registo:
Ficheiro Descrição TamanhoFormato 
Extending Automatic Transcripts in a Unified Data Representation.pdf743,25 kBAdobe PDFVer/Abrir


FacebookTwitterDeliciousLinkedInDiggGoogle BookmarksMySpaceOrkut
Formato BibTex mendeley Endnote Logotipo do DeGóis Logotipo do Orcid 

Todos os registos no repositório estão protegidos por leis de copyright, com todos os direitos reservados.