Utilize este identificador para referenciar este registo: http://hdl.handle.net/10071/7049
Registo completo
Campo DCValorIdioma
dc.contributor.authorBatista, F.-
dc.contributor.authorMoniz, H.-
dc.contributor.authorTrancoso, I.-
dc.contributor.authorMamede, N.-
dc.date.accessioned2014-05-02T14:42:32Z-
dc.date.available2014-05-02T14:42:32Z-
dc.date.issued2012-
dc.identifier10.1109/TASL.2011.2159594en_US
dc.identifier.issn1558-7916por
dc.identifier.urihttps://ciencia.iscte-iul.pt/public/pub/id/6531en_US
dc.identifier.urihttp://hdl.handle.net/10071/7049-
dc.descriptionWOS:000299525800012 (Nº de Acesso Web of Science)-
dc.description“Prémio Científico ISCTE-IUL 2013”-
dc.description.abstractThis paper focuses on the tasks of recovering capitalization and punctuation marks from texts without that information, such as spoken transcripts, produced by automatic speech recognition systems. These two practical rich transcription tasks were performed using the same discriminative approach, based on maximum entropy, suitable for on-the-fly usage. Reported experiments were conducted both over Portuguese and English broadcast news data. Both force aligned and automatic transcripts were used, allowing to measure the impact of the speech recognition errors. Capitalized words and named entities are intrinsically related, and are influenced by time variation effects. For that reason, the so-called language dynamics have been addressed for the capitalization task. Language adaptation results indicate, for both languages, that the capitalization performance is affected by the temporal distance between the training and testing data. In what regards the punctuation task, this paper covers the three most frequent punctuation marks: full stop, comma, and question marks. Different methods were explored for improving the baseline results for full stop and comma. The first uses punctuation information extracted from large written corpora. The second applies different levels of linguistic structure, including lexical, prosodic, and speaker related features. The comma detection improved significantly in the first method, thus indicating that it depends more on lexical features. The second method provided even better results, for both languages and both punctuation marks, best results being achieved mainly for full stop. As for question marks, there is a small gain, but differences are not very significant, due to the relatively small number of question marks in the corpora.por
dc.language.isoengpor
dc.publisherIEEE Signal Processing Societypor
dc.rightsembargoedAccesspor
dc.subjectAutomatic speech processingpor
dc.subjectCapitalizationpor
dc.subjectLanguage dynamicspor
dc.subjectNatural language processingpor
dc.subjectPunctuation markspor
dc.subjectRich transcriptionpor
dc.titleBilingual Experiments on Automatic Recovery of Capitalization and Punctuation of Automatic Speech Transcriptspor
dc.typearticleen_US
dc.pagination474-485por
dc.publicationstatusPublicadopor
dc.peerreviewedSimpor
dc.relation.publisherversionThe definitive version is available at IEEE: http://dx.doi.org/10.1109/TASL.2011.2159594por
dc.journalIEEE Transactions on Audio, Speech, and Language Processingpor
dc.distributionInternacionalpor
dc.volume20por
dc.number2por
degois.publication.firstPage474por
degois.publication.lastPage485por
degois.publication.issue2por
degois.publication.titleIEEE Transactions on Audio, Speech, and Language Processingpor
dc.date.updated2014-05-02T14:34:58Z-
Aparece nas coleções:CTI-RI - Artigos em revistas científicas internacionais com arbitragem científica

Ficheiros deste registo:
Ficheiro Descrição TamanhoFormato 
Batista 2012 IEEE.pdf
  Restricted Access
925,74 kBAdobe PDFVer/Abrir Request a copy


FacebookTwitterDeliciousLinkedInDiggGoogle BookmarksMySpaceOrkut
Formato BibTex mendeley Endnote Logotipo do DeGóis Logotipo do Orcid 

Todos os registos no repositório estão protegidos por leis de copyright, com todos os direitos reservados.