Utilize este identificador para referenciar este registo: http://hdl.handle.net/10071/25540
Registo completo
Campo DCValorIdioma
dc.contributor.authorFerreira, J. P.-
dc.contributor.authorChesi, C.-
dc.contributor.authorBaldewijns, D.-
dc.contributor.authorDias, M. S.-
dc.contributor.authorBraga, D.-
dc.contributor.authorPinto, F. M.-
dc.contributor.authorCho, H.-
dc.contributor.authorCorreia, M.-
dc.contributor.authorFerreira, A.-
dc.contributor.editorCalzolari, N., Choukri, K., Declerck, T., Loftsson, H., Maegaard, B., Mariani, J., Moreno, A., Odijk, J., and Piperidis, S.-
dc.date.accessioned2022-05-25T11:45:11Z-
dc.date.available2022-05-25T11:45:11Z-
dc.date.issued2014-
dc.identifier.citationFerreira, J. P., Chesi, C., Baldewijns, D., Dias, M. S., Braga, D., Pinto, F. M., Cho, H., Correia, M., & Ferreira, A. (2014). Casa de la Lhéngua: A set of language resources and natural language processing tools for Mirandese. In N. Calzolari, K. Choukri, T. Declerck, H. Loftsson, B. Maegaard, J. Mariani, A. Moreno, J. Odijk, & S. Piperidis (Eds.), Proceedings of the 9th International Conference on Language Resources and Evaluation, LREC 2014 (pp. 536-540). European Language Resources Association (ELRA). http://hdl.handle.net/10071/25540-
dc.identifier.isbn978-295174088-4-
dc.identifier.urihttp://hdl.handle.net/10071/25540-
dc.description.abstractThis paper describes the efforts for the construction of Language Resources and NLP tools for Mirandese, a minority language spoken in North-eastern Portugal, now available on a community-led portal, Casa de la Lhéngua. The resources were developed in the context of a collaborative citizenship project led by Microsoft, in the context of the creation of the first TTS system for Mirandese. Development efforts encompassed the compilation of a corpus with over 1M tokens, the construction of a GTP system, syllable-division, inflection and a Part-of-Speech (POS) tagger modules, leading to the creation of an inflected lexicon of about 200.000 entries with phonetic transcription, detailed POS tagging, syllable division, and stress mark-up. Alongside these tasks, which were made easier through the adaptation and reuse of existing tools for closely related languages, a casting for voice talents among the speaking community was conducted and the first speech database for speech synthesis was recorded for Mirandese. These resources were combined to fulfil the requirements of a well-tested statistical parameter synthesis model, leading to an intelligible voice font. These language resources are available freely at Casa de la Lhéngua, aiming at promoting the development of real-life applications and fostering linguistic research on Mirandese.eng
dc.language.isoeng-
dc.publisherEuropean Language Resources Association (ELRA)-
dc.relation.ispartofProceedings of the 9th International Conference on Language Resources and Evaluation, LREC 2014-
dc.rightsopenAccess-
dc.subjectLanguage resourceseng
dc.subjectMinority languageeng
dc.subjectMirandeseeng
dc.subjectSpeech synthesiseng
dc.subjectLexical databaseeng
dc.titleCasa de la Lhéngua: A set of language resources and natural language processing tools for Mirandeseeng
dc.typeconferenceObject-
dc.event.title9th International Conference on Language Resources and Evaluation, LREC 2014-
dc.event.typeConferênciapt
dc.event.locationReykjavik, Icelandeng
dc.event.date2014-
dc.pagination536 - 540-
dc.peerreviewedyes-
dc.journalProceedings of the Ninth International Conference on Language Resources and Evaluation (LREC 2014)-
degois.publication.firstPage536-
degois.publication.lastPage540-
degois.publication.locationReykjavikeng
degois.publication.titleCasa de la Lhéngua: A set of language resources and natural language processing tools for Mirandeseeng
dc.date.updated2023-06-26T13:04:03Z-
dc.description.versioninfo:eu-repo/semantics/publishedVersion-
dc.subject.fosDomínio/Área Científica::Ciências Naturais::Ciências da Computação e da Informaçãopor
dc.subject.fosDomínio/Área Científica::Engenharia e Tecnologia::Engenharia Eletrotécnica, Eletrónica e Informáticapor
dc.subject.fosDomínio/Área Científica::Humanidades::Línguas e Literaturaspor
iscte.subject.odsIndústria, inovação e infraestruturaspor
iscte.subject.odsReduzir as desigualdadespor
iscte.identifier.cienciahttps://ciencia.iscte-iul.pt/id/ci-pub-96271-
iscte.alternateIdentifiers.wosWOS:WOS:000355611002022-
iscte.alternateIdentifiers.scopus2-s2.0-85037083070-
Aparece nas coleções:ISTAR-CRI - Comunicações a conferências internacionais

Ficheiros deste registo:
Ficheiro TamanhoFormato 
conferenceobject_72124.pdf17,36 MBAdobe PDFVer/Abrir


FacebookTwitterDeliciousLinkedInDiggGoogle BookmarksMySpaceOrkut
Formato BibTex mendeley Endnote Logotipo do DeGóis Logotipo do Orcid 

Todos os registos no repositório estão protegidos por leis de copyright, com todos os direitos reservados.