Utilize este identificador para referenciar este registo:
http://hdl.handle.net/10071/25398
Autoria: | Barreiro, A. Batista, F. |
Data: | 2016 |
Título próprio: | Machine translation of non-contiguous multiword units |
Paginação: | 22 - 30 |
Título do evento: | Proceedings of a meeting held 17 June 2016, San Diego, California, USA. Held at the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL HLT 2016) |
ISBN: | 978-1-5108-2521-5 |
Resumo: | Non-adjacent linguistic phenomena such as non-contiguous multiwords and other phrasal units containing insertions, i.e., words that are not part of the unit, are difficult to process and remain a problem for NLP applications. Non-contiguous multiword units are common across languages and constitute some of the most important challenges to high quality machine translation. This paper presents an empirical analysis of non-contiguous multiwords, and highlights our use of the Logos Model and the Semtab function to deploy semantic knowledge to align non-contiguous multiword units with the goal to translate these units with high fidelity. The phrase level manual alignments illustrated in the paper were produced with the CLUE-Aligner, a Cross-Language Unit Elicitation alignment tool. |
Arbitragem científica: | yes |
Acesso: | Acesso Aberto |
Aparece nas coleções: | IT-CRI - Comunicações a conferências internacionais |
Ficheiros deste registo:
Ficheiro | Descrição | Tamanho | Formato | |
---|---|---|---|---|
conference_object_28921.pdf | Versão Aceite | 572,2 kB | Adobe PDF | Ver/Abrir |
Todos os registos no repositório estão protegidos por leis de copyright, com todos os direitos reservados.