Utilize este identificador para referenciar este registo:
http://hdl.handle.net/10071/23672
Autoria: | Rei, R. Batista, F. Guerreiro, N. M. Coheur, L. |
Editor: | Benites, F., Tuggener, D., Hurlimann, M., Cieliebak, M., & Vogel, M. |
Data: | 2021 |
Título próprio: | Multilingual simultaneous sentence end and punctuation prediction |
Volume: | 2957 |
Título do evento: | 2021 Swiss Text Analytics Conference, SwissText 2021 |
ISSN: | 1613-0073 |
Resumo: | This paper describes the model and its corresponding setup, proposed by the Unbabel & INESC-ID team for the 1st Shared Task on Sentence End and Punctuation Prediction in NLG Text (SEPP-NLG 2021). The shared task covers 4 languages (English, German, French and Italian) and includes two subtasks: Subtask 1 - detecting the end of a sentence, and subtask 2 - predicting a range of punctuation marks. Our team proposes a single multilingual and multitask model that is able to produce suitable results for all the languages and subtasks involved. The results show that it is possible to achieve state-of-the-art results using one single multilingual model for both tasks and multiple languages. Using a single multilingual model to solve the task for multiple languages is of particular importance, since training a different model for each language is a cumbersome and time-consuming process. |
Arbitragem científica: | yes |
Acesso: | Acesso Aberto |
Aparece nas coleções: | CTI-CRI - Comunicações a conferências internacionais |
Ficheiros deste registo:
Ficheiro | Descrição | Tamanho | Formato | |
---|---|---|---|---|
conferenceobject_82723.pdf | Versão Editora | 622,53 kB | Adobe PDF | Ver/Abrir |
Todos os registos no repositório estão protegidos por leis de copyright, com todos os direitos reservados.