Please use this identifier to cite or link to this item: http://hdl.handle.net/10071/23672
Author(s): Rei, R.
Batista, F.
Guerreiro, N. M.
Coheur, L.
Editor: Benites, F., Tuggener, D., Hurlimann, M., Cieliebak, M., & Vogel, M.
Date: 2021
Title: Multilingual simultaneous sentence end and punctuation prediction
Volume: 2957
Event title: 2021 Swiss Text Analytics Conference, SwissText 2021
ISSN: 1613-0073
Abstract: This paper describes the model and its corresponding setup, proposed by the Unbabel & INESC-ID team for the 1st Shared Task on Sentence End and Punctuation Prediction in NLG Text (SEPP-NLG 2021). The shared task covers 4 languages (English, German, French and Italian) and includes two subtasks: Subtask 1 - detecting the end of a sentence, and subtask 2 - predicting a range of punctuation marks. Our team proposes a single multilingual and multitask model that is able to produce suitable results for all the languages and subtasks involved. The results show that it is possible to achieve state-of-the-art results using one single multilingual model for both tasks and multiple languages. Using a single multilingual model to solve the task for multiple languages is of particular importance, since training a different model for each language is a cumbersome and time-consuming process.
Peerreviewed: yes
Access type: Open Access
Appears in Collections:CTI-CRI - Comunicações a conferências internacionais

Files in This Item:
File Description SizeFormat 
conferenceobject_82723.pdfVersão Editora622,53 kBAdobe PDFView/Open


FacebookTwitterDeliciousLinkedInDiggGoogle BookmarksMySpaceOrkut
Formato BibTex mendeley Endnote Logotipo do DeGóis Logotipo do Orcid 

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.