Utilize este identificador para referenciar este registo: http://hdl.handle.net/10071/31538
Registo completo
Campo DCValorIdioma
dc.contributor.authorBico, M. I.-
dc.contributor.authorBaptista, J.-
dc.contributor.authorBatista, F.-
dc.contributor.authorCardeira, E.-
dc.date.accessioned2024-04-15T08:19:32Z-
dc.date.available2024-04-15T08:19:32Z-
dc.date.issued2024-
dc.identifier.citationBico, M. I., Baptista, J., Batista, F., & Cardeira, E. (2024). Enriching Portuguese medieval texts with named entity recognition. International Journal of Humanities and Arts Computing, 18(1), 109-124. https://doi.org/10.3366/ijhac.2024.0324-
dc.identifier.issn1753-8548-
dc.identifier.urihttp://hdl.handle.net/10071/31538-
dc.description.abstractHistorical data poses unique challenges to natural language processing (NLP) and information retrieval (IR) tools, including digitization errors, lack of annotated data, and diachronic-specific issues. However, the increasing recognition of the value in historical documents has promoted efforts to semantically enrich and optimize their analysis. This article contributes to this endeavour by enriching the Corpus de Textos Antigos through NLP tools and techniques to enhance its usability and support research. The corpus undergoes linguistic annotation, including part-of-speech tagging, lemma annotation and named entity recognition (NER). Subsequently, the article delves into the tasks of entity disambiguation and entity linking, which involve identifying and disambiguating named entities by referring to a knowledge base (KB). Addressing the challenges posed by factors such as text state, epoch and the chosen KB, the article presents insights into related work, annotation results and the linguistic interest of a medieval annotated corpus for named entities. It concludes by discussing the challenges and providing avenues for future research in this domain.eng
dc.language.isoeng-
dc.publisherEdinburgh University Press-
dc.rightsopenAccess-
dc.subjectCorpus analysiseng
dc.subjectNamed entity disambiguationeng
dc.subjectNamed entity linkingeng
dc.subjectNatural language processingeng
dc.subjectInformation retrievaleng
dc.subjectPortuguese medieval textseng
dc.titleEnriching Portuguese medieval texts with named entity recognitioneng
dc.typearticle-
dc.pagination109 - 124-
dc.peerreviewedyes-
dc.volume18-
dc.number1-
dc.date.updated2024-04-15T09:17:24Z-
dc.description.versioninfo:eu-repo/semantics/acceptedVersion-
dc.identifier.doi10.3366/ijhac.2024.0324-
dc.subject.fosDomínio/Área Científica::Humanidades::Línguas e Literaturaspor
iscte.subject.odsEducação de qualidadepor
iscte.identifier.cienciahttps://ciencia.iscte-iul.pt/id/ci-pub-103751-
iscte.journalInternational Journal of Humanities and Arts Computing-
Aparece nas coleções:CTI-RI - Artigos em revistas científicas internacionais com arbitragem científica

Ficheiros deste registo:
Ficheiro TamanhoFormato 
article_103751.pdf265,34 kBAdobe PDFVer/Abrir


FacebookTwitterDeliciousLinkedInDiggGoogle BookmarksMySpaceOrkut
Formato BibTex mendeley Endnote Logotipo do DeGóis Logotipo do Orcid 

Todos os registos no repositório estão protegidos por leis de copyright, com todos os direitos reservados.