Please use this identifier to cite or link to this item: http://hdl.handle.net/10071/23678
Author(s): Coelho, J.
Neto, A.
Tavares, M.
Coutinho, C.
Oliveira, J.
Ribeiro, R.
Batista, F.
Editor: Cucchiara, R., Fred, A., & Filipe, J.
Date: 2021
Title: Transformer-based language models for semantic search and mobile applications retrieval
Volume: 1
Pages: 225 - 232
Event title: 13th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - KDIR
ISSN: 2184-3228
ISBN: 978-989-758-533-3
DOI (Digital Object Identifier): 10.5220/0010657300003064
Keywords: Semantic search
Word embeddings
ElasticSearch
Mobile applications
Transformer-based models
Abstract: Search engines are being extensively used by Mobile App Stores, where millions of users world-wide use them every day. However, some stores still resort to simple lexical-based search engines, despite the recent advances in Machine Learning, Information Retrieval, and Natural Language Processing, which allow for richer semantic strategies. This work proposes an approach for semantic search of mobile applications that relies on transformer-based language models, fine-tuned with the existing textual information about known mobile applications. Our approach relies solely on the application name and on the unstructured textual information contained in its description. A dataset of about 500 thousand mobile apps was extended in the scope of this work with a test set, and all the available textual data was used to fine-tune our neural language models. We have evaluated our models using a public dataset that includes information about 43 thousand applications, and 56 manually annotated non- exact queries. The results show that our model surpasses the performance of all the other retrieval strategies reported in the literature. Tests with users have confirmed the performance of our semantic search approach, when compared with an existing deployed solution.
Peerreviewed: yes
Access type: Open Access
Appears in Collections:CTI-CRI - Comunicações a conferências internacionais
ISTAR-CRI - Comunicações a conferências internacionais
IT-CRI - Comunicações a conferências internacionais

Files in This Item:
File Description SizeFormat 
conferenceobject_82724.pdfVersão Aceite274,2 kBAdobe PDFView/Open


FacebookTwitterDeliciousLinkedInDiggGoogle BookmarksMySpaceOrkut
Formato BibTex mendeley Endnote Logotipo do DeGóis Logotipo do Orcid 

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.