Skip navigation
Logo
Formação de utilizadores | Serviço de referência e pesquisa

Catálogo bibliográfico

Retrievo
EDS
b-on
Mais
recursos
Portais agregadores de conteúdos
Utilize este identificador para referenciar este registo:

acessibilidade

http://hdl.handle.net/10071/7329
acessibilidade
Título: Comparison of existing open-source tools for Web crawling and indexing of free Music
Autor: Serrão, C.
Ricardo, A.
Palavras-chave: Content Analysis and Indexing
Information Storage and Retrieval
Information Filtering
Retrieval Process
Selection Process
Open Source
Creative Commons
Music
MP3.
Data: 2013
Editora: Journal of Telecommunications
Resumo: This paper presents a portrait of existing open-source web crawlers tools that also have an indexing component. The goal is to understand what tool is best suited to crawl and index a large collection of music MP3 files freely available in the Internet. In this study each piece of software is briefly described, with an overview, identification of some users, and their main advantages and disadvantages. In order to better understand the most significant differences between the different tools a resume of features like: programming language in which they are written, the platform used for deployment, the type of index used, database integration, front-end capabilities, existence of a plugin system, MP3 and Adobe Flash (SWF files) parsing support, is presented. Finally the tools were classified according to the prospected collection size, being divided into tools to mirror small collections, medium and large collections with software capable of handling large amounts of data. In conclusion, an assessment on which tools are best suited to handle large collections in a distributed way is made.
Arbitragem científica: Sim
URI: https://sites.google.com/site/journaloftelecommunications/volume-18-issue-1-january-2013
https://ciencia.iscte-iul.pt/public/pub/id/14731
http://hdl.handle.net/10071/7329
ISSN: 2042-8839
Versão do Editor: The definitive version is available at: http://www.scribd.com/doc/123153248
Ocorre nas coleções:CTI-RI - Artigos em revistas científicas internacionais com arbitragem científica
ADETTI-RI - Artigo em revista científica internacional com arbitragem científica



FacebookTwitterDeliciousLinkedInDiggGoogle BookmarksMySpace
Formato BibTex MendeleyEndnote Currículo DeGóis 

Todos os registos no repositório estão protegidos por leis de copyright, com todos os direitos reservados.