Utilize este identificador para referenciar este registo: http://hdl.handle.net/10071/24971
Autoria: Mariano, P.
Almeida, S. M.
Santana, P.
Data: 2025
Título próprio: On the automated learning of air pollution prediction models from data collected by mobile sensor networks
Título da revista: Energy Sources, Part A: Recovery, Utilization, and Environmental Effects
Volume: 47
Número: 1
Paginação: 1772 - 1788
Referência bibliográfica: Mariano, P., Almeida, S. M., & Santana, P. (2025). On the automated learning of air pollution prediction models from data collected by mobile sensor networks. Energy Sources, Part A: Recovery, Utilization, and Environmental Effects, 47(1), 1772-1788. https://doi.org/10.1080/15567036.2021.1968076
ISSN: 1556-7036
DOI (Digital Object Identifier): 10.1080/15567036.2021.1968076
Palavras-chave: Machine learning
Air pollution
Time-series
Land- use
Decision tree
Support vector machine
Resumo: This paper addresses the problem of automated learning of air pollution predictive models that were trained using information gathered by a set of mobile low-cost sensors. Concretely, fast to compute machine learning methods (Decision Trees and Support Vector Machines) were used to build regression models that predict air pollution levels for a given location. The models were trained using the data collected by the OpenSense project, in particular, number of particulate matter, particle diameter, and lung deposited surface area (LDSA). We examined two different sets of attributes: one based on a geographical description of the location under analysis (e.g. distribution of households and roads), and another based on a time series of past air pollution observations in that location. Overall, we have found out that past measures lead to better pollution predictions. The best R2 score was 0.751 obtained with the model that predicts LDSA and was trained with the data set with time series attributes, while the worst R2 was 0.009 obtained with the geographical data set to predict number of particles. The performance of the best model is on par with similar air pollution systems. Moreover it can be used in a production system that requires frequent updates.
Arbitragem científica: yes
Acesso: Acesso Aberto
Aparece nas coleções:ISTAR-RI - Artigos em revistas científicas internacionais com arbitragem científica

Ficheiros deste registo:
Ficheiro TamanhoFormato 
article_82950.pdf1,55 MBAdobe PDFVer/Abrir


FacebookTwitterDeliciousLinkedInDiggGoogle BookmarksMySpaceOrkut
Formato BibTex mendeley Endnote Logotipo do DeGóis Logotipo do Orcid 

Todos os registos no repositório estão protegidos por leis de copyright, com todos os direitos reservados.