Reinforcement learning-based control of traffic lights in non-stationary environments: A case study in a microscopic simulator

Oliveira, D. de.; Bazzan, A. L. C.; Silva, B. C. da.; Basso, E. W.; Nunes, L.; Rossetti, R.; Oliveira, E. de.; Silva, R. da.; Lamb, L.

Utilize este identificador para referenciar este registo: http://hdl.handle.net/10071/5328

Autoria:	Oliveira, D. de. Bazzan, A. L. C. Silva, B. C. da. Basso, E. W. Nunes, L. Rossetti, R. Oliveira, E. de. Silva, R. da. Lamb, L.
Editor:	Dunin-Kęplicz, B., Omicini, A., and Padget, J.
Data:	2006
Título próprio:	Reinforcement learning-based control of traffic lights in non-stationary environments: A case study in a microscopic simulator
Volume:	223
Título e volume do livro:	CEUR Workshop Proceedings. European Workshop on Multi-Agent Systems 2006
Paginação:	31-42
Título do evento:	4th European Workshop on Multi-Agent Systems (EUMAS'06)
Referência bibliográfica:	Oliveira, D. de., Bazzan, A. L. C., Silva, B. C. da., Basso, E. W., Nunes, L., Rossetti, R., Oliveira, E. de., Silva, R. da., & Lamb, L. (2006). Reinforcement learning-based control of traffic lights in non-stationary environments: A case study in a microscopic simulator. CEUR Workshop Proceedings. European Workshop on Multi-Agent Systems 2006, 223. http://hdl.handle.net/10071/5328
ISSN:	1613-0073
Resumo:	Coping with dynamic changes in traffic volume has been the object of recent publications. Recently, a method was proposed, which is capable of learning in non-stationary scenarios via an approach to detect context changes. For particular scenarios such as the traffic control one, the performance of that method is better than a greedy strategy, as well as other reinforcement learning approaches, such as Q-learning and Prioritized Sweeping. The goal of the present paper is to assess the feasibility of applying the above mentioned approach in a more realistic scenario, implemented by means of a microscopic traffic simulator. We intend to show that to use of context detection is suitable to deal with noisy scenarios where non-stationarity occurs not only due to the changing volume of vehicles, but also because of the random behavior of drivers in what regards the operational task of driving (e.g. deceleration probability). The results confirm the tendencies already detected in the previous paper, although here the increase in noise makes the learning task much more difficult, and the correct separation of contexts harder.
Arbitragem científica:	yes
Acesso:	Acesso Restrito
Aparece nas coleções:	CTI-CRI - Comunicações a conferências internacionais

Ficheiros deste registo:

Ficheiro	Descrição	Tamanho	Formato
RLinITSUMOcr.pdf Restricted Access		255,26 kB	Adobe PDF	Ver/Abrir Request a copy

Mostrar registo em formato completo Visualizar estatísticas