Hierarchical reinforcement learning using path clustering

Gil, P.; Nunes, L.

Utilize este identificador para referenciar este registo: http://hdl.handle.net/10071/27772

Registo completo

Campo DC	Valor	Idioma
dc.contributor.author	Gil, P.	-
dc.contributor.author	Nunes, L.	-
dc.date.accessioned	2023-02-07T12:47:39Z	-
dc.date.available	2023-02-07T12:47:39Z	-
dc.date.issued	2013-01-01	-
dc.identifier.citation	Gil, P., & Nunes, L. (2013). Hierarchical reinforcement learning using path clustering. In 2013 8th Iberian Conference on Information Systems and Technologies (CISTI), 6615769. IEEE.	-
dc.identifier.isbn	978-989-98434-0-0	-
dc.identifier.issn	2166-0727	-
dc.identifier.uri	http://hdl.handle.net/10071/27772	-
dc.description.abstract	In this paper we intend to study the possibility to improve the performance of the Q-Learning algorithm, by automatically finding subgoals and making better use of the acquired knowledge. This research explores a method that allows an agent to gather information about sequences of states that lead to a goal, detect classes of common sequences and introduce the states at the end of these sequences as subgoals. We use the taxiproblem (a standard in Hierarchical Reinforcement Learning literature) and conclude that, even though this problem's scale is relatively small, in most of the cases subgoals do improve the learning speed, achieving relatively good results faster than standard Q-Learning. We propose a specific iteration interval as the most appropriate to insert subgoals in the learning process. We also found that early adoption of subgoals may lead to suboptimal learning. The extension to more challenging problems is an interesting subject for future work.	eng
dc.language.iso	eng	-
dc.publisher	IEEE	-
dc.relation.ispartof	2013 8th Iberian Conference on Information Systems and Technologies (CISTI)	-
dc.rights	openAccess	-
dc.subject	Hierarchical reinforcement learning	eng
dc.subject	Q-learning	eng
dc.subject	Performance	eng
dc.subject	Subgoals	eng
dc.title	Hierarchical reinforcement learning using path clustering	eng
dc.type	conferenceObject	-
dc.event.title	8th Iberian Conference on Information Systems and Technologies, CISTI 2013	-
dc.event.type	Conferência	pt
dc.event.location	Lisboa	eng
dc.event.date	2013	-
dc.peerreviewed	yes	-
dc.date.updated	2023-02-07T12:46:16Z	-
dc.description.version	info:eu-repo/semantics/acceptedVersion	-
dc.subject.fos	Domínio/Área Científica::Ciências Naturais::Ciências da Computação e da Informação	por
iscte.identifier.ciencia	https://ciencia.iscte-iul.pt/id/ci-pub-42667	-
iscte.alternateIdentifiers.wos	WOS:WOS:000345737600070	-
iscte.alternateIdentifiers.scopus	2-s2.0-84887948781	-
Aparece nas coleções:	IT-CRI - Comunicações a conferências internacionais

Ficheiros deste registo:

Ficheiro	Tamanho	Formato
conferenceobject_42667.pdf	647,53 kB	Adobe PDF	Ver/Abrir

Mostrar registo em formato simples Visualizar estatísticas