Utilize este identificador para referenciar este registo: http://hdl.handle.net/10071/5353
Registo completo
Campo DCValorIdioma
dc.contributor.authorGil, Paulo-
dc.contributor.authorNunes, Luís-
dc.date.accessioned2013-07-30T13:55:17Z-
dc.date.available2013-07-30T13:55:17Z-
dc.date.issued2013-07-30-
dc.identifier.urihttp://hdl.handle.net/10071/5353-
dc.description.abstractIn this paper we intend to study the possibility to improve the performance of the Q-Learning algorithm, by automatically finding subgoals and making better use of the acquired knowledge. This research explores a method that allows an agent to gather information about sequences of states that lead to a goal, detect classes of common sequences and introduce the states at the end of these sequences as subgoals. We use the taxi-problem (a standard in Hierarchical Reinforcement Learning literature) and conclude that, even though this problem's scale is relatively small, in most of the cases subgoals do improve the learning speed, achieving relatively good results faster than standard Q-Learning. We propose a specific iteration interval as the most appropriate to insert subgoals in the learning process. We also found that early adoption of subgoals may lead to suboptimal learning. The extension to more challenging problems is an interesting subject for future workpor
dc.language.isoengpor
dc.rightsrestrictedAccesspor
dc.subjectreinforcement learningpor
dc.subjectQ-Learningpor
dc.subjectsubgoalspor
dc.subjectoptionspor
dc.titleHierarchical reinforcement learning using path clusteringpor
dc.typeconferenceObjectpor
dc.event.titleConferência Ibérica de Sistemas e Tecnologias de Informação, CISTI 2013por
dc.event.typeConferênciapor
dc.event.locationLisboa, Portugalpor
dc.event.date2013por
dc.paginationVol. I, pp. 659 - 664por
dc.publicationstatusPublicadopor
dc.peerreviewedSimpor
Aparece nas coleções:CTI-CRI - Comunicações a conferências internacionais

Ficheiros deste registo:
Ficheiro Descrição TamanhoFormato 
Hierarchical reinforcement learning using path clustering.pdf
  Restricted Access
300,85 kBAdobe PDFVer/Abrir Request a copy


FacebookTwitterDeliciousLinkedInDiggGoogle BookmarksMySpaceOrkut
Formato BibTex mendeley Endnote Logotipo do DeGóis Logotipo do Orcid 

Todos os registos no repositório estão protegidos por leis de copyright, com todos os direitos reservados.