Utilize este identificador para referenciar este registo:
http://hdl.handle.net/10071/5356
Registo completo
Campo DC | Valor | Idioma |
---|---|---|
dc.contributor.author | Jardim, David | - |
dc.contributor.author | Oliveira, Sancho | - |
dc.contributor.author | Nunes, Luís | - |
dc.date.accessioned | 2013-07-30T14:04:49Z | - |
dc.date.available | 2013-07-30T14:04:49Z | - |
dc.date.issued | 2013-07-30 | - |
dc.identifier.uri | http://hdl.handle.net/10071/5356 | - |
dc.description.abstract | In this paper we present a method that allows an agent to discover and create temporal abstractions autonomously. Our method is based on the concept that to reach the goal, the agent must pass through relevant states that we will interpret as subgoals. To detect useful subgoals, our method creates intersections between several paths leading to a goal. Our research focused on domains largely used in the study of temporal abstractions. We used several versions of the room-to-room navigation problem. We determined that, in the problems tested, an agent can learn more rapidly by automatically discovering subgoals and creating abstractions. | por |
dc.language.iso | eng | por |
dc.rights | restrictedAccess | por |
dc.subject | Autonomous Agents | por |
dc.subject | Machine Learning | por |
dc.subject | Reinforcement Learning | por |
dc.subject | Sub-goals | por |
dc.title | Hierarchical Reinforcement Learning: Learning Sub-goals and State-Abstraction | por |
dc.type | conferenceObject | por |
dc.event.title | Workshop on Intelligent Systems and Application (WISA 2011), 6ª Conferência Ibérica de Sistemas e Tecnologias de Informação (CISTI'2011) | por |
dc.event.type | Workshop | por |
dc.event.location | Chaves, Portugal | por |
dc.event.date | 2011 | por |
dc.pagination | Vol. II, pp. 245 - 248 | por |
dc.publicationstatus | Publicado | por |
dc.peerreviewed | Sim | por |
Aparece nas coleções: | CTI-CRI - Comunicações a conferências internacionais |
Ficheiros deste registo:
Ficheiro | Descrição | Tamanho | Formato | |
---|---|---|---|---|
HRL Short Paper.pdf Restricted Access | 321,61 kB | Adobe PDF | Ver/Abrir Request a copy |
Todos os registos no repositório estão protegidos por leis de copyright, com todos os direitos reservados.