Compulsory Flow Q-Learning: an RL algorithm for robot navigation based on partial-policy and macro-states

Silva, Valdinei Freire da; Costa, Anna Helena Reali

doi:10.1007/BF03194507

Acessibilidade / Reportar erro

Brasil

Journal of the Brazilian Computer Society

Español English

Brasil

Español English

sumário « anterior atual seguinte »

Sumário

• J. Braz. Comp. Soc. 15 (3) • Sept 2009 • https://doi.org/10.1007/BF03194507 copy

Compulsory Flow Q-Learning: an RL algorithm for robot navigation based on partial-policy and macro-states

Authorship SCIMAGO INSTITUTIONS RANKINGS

Reinforcement Learning is carried out on-line, through trial-and-error interactions of the agent with the environment, which can be very time consuming when considering robots. In this paper we contribute a new learning algorithm, CFQ-Learning, which uses macro-states, a low-resolution discretisation of the state space, and a partial-policy to get around obstacles, both of them based on the complexity of the environment structure. The use of macro-states avoids convergence of algorithms, but can accelerate the learning process. In the other hand, partial-policies can guarantee that an agent fulfils its task, even through macro-state. Experiments show that the CFQ-Learning performs a good balance between policy quality and learning rate.

machine learning; reinforcement learning; abstraction; partial-policy; macro-states

Sociedade Brasileira de Computação Sociedade Brasileira de Computação - UFRGS, Av. Bento Gonçalves 9500, B. Agronomia, Caixa Postal 15064, 91501-970 Porto Alegre, RS - Brazil, Tel. / Fax: (55 51) 316.6835 - Campinas - SP - Brazil
E-mail: jbcs@icmc.sc.usp.br

Acompanhe os números deste periódico no seu leitor de RSS