SSPQL: Stochastic Shortest Path-based Q-learning

SSPQL: Stochastic Shortest Path-based Q-learning
SSPQL: Stochastic Shortest Path-based Q-learning

ㆍ 저자명: Kwon. Woo-Young,Suh. Il-Hong,Lee. Sang-Hoon
ㆍ 간행물명: International Journal of Control, Automation and Systems
ㆍ 권/호정보: 2011년|9권 2호|pp.328-338 (11 pages)
ㆍ 발행정보: 제어로봇시스템학회
ㆍ 파일정보: 정기간행물|ENG|
PDF텍스트
ㆍ 주제분야: 기타

이 논문은 한국과학기술정보연구원과 논문 연계를 통해 무료로 제공되는 원문입니다.

서지반출

기타언어초록

Reinforcement learning (RL) has been widely used as a mechanism for autonomous robots to learn state-action pairs by interacting with their environment. However, most RL methods usually suffer from slow convergence when deriving an optimum policy in practical applications. To solve this problem, a stochastic shortest path-based Q-learning (SSPQL) is proposed, combining a stochastic shortest path-finding method with Q-learning, a well-known model-free RL method. The rationale is, if a robot has an internal state-transition model which is incrementally learnt, then the robot can infer the local optimum policy by using a stochastic shortest path-finding method. By increasing state-action pair values comprising of these local optimum policies, a robot can then reach a goal quickly and as a result, this process can enhance convergence speed. To demonstrate the validity of this proposed learning approach, several experimental results are presented in this paper.

키워드

Model-based RL reinforcement learning robot learning stochastic shortest path

다운URL