Tavakol Aghaei, Vahid and Ağababaoğlu, Arda and Onat, Ahmet and Yıldırım, Sinan (2019) Bayesian learning for policy search in trajectory control of a planar manipulator. In: IEEE 9th Annual Computing and Communication Workshop and Conference (CCWC), Las Vegas, NV, USA
Full text not available from this repository. (Request a copy)
Official URL: https://dx.doi.org/10.1109/CCWC.2019.8666449
Abstract
Application of learning algorithms to robotics and control problems with highly nonlinear dynamics to obtain a plausible control policy in a continuous state space is expected to greatly facilitate the design process. Recently, policy search methods such as policy gradient in Reinforcement Learning (RL) have succeeded in coping with such complex systems. Nevertheless, they are slow in convergence speed and are prone to get stuck in local optima. To alleviate this, a Bayesian inference method based on Markov Chain Monte Carlo (MCMC), utilizing a multiplicative reward function, is proposed. This study aims to compare eNAC, a popular gradient based RL method, with the proposed Bayesian learning method, where the objective is trajectory control of a complex model of a 2-DOF planar manipulator. The results obtained for the convergence speed of the proposed algorithm and time response performance, illustrate that the proposed MCMC algorithm is qualified for complex problems in robotics.
Item Type: | Papers in Conference Proceedings |
---|---|
Uncontrolled Keywords: | Bayesian Learning; Control Systems; Markov Chain Monte Carlo; Reinforcement Learning; Robotics |
Divisions: | Faculty of Engineering and Natural Sciences > Academic programs > Industrial Engineering Faculty of Engineering and Natural Sciences > Academic programs > Mechatronics Faculty of Engineering and Natural Sciences |
Depositing User: | Ahmet Onat |
Date Deposited: | 13 Jun 2023 12:30 |
Last Modified: | 13 Jun 2023 12:30 |
URI: | https://research.sabanciuniv.edu/id/eprint/46041 |