TY - GEN
T1 - Modeling Reinforcement Learning Algorithms for performance analysis
AU - Kulkarni, Shrirang Ambaji
AU - Rao, G. Raghavendra
PY - 2009
Y1 - 2009
N2 - Reinforcement Learning Algorithms present interesting learning techniques. Here an autonomous agent interacts with its environment to choose optimal actions to achieve its goals. The performance of an agent is determined by how quickly it learns and converges to an optimal solution. Q-learning and Prioritized sweeping provide interesting techniques to achieve this. In this paper we try to analyze the performance of Q-learning and Prioritized sweeping as examples of model free and model based reinforcement learning. We also try to analyze the optimal number of backups required for prioritized sweeping. We model the results of prioritized sweeping as a regression model and discuss the prediction of the model by comparing it with the accuracy of our simulation results.
AB - Reinforcement Learning Algorithms present interesting learning techniques. Here an autonomous agent interacts with its environment to choose optimal actions to achieve its goals. The performance of an agent is determined by how quickly it learns and converges to an optimal solution. Q-learning and Prioritized sweeping provide interesting techniques to achieve this. In this paper we try to analyze the performance of Q-learning and Prioritized sweeping as examples of model free and model based reinforcement learning. We also try to analyze the optimal number of backups required for prioritized sweeping. We model the results of prioritized sweeping as a regression model and discuss the prediction of the model by comparing it with the accuracy of our simulation results.
UR - https://www.scopus.com/pages/publications/70349137458
UR - https://www.scopus.com/inward/citedby.url?scp=70349137458&partnerID=8YFLogxK
U2 - 10.1145/1523103.1523111
DO - 10.1145/1523103.1523111
M3 - Conference contribution
AN - SCOPUS:70349137458
SN - 9781605583518
T3 - Proceedings of the International Conference on Advances in Computing, Communication and Control, ICAC3'09
SP - 35
EP - 39
BT - Proceedings of the International Conference on Advances in Computing, Communication and Control, ICAC3'09
T2 - International Conference on Advances in Computing, Communication and Control, ICAC3'09
Y2 - 23 January 2009 through 24 January 2009
ER -