Modeling Reinforcement Learning Algorithms for performance analysis

Shrirang Ambaji Kulkarni, G. Raghavendra Rao

Research output: Chapter in Book/Report/Conference proceedingConference contribution

3 Citations (Scopus)

Abstract

Reinforcement Learning Algorithms present interesting learning techniques. Here an autonomous agent interacts with its environment to choose optimal actions to achieve its goals. The performance of an agent is determined by how quickly it learns and converges to an optimal solution. Q-learning and Prioritized sweeping provide interesting techniques to achieve this. In this paper we try to analyze the performance of Q-learning and Prioritized sweeping as examples of model free and model based reinforcement learning. We also try to analyze the optimal number of backups required for prioritized sweeping. We model the results of prioritized sweeping as a regression model and discuss the prediction of the model by comparing it with the accuracy of our simulation results.

Original languageEnglish
Title of host publicationProceedings of the International Conference on Advances in Computing, Communication and Control, ICAC3'09
Pages35-39
Number of pages5
DOIs
Publication statusPublished - 2009
EventInternational Conference on Advances in Computing, Communication and Control, ICAC3'09 - Mumbai, India
Duration: 23-01-200924-01-2009

Publication series

NameProceedings of the International Conference on Advances in Computing, Communication and Control, ICAC3'09

Conference

ConferenceInternational Conference on Advances in Computing, Communication and Control, ICAC3'09
Country/TerritoryIndia
CityMumbai
Period23-01-0924-01-09

All Science Journal Classification (ASJC) codes

  • Computer Networks and Communications
  • Software
  • Control and Systems Engineering

Fingerprint

Dive into the research topics of 'Modeling Reinforcement Learning Algorithms for performance analysis'. Together they form a unique fingerprint.

Cite this