Q-Learning-Based Multivariate Nonlinear Model Predictive Controller: Experimental Validation on Batch Reactor for Temperature Trajectory Tracking

  • Abhiram Varma Vegesna
  • , Muralikrishna Shamaiah Narayanarao
  • , Kishore Bhamidipati
  • , Thirunavukkarasu Indiran*
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

1 Citation (Scopus)

Abstract

This study introduces a Q-learning-based nonlinear model predictive control (QL-NMPC) framework for temperature control in batch reactors. A reinforcement learning agent is trained in simulation to learn optimal control strategies using coolant flow rate and heater current as inputs. The resulting policy, represented as a Q-table, is implemented in real time on a physical reactor setup using the NVIDIA Jetson Orin platform. The proposed QL-NMPC framework employs a value iteration-based Q-learning algorithm, enabling model-free policy optimization without explicit policy evaluation steps, and demonstrates effective temperature tracking while highlighting the potential of reinforcement learning for controlling nonlinear batch processes without relying on system identification.

Original languageEnglish
Pages (from-to)28362-28371
Number of pages10
JournalACS Omega
Volume10
Issue number26
DOIs
Publication statusPublished - 08-07-2025

All Science Journal Classification (ASJC) codes

  • General Chemistry
  • General Chemical Engineering

Fingerprint

Dive into the research topics of 'Q-Learning-Based Multivariate Nonlinear Model Predictive Controller: Experimental Validation on Batch Reactor for Temperature Trajectory Tracking'. Together they form a unique fingerprint.

Cite this