Abstract
The hybrid renewable energy systems (HRES) must be capable of responding to intermittent supply by photovoltaics/wind, the health of storage, and time-dependent tariffs. Instantaneous, rule-based dispatch cannot operate under such stochastic couplings. Some of the reinforcement learning (RL) experiments rely on ad-hoc simulators, ineffective constraint management, and training regimes, which are not operational in latency. The study has offset this gap by creating a modular and physics-consistent RL environment of HRES that is capable of combining forecast-aware state-construction, battery/grid-safety screening, and reproducible benchmarking of Q-Learning, DQN, PPO, and a novel Hybrid Policy Gradient Algorithm (HPGA). Environment includes projection of constraints (SoC, ramp, feeder limits), multi-objective reward shaping (cost, curtailment, aging), as well as a co-simulation loop that reveals non-stationarity of real market and weather. The platform has shown the final outputs of a representative microgrid (200 kW PV, 150 kW wind, 500kWh BESS). The PPO has 91.4 percent renewable utilization and 15.8 percent operating cost reduction over a rule base as compared to HPGA, which boosts convergence stability and battery stress. Decision latency reaches 32 ms/step (PPO) versus 47 (DQN) and 62 (Q-Learning), and policies stabilize within ≈480 episodes compared with ≈700 for tabular methods. By coupling rigorous physical constraints with sample-efficient RL updates, the proposed environment enables deployable, low-latency control and provides a transparent testbed for future algorithms advancing HRES operation toward higher renewable penetration, lower lifecycle cost, and robust grid support.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the 2025 International Conference on Computational Intelligence, Security, and Artificial Intelligence, IntelliSecAI 2025 |
| Publisher | Institute of Electrical and Electronics Engineers Inc. |
| ISBN (Electronic) | 9798331565763 |
| DOIs | |
| Publication status | Published - 2025 |
| Event | 2025 International Conference on Computational Intelligence, Security, and Artificial Intelligence, IntelliSecAI 2025 - Al-Khobar, Saudi Arabia Duration: 17-12-2025 → 18-12-2025 |
Publication series
| Name | Proceedings of the 2025 International Conference on Computational Intelligence, Security, and Artificial Intelligence, IntelliSecAI 2025 |
|---|
Conference
| Conference | 2025 International Conference on Computational Intelligence, Security, and Artificial Intelligence, IntelliSecAI 2025 |
|---|---|
| Country/Territory | Saudi Arabia |
| City | Al-Khobar |
| Period | 17-12-25 → 18-12-25 |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 7 Affordable and Clean Energy
All Science Journal Classification (ASJC) codes
- Artificial Intelligence
- Computer Networks and Communications
- Software
- Information Systems
Fingerprint
Dive into the research topics of 'Implementation of Reinforcement Learning Environment for Hybrid Renewable Energy Systems'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver