Abstract
Energy control systems need to balance the use of locally produced and consumed power. Their goal is to maximize self-use by charging local storage facilities while maintaining grid-convenience, i.e. avoiding largely different successive feed-in rates to the grid. To avoid over- or undercharging of the battery, control needs to respect certain safety thresholds. Recently, energy control systems have been enhanced with learning components for optimized control. This however, makes it more difficult to ensure that control respects above-mentioned safety thresholds, especially in the presence of incomplete information.
This paper proposes an approach for safe learning in energy control systems by combining a shielded reinforcement learning (RL) agent that determines which percentage of the locally produced energy is stored in a battery and which percentage is fed into the grid, with a digital twin of the battery that maintains information on the State of Charge of the battery. To ensure safe learning, we use a formally verified shield, which ensures under certain assumptions that unsafe actions, i.e. overcharging of the battery, are avoided. For formal verification, the RL agent is replaced by a contract and we assume that the digital twin is always available and provides missing information to the RL agent in case of communication losses. This combination then allows us to formally verify the safety of the resulting energy control system with complex discrete and continuous dynamics. We illustrate our approach by developing a Simulink model of a real smart house in Heeten, NL [35]. Our experimental results demonstrate that both self-use and grid-convenience can be achieved while maintaining safe battery use.
This paper proposes an approach for safe learning in energy control systems by combining a shielded reinforcement learning (RL) agent that determines which percentage of the locally produced energy is stored in a battery and which percentage is fed into the grid, with a digital twin of the battery that maintains information on the State of Charge of the battery. To ensure safe learning, we use a formally verified shield, which ensures under certain assumptions that unsafe actions, i.e. overcharging of the battery, are avoided. For formal verification, the RL agent is replaced by a contract and we assume that the digital twin is always available and provides missing information to the RL agent in case of communication losses. This combination then allows us to formally verify the safety of the resulting energy control system with complex discrete and continuous dynamics. We illustrate our approach by developing a Simulink model of a real smart house in Heeten, NL [35]. Our experimental results demonstrate that both self-use and grid-convenience can be achieved while maintaining safe battery use.
| Original language | English |
|---|---|
| Title of host publication | Performance Evaluation Methodologies and Tools |
| Subtitle of host publication | 17th EAI International Conference, Valuetools 2024, Milan, Italy, December 12–13, 2024, Proceedings |
| Editors | Marco Gribaudo, Mauro Iacono, Sahra Sedigh Sarvestani |
| Pages | 274-294 |
| Number of pages | 21 |
| ISBN (Electronic) | 978-3-032-06818-7 |
| DOIs | |
| Publication status | Published - 2 Jan 2026 |
| Event | 17th EAI International Conference on Performance Evaluation Methodologies and Tools 2024 - Politecnico Milano, Milan, Italy Duration: 12 Dec 2024 → 13 Dec 2024 Conference number: 17 https://valuetools.eai-conferences.org/2024/ |
Publication series
| Name | Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering |
|---|---|
| Publisher | Springer |
| Volume | 663 |
| ISSN (Print) | 1867-8211 |
| ISSN (Electronic) | 1867-822X |
Conference
| Conference | 17th EAI International Conference on Performance Evaluation Methodologies and Tools 2024 |
|---|---|
| Abbreviated title | EAI ValueTools 2024 |
| Country/Territory | Italy |
| City | Milan |
| Period | 12/12/24 → 13/12/24 |
| Internet address |
Keywords
- NLA
Fingerprint
Dive into the research topics of 'Safe Battery Use and Grid-Convenience in an Intelligent Energy Control System'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver