Timezone: »

 
Controlling Commercial Cooling Systems Using Reinforcement Learning
Jerry Luo · Cosmin Paduraru · Octavian Voicu · Yuri Chervonyi · Scott Munns · Jerry Li · Crystal Qian · Praneet Dutta · Daniel Mankowitz · Jared Quincy Davis · Ningjia Wu · Xingwei Yang · Chu-Ming Chang · Ted Li · Rob Rose · Mingyan Fan · Hootan Nakhost · Tinglin Liu · Deeni Fatiha · Neil Satra · Juliet Rothenberg · Molly Carlin · Satish Tallapaka · Sims Witherspoon · David Parish · Peter Dolan · Chenyu Zhao

This paper is a technical overview of our recent work on reinforcement learning for controlling commercial cooling systems. Building on previous work on cooling data centers more efficiently, we recently conducted two live experiments in partnership with a building management system provider. These live experiments had a variety of challenges in areas such as evaluation, learning from offline data, and constraint satisfaction. Our paper describes these challenges in the hope that awareness of them will benefit future applied RL work. We also describe the way we adapted our RL system to deal with these challenges, resulting in energy savings of approximately 9% and 13% respectively at the two live experiment sites.

Author Information

Jerry Luo (DeepMind)
Cosmin Paduraru (DeepMind)
Octavian Voicu (DeepMind)
Yuri Chervonyi (Google/Deepmind)
Scott Munns (Trane Technologies)
Jerry Li (Deepmind)

Industrial researcher specializes in Generative models, style transfer, and RL.

Crystal Qian (DeepMind)
Praneet Dutta (Google)
Daniel Mankowitz (DeepMind)
Jared Quincy Davis (DeepMind | Stanford)
Ningjia Wu (Google)
Xingwei Yang (Google)
Chu-Ming Chang (Google)
Ted Li (Google)
Rob Rose (Google)
Mingyan Fan (Google)
Hootan Nakhost (Google)
Tinglin Liu (Google)
Deeni Fatiha (DeepMind)
Neil Satra (Google)
Juliet Rothenberg (DeepMind)
Molly Carlin (DeepMind)
Satish Tallapaka (Google)
Sims Witherspoon (DeepMind)
David Parish (Google)
Peter Dolan (Waymo)

Over a decade in machine learning at Google, focusing on applying Recommender systems to News and TV content and large system optimization, and building and deploying production ML infrastructure.

Chenyu Zhao (Google)

Related Events (a corresponding poster, oral, or spotlight)

More from the Same Authors