Timezone: »
We present Park, a platform for researchers to experiment with Reinforcement Learning (RL) for computer systems. Using RL for improving the performance of systems has a lot of potential, but is also in many ways very different from, for example, using RL for games. Thus, in this work we first discuss the unique challenges RL for systems has, and then propose Park an open extensible platform, which makes it easier for ML researchers to work on systems problems. Currently, Park consists of 12 real world system-centric optimization problems with one common easy to use interface. Finally, we present the performance of existing RL approaches over those 12 problems and outline potential areas of future work.
Author Information
Hongzi Mao (MIT)
Parimarjan Negi (MIT CSAIL)
Akshay Narayan (MIT CSAIL)
Hanrui Wang (Massachusetts Institute of Technology)
Jiacheng Yang (MIT CSAIL)
Haonan Wang (MIT CSAIL)
Ryan Marcus (MIT CSAIL)
Ravichandra Addanki (Massachusetts Institute of Technology)
Mehrdad Khani Shirkoohi (MIT)
Songtao He (Massachusetts Institute of Technology)
Vikram Nathan (MIT)
Frank Cangialosi (MIT CSAIL)
Shaileshh Venkatakrishnan (MIT)
Wei-Hung Weng (MIT)
Song Han (MIT)
Tim Kraska (MIT)
Dr.Mohammad Alizadeh (Massachusetts institute of technology)
More from the Same Authors
-
2020 Poster: MCUNet: Tiny Deep Learning on IoT Devices »
Ji Lin · Wei-Ming Chen · Yujun Lin · john cohn · Chuang Gan · Song Han -
2020 Spotlight: MCUNet: Tiny Deep Learning on IoT Devices »
Ji Lin · Wei-Ming Chen · Yujun Lin · john cohn · Chuang Gan · Song Han -
2020 Poster: Differentiable Augmentation for Data-Efficient GAN Training »
Shengyu Zhao · Zhijian Liu · Ji Lin · Jun-Yan Zhu · Song Han -
2020 Poster: TinyTL: Reduce Memory, Not Parameters for Efficient On-Device Learning »
Han Cai · Chuang Gan · Ligeng Zhu · Song Han -
2020 Poster: High-Dimensional Contextual Policy Search with Unknown Context Rewards using Bayesian Optimization »
Qing Feng · Ben Letham · Hongzi Mao · Eytan Bakshy -
2020 Spotlight: High-Dimensional Contextual Policy Search with Unknown Context Rewards using Bayesian Optimization »
Qing Feng · Ben Letham · Hongzi Mao · Eytan Bakshy -
2019 Poster: Learning Generalizable Device Placement Algorithms for Distributed Machine Learning »
Ravichandra Addanki · Shaileshh Bojja Venkatakrishnan · Shreyan Gupta · Hongzi Mao · Mohammad Alizadeh -
2019 Poster: Deep Leakage from Gradients »
Ligeng Zhu · Zhijian Liu · Song Han -
2019 Poster: Point-Voxel CNN for Efficient 3D Deep Learning »
Zhijian Liu · Haotian Tang · Yujun Lin · Song Han -
2019 Spotlight: Point-Voxel CNN for Efficient 3D Deep Learning »
Zhijian Liu · Haotian Tang · Yujun Lin · Song Han -
2018 Poster: Unsupervised Cross-Modal Alignment of Speech and Text Embedding Spaces »
Yu-An Chung · Wei-Hung Weng · Schrasing Tong · Jim Glass -
2018 Spotlight: Unsupervised Cross-Modal Alignment of Speech and Text Embedding Spaces »
Yu-An Chung · Wei-Hung Weng · Schrasing Tong · Jim Glass