Poster
in
Workshop: Vision Language Models: Challenges of Real World Deployment

Eureka: Intelligent Feature Engineering for Enterprise AI Cloud Resource Demand Prediction

Hangxuan Li ⋅ Renjun Jia ⋅ Xuezhang Wu ⋅ zeqi zheng ⋅ Yunjie Qian ⋅ Lily (Xianling) Zhang

2025 Poster
in
Workshop: Vision Language Models: Challenges of Real World Deployment

Project Page [ Poster] [ OpenReview]

Abstract

The rapid growth of foundation models (LLMs, VLMs) has surged enterprise cloud AI demand, where GPU resources must be dynamically allocated to meet evolving workloads. Accurate demand prediction is critical for efficient and reliable real-world deployment, yet traditional forecasting systems struggle due to sparse historical data and highly volatile workload behaviors. We present Eureka, an LLM-driven agentic framework that automates feature engineering. Our approach has three main components: a domain knowledge-driven Expert Agent that encodes cloud resource expertise to evaluate feature quality, an Automated Feature Generator that explores new feature spaces, lastly a RL Feedback Loop (reinforcement learning) connects the two components and enables continuous learning. Deployed and evaluated on real-world cloud provider datasets, Eureka improves demand fulfillment rate by 16\%, and reduces computing resource migration rates by 33\%. This work introduces a novel intelligent system for cloud resource prediction and AI supply chain management, advancing the efficiency, scalability, and deployability of foundation models in production environments.

Chat is not available.