Workshop
Machine Learning Systems
Alex Beutel · Tianqi Chen · Sameer Singh · Elaine Angelino · Markus Weimer · Joseph Gonzalez

Sat Dec 12th 08:30 AM -- 06:30 PM @ 511 d
Event URL: http://learningsys.org/ »

The broadening use of machine learning, the explosive growth in data, and the complexity of the large-scale learning systems required to analyze these data have together fueled interdisciplinary research at the intersection of Machine Learning and System design. Addressing these challenges demands a combination of the right abstractions -- for algorithms, data structures, and interfaces -- as well as scalable systems capable of addressing real world learning problems. At the same time, it is becoming increasingly clear that data-driven and learning-driven approaches provide natural and powerful solutions to building and managing complex modern systems. In total, the flow of ideas between these two communities continues to offer promising opportunities toward solving even larger problems.

Designing systems for machine learning presents new challenges and opportunities over the design of traditional data processing systems. For example, what is the right abstraction for data consistency in the context of parallel, stochastic learning algorithms? What guarantees of fault tolerance are needed during distributed learning? The statistical nature of machine learning offers an opportunity for more efficient systems but requires revisiting many of the challenges addressed by the systems and database communities over the past few decades. Machine learning focused developments in distributed learning platforms, programming languages, data structures, general purpose GPU programming, and a wide variety of other domains have had and will continue to have a large impact in both academia and industry.

As the relationship between the machine learning and systems communities has grown stronger, new research in using machine learning tools to solve classic systems challenges has also grown. Specifically, as we develop larger and more complex systems and networks for storing, analyzing, serving, and interacting with data, machine learning offers promise for modeling system dynamics, detecting issues, and making intelligent, data-driven decisions within our systems. Machine learning techniques have begun to play critical roles in scheduling, system tuning, and network analysis. Through working with systems and databases researchers to solve systems challenges, machine learning researchers can both improve their own learning systems as well impact the systems community and infrastructure at large.

This workshop aims to bring together researchers from the two communities and facilitate discussion on the challenges of Machine Learning Systems. This is a successor to the Big Learning workshop, which in past NIPS successfully focused on and brought attention to the need for scaling machine learning. Moving forward, this Machine Learning Systems workshop aims to address two major questions: (1) How should we design useful abstractions and build scalable systems to support large-scale machine learning? (2) How can we use machine learning in our systems to make them smarter and more efficient? Focal points for discussions and solicited submissions include but are not limited to:
- Systems for online and batch learning algorithms
- Systems for out-of-core machine learning
- Implementation studies of large-scale distributed learning algorithms --- challenges faced and lessons learned
- Database systems for Big Learning --- models and algorithms implemented, properties (fault tolerance, consistency, scalability, etc.), strengths and limitations
- Programming languages for machine learning
- Data driven systems --- learning for job scheduling, configuration tuning, straggler mitigation, network configuration, and security
- Systems for interactive machine learning
- Systems for serving machine learning models at scale



08:50 AM TensorFlow: A system for machine learning on heterogeneous systems (Talk) Jeff Dean
11:20 AM Intelligent Services (Talk) Joseph Gonzalez
02:30 PM Multiworld Testing (Talk) Sarah Bird
03:10 PM ML and Functional Programming (Talk) Mathias Brandewinder
04:30 PM Privacy in a Data Driven World (Talk) Roxana Geambasu
05:10 PM Scaling Machine Learning (Talk) Alex Smola
06:00 PM Hardware Trends for High Performance Analytics (Talk) Christopher Ré

Author Information

Alex Beutel (Carnegie Mellon University)
Tianqi Chen (University of Washington)
Sameer Singh (University of Washington)
Elaine Angelino (Harvard)
Markus Weimer (Microsoft)
Joseph Gonzalez (UC Berkeley)

More from the Same Authors