Skip to yearly menu bar Skip to main content


Poster
in
Workshop: ML for Systems

When to Reason: Semantic Router for vLLM

Chen Wang ⋅ Xunzhuo Liu ⋅ Yuhan Liu ⋅ Yue Zhu ⋅ Xiangxi Mo ⋅ Junchen Jiang ⋅ Huamin Chen

Abstract

Chat is not available.