Skip to yearly menu bar Skip to main content


Poster Wed, Dec 3, 2025 • 4:30 PM – 7:30 PM PST

Efficient Hybrid Language Model Compression through Group-Aware SSM Pruning

Ali Taghibakhshi ⋅ Sharath Turuvekere Sreenivas ⋅ Saurav Muralidharan ⋅ Marcin Chochowski ⋅ Yashaswi Karnati ⋅ Raviraj Joshi ⋅ Ameya Mahabaleshwarkar ⋅ ZIJIA CHEN ⋅ Yoshi Suhara ⋅ Oluwatobi Olabiyi ⋅ Daniel Korzekwa ⋅ Mostofa Patwary ⋅ Mohammad Shoeybi ⋅ Jan Kautz ⋅ Bryan Catanzaro ⋅ Ashwath Aithal ⋅ Nima Tajbakhsh ⋅ Pavlo Molchanov

Abstract

Video

Chat is not available.