Skip to yearly menu bar Skip to main content


San Diego Poster Wed, Dec 3, 2025 • 4:30 PM – 7:30 PM PST Exhibit Hall C,D,E #3518

Efficient Hybrid Language Model Compression through Group-Aware SSM Pruning

Ali Taghibakhshi · Sharath Turuvekere Sreenivas · Saurav Muralidharan · Marcin Chochowski · Yashaswi Karnati · Raviraj Joshi · Ameya Mahabaleshwarkar · ZIJIA CHEN · Yoshi Suhara · Oluwatobi Olabiyi · Daniel Korzekwa · Mostofa Patwary · Mohammad Shoeybi · Jan Kautz · Bryan Catanzaro · Ashwath Aithal · Nima Tajbakhsh · Pavlo Molchanov

Abstract

Log in and register to view live content