Poster
in
Workshop: Artificial Intelligence for Music: Where Creativity Meets Computation

Midi-LLM: Adapting Large Language Models for Text-to-MIDI Music Generation

Shih-Lun Wu ⋅ Yoon Kim ⋅ Anna Huang

2025 Poster
in
Workshop: Artificial Intelligence for Music: Where Creativity Meets Computation

Project Page [ OpenReview]

Abstract

We present Midi-LLM, an LLM for generating multitrack MIDI music from free-form text prompts. Our approach expands a text LLM's vocabulary to include MIDI tokens, and uses a two-stage training recipe to endow text-to-MIDI abilities. By preserving the LLM’s weight signature, we can directly leverage the vLLM library for accelerated inference. Experiments show that Midi-LLM achieves higher quality, better text control, and faster inference compared to the recent Text2midi model. Live and static demos at https://midi-llm-demo.vercel.app.

Chat is not available.