Time-Shifted Token Scheduling for Symbolic Music Generation

Ting-Kang Wang, Chih-Pin Tan, Yi-Hsuan Yang

公開日: 2025/9/28

Abstract

Symbolic music generation faces a fundamental trade-off between efficiency and quality. Fine-grained tokenizations achieve strong coherence but incur long sequences and high complexity, while compact tokenizations improve efficiency at the expense of intra-token dependencies. To address this, we adapt a delay-based scheduling mechanism (DP) that expands compound-like tokens across decoding steps, enabling autoregressive modeling of intra-token dependencies while preserving efficiency. Notably, DP is a lightweight strategy that introduces no additional parameters and can be seamlessly integrated into existing representations. Experiments on symbolic orchestral MIDI datasets show that our method improves all metrics over standard compound tokenizations and narrows the gap to fine-grained tokenizations.

Time-Shifted Token Scheduling for Symbolic Music Generation | SummarXiv | SummarXiv