UniComposer: Band-Level Music Composition with Symbolic and Audio Unification

Abstract

UniComposer

A novel music generation pipeline that composes at the band level, utilizing a hierarchical multi-track music representation complemented by four cascaded diffusion models which progressively generate rhythm features, and unified features extracted from both symbolic and audio music by autoencoders.

Band-level Music Generation

Capable of allocating instruments based on musical features, their expressive potential and performance characteristics differences.

Unification of Symbolic and Audio

Architecture of joining the advantages of both format together, harnessing the richness of audio data and expressiveness of symbolic music.

Architecture

Generation Pipeline

Given input audio/symbolic input, musicology feature (e.g., time signature) and melody feature are extracted.
Four cascaded DMs to gradually generate features for monophonic (e.g., piano), polyphonic (e.g., flute) and percussion (e.g., drum).
Symbolic output are decoded, and can be converted to audio (optional).

Hierarchical Separation Intuition

Band-level music representation.

Full MIDI

57d2cdba98bb3730ae8a3ac76e96cb71.mp3

Main Melody

57d2cdba98bb3730ae8a3ac76e96cb71_melody.mp3

Reduced Mono.

57d2cdba98bb3730ae8a3ac76e96cb71_mono_reduced.mp3

Reduced Poly.

57d2cdba98bb3730ae8a3ac76e96cb71_poly_reduced.mp3

Reduced Perc.

57d2cdba98bb3730ae8a3ac76e96cb71_drum.mp3

Detailed Mono.

57d2cdba98bb3730ae8a3ac76e96cb71_mono.mp3

Detailed Poly.

57d2cdba98bb3730ae8a3ac76e96cb71_poly.mp3

Detailed Perc.

57d2cdba98bb3730ae8a3ac76e96cb71_drum.mp3

Showcases

Multi-Instrument Music Generation

From SINGLE-TRACK input to BAND-LEVEL output.

Input Melody

1_1.mp3

Input Melody

2_1.mp3

Input Melody

3_1.mp3

Output Band-level Music

1.mp3

Output Band-level Music

2.mp3

Output Band-level Music

3.mp3

Unification of Audio and Symbolic Music

Dealing with MP3, WAV and MIDI music in SINGLE framework.

Input: Violin Melody (AUDIO)

violin_piece.mp3

Input: Piano Melody (AUDIO)

flower_dance.mp3

Input: Human Cappella (AUDIO)

en_songs.mp3

Output: Band-level Music

violin_piece_acc_guitar_trumpet.mp3

Output: Band-level Music

flower_dance_acc_guitar_violin.mp3

Output: Band Accompaniment

en_songs_acc_guitar_violin.mp3

2. Ability of translating MP3, WAV into MIDI.

Input: Single-track AUDIO

flower_dance.mp3

Input: Multi-track AUDIO

eg2.mp3

Input: Special Instrument AUDIO

violin_piece.mp3

Converted MIDI

flower_dance_byDM1.mp3

Converted MIDI

eg2_trans.mp3

Converted MIDI

violin_piece_byDM1.mp3

Google Sites

Report abuse