HeartMuLa audio model

Name: HeartMuLa
Also Known As: HeartMuLa-oss-3B
Licence: Apache License 2.0
Creator: HeartMuLa Team

HeartMuLa is a model published in january 2026 that can make full songs with multilingual support including but not limited to English, Chinese, Japanese, Korean and Spanish.

Initially released under non-commercial research and educational use only licence, it then became available under the permissive Apache 2.0. Still,be aware that since HeartMuLa’s repo includes code taken from ConversationTTS, which uses a CC BY-NC 4.0 license, it can’t fully relicense those sections under Apache 2.0 unless the original authors agree.

You can give it style tips, lyrics, or reference audio and it'll generate music.

You can change lots of stuff like the music style for different parts of a song, or ask it to make short tracks for videos.

The paper says HeartMuLa can make clear, long-form music with control over stuff like lyrics and style.

It's built in layers and combines several tools to take on big music tasks.

When they scale it up to larger sizes like 7B parameters it gives results close to top commercial models.

Key Features

No performance evaluations available for this model yet.

No sample outputs available for this model yet.

Where To Find HeartMuLa

If you'd like to access this model, you can explore the following possibilities:

Weights GitHub Apache License 2.0

Hugging Face

Other Models by HeartMuLa Team

HeartTranscriptor

Useful Links

ComfyUI custom node for HeartMuLa

Workflow

ComfyUI custom node for HeartMuLa

Added on: January 22, 2026