Model Overview
Flagship Model · 2026.3
Tempolor 4.6
A flagship music-generation model. Through a hierarchical Codec representation system, it achieves structured generation from high-level semantics to fine-grained acoustics, producing 48kHz stereo high-quality music.
48kHz Stereo
Hierarchical Codec
Precise Remix
Fine-grained Audio Editing
Overview

Tempolor 4.6 is Tempolor's current flagship music-generation model. Architecturally, building on previous model versions, the bottleneck has been redeveloped; in terms of the generation paradigm it stays consistent with real music creation, producing 48kHz stereo high-quality music through a hierarchical, progressive representation system of a musicality codec, a music-semantic codec and a music-audio acoustic codec.

This version decomposes music generation into representation-learning and generation tasks at different levels, achieving coarse-to-fine structured generation: the high level handles musicality and structural organization, the middle level handles semantics and content expression, and the low level handles acoustic detail and high-fidelity reconstruction. It represents the current mainstream paradigm — a hierarchical path 'from high-level semantics to fine-grained acoustics' — balancing musicality and audio quality.

In terms of controllability, beyond ordinary generation, the model further supports precise Remix rewriting, fine-grained audio editing and more.

Performance

Tempolor 4.6 establishes a more reference-worthy balance among long-form structural planning, lyric-carrying capacity and audio fidelity. Thanks to multi-layer Codec coordination and the LLM's long-range organization ability, the model maintains thematic-motif coherence and emotional unity across longer generation spans.

This version shows clearer layering and spatial separation in drums, bass, harmony and vocals — not only building a structurally complete framework, but also moving closer to mature delivery standards in fine listening details.

Especially when handling slow-tempo, relaxing styles, the model's emotional expression and arrangement texture are particularly delicate, offering the best solution in the current Tempolor series for brand theme music, commercial-grade demos and complex lyric creation.

Demo
42 Seventh StreetMV · Nostalgia
Lo-Fi Hip-HopMV
Stardust AutocompleteMV · Longing
Future BassMV
Break the Code
0:00 / 0:00
Drum and Bass
[Intro]
Neon veins through the city grid
Signals racing where the lost kids hid
Pulse in my chest Matching the kick
Every heartbeat Another trick
Cables snake beneath the ground
Data streams make a hollow sound
I'm just voltage Looping round
Searching for what can't be found
[Chorus]
Break the pattern Break the code
Every circuit's gonna overload
We're electric We explode
Running fast down this digital road
Break the pattern Break the code
Feel the surge Let it all erode
[Inst]
Pixels blur at this velocity
Gravity bends Losing clarity
But I'm alive in the frequency
Riding waves of pure energy
[Chorus]
Break the pattern Break the code
Every circuit's gonna overload
We're electric We explode
Running fast down this digital road
Break the pattern Break the code
Feel the surge Let it all erode
[Bridge]
Static climbs the tower walls
I'm rewired when the system calls
No more questions No more stalls
Just the rush before it falls
[Chorus]
Break the pattern Break the code
Every circuit's gonna overload
We're electric We explode
Running fast down this digital road
Break the pattern Break the code
Feel the surge Let it all erode
光差す方へ
0:00 / 0:00
J-Pop
[Intro]
灰色の空に 閉じ込められて
誰にも言えない 想いを抱えて
迷いながら 歩いてきたけど
もう一度だけ 信じてみたい
[Chorus]
光差す方へ 羽ばたいていける
怖くないよ もう一人じゃない
心の奥で 聴こえる声が
「大丈夫」って 背中押してくれる
新しい朝が 今 始まるから
[Inst]
傷ついた日々も 無駄じゃなかった
すべてが今を 作る糧になる
[Chorus]
光差す方へ 羽ばたいていける
怖くないよ もう一人じゃない
心の奥で 聴こえる声が
「大丈夫」って 背中押してくれる
新しい明日へ 今 踏み出すから
[Bridge]
どんなに遠くても 諦めないで
この手を伸ばせば 届くはずだから
[Chorus]
光差す方へ 羽ばたいていける
怖くないよ もう一人じゃない
心の奥で 聴こえる声が
「大丈夫」って 背中押してくれる
未来が待ってる 今 輝き出す
[Outro]
Benchmark Results

Based on Chinese and English test sets (30 each, 60 total), compared against Mureka v9, Suno v5.5 and MiniMax V2.6, covering the Meta Audiobox Aesthetics and SongEval evaluation systems.

Tempolor v4.6
Suno v5.5
Mureka v9
MiniMax V2.6
Audio Aesthetics AssessmentMeta Audiobox Aesthetics
7.73
7.72
7.63
7.69
CE
Content Enjoyment
7.96
7.99
7.83
7.91
CU
Content Usefulness
6.23
6.34
6.59
6.42
PC
Production Complexity
8.33
8.32
8.16
8.22
PQ
Production Quality
ModelCE
Content Enjoyment
CU
Content Usefulness
PC
Production Complexity
PQ
Production Quality
Tempolor v4.67.72517.95966.22638.3291
Suno v5.57.71567.99496.33998.3184
Mureka v97.63247.82756.58598.1604
MiniMax V2.67.68727.91316.41978.2175
Music Aesthetics AssessmentSongEval
4.44
4.36
4.48
4.23
Musicality
Musicality
4.56
4.48
4.59
4.37
Coherence
Coherence
4.34
4.26
4.42
4.14
Naturalness
Naturalness
4.57
4.49
4.59
4.35
Memorability
Memorability
4.45
4.36
4.45
4.22
Clarity
Clarity
ModelMusicality
Musicality
Coherence
Coherence
Naturalness
Naturalness
Memorability
Memorability
Clarity
Clarity
Tempolor v4.64.44194.56394.34384.57104.4458
Suno v5.54.36164.48144.25654.48854.3634
Mureka v94.47634.59284.41674.58734.4523
MiniMax V2.64.23154.36684.14474.34634.2244

Comparison of time to generate 120s of music audio (inference on an Nvidia L20 GPU)

Time to generate 120s of music audio

Time: seconds
Yue
60
Mureka
36
DiffRhythm
20
AceStep
10
Tempolor V3.0
2.5
010203040506070
Tempolor V3.0 Speed
Industry-leading commercial music-generation model
Tempolor V3.0 RTF 0.02
Generates 2 minutes of music in 2.5 seconds