Tempolor 4.6 — Research

Flagship Model · 2026.3

Tempolor 4.6

A flagship music-generation model. Through a hierarchical Codec representation system, it achieves structured generation from high-level semantics to fine-grained acoustics, producing 48kHz stereo high-quality music.

48kHz Stereo

Hierarchical Codec

Precise Remix

Fine-grained Audio Editing

Overview

Tempolor 4.6 is Tempolor's current flagship music-generation model. Architecturally, building on previous model versions, the bottleneck has been redeveloped; in terms of the generation paradigm it stays consistent with real music creation, producing 48kHz stereo high-quality music through a hierarchical, progressive representation system of a musicality codec, a music-semantic codec and a music-audio acoustic codec.

This version decomposes music generation into representation-learning and generation tasks at different levels, achieving coarse-to-fine structured generation: the high level handles musicality and structural organization, the middle level handles semantics and content expression, and the low level handles acoustic detail and high-fidelity reconstruction. It represents the current mainstream paradigm — a hierarchical path 'from high-level semantics to fine-grained acoustics' — balancing musicality and audio quality.

In terms of controllability, beyond ordinary generation, the model further supports precise Remix rewriting, fine-grained audio editing and more.

Performance

Tempolor 4.6 establishes a more reference-worthy balance among long-form structural planning, lyric-carrying capacity and audio fidelity. Thanks to multi-layer Codec coordination and the LLM's long-range organization ability, the model maintains thematic-motif coherence and emotional unity across longer generation spans.

This version shows clearer layering and spatial separation in drums, bass, harmony and vocals — not only building a structurally complete framework, but also moving closer to mature delivery standards in fine listening details.

Especially when handling slow-tempo, relaxing styles, the model's emotional expression and arrangement texture are particularly delicate, offering the best solution in the current Tempolor series for brand theme music, commercial-grade demos and complex lyric creation.

Demo

42 Seventh StreetMV · Nostalgia

Lo-Fi Hip-HopMV

Stardust AutocompleteMV · Longing

Future BassMV

Break the Code

0:00 / 0:00

Drum and Bass

[Intro]

Neon veins through the city grid

Signals racing where the lost kids hid

Pulse in my chest Matching the kick

Every heartbeat Another trick

Cables snake beneath the ground

Data streams make a hollow sound

I'm just voltage Looping round

Searching for what can't be found

[Chorus]

Break the pattern Break the code

Every circuit's gonna overload

We're electric We explode

Running fast down this digital road

Break the pattern Break the code

Feel the surge Let it all erode

[Inst]

Pixels blur at this velocity

Gravity bends Losing clarity

But I'm alive in the frequency

Riding waves of pure energy

[Chorus]

Break the pattern Break the code

Every circuit's gonna overload

We're electric We explode

Running fast down this digital road

Break the pattern Break the code

Feel the surge Let it all erode

[Bridge]

Static climbs the tower walls

I'm rewired when the system calls

No more questions No more stalls

Just the rush before it falls

[Chorus]

Break the pattern Break the code

Every circuit's gonna overload

We're electric We explode

Running fast down this digital road

Break the pattern Break the code

Feel the surge Let it all erode

光差す方へ

0:00 / 0:00

J-Pop

[Intro]

灰色の空に閉じ込められて

誰にも言えない想いを抱えて

迷いながら歩いてきたけど

もう一度だけ信じてみたい

[Chorus]

光差す方へ羽ばたいていける

怖くないよもう一人じゃない

心の奥で聴こえる声が

「大丈夫」って背中押してくれる

新しい朝が今始まるから

[Inst]

傷ついた日々も無駄じゃなかった

すべてが今を作る糧になる

[Chorus]

光差す方へ羽ばたいていける

怖くないよもう一人じゃない

心の奥で聴こえる声が

「大丈夫」って背中押してくれる

新しい明日へ今踏み出すから

[Bridge]

どんなに遠くても諦めないで

この手を伸ばせば届くはずだから

[Chorus]

光差す方へ羽ばたいていける

怖くないよもう一人じゃない

心の奥で聴こえる声が

「大丈夫」って背中押してくれる

未来が待ってる今輝き出す

[Outro]

Benchmark Results

Based on Chinese and English test sets (30 each, 60 total), compared against Mureka v9, Suno v5.5 and MiniMax V2.6, covering the Meta Audiobox Aesthetics and SongEval evaluation systems.

Tempolor v4.6

Suno v5.5

Mureka v9

MiniMax V2.6

Audio Aesthetics AssessmentMeta Audiobox Aesthetics

5.0

5.5

6.0

6.5

7.0

7.5

8.0

8.5

9.0

7.73

7.72

7.63

7.69

Content Enjoyment

7.96

7.99

7.83

7.91

Content Usefulness

6.23

6.34

6.59

6.42

Production Complexity

8.33

8.32

8.16

8.22

Production Quality

Model	CE↑ Content Enjoyment	CU↑ Content Usefulness	PC↑ Production Complexity	PQ↑ Production Quality
Tempolor v4.6	7.7251	7.9596	6.2263	8.3291
Suno v5.5	7.7156	7.9949	6.3399	8.3184
Mureka v9	7.6324	7.8275	6.5859	8.1604
MiniMax V2.6	7.6872	7.9131	6.4197	8.2175

Music Aesthetics AssessmentSongEval

3.5

3.6

3.7

3.8

3.9

4.0

4.1

4.2

4.3

4.4

4.5

4.6

4.7

4.8

4.9

5.0

4.44

4.36

4.48

4.23

Musicality

4.56

4.48

4.59

4.37

Coherence

4.34

4.26

4.42

4.14

Naturalness

4.57

4.49

4.59

4.35

Memorability

4.45

4.36

4.45

4.22

Clarity

Model	Musicality↑ Musicality	Coherence↑ Coherence	Naturalness↑ Naturalness	Memorability↑ Memorability	Clarity↑ Clarity
Tempolor v4.6	4.4419	4.5639	4.3438	4.5710	4.4458
Suno v5.5	4.3616	4.4814	4.2565	4.4885	4.3634
Mureka v9	4.4763	4.5928	4.4167	4.5873	4.4523
MiniMax V2.6	4.2315	4.3668	4.1447	4.3463	4.2244

Comparison of time to generate 120s of music audio (inference on an Nvidia L20 GPU)

Time to generate 120s of music audio

Time: seconds

Yue

Mureka

DiffRhythm

AceStep

Tempolor V3.0

2.5

010203040506070

Tempolor V3.0 Speed

Industry-leading commercial music-generation model

Tempolor V3.0 RTF 0.02

Generates 2 minutes of music in 2.5 seconds