model: add hunyuan dense #14878

stevenkuang-tencent · 2025-07-25T13:23:05Z

Update:

Support hunyuan_dense
fix hunyuan_moe chat template

Signed-off-by: stevenkuang <stevenkuang@tencent.com>

convert_hf_to_gguf.py

convert_hf_to_gguf_update.py

gguf-py/gguf/constants.py

This reverts commit aa973ca.

Signed-off-by: stevenkuang <stevenkuang@tencent.com>

convert_hf_to_gguf.py

src/llama-chat.cpp

src/llama-model.cpp

CISC · 2025-07-25T20:06:36Z

convert_hf_to_gguf.py

@@ -684,6 +684,9 @@ def get_vocab_base_pre(self, tokenizer) -> str:
        if chkhsh == "7e57df22b1fe23a7b1e1c7f3dc4e3f96d43a4eb0836d0c6bdc3436d7b2f1c664":
            # ref: https://huggingface.co/tencent/Hunyuan-A13B-Instruct
            res = "hunyuan"
+        if chkhsh == "bba3b3366b646dbdded5dbc42d59598b849371afc42f7beafa914afaa5b70aa6":
+            # ref: https://huggingface.co/tencent/Hunyuan-4B
+            res = "hunyuan"


Just checking; is it using the same pre-tokenizer regex as the MoE?

Just checking; is it using the same pre-tokenizer regex as the MoE?

There are two types of vocabulary in Hunyuan, regardless of whether it is moe or dense.

And are they all using the same regex, ie this one?

llama.cpp/src/llama-vocab.cpp

Lines 355 to 360 in 11dd5a4

case LLAMA_VOCAB_PRE_TYPE_HUNYUAN:

regex_exprs = {

// original regex from tokenizer.json

// "(?i:'s|'t|'re|'ve|'m|'ll|'d)|[^\\r\\n\\p{L}\\p{N}]?\\p{L}+|\\p{N}| ?[^\\s\\p{L}\\p{N}]+[\\r\\n]*|\\s*[\\r\\n]+|\\s+(?!\\S)|\\s+"

"(?:'[sS]|'[tT]|'[rR][eE]|'[vV][eE]|'[mM]|'[lL][lL]|'[dD])|[^\\r\\n\\p{L}\\p{N}]?\\p{L}+|\\p{N}| ?[^\\s\\p{L}\\p{N}]+[\\r\\n]*|\\s*[\\r\\n]+|\\s+(?!\\S)|\\s+",

};

Signed-off-by: stevenkuang <stevenkuang@tencent.com>

CISC · 2025-07-26T19:37:37Z

convert_hf_to_gguf.py

+    def set_vocab(self):
+        if (self.dir_model / "tokenizer.json").is_file():
+            self._set_vocab_gpt2()
+            self.gguf_writer.add_add_bos_token(True)


It shouldn't be necessary to set this manually, a correctly configured model has this set in tokenizer_config.json and this will be picked up from there by gguf.SpecialVocab (called from _set_vocab_gpt2).

CISC · 2025-07-29T09:24:31Z

@stevenkuang-tencent gentle ping

stevenkuang-tencent added 3 commits July 25, 2025 19:55

support hunyuan_v1_dense

5d2c042

Signed-off-by: stevenkuang <stevenkuang@tencent.com>

update hunyuan_moe to hunyuan_v1_moe

aa973ca

Signed-off-by: stevenkuang <stevenkuang@tencent.com>

fix rope alpha assert and bos token

5645497

Signed-off-by: stevenkuang <stevenkuang@tencent.com>

github-actions bot added the python python script changes label Jul 25, 2025

add blank line

63f32c3

Signed-off-by: stevenkuang <stevenkuang@tencent.com>

CISC reviewed Jul 25, 2025

View reviewed changes

convert_hf_to_gguf.py Outdated Show resolved Hide resolved

convert_hf_to_gguf_update.py Outdated Show resolved Hide resolved

gguf-py/gguf/constants.py Outdated Show resolved Hide resolved

stevenkuang-tencent added 3 commits July 26, 2025 03:26

Revert "update hunyuan_moe to hunyuan_v1_moe"

78de8db

This reverts commit aa973ca.

use hunyuan_dense instead of hunyuan_v1_dense

c7329b4

Signed-off-by: stevenkuang <stevenkuang@tencent.com>

fix hunyuan_moe chat template

0192c12

Signed-off-by: stevenkuang <stevenkuang@tencent.com>

stevenkuang-tencent changed the title ~~model: add hunyuan v1 dense~~ model: add hunyuan dense Jul 25, 2025

CISC requested changes Jul 25, 2025

View reviewed changes

xunjieliu mentioned this pull request Jul 26, 2025

Reddit News Daily 2025-07-26 xunjieliu/reddit-daily-news#132

Open

stevenkuang-tencent added 2 commits July 27, 2025 01:08

remove leftover code

3ecc5d3

Signed-off-by: stevenkuang <stevenkuang@tencent.com>

update hunyuan dense chat template

6c17323

Signed-off-by: stevenkuang <stevenkuang@tencent.com>

stevenkuang-tencent requested a review from CISC July 26, 2025 17:19

CISC approved these changes Jul 26, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

model: add hunyuan dense #14878

model: add hunyuan dense #14878

stevenkuang-tencent commented Jul 25, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

CISC Jul 25, 2025

Uh oh!

stevenkuang-tencent Jul 26, 2025

Uh oh!

CISC Jul 26, 2025

Uh oh!

CISC Jul 26, 2025

Uh oh!

CISC commented Jul 29, 2025

Uh oh!

Uh oh!

	case LLAMA_VOCAB_PRE_TYPE_HUNYUAN:
	regex_exprs = {
	// original regex from tokenizer.json
	// "(?i:'s\|'t\|'re\|'ve\|'m\|'ll\|'d)\|[^\\r\\n\\p{L}\\p{N}]?\\p{L}+\|\\p{N}\| ?[^\\s\\p{L}\\p{N}]+[\\r\\n]\|\\s[\\r\\n]+\|\\s+(?!\\S)\|\\s+"
	"(?:'[sS]\|'[tT]\|'[rR][eE]\|'[vV][eE]\|'[mM]\|'[lL][lL]\|'[dD])\|[^\\r\\n\\p{L}\\p{N}]?\\p{L}+\|\\p{N}\| ?[^\\s\\p{L}\\p{N}]+[\\r\\n]\|\\s[\\r\\n]+\|\\s+(?!\\S)\|\\s+",
	};

model: add hunyuan dense #14878

Are you sure you want to change the base?

model: add hunyuan dense #14878

Conversation

stevenkuang-tencent commented Jul 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

CISC Jul 25, 2025

Choose a reason for hiding this comment

Uh oh!

stevenkuang-tencent Jul 26, 2025

Choose a reason for hiding this comment

Uh oh!

CISC Jul 26, 2025

Choose a reason for hiding this comment

Uh oh!

CISC Jul 26, 2025

Choose a reason for hiding this comment

Uh oh!

CISC commented Jul 29, 2025

Uh oh!

Uh oh!

stevenkuang-tencent commented Jul 25, 2025 •

edited

Loading