Kimi-K2-Thinking native tool calling format #17251

KiruyaMomochi · 2025-11-13T20:29:01Z

The implementation might support Kimi-K2-Instruct too, but I don't have enough disk space to test now :(

Almost silly copy-paste from DeepSeek V3.1 #15533, modified according to https://github.com/MoonshotAI/Kimi-K2/blob/main/docs/tool_call_guidance.md: matching function id instead of plain function name.

Considerations:

The official template does not contain any <think> tag at the end, so thinking_forced_open is false. Should we test it by modify the template manually?
Did not add template update instruction to models/templates/README.md for now, because their template has tojson(separators=(',', ':')). Although the value of separators is the same as default value, but we must remove it to make the template work for minja.
DeepSeek V3.1 might be possible to generate <｜tool▁calls▁begin｜>tool... and ignoring <｜tool▁call▁begin｜>, but I have not observed such behavior in Kimi-K2-Thinking and always get <|tool_calls_section_begin|><|tool_call_begin|>, therefore I'm removing the ? in the function regex.

llama.cpp/common/chat.cpp

Line 1751 in c4abcb2

static const common_regex function_regex("(?:<｜tool▁call▁begin｜>)?([^\\n<]+)(?:<｜tool▁sep｜>)");

Actually, I always get an extra <|tool_calls_section_end|> when keeping ?, but I have not been able to fix it, so finally removed the ?.
Have not tested lower quantized variants, maybe they could have different behavior which need to adapt the current parser?

For maintainers: I may have a busy weekend so fell free to edit directly if I'm not able to reply in time.

Closes #17155.

calvin2021y · 2025-11-14T02:47:24Z

#16932

KiruyaMomochi · 2025-11-14T04:50:40Z

Thanks! Will try #16932.

KiruyaMomochi · 2025-11-15T06:37:23Z

Finally get some to test and it worked well!

KiruyaMomochi · 2025-11-15T06:37:47Z

Close in favour of #16932.

KiruyaMomochi added 5 commits November 11, 2025 03:47

chat : Kimi-K2-Thinking tool calling support

0c3c896

fix : escape vertical bar in regex

94d85cc

fix: function call with id

7c8a694

fix: kimi-k2 tool calling grammar

56153aa

fix: kimi-k2 tool calling testing with correct tool calling format

accad29

KiruyaMomochi requested a review from ggerganov as a code owner November 13, 2025 20:29

DajanaV mentioned this pull request Nov 13, 2025

UPSTREAM PR #17251: Kimi-K2-Thinking native tool calling format auroralabs-loci/llama.cpp#202

Open

github-actions bot added the testing Everything test related label Nov 13, 2025

KiruyaMomochi closed this Nov 15, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Kimi-K2-Thinking native tool calling format #17251

Kimi-K2-Thinking native tool calling format #17251

KiruyaMomochi commented Nov 13, 2025 •

edited

Loading

Uh oh!

calvin2021y commented Nov 14, 2025

Uh oh!

KiruyaMomochi commented Nov 14, 2025

Uh oh!

KiruyaMomochi commented Nov 15, 2025

Uh oh!

KiruyaMomochi commented Nov 15, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Kimi-K2-Thinking native tool calling format #17251

Kimi-K2-Thinking native tool calling format #17251

Conversation

KiruyaMomochi commented Nov 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

calvin2021y commented Nov 14, 2025

Uh oh!

KiruyaMomochi commented Nov 14, 2025

Uh oh!

KiruyaMomochi commented Nov 15, 2025

Uh oh!

KiruyaMomochi commented Nov 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

KiruyaMomochi commented Nov 13, 2025 •

edited

Loading

KiruyaMomochi commented Nov 15, 2025 •

edited

Loading