MiroThinker tool call parser by hksdpc255 · Pull Request #20624 · ggml-org/llama.cpp

hksdpc255 · 2026-03-16T06:21:51Z

The MiroThinker series v1.0–v1.7 (and likely every version before v2.0) uses an MCP-style tool call:

<use_mcp_tool>
<server_name>{server_name}</server_name>
<tool_name>{tool_name}</tool_name>
<arguments>
{json_args}
</arguments>
...
</use_mcp_tool>

It requires the MCP server name to be included in the system prompt, which makes it impossible for the autoparser to work with it.

pwilkin

There doesn't seem to be any support for structured outputs.

hksdpc255 · 2026-03-17T01:14:12Z

There doesn't seem to be any support for structured outputs.

@pwilkin What does structured outputs means? Is there any examples to implement that?

hksdpc255 · 2026-03-17T01:29:44Z

@pwilkin Actually, I have another idea. I could further improve the chat template to recognize formatted tool names from MCP servers (e.g., mcp__<server_name>__<tool_name> used by claude-code), which would make the server name more meaningful.

However, this would require, as an example, mapping
<server_name>name_part_1</server_name>[spaces]<tool_name>name_part_2</tool_name>
to mcp__name_part_1__name_part_2.

How can I implement this kind of custom transformation using the new PEG parser?

CISC · 2026-03-17T09:55:06Z

+
+{#- ========== Workaround for llama.cpp crashing ========== #}
+{%- for message in messages %}
+    {%- if message.role == "assistant" %}
+        {%- if message.tool_calls | length == 0 %}
+            {%- set fake_function = namespace(name='fake_name', arguments='{}') %}
+            {%- set fake_function = namespace(function=fake_function) %}
+            {%- set message.tool_calls = [fake_function, fake_function] %}
+        {%- endif %}
+    {%- endif %}
+{%- endfor %}
+{#- ========== Workaround for llama.cpp crashing ========== #}


Was this fixed? If not, someone should look at it.

This part add fake functions to assistant messages and it will prevent llama-server from crash.

The crash seems to occur here:

llama.cpp/common/jinja/caps.cpp

Lines 228 to 252 in 740a447

[&](bool success, value & messages, value & tools) {

if (!success) {

return; // Nothing can be inferred

}

auto & tool_name = tools->at(0)->at("function")->at("name");

caps_print_stats(tool_name, "tools[0].function.name");

caps_print_stats(tools, "tools");

if (!tool_name->stats.used) {

result.supports_tools = false;

}

auto & tool_calls = messages->at(1)->at("tool_calls");;

caps_print_stats(tool_calls, "messages[1].tool_calls");

if (!tool_calls->stats.used) {

result.supports_tool_calls = false;

return;

}

auto & tool_arg = tool_calls->at(0)->at("function")->at("arguments")->at("arg");

caps_print_stats(tool_arg, "messages[1].tool_calls[0].function.arguments.arg");

if (tool_arg->stats.used) {

result.supports_object_arguments = true;

}

}

It assumes that the message list passed into the chat template is immutable. However, in llama.cpp’s Jinja engine, a reference is passed to the template rather than a copy, which makes it effectively mutable.

CISC

Otherwise the template LGTM (even though it will not work in immutable sandbox; does not matter for us), remaining changes and approval up to @pwilkin

aldehir · 2026-03-17T10:14:29Z

How can I implement this kind of custom transformation using the new PEG parser?

You can create a custom mapper and add another chat format. See chat-peg-parser.cpp, you can likely inherit the one there.

Definitely an interesting model...

Is it not possible to hardcode the server name to something like "localhost" for better compatibility?

hksdpc255 · 2026-03-17T10:32:29Z

Definitely an interesting model...

Is it not possible to hardcode the server name to something like "localhost" for better compatibility?

Yes, that is possible. However, this model is fine-tuned to use specific server names such as tool-python, search_and_scrape_webpage, and jina_scrape_llm_summary for tasks like Python/shell execution, web search, and fetching web content.

Mapping everything to a generic server would in pratically require additional reasoning tokens during inference, and the results would not be as good as when using the original format.

pwilkin · 2026-03-17T10:36:02Z

How can I implement this kind of custom transformation using the new PEG parser?

You can create a custom mapper and add another chat format. See chat-peg-parser.cpp, you can likely inherit the one there.

Don't even need to create a custom mapper since for the analysis I made a tagged mapper that can be used out-of-the-box for this :)

See the parser usages in chat-diff-analyzer.cpp for examples of extracting fragments.

pwilkin · 2026-03-17T10:38:01Z

There doesn't seem to be any support for structured outputs.

@pwilkin What does structured outputs means? Is there any examples to implement that?

Basically this:

        if (has_response_format) {
            auto response_format = p.rule("response-format", p.content(p.schema(p.json(), "response-format-schema", inputs.json_schema)));
            return ctx.reasoning_parser + p.space() + p.choice({
                p.literal("```json") + p.space() + response_format + p.space() + p.literal("```"),
                response_format
            }) + p.end();
        }

hksdpc255 · 2026-03-17T10:48:15Z

There doesn't seem to be any support for structured outputs.

@pwilkin What does structured outputs means? Is there any examples to implement that?

Basically this:

        if (has_response_format) {
            auto response_format = p.rule("response-format", p.content(p.schema(p.json(), "response-format-schema", inputs.json_schema)));
            return ctx.reasoning_parser + p.space() + p.choice({
                p.literal("```json") + p.space() + response_format + p.space() + p.literal("```"),
                response_format
            }) + p.end();
        }

Oh... I know what you means. I'll implement it later. Let me convert this PR to draft before I fully implement it.

hksdpc255 · 2026-03-21T13:42:55Z

@pwilkin Mind taking another look?

hksdpc255 · 2026-03-27T06:16:02Z

@pwilkin Is current implementation good for merge now?

pwilkin · 2026-03-27T13:44:36Z

Yeah, almost good - please add proper tests to test-chat.cpp.

hksdpc255 · 2026-04-01T06:53:33Z

@pwilkin Done

…pt, "<think>")

pwilkin · 2026-04-01T12:58:07Z

@aldehir care to take a look?

pwilkin · 2026-04-01T15:24:23Z

Aight going to run CI and merge if green.

Implement MiroThinker tool call parser

685c01d

hksdpc255 changed the title ~~Implement MiroThinker tool call parser~~ MiroThinker tool call parser Mar 16, 2026

Add chat template for MiroThinker

eaf6e79

hksdpc255 marked this pull request as ready for review March 16, 2026 10:03

hksdpc255 requested a review from a team as a code owner March 16, 2026 10:03

CISC reviewed Mar 16, 2026

View reviewed changes

Comment thread models/templates/MiroThinker-v1.jinja Outdated

Comment thread models/templates/MiroThinker-v1.jinja Outdated

pwilkin requested changes Mar 16, 2026

View reviewed changes

Comment thread common/chat.cpp

Add CALL_BEGIN2 for server name parsing

d6ec977

hksdpc255 added 2 commits March 17, 2026 09:20

Refactor macros to accept tool parameter

02e6701

Remove ensure_ascii=False from JSON serialization

aa5a30f

hksdpc255 requested review from CISC and pwilkin March 17, 2026 01:31

loci-dev mentioned this pull request Mar 17, 2026

UPSTREAM PR #20624: MiroThinker tool call parser auroralabs-loci/llama.cpp#1262

Open

CISC reviewed Mar 17, 2026

View reviewed changes

Comment thread models/templates/MiroThinker-v1.jinja Outdated

Remove redundant user role check in template

0aba8f5

CISC reviewed Mar 17, 2026

View reviewed changes

CISC approved these changes Mar 17, 2026

View reviewed changes

hksdpc255 marked this pull request as draft March 17, 2026 10:50

hksdpc255 added 2 commits March 22, 2026 00:37

Merge branch 'ggml-org:master' into patch-2

605fb08

Add structured outputs support for common_chat_params_init_mirothinker

b156e20

hksdpc255 marked this pull request as ready for review March 21, 2026 13:40

add tests and fix bugs founded by tests

6190ee3

github-actions Bot added the testing Everything test related label Apr 1, 2026

hksdpc255 added 2 commits April 1, 2026 18:16

Merge branch 'ggml-org:master' into patch-2

42fde3f

Replace wrap_for_generation_prompt by p.prefix(inputs.generation_prom…

b68afd5

…pt, "<think>")

pwilkin approved these changes Apr 1, 2026

View reviewed changes

aldehir approved these changes Apr 1, 2026

View reviewed changes

Merge branch 'master' into patch-2

eee2cf8

pwilkin approved these changes Apr 1, 2026

View reviewed changes

	[&](bool success, value & messages, value & tools) {
	if (!success) {
	return; // Nothing can be inferred
	}

	auto & tool_name = tools->at(0)->at("function")->at("name");
	caps_print_stats(tool_name, "tools[0].function.name");
	caps_print_stats(tools, "tools");
	if (!tool_name->stats.used) {
	result.supports_tools = false;
	}

	auto & tool_calls = messages->at(1)->at("tool_calls");;
	caps_print_stats(tool_calls, "messages[1].tool_calls");
	if (!tool_calls->stats.used) {
	result.supports_tool_calls = false;
	return;
	}

	auto & tool_arg = tool_calls->at(0)->at("function")->at("arguments")->at("arg");
	caps_print_stats(tool_arg, "messages[1].tool_calls[0].function.arguments.arg");
	if (tool_arg->stats.used) {
	result.supports_object_arguments = true;
	}
	}

Conversation

hksdpc255 commented Mar 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

pwilkin left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

hksdpc255 commented Mar 17, 2026

Uh oh!

hksdpc255 commented Mar 17, 2026

Uh oh!

Uh oh!

CISC Mar 17, 2026

Choose a reason for hiding this comment

Uh oh!

hksdpc255 Mar 17, 2026

Choose a reason for hiding this comment

Uh oh!

hksdpc255 Mar 17, 2026

Choose a reason for hiding this comment

Uh oh!

CISC left a comment

Choose a reason for hiding this comment

Uh oh!

aldehir commented Mar 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

hksdpc255 commented Mar 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pwilkin commented Mar 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pwilkin commented Mar 17, 2026

Uh oh!

hksdpc255 commented Mar 17, 2026

Uh oh!

hksdpc255 commented Mar 21, 2026

Uh oh!

hksdpc255 commented Mar 27, 2026

Uh oh!

pwilkin commented Mar 27, 2026

Uh oh!

hksdpc255 commented Apr 1, 2026

Uh oh!

pwilkin commented Apr 1, 2026

Uh oh!

pwilkin commented Apr 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

hksdpc255 commented Mar 16, 2026 •

edited

Loading

aldehir commented Mar 17, 2026 •

edited

Loading

hksdpc255 commented Mar 17, 2026 •

edited

Loading

pwilkin commented Mar 17, 2026 •

edited

Loading