Add GLM-4-MoE tool calling support by qgallouedec · Pull Request #5463 · huggingface/trl

qgallouedec · 2026-04-06T20:17:40Z

Add response schema for GLM-4-MoE (<tool_call>name\n<arg_key>...<arg_value>... format) (done by Claude)
Add glm4moe.jinja template for identity matching in add_response_schema
Add GLM-4-MoE to TestAddResponseSchema and TestParseResponse test parametrizations
Add GLM-4-MoE to supported models in agent training docs

Part of #5460

Warning

Requires/contains #5459

Note

Medium Risk
Adds a new response-parsing regex/schema and chat-template identity match for GLM-4-MoE; incorrect regex or template mismatches could cause tool-call parsing failures or mis-parsed assistant content.

Overview
Adds GLM-4-MoE tool-calling support by introducing a new glm4moe.jinja identity template and wiring add_response_schema() to attach a GLM-4-MoE-specific response_schema that parses <tool_call>...<arg_key>...<arg_value>... formatted calls.

Extends the chat template utilities test suite to run add_response_schema/parse_response parametrizations against a GLM-4-MoE tiny model, and updates GRPO agent-training docs to list GLM-4-MoE as a tested/supported model.

^{Reviewed by Cursor Bugbot for commit 3fc2ca8. Bugbot is set up for automated code reviews on this repo. Configure here.}

…sages + async grpo

HuggingFaceDocBuilderDev · 2026-04-06T20:21:56Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

qgallouedec · 2026-04-06T21:26:31Z

cc @Rocketknight1 for the schema

docs/source/grpo_trainer.md

Co-authored-by: Quentin Gallouédec <45557362+qgallouedec@users.noreply.github.com>

trl/chat_template_utils.py

tests/test_chat_template_utils.py

cursor

Cursor Bugbot has reviewed your changes and found 1 potential issue.

^{❌ Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, enable autofix in the Cursor dashboard.}

^{Reviewed by Cursor Bugbot for commit 3fc2ca8. Configure here.}

cursor · 2026-04-09T02:19:03Z

trl/chat_template_utils.py

    ```
    """
+    if tokenizer.chat_template == glm4moe_chat_template:
+        tokenizer.response_schema = glm4moe_schema


Missing return statement in GLM-4-MoE branch

High Severity

The glm4moe branch in add_response_schema sets the response_schema but is missing a return tokenizer statement. Execution falls through to the final ValueError, causing add_response_schema to always fail for GLM-4-MoE tokenizers. This breaks tool-calling support, for example, during GRPOTrainer initialization.

^{Reviewed by Cursor Bugbot for commit 3fc2ca8. Configure here.}

qgallouedec and others added 14 commits April 5, 2026 18:44

Narrow prefix-preserving check to the actual requirement

4b3aa51

Merge branch 'main' into narrow-prefix-preserving-check

0894910

Update chat template examples to use multiplication function calls

730070b

style

4622d77

Move chat templates from inline strings to .jinja files

08d4c51

tools in dummy

276559d

Add chat template files to MANIFEST.in

673c35d

Enhance chat template handling to include tool call formatting in mes…

604c476

…sages + async grpo

align grpo and async

83a7ef6

Merge branch 'main' into chat-templates-files

0f28384

revert no content

e5d7cdf

docstyle ignore

a618809

Merge branch 'main' into chat-templates-files

a0b81b1

Add GLM-4-MoE tool calling support

2384da5

Merge branch 'main' into glm4moe-tool-calling

14ac6a7

qgallouedec mentioned this pull request Apr 6, 2026

Tracking: tool calling support across chat templates #5460

Open

qgallouedec requested review from AmineDiro, albertvillanova and kashif April 6, 2026 21:26

qgallouedec commented Apr 6, 2026

View reviewed changes

docs/source/grpo_trainer.md Outdated Show resolved Hide resolved

qgallouedec and others added 4 commits April 6, 2026 17:31

Apply suggestions from code review

0b85443

Co-authored-by: Quentin Gallouédec <45557362+qgallouedec@users.noreply.github.com>

doc

c9dfa1f

Merge branch 'main' into glm4moe-tool-calling

e06d88c

Merge branch 'main' into glm4moe-tool-calling

303eac5

cursor bot reviewed Apr 7, 2026

View reviewed changes

trl/chat_template_utils.py Outdated Show resolved Hide resolved

qgallouedec added 2 commits April 7, 2026 12:11

Merge branch 'main' into glm4moe-tool-calling

87e23ed

Merge branch 'main' into glm4moe-tool-calling

d8985b2

qgallouedec commented Apr 8, 2026

View reviewed changes

tests/test_chat_template_utils.py Outdated Show resolved Hide resolved

Apply suggestion from @qgallouedec

dff9615

albertvillanova approved these changes Apr 8, 2026

View reviewed changes

qgallouedec added 2 commits April 8, 2026 20:42

Merge branch 'main' into glm4moe-tool-calling

c31e258

Merge branch 'main' into glm4moe-tool-calling

3fc2ca8

cursor bot reviewed Apr 9, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add GLM-4-MoE tool calling support#5463

Add GLM-4-MoE tool calling support#5463
qgallouedec wants to merge 24 commits intomainfrom
glm4moe-tool-calling

qgallouedec commented Apr 6, 2026 •

edited by cursor bot

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Apr 6, 2026

Uh oh!

qgallouedec commented Apr 6, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cursor bot left a comment

Uh oh!

cursor bot Apr 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

qgallouedec commented Apr 6, 2026 • edited by cursor bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Apr 6, 2026

Uh oh!

qgallouedec commented Apr 6, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cursor bot left a comment

Choose a reason for hiding this comment

Uh oh!

cursor bot Apr 9, 2026

Choose a reason for hiding this comment

Missing return statement in GLM-4-MoE branch

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

qgallouedec commented Apr 6, 2026 •

edited by cursor bot

Loading