Add GPT-OSS tool calling support by qgallouedec · Pull Request #5464 · huggingface/trl

qgallouedec · 2026-04-06T21:57:11Z

Add response schema for GPT-OSS (done by Claude)
Add gptoss.jinja template for identity matching in add_response_schema
Add GPT-OSS to TestAddResponseSchema and TestParseResponse test parametrizations
Add GPT-OSS to supported models in agent training docs

Part of #5460

Warning

Requires/contains #5459

Note

Medium Risk
Introduces a new GPT-OSS response parsing schema and updates GRPO tool-suffix tokenization to depend on real tool names, which could affect tool-call loop formatting and parsing across models if templates behave differently.

Overview
Adds GPT-OSS tool-calling support by introducing a new gptoss.jinja identity template and a corresponding gptoss_schema, and wiring both into add_response_schema so parse_response can extract content and single tool calls from GPT-OSS outputs.

Updates GRPO tool-call suffix extraction in both GRPOTrainer and AsyncRolloutWorker to build the dummy conversation using the actual tool name (not "dummy"), matching templates (like GPT-OSS) that derive tool-response headers from the preceding tool call.

Extends response-schema/parsing tests to include a tiny-GptOssForCausalLM tokenizer with model-specific skips/expectations, and documents GPT-OSS as a supported agent-training model in grpo_trainer.md.

^{Reviewed by Cursor Bugbot for commit 392dece. Bugbot is set up for automated code reviews on this repo. Configure here.}

…sages + async grpo

HuggingFaceDocBuilderDev · 2026-04-06T22:01:05Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

trl/trainer/grpo_trainer.py

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: b18e39efd6

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

trl/trainer/grpo_trainer.py

…compatibility

qgallouedec · 2026-04-06T22:36:22Z

cc @Rocketknight1 for the schema

trl/experimental/async_grpo/async_rollout_worker.py

cursor

Cursor Bugbot has reviewed your changes and found 2 potential issues.

There are 3 total unresolved issues (including 1 from previous review).

^{❌ Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, enable autofix in the Cursor dashboard.}

^{Reviewed by Cursor Bugbot for commit 450b9ef. Configure here.}

cursor · 2026-04-09T00:57:57Z

trl/chat_template_utils.py

    """
+    if tokenizer.chat_template == gptoss_chat_template:
+        tokenizer.response_schema = gptoss_schema
+        return tokenizer


Prefix-preserving check diverges from suffix extraction construction

Low Severity

is_chat_template_prefix_preserving still uses a hardcoded "dummy" tool name, while _get_tool_suffix_ids was changed to use the real tool name via tool_messages[0]["name"]. The comment on line 350 explicitly states "Use the same dummy messages as _get_tool_suffix_ids", but the constructions now differ. For GPT-OSS, the tool name is embedded in the rendered text (e.g. to=functions.NAME), so the validation function is no longer testing the exact property that _get_tool_suffix_ids relies on.

Additional Locations (2)

trl/trainer/grpo_trainer.py#L1418-L1419

trl/experimental/async_grpo/async_rollout_worker.py#L559-L560

^{Reviewed by Cursor Bugbot for commit 450b9ef. Configure here.}

trl/trainer/grpo_trainer.py

tests/test_chat_template_utils.py

Co-authored-by: Quentin Gallouédec <45557362+qgallouedec@users.noreply.github.com>

qgallouedec and others added 15 commits April 5, 2026 18:44

Narrow prefix-preserving check to the actual requirement

4b3aa51

Merge branch 'main' into narrow-prefix-preserving-check

0894910

Update chat template examples to use multiplication function calls

730070b

style

4622d77

Move chat templates from inline strings to .jinja files

08d4c51

tools in dummy

276559d

Add chat template files to MANIFEST.in

673c35d

Enhance chat template handling to include tool call formatting in mes…

604c476

…sages + async grpo

align grpo and async

83a7ef6

Merge branch 'main' into chat-templates-files

0f28384

revert no content

e5d7cdf

docstyle ignore

a618809

Merge branch 'main' into chat-templates-files

a0b81b1

Merge branch 'main' into chat-templates-files

67ab0af

Add GPT-OSS tool calling support

b18e39e

qgallouedec changed the title ~~Gpt oss tool calling~~ Add GPT-OSS tool calling support Apr 6, 2026

fix gpt oss

71ce5a0

cursor bot reviewed Apr 6, 2026

View reviewed changes

trl/trainer/grpo_trainer.py Show resolved Hide resolved

chatgpt-codex-connector bot reviewed Apr 6, 2026

View reviewed changes

trl/trainer/grpo_trainer.py Outdated Show resolved Hide resolved

Update tool suffix ID retrieval to use actual tool names for GPT-OSS …

8f1ad1e

…compatibility

qgallouedec mentioned this pull request Apr 6, 2026

Tracking: tool calling support across chat templates #5460

Open

qgallouedec added 2 commits April 6, 2026 22:26

style

9b9771d

align async

b3f4481

qgallouedec requested review from AmineDiro, albertvillanova and kashif April 6, 2026 22:36

qgallouedec added 2 commits April 7, 2026 08:25

Merge branch 'main' into gpt-oss-tool-calling

76a0f66

Merge branch 'main' into gpt-oss-tool-calling

0890038

qgallouedec and others added 3 commits April 7, 2026 14:49

style

3253602

Merge branch 'main' into gpt-oss-tool-calling

b95dbec

Merge branch 'main' into gpt-oss-tool-calling

ec81a1e

albertvillanova approved these changes Apr 8, 2026

View reviewed changes

cursor bot reviewed Apr 8, 2026

View reviewed changes

trl/experimental/async_grpo/async_rollout_worker.py Show resolved Hide resolved

Merge branch 'main' into gpt-oss-tool-calling

450b9ef

cursor bot reviewed Apr 9, 2026

View reviewed changes

qgallouedec commented Apr 9, 2026

View reviewed changes

tests/test_chat_template_utils.py Outdated Show resolved Hide resolved

Apply suggestions from code review

392dece

Co-authored-by: Quentin Gallouédec <45557362+qgallouedec@users.noreply.github.com>

qgallouedec merged commit 720c1f2 into main Apr 9, 2026
16 checks passed

qgallouedec deleted the gpt-oss-tool-calling branch April 9, 2026 01:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add GPT-OSS tool calling support#5464

Add GPT-OSS tool calling support#5464
qgallouedec merged 26 commits intomainfrom
gpt-oss-tool-calling

qgallouedec commented Apr 6, 2026 •

edited by cursor bot

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Apr 6, 2026

Uh oh!

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

Uh oh!

qgallouedec commented Apr 6, 2026

Uh oh!

Uh oh!

cursor bot left a comment

Uh oh!

cursor bot Apr 9, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

qgallouedec commented Apr 6, 2026 • edited by cursor bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Apr 6, 2026

Uh oh!

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

qgallouedec commented Apr 6, 2026

Uh oh!

Uh oh!

cursor bot left a comment

Choose a reason for hiding this comment

Uh oh!

cursor bot Apr 9, 2026

Choose a reason for hiding this comment

Prefix-preserving check diverges from suffix extraction construction

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

qgallouedec commented Apr 6, 2026 •

edited by cursor bot

Loading