Avoid image deepcopy in prepare_multimodal_messages by albertvillanova · Pull Request #5475 · huggingface/trl

albertvillanova · 2026-04-08T06:35:41Z

Avoid image deepcopy in prepare_multimodal_messages.

This PR refactors the prepare_multimodal_messages function by replacing the deepcopy of original input messages with an incremental build of the output list of message dictionaries with transformed content.

Follow-up to:

Simplify _get_tool_suffix_ids #5440

Motivation

prepare_multimodal_messages used copy.deepcopy(messages) to avoid mutating the caller's input. This becomes a problem now that messages can contain PIL images (e.g. in "tool" role turns: prepare_multimodal_messages(tool_messages)): deepcopying an image is expensive and can fail for certain image types.

Solution

The fix replaces the deepcopy with an incremental build of the output list. A new message dict ({**message, "content": [...]}) is only created when a string "content" is transformed into a structured list; messages whose content is already a list are passed through as-is. Because the image-filling step only writes into newly-created placeholder dicts, the original messages are never mutated.

Changes

Refactoring for input immutability and safer message transformation:

The function no longer deep-copies the input messages; instead, it builds a new list by creating new message dictionaries only when transformations are needed, ensuring the original input is left unchanged.
All further processing, including counting image placeholders and inserting images, now operates on new messages rather than the original input.
The function's return type description is updated to clarify that a new list of messages is returned, not a deep copy.

Note

Medium Risk
Behavior around immutability changes: messages whose content is already structured may now be returned by reference, so downstream mutation of the returned objects could affect the caller’s originals.

Overview
Refactors prepare_multimodal_messages to avoid copy.deepcopy(messages) (which can be expensive/fail when messages contain PIL images), and instead incrementally builds new_messages while transforming only string content into structured blocks.

Updates placeholder counting and image injection to operate on new_messages, and changes image filling to create new content-part dicts (leaving original parts untouched). Docstring return description is updated from deep-copied to new list.

^{Reviewed by Cursor Bugbot for commit 7107104. Bugbot is set up for automated code reviews on this repo. Configure here.}

cursor

Cursor Bugbot has reviewed your changes and found 1 potential issue.

^{❌ Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, enable autofix in the Cursor dashboard.}

^{Reviewed by Cursor Bugbot for commit f57630f. Configure here.}

trl/data_utils.py

HuggingFaceDocBuilderDev · 2026-04-08T06:39:53Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

qgallouedec

it feels correct, let's see what @codex review says

qgallouedec · 2026-04-08T14:06:26Z

@codex review

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 71071043a3

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-04-08T14:09:52Z

trl/data_utils.py

+            else:
+                new_content.append(part)


Deep-copy non-image blocks when rebuilding content

This branch reuses the original block dict object for every non-image part, so the returned structure aliases nested objects from the caller’s input when messages are already in structured format. Any downstream in-place edit of prepare_multimodal_messages(...) output (for example, adding keys to text/tool blocks) will mutate the original messages, which is a regression from the previous deep-copy behavior and breaks practical immutability expectations for this helper.

Useful? React with 👍 / 👎.

qgallouedec

thanks

albertvillanova added 2 commits April 8, 2026 08:27

Avoid deepcopy in prepare_multimodal_messages

f237d56

Update docstring

f57630f

cursor bot reviewed Apr 8, 2026

View reviewed changes

trl/data_utils.py Show resolved Hide resolved

Avoid mutating in-place messages with image placeholders

7107104

qgallouedec reviewed Apr 8, 2026

View reviewed changes

chatgpt-codex-connector bot reviewed Apr 8, 2026

View reviewed changes

qgallouedec approved these changes Apr 9, 2026

View reviewed changes

albertvillanova merged commit 1e667d8 into huggingface:main Apr 9, 2026
13 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Avoid image deepcopy in prepare_multimodal_messages#5475

Avoid image deepcopy in prepare_multimodal_messages#5475
albertvillanova merged 3 commits intohuggingface:mainfrom
albertvillanova:avoid-image-deepcopy-prepare_multimodal_messages

albertvillanova commented Apr 8, 2026 •

edited by cursor bot

Loading

Uh oh!

cursor bot left a comment

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Apr 8, 2026

Uh oh!

qgallouedec left a comment •

edited

Loading

Uh oh!

qgallouedec commented Apr 8, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

chatgpt-codex-connector bot Apr 8, 2026

Uh oh!

qgallouedec left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

albertvillanova commented Apr 8, 2026 • edited by cursor bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Solution

Changes

Uh oh!

cursor bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Apr 8, 2026

Uh oh!

qgallouedec left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

qgallouedec commented Apr 8, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Apr 8, 2026

Choose a reason for hiding this comment

Uh oh!

qgallouedec left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

albertvillanova commented Apr 8, 2026 •

edited by cursor bot

Loading

qgallouedec left a comment •

edited

Loading