[docs] vlm addition by stevhliu · Pull Request #45271 · huggingface/transformers

stevhliu · 2026-04-06T18:56:42Z

adds a separate vlm contribution doc for more visibility instead of being hidden in the Contribute to Transformers doc, and integration tests are covered in #45152

HuggingFaceDocBuilderDev · 2026-04-06T19:06:59Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

zucchini-nlp · 2026-04-07T12:45:37Z

docs/source/en/_toctree.yml

+    - local: new_vlm
+      title: Add a vision language model


i remember another PR of yours, re-ordering these sections. Ig vlm addition should be merged after that?

yeah once #45130 is in we can merge this next :)

zucchini-nlp · 2026-04-07T12:47:10Z

docs/source/en/add_vision_processing_components.md

+- [torchvision](https://docs.pytorch.org/vision/stable/index.html) backend is the default and supports GPU acceleration.
+- [PIL](https://pillow.readthedocs.io/en/stable/index.html) backend is a fallback when no GPU is available.
+
+Both classes share the same preprocessing logic but have different backends. Their constructor signatures and default values must be identical. [`AutoImageProcessor.from_pretrained()`] selects the backend at load time and falls back to PIL when torchvision isn't available. Mismatched signatures cause the same saved config to behave differently across environments.


tbh this is not VLM-specific so prob we can name it differently. Like adding vision components or vision-processing components

Then we can add another for audio/video components if needed

good idea, i like the flexibility!

docs/source/en/new_vlm.md

stevhliu added 2 commits April 6, 2026 11:53

new vlm

693c04d

update

6a54710

stevhliu requested a review from zucchini-nlp April 6, 2026 19:07

zucchini-nlp reviewed Apr 7, 2026

View reviewed changes

feedback

3d62bb2

stevhliu mentioned this pull request Apr 8, 2026

[docs] modular transformers #45327

Draft

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[docs] vlm addition#45271

[docs] vlm addition#45271
stevhliu wants to merge 3 commits intohuggingface:mainfrom
stevhliu:new-vlm

stevhliu commented Apr 6, 2026

Uh oh!

HuggingFaceDocBuilderDev commented Apr 6, 2026

Uh oh!

zucchini-nlp Apr 7, 2026

Uh oh!

stevhliu Apr 7, 2026

Uh oh!

zucchini-nlp Apr 7, 2026

Uh oh!

stevhliu Apr 7, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

stevhliu commented Apr 6, 2026

Uh oh!

HuggingFaceDocBuilderDev commented Apr 6, 2026

Uh oh!

zucchini-nlp Apr 7, 2026

Choose a reason for hiding this comment

Uh oh!

stevhliu Apr 7, 2026

Choose a reason for hiding this comment

Uh oh!

zucchini-nlp Apr 7, 2026

Choose a reason for hiding this comment

Uh oh!

stevhliu Apr 7, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants