Add FuseQATConvBN to fuse_ops (#19442) by ethansfng · Pull Request #19442 · pytorch/executorch

ethansfng · 2026-05-10T03:13:53Z

Summary:

Adds a FuseQATConvBN which folds the QAT Conv-BN simulation chain (conv → q → dq → div(scale) → add(orig_bias) → batch_norm) inserted by prepare_qat_pt2e into the conv's quantized bias and removes the chain.

The pass runs in two steps inside a single call():

Bias prep — for each conv, create a zero-filled quantized bias if missing, or quantize a float bias as per-tensor int32. Required so step 2 has a quantized bias slot to write the BN correction into.
Fold — for each matched chain, compute the BN correction
C = (orig_bias - running_mean) * bn_weight / sqrt(running_var + eps) + bn_bias
and absorb it into the conv's quantized bias in place. Erase the chain + batch_norm.

Reviewed By: DrJessop

Differential Revision: D104497938

pytorch-bot · 2026-05-10T03:13:56Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/19442

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

Run pull request jobs on OSDC runners in shadow mode

❌ 1 New Failure, 5 Unrelated Failures

As of commit ee9d4df with merge base fe98297 ():

NEW FAILURE - The following job has failed:

pull / unittest / linux / linux-job (gh)
exir/tests/test_joint_graph.py::TestJointGraph::test_joint_graph

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

pull / unittest / macos / macos-job (gh) (trunk failure)
exir/tests/test_joint_graph.py::TestJointGraph::test_joint_graph
pull / unittest / windows / windows-job (gh) (trunk failure)
backends/xnnpack/test/recipes/test_xnnpack_recipes.py::TestXnnpackRecipes::test_8a4w_recipe
pull / unittest-editable / linux / linux-job (gh) (trunk failure)
pull / unittest-editable / macos / macos-job (gh) (trunk failure)
exir/tests/test_memory_planning.py::TestMisc::test_multiple_pools_1
pull / unittest-editable / windows / windows-job (gh) (trunk failure)
backends/xnnpack/test/recipes/test_xnnpack_recipes.py::TestXnnpackRecipes::test_8a4w_recipe

This comment was automatically generated by Dr. CI and updates every 15 minutes.

meta-codesync · 2026-05-10T03:14:01Z

@ethansfng has exported this pull request. If you are a Meta employee, you can view the originating Diff in D104497938.

github-actions · 2026-05-10T03:14:41Z

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

Summary: Adds a FuseQATConvBN which folds the QAT Conv-BN simulation chain (`conv → q → dq → div(scale) → add(orig_bias) → batch_norm`) inserted by `prepare_qat_pt2e` into the conv's quantized bias and removes the chain. The pass runs in two steps inside a single `call()`: 1. Bias prep — for each conv, create a zero-filled quantized bias if missing, or quantize a float bias as per-tensor int32. Required so step 2 has a quantized bias slot to write the BN correction into. 2. Fold — for each matched chain, compute the BN correction C = (orig_bias - running_mean) * bn_weight / sqrt(running_var + eps) + bn_bias and absorb it into the conv's quantized bias in place. Erase the chain + batch_norm. Differential Revision: D104497938

Summary: Adds a FuseQATConvBN which folds the QAT Conv-BN simulation chain (`conv → q → dq → div(scale) → add(orig_bias) → batch_norm`) inserted by `prepare_qat_pt2e` into the conv's quantized bias and removes the chain. The pass runs in two steps inside a single `call()`: 1. Bias prep — for each conv, create a zero-filled quantized bias if missing, or quantize a float bias as per-tensor int32. Required so step 2 has a quantized bias slot to write the BN correction into. 2. Fold — for each matched chain, compute the BN correction C = (orig_bias - running_mean) * bn_weight / sqrt(running_var + eps) + bn_bias and absorb it into the conv's quantized bias in place. Erase the chain + batch_norm. Reviewed By: DrJessop Differential Revision: D104497938

meta-cla Bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label May 10, 2026

meta-codesync Bot added fb-exported meta-exported labels May 10, 2026

meta-codesync Bot changed the title ~~Add FuseQATConvBN to fuse_ops~~ Add FuseQATConvBN to fuse_ops (#19442) May 10, 2026

ethansfng force-pushed the export-D104497938 branch from d5c07d4 to 84d7498 Compare May 10, 2026 05:15

ethansfng force-pushed the export-D104497938 branch from 84d7498 to 472a5cd Compare May 10, 2026 18:49

ethansfng force-pushed the export-D104497938 branch 2 times, most recently from c5e9566 to 0ced839 Compare May 11, 2026 18:45

ethansfng force-pushed the export-D104497938 branch from 0ced839 to 51cb526 Compare May 12, 2026 18:28

ethansfng force-pushed the export-D104497938 branch from 51cb526 to b08ceb7 Compare May 12, 2026 21:43

aliafzal approved these changes May 12, 2026

View reviewed changes

ethansfng force-pushed the export-D104497938 branch from b08ceb7 to ee9d4df Compare May 12, 2026 22:39

meta-codesync Bot merged commit 2ea50ac into pytorch:main May 13, 2026
171 of 177 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add FuseQATConvBN to fuse_ops (#19442)#19442

Add FuseQATConvBN to fuse_ops (#19442)#19442
meta-codesync[bot] merged 1 commit into
pytorch:mainfrom
ethansfng:export-D104497938

ethansfng commented May 10, 2026 •

edited by meta-codesync Bot

Loading

Uh oh!

pytorch-bot Bot commented May 10, 2026 •

edited

Loading

Uh oh!

meta-codesync Bot commented May 10, 2026

Uh oh!

github-actions Bot commented May 10, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ethansfng commented May 10, 2026 • edited by meta-codesync Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot Bot commented May 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/19442

❗ 1 Active SEVs

❌ 1 New Failure, 5 Unrelated Failures

Uh oh!

meta-codesync Bot commented May 10, 2026

Uh oh!

github-actions Bot commented May 10, 2026

This PR needs a release notes: label

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ethansfng commented May 10, 2026 •

edited by meta-codesync Bot

Loading

pytorch-bot Bot commented May 10, 2026 •

edited

Loading

This PR needs a `release notes:` label