Allow passing tensor arguments in reader constructors by rostan-t · Pull Request #6252 · NVIDIA/DALI

rostan-t · 2026-03-11T10:59:56Z

Category:

New feature (non-breaking change which adds functionality)

Description:

Currently, it is necessary to invoke readers in order to pass tensor arguments. The recommended way to use readers is with next_epoch and the __call__ API is not even documented.

This PR allows constructing readers with tensor arguments.

Additional information:

Affected modules and functionalities:

Dynamic mode.

Key points relevant for the review:

Tests:

Checklist

Documentation

DALI team only

Requirements

Implements new requirements
Affects existing requirements
N/A

REQ IDs: N/A

JIRA TASK: DALI-4600

review-notebook-app · 2026-03-11T11:00:02Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

greptile-apps · 2026-03-11T11:02:41Z

Greptile Summary

This PR extends DALI's dynamic mode reader API to allow Tensor (and tensor-like) objects to be passed directly in reader constructors, removing the requirement to always call __call__ just to supply tensor arguments. Previously tensor args were silently excluded from all reader constructor signatures; now they are accepted, validated, stored in _raw_tensor_args, and forwarded to the backend on each _run/_batches/_samples invocation via the new _process_tensor_args helper.

Key changes:

build_constructor (_op_builder.py): for _is_reader classes, tensor-arg values are split at construction time — scalar values are passed through normally to the backend constructor, while actual Tensor/array objects are stripped from kwargs, converted with the correct dtype, and stored in _raw_tensor_args.
Reader (_ops.py): four new instance attributes (_raw_tensor_args, _tensor_args, _previous_batch_size, _tensor_arg_names) and _process_tensor_args which lazily broadcasts the stored sample tensors to a full Batch when batch-mode is used, caching the result as long as batch_size is unchanged.
build_call_function (_op_builder.py): guards against re-supplying in __call__ any arg that was already provided in the constructor, then injects _raw_tensor_args into raw_kwargs before _process_params.
get_metadata signature updated to take batch_size so the backend can be initialised correctly when tensor args are present.
One minor dead-code assignment (tensor_args = None in the else branch of _batches) is set but immediately overwritten in the loop body on every iteration.

Confidence Score: 5/5

Safe to merge — all previously identified correctness issues have been resolved; only a minor dead-code assignment remains.

All P0/P1 concerns from prior review rounds have been addressed. The remaining finding is a P2 dead-code assignment (tensor_args = None) that has no runtime impact. The core feature logic (constructor tensor-arg extraction, _process_tensor_args caching, __call__ injection, Batch validation in the constructor) is correct and well-tested.

No files require special attention; _op_builder.py and _ops.py carry the bulk of the new logic and were reviewed most carefully.

Important Files Changed

Filename	Overview
dali/python/nvidia/dali/experimental/dynamic/_invocation.py	Fixes a stale import path (`.ops` → `._ops`) in the TYPE_CHECKING block; no runtime logic changed.
dali/python/nvidia/dali/experimental/dynamic/_op_builder.py	Adds tensor-arg handling to the generated constructor: separates scalar and Tensor constructor args, populates `_tensor_arg_names`/`_raw_tensor_args`, and injects stored Tensor args in `__call__`. Logic is correct; one dead `tensor_args = None` assignment in the `else` branch of `_batches` (set then immediately overwritten in the loop) reduces readability.
dali/python/nvidia/dali/experimental/dynamic/_ops.py	Adds `_process_tensor_args` to `Reader`, threads tensor args through `_samples`/`_batches`/`get_metadata`, and initialises `_raw_tensor_args`/`_tensor_arg_names` fields. Caching in `_process_tensor_args` prevents redundant work. The `tensor_args = None` in the else-branch of `_batches` is dead code (overwritten on every loop iteration).
dali/python/nvidia/dali/experimental/dynamic/pytorch/nodes.py	Updates `get_metadata()` call to pass `self._batch_size` (matching the new signature), and switches `_stream(output_stream)` to the named argument form `_stream(stream=output_stream)`. Clean, minimal changes.
dali/python/nvidia/dali/ops/_signatures.py	Removes `include_inputs=False`/`include_kwarg_inputs=False` from `__init__` stub generation for reader classes (defaulting both to `True`), and adds `allow_data_node_kwargs=False`/`allow_batch_kwargs=False` to restrict accepted types. Typical source readers have zero schema inputs so this is unlikely to cause problems in practice.
dali/test/python/experimental_mode/test_reader_decoder.py	Adds tests for Tensor constructor args, partial scalar/tensor mixing, and duplicate-arg detection. Also tightens the `glob` pattern for `test_reader_shards_error`.
dali/test/python/type_annotations/test_typing_dynamic.py	New test file with type-annotation correctness checks for `Tensor`/`Batch` outputs and a focused `test_numpy_reader_roi` test.

Flowchart

%%{init: {'theme': 'neutral'}}%%
flowchart TD
    A["Reader constructor called"] --> B{is_reader?}
    B -- No --> Z["Pass all kwargs to base __init__"]
    B -- Yes --> C["Iterate tensor_arg_names"]
    C --> D{arg value type?}
    D -- "None or scalar" --> E["Keep in kwargs → static backend arg"]
    D -- "Tensor / array" --> F["Remove from kwargs\nStore in _raw_tensor_args"]
    D -- "Batch" --> G["Raise ValueError"]
    E --> H["Store _tensor_arg_names + _raw_tensor_args"]
    F --> H
    H --> I{Reader usage path}
    I -- "next_epoch/_samples/_batches" --> J["_process_tensor_args(batch_size)\n→ Batch.broadcast each stored Tensor"]
    I -- "__call__" --> K["Check overlap with _tensor_arg_names\nInject _raw_tensor_args into raw_kwargs"]
    J --> L["super()._run(ctx, **tensor_args)"]
    K --> L

_{Reviews (16): Last reviewed commit: "Perform dtype conversion of reader const..." | Re-trigger Greptile}

dali/python/nvidia/dali/experimental/dynamic/_op_builder.py

dali/python/nvidia/dali/experimental/dynamic/_invocation.py

rostan-t · 2026-03-11T11:14:30Z

!build

dali/python/nvidia/dali/experimental/dynamic/_op_builder.py

dali/python/nvidia/dali/experimental/dynamic/_ops.py

dali-automaton · 2026-03-11T11:20:30Z

CI MESSAGE: [45875629]: BUILD STARTED

dali/python/nvidia/dali/experimental/dynamic/_ops.py

dali/test/python/experimental_mode/test_reader_decoder.py

dali/python/nvidia/dali/experimental/dynamic/_ops.py

dali-automaton · 2026-03-11T19:46:06Z

CI MESSAGE: [45875629]: BUILD FAILED

dali/python/nvidia/dali/experimental/dynamic/_invocation.py

dali/python/nvidia/dali/experimental/dynamic/_op_builder.py

dali/python/nvidia/dali/experimental/dynamic/_ops.py

mzient

To change:

make a copy tensor arguments
do not promote non-tensor arguments to tensors
simplify caller_depth logic

dali/python/nvidia/dali/experimental/dynamic/_ops.py

dali/python/nvidia/dali/experimental/dynamic/_op_builder.py

rostan-t · 2026-03-16T10:09:39Z

the init stub change must be scoped to reader operators only

@greptileai This is not necessary. Readers are now the only operators publicly exposed as classes.

greptile-apps · 2026-03-16T10:10:06Z

That makes sense - if readers are the only operators exposed as classes, then the change is already naturally scoped correctly. Thanks for the clarification!

rostan-t · 2026-03-16T10:11:42Z

!build

dali-automaton · 2026-03-16T10:15:55Z

CI MESSAGE: [46240320]: BUILD STARTED

dali-automaton · 2026-03-16T20:49:04Z

CI MESSAGE: [46240320]: BUILD PASSED

dali/python/nvidia/dali/experimental/dynamic/_op_builder.py

dali/python/nvidia/dali/experimental/dynamic/_ops.py

dali-automaton · 2026-03-18T20:00:47Z

CI MESSAGE: [46459031]: BUILD STARTED

dali-automaton · 2026-03-18T22:12:43Z

CI MESSAGE: [46459031]: BUILD FAILED

Signed-off-by: Rostan Tabet <rtabet@nvidia.com>

… traces Signed-off-by: Rostan Tabet <rtabet@nvidia.com>

Signed-off-by: Rostan Tabet <rtabet@nvidia.com>

…essing Signed-off-by: Rostan Tabet <rtabet@nvidia.com>

Signed-off-by: Rostan Tabet <rtabet@nvidia.com>

…al tensors Signed-off-by: Rostan Tabet <rtabet@nvidia.com>

Signed-off-by: Rostan Tabet <rtabet@nvidia.com>

… _process_tensor_args Signed-off-by: Rostan Tabet <rtabet@nvidia.com>

PR NVIDIA#6262 fixes the caller depth properly. Signed-off-by: Rostan Tabet <rtabet@nvidia.com>

Signed-off-by: Rostan Tabet <rtabet@nvidia.com>

dali/python/nvidia/dali/experimental/dynamic/_invocation.py

rostan-t · 2026-03-20T11:03:46Z

!build

dali-automaton · 2026-03-20T11:05:25Z

CI MESSAGE: [46597957]: BUILD STARTED

dali-automaton · 2026-03-20T14:01:05Z

CI MESSAGE: [46597957]: BUILD PASSED

rostan-t added the Dynamic Mode label Mar 11, 2026

greptile-apps bot reviewed Mar 11, 2026

View reviewed changes

dali/python/nvidia/dali/experimental/dynamic/_op_builder.py Outdated Show resolved Hide resolved

dali/python/nvidia/dali/experimental/dynamic/_invocation.py Outdated Show resolved Hide resolved

rostan-t force-pushed the ndd-reader-tensor-args branch 2 times, most recently from ed9a066 to c498545 Compare March 11, 2026 11:08

greptile-apps bot reviewed Mar 11, 2026

View reviewed changes

dali/python/nvidia/dali/experimental/dynamic/_op_builder.py Outdated Show resolved Hide resolved

dali/python/nvidia/dali/experimental/dynamic/_ops.py Show resolved Hide resolved

dali-automaton assigned mzient and szkarpinski Mar 11, 2026

greptile-apps bot reviewed Mar 11, 2026

View reviewed changes

dali/python/nvidia/dali/experimental/dynamic/_ops.py Show resolved Hide resolved

dali/test/python/experimental_mode/test_reader_decoder.py Show resolved Hide resolved

dali/python/nvidia/dali/experimental/dynamic/_ops.py Show resolved Hide resolved

greptile-apps bot reviewed Mar 11, 2026

View reviewed changes

dali/python/nvidia/dali/experimental/dynamic/_ops.py Outdated Show resolved Hide resolved

mzient reviewed Mar 13, 2026

View reviewed changes

dali/python/nvidia/dali/experimental/dynamic/_invocation.py Outdated Show resolved Hide resolved

mzient reviewed Mar 13, 2026

View reviewed changes

dali/python/nvidia/dali/experimental/dynamic/_op_builder.py Outdated Show resolved Hide resolved

mzient reviewed Mar 13, 2026

View reviewed changes

dali/python/nvidia/dali/experimental/dynamic/_ops.py Outdated Show resolved Hide resolved

mzient requested changes Mar 13, 2026

View reviewed changes

greptile-apps bot reviewed Mar 13, 2026

View reviewed changes

dali/python/nvidia/dali/experimental/dynamic/_ops.py Show resolved Hide resolved

rostan-t force-pushed the ndd-reader-tensor-args branch from 51fb904 to f283da0 Compare March 13, 2026 17:00

github-advanced-security bot found potential problems Mar 13, 2026

View reviewed changes

dali/python/nvidia/dali/experimental/dynamic/_op_builder.py Dismissed Show dismissed Hide dismissed

greptile-apps bot reviewed Mar 13, 2026

View reviewed changes

dali/python/nvidia/dali/experimental/dynamic/_op_builder.py Outdated Show resolved Hide resolved

rostan-t requested a review from mzient March 16, 2026 10:11

mzient reviewed Mar 17, 2026

View reviewed changes

dali/python/nvidia/dali/experimental/dynamic/_op_builder.py Outdated Show resolved Hide resolved

mzient reviewed Mar 17, 2026

View reviewed changes

dali/python/nvidia/dali/experimental/dynamic/_ops.py Outdated Show resolved Hide resolved

rostan-t requested a review from mzient March 19, 2026 09:25

rostan-t added 22 commits March 19, 2026 12:40

Support constructing readers with tensor args

f76b523

Signed-off-by: Rostan Tabet <rtabet@nvidia.com>

Detect when default values are passed when invoking a reader

0878b6c

Signed-off-by: Rostan Tabet <rtabet@nvidia.com>

Fix call stack depth for reader invocations when reconstructing stack…

553d01c

… traces Signed-off-by: Rostan Tabet <rtabet@nvidia.com>

Add tests passing tensor arguments

17bb0b5

Signed-off-by: Rostan Tabet <rtabet@nvidia.com>

Disallow constructing a reader with batch kwargs

a3172ff

Signed-off-by: Rostan Tabet <rtabet@nvidia.com>

Update signature of reader constructors to allow tensor arguments

a8aa74a

Signed-off-by: Rostan Tabet <rtabet@nvidia.com>

Update NumpyReader example to pass ROI in the reader constructor

a1fccb3

Signed-off-by: Rostan Tabet <rtabet@nvidia.com>

Apply suggestions from review

cdaefaf

Signed-off-by: Rostan Tabet <rtabet@nvidia.com>

Prevent processing again tensor args when not necessary in batch proc…

a3a59ea

…essing Signed-off-by: Rostan Tabet <rtabet@nvidia.com>

Fix test_video_resize_tensor_args_partial

ab75701

Signed-off-by: Rostan Tabet <rtabet@nvidia.com>

Cache processed tensor args passed in the constructor

4ba75c3

Signed-off-by: Rostan Tabet <rtabet@nvidia.com>

Fix caller_depth handling. Remove special case for readers

aee5ab5

Signed-off-by: Rostan Tabet <rtabet@nvidia.com>

Pass scalar arguments directly to reader constructors and copy extern…

4785f39

…al tensors Signed-off-by: Rostan Tabet <rtabet@nvidia.com>

Set _raw_tensor_args instead of _tensor_args in reader constructor

a5e6c5e

Signed-off-by: Rostan Tabet <rtabet@nvidia.com>

Fix tensor arg tracking in reader op constructor

d1f0de9

Signed-off-by: Rostan Tabet <rtabet@nvidia.com>

Properly use reader tensor args in TorchData integration

09b7dd7

Signed-off-by: Rostan Tabet <rtabet@nvidia.com>

Fix signature of reader constructors

e3de140

Signed-off-by: Rostan Tabet <rtabet@nvidia.com>

Fix tensor arg handling in reader op constructor

2137a04

Signed-off-by: Rostan Tabet <rtabet@nvidia.com>

Fix typos

ccbe827

Signed-off-by: Rostan Tabet <rtabet@nvidia.com>

Copy all tensors passed to constructors and perform only broadcast in…

f2382ac

… _process_tensor_args Signed-off-by: Rostan Tabet <rtabet@nvidia.com>

Revert the change to the caller depth.

59aa5c6

PR NVIDIA#6262 fixes the caller depth properly. Signed-off-by: Rostan Tabet <rtabet@nvidia.com>

Perform dtype conversion of reader constructor arguments

0a640ea

Signed-off-by: Rostan Tabet <rtabet@nvidia.com>

rostan-t force-pushed the ndd-reader-tensor-args branch from da6080c to 0a640ea Compare March 19, 2026 13:33

github-advanced-security bot found potential problems Mar 19, 2026

View reviewed changes

dali/python/nvidia/dali/experimental/dynamic/_invocation.py Dismissed Show dismissed Hide dismissed

Conversation

rostan-t commented Mar 11, 2026

Category:

Description:

Additional information:

Affected modules and functionalities:

Key points relevant for the review:

Tests:

Checklist

Documentation

DALI team only

Requirements

Uh oh!

review-notebook-app bot commented Mar 11, 2026

Uh oh!

greptile-apps bot commented Mar 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Greptile Summary

Confidence Score: 5/5

Important Files Changed

Flowchart

Uh oh!

Uh oh!

Uh oh!

rostan-t commented Mar 11, 2026

Uh oh!

Uh oh!

Uh oh!

dali-automaton commented Mar 11, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

dali-automaton commented Mar 11, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mzient left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

rostan-t commented Mar 16, 2026

Uh oh!

greptile-apps bot commented Mar 16, 2026

Uh oh!

rostan-t commented Mar 16, 2026

Uh oh!

dali-automaton commented Mar 16, 2026

Uh oh!

dali-automaton commented Mar 16, 2026

Uh oh!

Uh oh!

Uh oh!

dali-automaton commented Mar 18, 2026

Uh oh!

dali-automaton commented Mar 18, 2026

Uh oh!

Uh oh!

rostan-t commented Mar 20, 2026

Uh oh!

dali-automaton commented Mar 20, 2026

Uh oh!

dali-automaton commented Mar 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

greptile-apps bot commented Mar 11, 2026 •

edited

Loading