Skip to content

[OpenMP] device Xteamr: Clean up template parameters#1246

Closed
ro-i wants to merge 1 commit intoamd-stagingfrom
amd/dev/ro-i/xteamr-template-params
Closed

[OpenMP] device Xteamr: Clean up template parameters#1246
ro-i wants to merge 1 commit intoamd-stagingfrom
amd/dev/ro-i/xteamr-template-params

Conversation

@ro-i
Copy link
Copy Markdown

@ro-i ro-i commented Jan 27, 2026

Remove the wave number and wave size template parameters from the entry points of the device xteam reduction functions. Replace the wave size parameter by a call to __gpu_num_lanes(), which is optimized out during compilation. Replace the wave number parameter by the constant 32, which is on the safe side for its current usage situations (VLA size needs to be constant, max number of threads is 1024, min wave size is 32).

Remove the wave number and wave size template parameters from the entry
points of the device xteam reduction functions. Replace the wave size
parameter by a call to `__gpu_num_lanes()`, which is optimized out
during compilation. Replace the wave number parameter by the constant
`32`, which is on the safe side for its current usage situations (VLA
size needs to be constant, max number of threads is 1024, min wave size
is 32).
@github-actions
Copy link
Copy Markdown

Thank you for submitting a Pull Request (PR) to the LLVM Project!

This PR will be automatically labeled and the relevant teams will be notified.

If you wish to, you can add reviewers by using the "Reviewers" section on this page.

If this is not working for you, it is probably because you do not have write permissions for the repository. In which case you can instead tag reviewers by name in a comment by using @ followed by their GitHub username.

If you have received no comments on your PR for a week, you can request a review by "ping"ing the PR by adding a comment “Ping”. The common courtesy "ping" rate is once a week. Please remember that you are asking for valuable time from other developers.

If you have further questions, they may be answered by the LLVM GitHub User Guide.

You can also ask questions in a comment on this PR, on the LLVM Discord or on the forums.

@ro-i
Copy link
Copy Markdown
Author

ro-i commented Jan 27, 2026

(PSDB is going to fail at least due to the not-yet-adapted smoke-limbo tests, see ROCm/aomp#1895)

@z1-cciauto
Copy link
Copy Markdown
Collaborator

@ro-i
Copy link
Copy Markdown
Author

ro-i commented Mar 9, 2026

Closed in favor of #1691

@ro-i ro-i closed this Mar 9, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants