Skip to content

Reduce blame test flakiness: increase hang dump timeout to 10s#15590

Merged
nohwnd merged 1 commit intomicrosoft:mainfrom
nohwnd:fix-blame-timeout
Mar 26, 2026
Merged

Reduce blame test flakiness: increase hang dump timeout to 10s#15590
nohwnd merged 1 commit intomicrosoft:mainfrom
nohwnd:fix-blame-timeout

Conversation

@nohwnd
Copy link
Copy Markdown
Member

@nohwnd nohwnd commented Mar 26, 2026

Problem

\HangDumpOnTimeout\ (3s timeout) and \HangDumpChildProcesses\ (5s timeout) are flaky in CI because process startup can be slow on hosted agents. \HangDumpChildProcesses\ is currently failing across 6 open PRs.

Fix

Increase both timeouts to 10s — the safe minimum to avoid flakiness without making tests unnecessarily slow.

Failing PRs this unblocks

Co-authored-by: Copilot 223556219+Copilot@users.noreply.github.com

HangDumpOnTimeout used 3s and HangDumpChildProcesses used 5s, both too
tight for CI where process startup can be slow. Increase to 10s which
is the safe minimum to avoid flakiness.

This fixes the HangDumpChildProcesses failure seen across multiple PRs.

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
Copilot AI review requested due to automatic review settings March 26, 2026 16:16
Copy link
Copy Markdown
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Increases blame hang-dump timeouts in acceptance tests to reduce CI flakiness caused by slow process startup on hosted agents.

Changes:

  • Increased HangDumpOnTimeout blame TestTimeout from 3s to 10s.
  • Increased HangDumpChildProcesses blame TestTimeout from 5s to 10s.
  • Added inline comments warning against reducing the timeout due to flakiness.

Comment on lines +136 to +137
// Don't reduce this, 10s is about the safe minimum to not have flakiness.
arguments = string.Concat(arguments, $@" /Blame:""CollectHangDump;HangDumpType=mini;TestTimeout=10s"" /Diag:{TempDirectory.Path}/log.txt");
Copy link

Copilot AI Mar 26, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The new timeout value (10s) and rationale are embedded inline, and the same comment/value are duplicated across tests. To make future adjustments safer and consistent, consider introducing a shared constant (e.g., const string SafeHangDumpTimeout = \"10s\"; or TimeSpan) and reusing it in both places, with a single comment explaining why that value is required.

Copilot uses AI. Check for mistakes.
Comment on lines +235 to +236
// Don't reduce this, 10s is about the safe minimum to not have flakiness.
arguments = string.Concat(arguments, $@" /Blame:""CollectHangDump;HangDumpType=mini;TestTimeout=10s""");
Copy link

Copilot AI Mar 26, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Same duplication as above: the TestTimeout=10s string and warning comment are repeated. Recommend factoring into a shared constant/helper to avoid drift if one test is updated and the other is not.

Copilot uses AI. Check for mistakes.
@nohwnd nohwnd changed the title Fix blame test flakiness: increase hang dump timeout to 10s Reduce blame test flakiness: increase hang dump timeout to 10s Mar 26, 2026
@nohwnd nohwnd merged commit 1519be8 into microsoft:main Mar 26, 2026
2 of 5 checks passed
@nohwnd
Copy link
Copy Markdown
Member Author

nohwnd commented Mar 27, 2026

/azp run

@nohwnd
Copy link
Copy Markdown
Member Author

nohwnd commented Mar 27, 2026

/azp run

2 similar comments
@nohwnd
Copy link
Copy Markdown
Member Author

nohwnd commented Mar 27, 2026

/azp run

@nohwnd
Copy link
Copy Markdown
Member Author

nohwnd commented Mar 27, 2026

/azp run

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants