Skip to content

Change _TestHelixAgentPool from 19H1 to RS5 #975

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
3 commits merged into from
Jun 17, 2019
Merged

Conversation

vatsan-madhavan
Copy link
Member

@vatsan-madhavan vatsan-madhavan commented Jun 15, 2019

Changing _TestHelixAgentPool from 19H1 to RS5 seems to bring down execution time in Helix to approx 15 mins.

https://dev.azure.com/dnceng/public/_build/results?buildId=226613&view=logs

Looks like the 19H1 pool may have been the reason our builds have been suffering from chronic timeouts, much like the ServerRS5 pool prior to that.

Fixes #952

@ghost ghost requested review from rladuca, ryalanms and stevenbrix June 15, 2019 18:04
@vatsan-madhavan vatsan-madhavan self-assigned this Jun 15, 2019
@vatsan-madhavan vatsan-madhavan added the auto_merge bot-command label Jun 15, 2019
@ghost
Copy link

ghost commented Jun 15, 2019

Hello @vatsan-madhavan!

Because this pull request has the auto_merge label, I will be glad to assist with helping to merge this pull request once all check-in policies pass.

Do note that I've been instructed to only help merge pull requests of this repository that have been opened for at least 30 minutes. No worries though, I will be back when the time is right! 😉

p.s. you can customize the way I help with merging this pull request, such as holding this pull request until a specific person approves. Simply @mention me (@msftbot) and give me an instruction to get started! Learn more here.

@ghost
Copy link

ghost commented Jun 15, 2019

@msftbot, require at least 1 sign off.

@ghost
Copy link

ghost commented Jun 15, 2019

@msftbot, require approvals from any one of the following: @vatsan-madhavan, @rladuca, @SamBent, @stevenbrix, @ryalanms

@vatsan-madhavan vatsan-madhavan removed the auto_merge bot-command label Jun 15, 2019
@vatsan-madhavan
Copy link
Member Author

This time RS5 seemed to complete quickly, but generic Win10 and Win7 took a lot of time.

I’m not sure what the right solution should be. @MattGal any suggestions?

@grubioe grubioe added the PR metadata: Label to tag PRs, to facilitate with triage label Jun 17, 2019
@MattGal
Copy link
Member

MattGal commented Jun 17, 2019

@vatsan-madhavan I'm observing that certain teams are doing work items that take 1.5+ hours EACH due to setting special environment variables for JIT stress and such; when these teams (i.e. coreCLR) run their tests on any given queue, that queue will be backed up a bit because of the slow running work items sent.

I'll try to identify the folks doing this and see if we can maybe not do this every PR, but for now the best solution is to have longer build timeouts; if you don't the work still gets done, but the build gets retried; this is only making the problem worse and not improving your speeds at all.

…s is the only one that seems immune to timeouts

- Limit Helix test runs to the open build pipline. Do not repeat the same tests in the internal pipeline.
@vatsan-madhavan
Copy link
Member Author

I'll try to identify the folks doing this and see if we can maybe not do this every PR, but for now the best solution is to have longer build timeouts;

Alright, we'll try to up our timeout to 120 mins for now. Is there a tracking issue we can follow to keep up? I'd really like to keep our build times not become arbitrarily long. Can we chat about the possibility of getting separate Helix queues for our team (perhaps shared with WinForms) maybe?

/cc @AdamYoblick

@vatsan-madhavan vatsan-madhavan force-pushed the dev/vatsan/helixpool-fixup branch from a7e35a7 to d035cf5 Compare June 17, 2019 17:53
@AdamYoblick
Copy link
Contributor

AdamYoblick commented Jun 17, 2019 via email

@vatsan-madhavan
Copy link
Member Author

We will add back Win7 after Preview 7. For now, we will test RS5 only.

@vatsan-madhavan vatsan-madhavan added the auto_merge bot-command label Jun 17, 2019
@ghost
Copy link

ghost commented Jun 17, 2019

@msftbot, require at least 1 sign off.

@ghost
Copy link

ghost commented Jun 17, 2019

@msftbot, require approvals from any one of the following: @vatsan-madhavan, @rladuca, @SamBent, @stevenbrix, @ryalanms

@ghost ghost merged commit a89cc5f into master Jun 17, 2019
@ghost ghost deleted the dev/vatsan/helixpool-fixup branch June 17, 2019 18:53
@ghost ghost locked as resolved and limited conversation to collaborators Apr 16, 2022
This pull request was closed.
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
auto_merge bot-command PR metadata: Label to tag PRs, to facilitate with triage
Projects
None yet
Development

Successfully merging this pull request may close these issues.

WPF Builds fail intermittently due to timeouts
5 participants