-
Notifications
You must be signed in to change notification settings - Fork 1.9k
Significant time difference between linux, macos and windows test executions on build machines #924
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Do we have any idea why the categorical split test is so much worse? /cc @codemzs |
@artidoro - any update on investigating why |
#1131 may be related in some way |
@artidoro , are you looking at it now? |
Yes I am going to take a look today, and update the thread. |
I looked at 10 different test runs in all OS and for all versions from the PR verification builds. These are the summaries of the results that I obtained:
The above preliminary analysis seems to show that there is no significant slowdown on mac and linux compared to windows. For a more detailed analysis, we should make sure the machines have the same specifications (I just used the results of PR verification builds). This has probably been solved by #1388, which fixed a bug in the way we were comparing test results with our baselines. |
Let's look on this build
MacOS Release - tests: 19:08
MacOS Debug - tests: 22:59
Linux Release - test 17:02
Linux Debug - test 21:02
Windows Release - test 5:00
Windows Debug - test 7:53
Windows machine:
2018-09-14T22:47:22.5353796Z Results File: D:\a\1\s\bin/AnyCPU.Debug\Microsoft.ML.Predictor.Tests\VssAdministrator_factoryvm-az366_2018-09-14_22_43_12.trx
2018-09-14T22:47:22.5354312Z
2018-09-14T22:47:22.5355746Z Total tests: 105. Passed: 49. Failed: 0. Skipped: 56.
2018-09-14T22:47:22.5356035Z Test Run Successful.
2018-09-14T22:47:22.5357667Z Test execution time: 4.2240 Minutes
Linux machine:
2018-09-14T22:59:26.1088812Z Results File: /__w/1/s/bin/AnyCPU.Debug/Microsoft.ML.Predictor.Tests/_5438db60ae8a_2018-09-14_22_41_24.trx
2018-09-14T22:59:26.1096444Z
2018-09-14T22:59:26.1109704Z Total tests: 105. Passed: 49. Failed: 0. Skipped: 56.
2018-09-14T22:59:26.1124041Z Test Run Successful.
2018-09-14T22:59:26.1137781Z Test execution time: 18.0608 Minutes
It looks like we run same set of tests but for some reason where is huge difference in execution.
Any one willing to investigate?
The text was updated successfully, but these errors were encountered: