-
Notifications
You must be signed in to change notification settings - Fork 1.9k
Enabling Tests that consume Zbaseline files. #78
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
rp.txt, ini files, txt files with scored instances are ok, but -out.txt created through running maml.exe on windows with paths to files been replaced by regex to something like %Data% or %Output%. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
What's the point of bringing this files? And your PR says, "Enabling Tests". I don't see any changes in test files, are they coming in next iteration?
This change enables some tests. You can see that tests being enabled here
Me and Zeeshan added the regex which does this for Unix and mac too.
The tests run fine on Linux and mac. We changed the behaviour in how we used to compare the files. |
Any chance you can break you PR into few PR with reasonable amount of files? Codeflow is not working, and I can't find test file in github web UI. In reply to: 387482606 [](ancestors = 387482606) |
I have a better suggestion, i can break this commit into smaller commits. Then you will be able to see the these commits in the github web UI. Will that be Okay ? #Closed |
As long it's something manageable, I'm fine with that. 471 files is hard to review. In reply to: 387499988 [](ancestors = 387499988) |
AFAIK we need confirmation from CELA regarding redistributing this dataset, and all others. At least I remember mails to CELA department with questions can we redistribute breast-cancer, iris, and other dataset we currently have in in test folder. Refers to: test/data/adult.test:1 in 690443f. [](commit_id = 690443f, deletion_comment = False) |
@Ivanidzo4ka ASFAIK the datasets already in the repo are CELA approved. (We made sure that before making it public) |
breast-cancer-weights-quarter is modification of public dataset, housing is same (it's a UCI dataset in which we change column orders), so I'm not sure is it modification of data makes it not cela approved or something else. I have PR to replace housing with wine dataset which we download from UCI (PR #17) (I have hopes to polish it soon) so maybe you can change housing to wine as well (although it will require major baseline modification. Sorry for be nagging cat, only excuse I can come up with, is unfamiliarity with all this OSS process. In reply to: 387530660 [](ancestors = 387530660) |
@Ivanidzo4ka i will remove the adult.train and adult.test in the next commit. Then i will currently disable the tests currently using the housing.txt. (Can turn them on later after your PR has been merged) |
In reply to: 387782207 [](ancestors = 387782207) |
@dotnet-bot test this please |
Hi @Anipik , thanks much for writing this. Migrating and re-enabling baseline tests is important work. So there are some things I find somewhat confusing about this, and perhaps you can help me out. The title to this PR is "Enabling Tests that consume Zbaseline files". Yet I see here, if I look at the PR, changes to So: I might have expected that if a test is being enabled, there would be some change in those files. Is it not so? Or do I misunderstand the purpose of this PR? Perhaps also a linked issue would be helpful, to understand the wider context. |
@TomFinley the first commit enables the test c8315d8 |
@shauheen @Ivanidzo4ka I am enabling one test at a time. I will create multiple small PRs for it. I will update the description |
.txt, -rp.txt and -out.txt are copied from the TLC project and values are edited in their last decimal places to match the results.
.ini files are copied from the output folder as i was not able to find them in TLC project.
Some Datasets are also added to the repo which were required by these tests.
The remaining tests are not enabled because they fail due to some other error and not because zbaselines or datasets are not present.
Update :-
The repo already contains zbaselines for the test being enabled
cc @eerhardt @danmosemsft @codemzs