Skip to content

update Unet3d #4

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 53 commits into from
Sep 8, 2023
Merged

update Unet3d #4

merged 53 commits into from
Sep 8, 2023

Conversation

suexu1025
Copy link
Collaborator

@suexu1025 suexu1025 commented Feb 24, 2023

  • add unet3d support for brats data
  • fix data loading slow on gsc using long file path
  • add run_brats/run_kitts scripts

UNET3D @ BRATS with BS 64 convergent at mean epoch of 63.7 with std 9.2 @0.845 DICE
UNET3D @ KITS with BS 56 convergent at mean epoch of 2213.4 with std 499.9. @0.908 DICE
e2e training time of BRATS vs KITS ~ 1:4

general performance from this PR:
on v4-8 expected throughputs of 17.8 samples/s on GCS.
Improvement over original impl: 28% using disk, 48% using GCS
on v4-128 expected throughputs of 263.2 samples/s on GCS
Improvement over original impl: 2.7x using disk 6.4x using GCS.

suexu1025 and others added 20 commits December 8, 2022 18:09
* add simple loader

* add simple loader

* add simple loader

* add simple loader

* update run sripts

* update list hard-coded

* using gfile

* using gfile

* try with no loading

* try with no loading

* remove glob

* fix type

* fix type

* fix type

* try reduce list

* remove dummy loader

* fix val loader

* comment out transpose

* rm hard coded code

* clean up

* clean up
@suexu1025 suexu1025 requested review from miladm and JackCaoG February 24, 2023 02:14
@JackCaoG
Copy link

I don't have enough context to review the full pr, @miladm Do you want to give it a try?

Copy link

@miladm miladm left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for the PR @suexu1025

Left a few questions and comments here.

We need to outline the perf gains w/ and w/o Brats in this PR. Feel free to make a separate PR for Brats if more convenient - your call.

@suexu1025
Copy link
Collaborator Author

address PR comment and summary of the PR benchmark result.

UNET3D @ BRATS with BS 64 convergent at mean epoch of 63.7 with std 9.2 @0.845 DICE
UNET3D @ KITS with BS 56 convergent at mean epoch of 2213.4 with std 499.9. @0.908 DICE
e2e training time of BRATS vs KITS ~ 1:4

general performance from this PR:
on v4-8 expected throughputs of 17.8 samples/s on GCS.
Improvement over original impl: 28% using disk, 48% using GCS
on v4-128 expected throughputs of 263.2 samples/s on GCS
Improvement over original impl: 2.7x using disk 6.4x using GCS.

@suexu1025 suexu1025 requested a review from miladm March 2, 2023 18:17
@suexu1025 suexu1025 merged commit bc92b9b into pytorch-tpu:master Sep 8, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants