Skip to content

Conversation

wanchaol
Copy link
Collaborator

@wanchaol wanchaol commented May 2, 2024

as titled. This could make 1-D and 2-D works with the lastest main build. thanks @bdhirsh for all the fixes!

We should figure out why dynamic shape gets turned on as a follow up

as titled. This could make 1-D and 2-D works

We should figure out why dynamic shape gets turned on as a follow up
@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label May 2, 2024
@wanchaol wanchaol requested review from vkuzo, bdhirsh and drisspg May 2, 2024 22:37
@@ -18,7 +18,7 @@ save_tb_folder = "tb"
[model]
name = "llama3"
flavor = "8B"
norm_type = "fused_rmsnorm" # [layernorm / np_layernorm / rmsnorm / fused_rmsnorm]
norm_type = "rmsnorm" # [layernorm / np_layernorm / rmsnorm / fused_rmsnorm]
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is this change intentional?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

yeah I feel it's a bit painful to switch between rmsnorm and fused_rmsnorm locally when switching between 1-D and 2-D sharding, so I changed rmsnorm to default, we can always default to fused_rmsnorm once @tianyu-l changes the fused_rmsnorm registration #296.

Let me know if you think we should default to fused_rmsnorm for 8B

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Keeping it as rmsnorm sounds good to me!

@wanchaol wanchaol merged commit 143b586 into main May 3, 2024
tianyu-l pushed a commit to tianyu-l/torchtitan_intern24 that referenced this pull request Aug 16, 2024
as titled. This could make 1-D and 2-D works with the lastest main
build. thanks @bdhirsh for all the fixes!

We should figure out why dynamic shape gets turned on as a follow up
philippguevorguian pushed a commit to YerevaNN/YNNtitan that referenced this pull request Aug 17, 2024
as titled. This could make 1-D and 2-D works with the lastest main
build. thanks @bdhirsh for all the fixes!

We should figure out why dynamic shape gets turned on as a follow up
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Meta Open Source bot.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants