Skip to content

Commit bda42f8

Browse files
authored
[None][feat] Support running heterogeneous model execution for Nemotron-H (#6866)
Signed-off-by: Daniel Afrimi <[email protected]>
1 parent c7e6145 commit bda42f8

File tree

1 file changed

+9
-1
lines changed

1 file changed

+9
-1
lines changed

tensorrt_llm/_torch/models/modeling_nemotron_h.py

Lines changed: 9 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -63,8 +63,16 @@ def __init__(
6363
layer_idx: int,
6464
):
6565
config = model_config.pretrained_config
66+
if isinstance(config.intermediate_size, list):
67+
if len(config.intermediate_size) == 1:
68+
intermediate_size = config.intermediate_size[0]
69+
else:
70+
intermediate_size = config.intermediate_size[layer_idx]
71+
else:
72+
intermediate_size = config.intermediate_size
73+
6674
super().__init__(hidden_size=config.hidden_size,
67-
intermediate_size=config.intermediate_size,
75+
intermediate_size=intermediate_size,
6876
bias=False,
6977
activation=relu2,
7078
dtype=config.torch_dtype,

0 commit comments

Comments
 (0)