Skip to content

Commit f20dc61

Browse files
evezhierUbuntu
authored andcommitted
fix: multi-GPU gather responses
Signed-off-by: Olya Kozlova <[email protected]>
1 parent 2024ab2 commit f20dc61

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

tensorrt_llm/_torch/pyexecutor/py_executor.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1443,7 +1443,7 @@ def _enqueue_responses(self, responses: List[Tuple[int, LlmResponse]]):
14431443
if responses_list is not None:
14441444
for resp in responses_list:
14451445
if resp is not None:
1446-
gather_responses.append(resp)
1446+
gather_responses.extend(resp)
14471447
responses = gather_responses
14481448
logger.debug(
14491449
f'after gather, rank = {self.dist.rank}, responses = {responses}')

0 commit comments

Comments
 (0)