nvpohanh

Created 1 commit in 1 repository

NVIDIA/TensorRT-LLM 1 commit

Created a pull request in NVIDIA/TensorRT-LLM that received 9 comments
Jun 6

test: add unit tests for Llama4 min_latency code

Add unit tests for Llama4 min_latency code: Sanity: as long as it can run Is close to HF: compare with HF output

+444 −0 lines changed • 9 comments

Reviewed 2 pull requests in 1 repository

NVIDIA/TensorRT-LLM 2 pull requests

chore: Refine weight prefetching.
This contribution was made on Jun 4

feat: add heuristics for checkpoint files prefetching.
This contribution was made on Jun 2

	Jun	Jul	Aug	Sep	Oct	Nov	Dec	Jan	Feb	Mar	Apr	May	Jun
Sun
Mon
Tue
Wed
Thu
Fri
Sat

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

nvpohanh

Achievements

Achievements

Block or report nvpohanh

Popular repositories Loading

25 contributions in the last year

Contribution activity

June 2025

Created a pull request in NVIDIA/TensorRT-LLM that received 9 comments

test: add unit tests for Llama4 min_latency code

	Jun	Jul	Aug	Sep	Oct	Nov	Dec	Jan	Feb	Mar	Apr	May	Jun
Sun
Mon
Tue
Wed
Thu
Fri
Sat

	Jun	Jul	Aug	Sep	Oct	Nov	Dec	Jan	Feb	Mar	Apr	May	Jun
Sun
Mon
Tue
Wed
Thu
Fri
Sat

	Jun	Jul	Aug	Sep	Oct	Nov	Dec	Jan	Feb	Mar	Apr	May	Jun
Sun
Mon
Tue
Wed
Thu
Fri
Sat