hann-wang

Follow

Hann Wang hann-wang

Follow

3 followers · 2 following

@AMD-AIG-AIMA
Beijing, China
20:55 (UTC +08:00)

Achievements

Achievements

Pinned Loading

torchtitan torchtitan Public

Forked from pytorch/torchtitan

DeepSeek V2/V3 training with block-wise FP8 linear/grouped mm/attention

Python 1
aiter aiter Public

Forked from ROCm/aiter

AI Tensor Engine for ROCm

Python
vllm vllm Public

Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python
xDiT xDiT Public

Forked from xdit-project/xDiT

xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism

Python
onnxruntime onnxruntime Public

Forked from microsoft/onnxruntime

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

C++
optimum optimum Public

Forked from huggingface/optimum

🚀 Accelerate training and inference of 🤗 Transformers and 🤗 Diffusers with easy to use hardware optimization tools

Python