⚡ Personal website: alexzhang13.github.io
🎮 My newest benchmark on LMs playing video games: https://www.vgbench.com/
🌎 My most recent papers: VideoGameBench, KernelBench, SWE-Bench Multimodal
PhD student at MIT, prev. Princeton CS
- NYC
-
13:47
(UTC -04:00) - alexzhang13.github.io
Pinned Loading
-
videogamebench
videogamebench PublicBenchmark environment for evaluating vision-language models (VLMs) on popular video games!
-
Ligo-Biosciences/AlphaFold3
Ligo-Biosciences/AlphaFold3 PublicOpen source implementation of AlphaFold3
-
gpu-mode/reference-kernels
gpu-mode/reference-kernels PublicOfficial Problem Sets / Reference Kernels for the GPU MODE Leaderboard!
-
ScalingIntelligence/KernelBench
ScalingIntelligence/KernelBench PublicKernelBench: Can LLMs Write GPU Kernels? - Benchmark with Torch -> CUDA problems
-
flashattention2-custom-mask
flashattention2-custom-mask PublicTriton implementation of FlashAttention2 that adds Custom Masks.
-
world-models-papers
world-models-papers PublicSelected list of papers on World Models that I found interesting and/or useful.
TeX 28
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.