Skip to content

Conversation

LeiWang1999
Copy link
Member

@LeiWang1999 LeiWang1999 commented Feb 14, 2025

This pull request includes several changes across multiple files to improve the build configuration, add support for multiple Python versions, and enhance code generation and storage access functionalities. The most important changes are grouped by theme below:

image

Build Configuration:

  • CMakeLists.txt: Added logic to locate TVM prebuild path and use it as a fallback for the TVM source directory. Also added a new source file for WebGPU target. [[1]](diffhunk://#diff-

Code Generation:

  • src/target/codegen_webgpu.h: Added a new header file for WebGPU code generation, including class CodeGenTileLangWebGPU with methods for handling various expressions and statements.

- Rename ThreadSync and TileLangThreadSync functions in C++ code
- Update Python docstring for ThreadSync with more detailed description
- Reorder library path detection in tilelang environment setup
- Minor comment and code cleanup in CUDA and warp specialization modules
- Standardize pointer type spacing in storage_access.h and storage_access.cc
- Update whitespace and indentation in thread_storage_sync.cc
- Reorder include statements in thread_partial_sync.cc
- Minor code formatting improvements across thread synchronization files
- Correct global function registration to use ThreadSync instead of TileLangThreadSync
- Update TVM global registration to match recent refactoring efforts
- Remove unnecessary whitespace in global function registration
- Compact the TVM global registration line for ThreadSync
- Implement WebGPU code generator (codegen_webgpu.cc and codegen_webgpu.h)
- Add WebGPU target support in lower.py and target.py
- Update CMakeLists.txt to include WebGPU codegen source files
- Introduce WebGPU-specific code generation for WGSL shader language
- Enhance code formatting in codegen_webgpu.cc and codegen_webgpu.h
- Standardize pointer type spacing and indentation
- Improve line breaks and reduce line length for better readability
- Minor code style improvements in WebGPU code generation
- Implement test_webgpu_codegen.py for WebGPU matrix multiplication
- Add assert_gemm_codegen function to validate WebGPU code generation
- Include basic matrix multiplication kernel test case
@LeiWang1999 LeiWang1999 merged commit c19bed5 into tile-ai:main Feb 15, 2025
2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant