cuda-notes

This repository contains code written during my journey learning CUDA.

Introduction

CUDA (Compute Unified Device Architecture) is a parallel computing platform and programming model developed by NVIDIA that enables developers to use CUDA-enabled GPUs to significantly accelerate computing applications. This repository documents the code I've written while learning CUDA. My learning is based on the textbook "Programming Massively Parallel Processors", a classic GPU programming textbook co-authored by NVIDIA's Chief Scientist David B. Kirk and Wen-mei W. Hwu, which comprehensively covers parallel programming techniques and optimization methods for modern GPUs.

My Environment

CUDA Toolkit: 12.8
GPU: NVIDIA GeForce RTX 4060

RTX 4060 Hardware Specifications

Warp size: 32
Number of Streaming Multiprocessors (SMs): 24
Maximum threads per block: 1024
Maximum threads per SM: 1536
Maximum blocks per SM: 24
Maximum shared memory per SM: 102400B
Maximum registers per SM: 65536

References

NVIDIA CUDA Documentation
CUDA C++ Programming Guide
CUDA Samples Repository
《Programming Massively Parallel Processors》 - David B. Kirk & Wen-mei W. Hwu

License

This project is licensed under the MIT License - see the LICENSE file for details

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
cuda-notes		cuda-notes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
cuda-notes.sln		cuda-notes.sln

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

cuda-notes

Introduction

My Environment

RTX 4060 Hardware Specifications

References

License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

shiyi0113/cuda-notes

Folders and files

Latest commit

History

Repository files navigation

cuda-notes

Introduction

My Environment

RTX 4060 Hardware Specifications

References

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages