CUDA Kernel Project Template
Use this template for public CUDA kernel project notes.
Goal
State the kernel behavior and the public benchmark target.
Baseline
Describe the baseline implementation and measurement setup.
Optimization notes
- Memory access pattern:
- Occupancy considerations:
- Profiling observations:
Results
Only publish results from public hardware, public inputs, and reproducible scripts.