Attn bias #1

alugorey · 2025-01-21T17:35:14Z

Fixes #ISSUE_NUMBER

…pytorch#144120) (pytorch#146372) Summary: # Summary ### Sticky points Cuda-graph rng handling has changed / deviated from original implementation. We will be left with a dangling 'offset' val and confusing naming due to BC ## Dependencies - Flash PR: Dao-AILab/flash-attention#1419 ### Other Points - The BC linter is complaining about losing generate.py and its functions which is not real BC surface cc albanD imported-using-ghimport Test Plan: Imported from OSS Building in dev `buck build @//mode/dev-nosan -c fbcode.nvcc_arch=h100a //caffe2:ATen-cu --show-full-output ` I and Nming the .so I do see that the flash symbols are correctly named: ``` 0000000001c3dfb0 t pytorch_flash::run_mha_bwd(pytorch_flash::Flash_bwd_params&, CUstream_st*)::$_0::operator()() const::{lambda()#1}::operator()() const::{lambda()#1}::operator()() const::{lambda()ROCm#7}::operator()() const 0000000001c36080 t pytorch_flash::run_mha_fwd(pytorch_flash::Flash_fwd_params&, CUstream_st*, bool)::$_0::operator()() const::{lambda()#2}::operator()() const::{lambda()#1}::operator()() const::{lambda()ROCm#6}::operator()() const 0000000001c360e0 t pytorch_flash::run_mha_fwd(pytorch_flash::Flash_fwd_params&, CUstream_st*, bool)::$_0::operator()() const::{lambda()#2}::operator()() const::{lambda()#1}::operator()() const::{lambda()ROCm#7}::operator()() const 0000000001c35fc0 t pytorch_flash::run_mha_fwd(pytorch_flash::Flash_fwd_params&, CUstream_st*, bool)::$_0::operator()() const::{lambda()#1}::operator()() const::{lambda()#1}::operator()() const::{lambda()ROCm#6}::operator()() const 0000000001c36020 t pytorch_flash::run_mha_fwd(pytorch_flash::Flash_fwd_params&, CUstream_st*, bool)::$_0::operator()() const::{lambda()#1}::operator()() const::{lambda()#1}::operator()() const::{lambda()ROCm#7}::operator()() const ``` Reviewed By: vkuzo Differential Revision: D68502879 Pulled By: drisspg Pull Request resolved: pytorch#146372 Approved by: https://github.com/jbschlosser

alugorey added 6 commits January 20, 2025 21:17

Adding plumbing for attn_bias in CK sdpa

454334e

Shim stuff

6bd43c4

Generated c_shim_cuda.h changes for just _sdpfa

8bdf347

Add flash_attention attn_bias param

af6f988

add default to derivatives

211fe7e

Remove default nature of attn_bias

2b59ab6

alugorey force-pushed the attn_bias branch from 34767e6 to 2b59ab6 Compare January 23, 2025 22:40

alugorey added 5 commits January 24, 2025 16:33

functorch default arg

e8452fc

nested tensor change

ad6f3eb

don't call v3 from v2

f31a0a6

swizzle some args

25981de

nested stuff

1194958

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Attn bias #1

Attn bias #1

Uh oh!

alugorey commented Jan 21, 2025

Uh oh!

Uh oh!

Attn bias #1

Are you sure you want to change the base?

Attn bias #1

Uh oh!

Conversation

alugorey commented Jan 21, 2025

Uh oh!

Uh oh!