CUDA: Faster FlashAttention kernel#6374
Merged
ggerganov merged 10 commits intoggml-org:gg/flash-attnfrom Apr 2, 2024
Merged
Commits
Commits on Mar 29, 2024
Commits on Mar 30, 2024
Commits on Mar 31, 2024
Commits on Apr 1, 2024
Commits on Apr 2, 2024
- committed
- committed
- committed