Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Bump FlashAttn version and add deterministic option for FAv2 (#585)
* Deterministic FA, bump minimum supported version Signed-off-by: Kirthi Shankar Sivamani <ksivamani@nvidia.com> * fixes Signed-off-by: Kirthi Shankar Sivamani <ksivamani@nvidia.com> * Fix MQA/GQA Signed-off-by: Kirthi Shankar Sivamani <ksivamani@nvidia.com> * Address review comments Signed-off-by: Kirthi Shankar Sivamani <ksivamani@nvidia.com> --------- Signed-off-by: Kirthi Shankar Sivamani <ksivamani@nvidia.com>
- Loading branch information