Add PyTorch built-in SDPA to Optimus (70ccb523) · Commits · NetSys / Optimus Prime

Unverified Commit 70ccb523 authored 10 months ago by Alexandru-Mihai GHERGHESCU

Add PyTorch built-in SDPA to Optimus

Add PyTorch's core scaled dot-product attention (SDPA) to Optimus. This
automatically uses flash attention 2, or memory efficient attention, if
the hardware supports it. If it doesn't, falls back to manual
implementation.

Training should be much faster with this; memory should also be around
half what it was before.

parent 209826e4

Hide whitespace changes

Inline Side-by-side

Showing with 55 additions and 1 deletion

Please register or to comment