R

RoptimusNet

An FSDP distributed training setup using HuggingFace Accelerate and Transformers, on Slurm