"README.md" did not exist on "4bb42f74cd10a926bd881b408272c0624bdc758f"
Compute/memory requirements scripts
Compare changes
Files
4- Alexandru-Mihai GHERGHESCU authored
Move all the model setup in a different script. Add per architecture variables (for example, feed forward matrices size), since most of the architectures today vary in one way or another. This makes it easier to change values around and get more meaningful results, and also enables users to more easily add new models.
Conflict: This file was added both in the source and target branches, but with different contents.
Ask someone with write access to resolve it.
@@ -22,7 +22,7 @@ since those use fundamentally different approaches.
@@ -66,8 +66,8 @@ cluster](https://lumi-supercomputer.eu/scaling-the-pre-training-of-large-languag
@@ -92,4 +92,5 @@ represents a small percent of the batch update.