Compute/memory requirements scripts
Compare changes
Files
4
Conflict: This file was added both in the source and target branches, but with different contents.
Ask someone with write access to resolve it.
@@ -8,9 +8,9 @@ which can then be iterated upon.
@@ -22,7 +22,7 @@ since those use fundamentally different approaches.
@@ -46,6 +46,17 @@ total number of GPUs of `32 (the base number of GPUs needed to hold the model,
@@ -55,13 +66,13 @@ cluster](https://lumi-supercomputer.eu/scaling-the-pre-training-of-large-languag
@@ -70,8 +81,9 @@ scenario).
@@ -80,4 +92,5 @@ usually work best when fed big matrices, which keeps them occupied more fully.