diff --git a/README.md b/README.md
index 593c4e7494dcefe8dd858f937fc74c69b947ae2c..25642c5b57ad43e6effb0154071047f6b7dcd70c 100644
--- a/README.md
+++ b/README.md
@@ -22,6 +22,8 @@ information for both newcomers as well as any other interesting new research:
 - [Formal verification](doc/verification.md): somewhat orthogonal to LLM's,
   refers to how to formally verify (and specify) the code output of language
   models.
+- [Datasets for training](doc/datasets.md): Available online datasets used to
+  train (open-source and other) language models
 
 ## The rest of the repositories