Skip to content
GitLab
Explore
Sign in
Primary navigation
Search or go to…
Project
A
Awesome LLM
Manage
Activity
Members
Labels
Plan
Issues
Issue boards
Milestones
Iterations
Wiki
Requirements
Code
Merge requests
Repository
Branches
Commits
Tags
Repository graph
Compare revisions
Snippets
Locked files
Build
Pipelines
Jobs
Pipeline schedules
Test cases
Artifacts
Deploy
Releases
Package registry
Container Registry
Model registry
Operate
Environments
Terraform modules
Monitor
Incidents
Analyze
Value stream analytics
Contributor analytics
CI/CD analytics
Repository analytics
Code review analytics
Issue analytics
Insights
Model experiments
Help
Help
Support
GitLab documentation
Compare GitLab plans
Community forum
Contribute to GitLab
Provide feedback
Keyboard shortcuts
?
Snippets
Groups
Projects
Show more breadcrumbs
NetSys
Awesome LLM
Repository graph
Repository graph
You can move around the graph by using the arrow keys.
d8c7f009d84e64002d0784c2f6ec2257d9bed65a
Select Git revision
Branches
5
feature/docs
feature/models
feature/scripts
main
default
protected
vladb/e2e_overview
5 results
Begin with the selected commit
Created with Raphaël 2.2.0
16
Mar
11
27
Feb
23
6
2
1
31
Jan
30
29
11
9
8
4
31
Dec
21
19
16
13
10
6
28
Nov
19
14
8
1
31
Oct
9
5
4
3
Added changes to README
main
main
Added references to benchmarks in README
Added references to benchmarks in README
Added code for human-eval benchmark
Add ARC benchmark evaluation code
Add 35M model setup
Move optimizer state and saved percentage of fp16 activations to globals
Fix memory requiremenst, add fp16 activations
Update README (fix typos, formatting)
Add note about splitting a model's layers on multiple GPU's
Add compute requirements script and docs
Add more information on memory usage for training
Add link to scripts/ in main README file
Add scripts folder, add memory requirements script
Add gradient checkpointing papers
feature/docs
feature/docs
Add GPT1 paper
feature/models
feature/models
Update mem/compute scripts interaction
feature/scripts
feature/scripts
Add overview of E2E training
vladb/e2e_overv…
vladb/e2e_overview
Add 35M model setup
Move optimizer state and saved percentage of fp16 activations to globals
Fix memory requiremenst, add fp16 activations
Update README (fix typos, formatting)
Add context extension documentation
Add note about splitting a model's layers on multiple GPU's
Add compute requirements script and docs
Add more information on memory usage for training
Add link to scripts/ in main README file
Add scripts folder, add memory requirements script
Merge branch 'main-patch-papers' into 'main'
Merge branch 'fix/datasets_doc' into 'main'
Add link to datasets doc in the README file
Merge branch 'vladb/datasets' into 'main'
Add common datasets table
Add tokenization and mixed precision sections
Merge branch 'main-patch-3fa8' into 'main'
Introduce the Profiling section
Merge branch 'feature/benchzoo' into 'main'
Merge branch 'refactor/structure' into 'main'
Add benchzoo repo
Add QLoRA paper, move LoRA paper to fine-tuning
Loading