Added code for human-eval benchmark (876d85a5) · Commits · NetSys / Awesome LLM · GitLab

Skip to content

GitLab

Explore

Sign in

Primary navigation

Project

A

Awesome LLM
- Activity
- Members
- Labels
- Issues
- Issue boards
- Milestones
- Iterations
- Wiki
- Requirements
- Environments
- Terraform modules
- Incidents

Snippets Groups Projects

876d85a5

Commit 876d85a5 authored 1 year ago by Rareș Stanca Committed by Vlad-Andrei BĂDOIU (78692) 1 year ago

Downloads
- Patches
- Plain Diff

Added code for human-eval benchmark

parent b8330408

No related branches found

No related tags found

Loading

Changes 7

Hide whitespace changes

Inline Side-by-side

Showing

benchmarks/human-eval/README.md 28 additions, 0 deletions

benchmarks/human-eval/README.md
benchmarks/human-eval/core/__init__.py 2 additions, 0 deletions

benchmarks/human-eval/core/__init__.py
benchmarks/human-eval/core/evaluation.py 67 additions, 0 deletions

benchmarks/human-eval/core/evaluation.py
benchmarks/human-eval/core/prompts.py 14 additions, 0 deletions

benchmarks/human-eval/core/prompts.py
benchmarks/human-eval/eval_llama.py 66 additions, 0 deletions

benchmarks/human-eval/eval_llama.py
benchmarks/human-eval/eval_phi.py 72 additions, 0 deletions

benchmarks/human-eval/eval_phi.py
benchmarks/human-eval/requirements.txt 7 additions, 0 deletions

benchmarks/human-eval/requirements.txt

with 256 additions and 0 deletions

Loading

0% Loading or .

You are about to add 0 people to the discussion. Proceed with caution.

Finish editing this message first!

Please register or sign in to comment