Add inference code (!15) · Merge requests · NetSys / Optimus Prime

Merged Alexandru-Mihai GHERGHESCU requested to merge feature/inference into main 1 year ago

Jan 26, 2024

Add tokens per second information · cbf807dd
Alexandru-Mihai GHERGHESCU authored 1 year ago
```
Output model tokens per second at the end of inference.
```
Unverified

cbf807dd

Add option to pass a prompt to the inference script · 0ee3b108

Alexandru-Mihai GHERGHESCU authored 1 year ago

This allows the inference code to start up with a prompt, instead of
waiting for user input from stdin. Allows easier scripting, useful for
batch generation, benchmarking etc.

Unverified

0ee3b108

Jan 25, 2024

Add inference docs to README.md · 3ab5c4fd
Alexandru-Mihai GHERGHESCU authored 1 year ago

Unverified

3ab5c4fd

Add inference code · accab39c

Alexandru-Mihai GHERGHESCU authored 1 year ago

Inference example code. At the moment, the code simply loads a model
state file and generates text using that. Parameters like max sequence
length, whether training used fp16, what the tokenizer used for training
is etc., need to be passed manually by the user (there's a lot of room
for error here). To be improved.

Merges changes from !14
Closes !14

Unverified

accab39c

Add inference code

Inference code

Description

Type of change

Merge request commits

Related Issues

Screenshots or GIFs

Checklist

Reviewer Guidelines

Additional Notes

@mentions

Activity

Add inference code

Inference code

Description

Type of change

Merge request commits

Related Issues

Screenshots or GIFs

Checklist

Reviewer Guidelines

Additional Notes

@mentions

Merge request reports

Activity