Commits · main · NetSys / LLaMA

Dec 03, 2023
- Fix to the model load debug messages · 3dfc63e4
  Vlad-Andrei BĂDOIU (78692) authored 1 year ago
  
  3dfc63e4
Nov 29, 2023
- Remove catch for keyboard interrupt · d48ed22b
  Alexandru-Mihai GHERGHESCU authored 1 year ago
  
  Since only the first process communicates with the user, the keyboard interrupt will not reach the other processes, hence it will still crash, whatever we catch or not. A more complex mechanism would be needed to transmit the user intention to close the app.
  d48ed22b
- Add missing import · 4908b55f
  Alexandru-Mihai GHERGHESCU authored 1 year ago
  
  4908b55f
- Add query time, add catch clause for keyboard interrupt · b33776f1
  Alexandru-Mihai GHERGHESCU authored 1 year ago
  
  b33776f1
- Reorganize messages · 1bab4761
  Alexandru-Mihai GHERGHESCU authored 1 year ago
  
  1bab4761
- Update loading messages, add note about supressed warnings · 7f3a3c48
  Alexandru-Mihai GHERGHESCU authored 1 year ago
  
  7f3a3c48
- Update messages to be aligned with a general theme · 0bc6f628
  Alexandru-Mihai GHERGHESCU authored 1 year ago
  
  0bc6f628
- Update context limit message · f1d58efa
  Alexandru-Mihai GHERGHESCU authored 1 year ago
  
  f1d58efa
- Add the tokenizer as well to the repo · f05c2dca
  Alexandru-Mihai GHERGHESCU authored 1 year ago
  
  Since the tokenizer model is just a 500KB file, we can easily add it here and not deal with it later.
  f05c2dca
- Add warning for context limit, exit cleanly · 33a278e4
  Alexandru-Mihai GHERGHESCU authored 1 year ago
  
  Previously, the script just crashed, without informing the user about it.
  33a278e4
- Port the LLaMA repo to the NetSys gitlab · 545f374a
  Alexandru-Mihai GHERGHESCU authored 1 year ago
  
  545f374a
Nov 14, 2023
- Merge pull request #900 from flu0r1ne/main · ef351e9c
  ruanslv authored 1 year ago
  
  Fix key-value caching for seqlen != 1 (Issue #899)
  ef351e9c
Nov 13, 2023

Correct KV comment seqlen -> seqlen + cache_len · cd0719dd

flu0r1ne authored 1 year ago

Update and add comments about the shape of the key and value
matrices in the attention component. E.g., the second dimension is
of length seqlen + cache_len not seqlen as previously stated.

cd0719dd

Update transformer mask comment · 6b3154bf

Alex authored 1 year ago


Update names for consistency with code

Co-authored-by: ruanslv <ruanslv@gmail.com>

6b3154bf

Nov 10, 2023
- Merge pull request #916 from facebookresearch/jspisak-patch-6 · 4835a30a
  Joseph Spisak authored 1 year ago
  
  Update README.md
  4835a30a
- Update README.md · 94b055f4
  Joseph Spisak authored 1 year ago
  
  94b055f4
Nov 08, 2023
- fix faq link · dccf6442
  Suraj Subramanian authored 1 year ago
  
  dccf6442
- Update issue templates · 9cd8d505
  Suraj Subramanian authored 1 year ago
  
  9cd8d505
Nov 03, 2023

Fix key-value caching for seqlen != 1 · e9077bd2

flu0r1ne authored 1 year ago

This commit fixes a bug in the key-value caching. Currently,
a square attention mask is misapplied to the scores matrix
despite not matching the shape of the scores matrix. This
results in a runtime error. In a correct implementation, the
decoder mask needs to describe how the new seq_len tokens
interact with all the cached tokens. That is, the attention
mask needs to be of shape (seq_len, total_len), indicating how
the token at row i (representing token i + cached_len in the
transformer model) attends to token j. Accordingly, the matrix
needs to mask entries where j > cached_len + i. This patch
horizontally appends (seq_len, cached_len) zeros to an
upper-triangular mask of size (seq_len, seq_len) to form the
(seq_len, total_len) mask.

e9077bd2

Nov 02, 2023
- Merge pull request #897 from JacobHelwig/main · 54d44631
  Joseph Spisak authored 1 year ago
  
  Correct "bug," typo to "bug", in README.md
  54d44631
- Correct "bug," typo to "bug", in README.md · 7909dee4
  JacobHelwig authored 1 year ago
  
  7909dee4
- Merge pull request #891 from facebookresearch/jspisak-patch-5 · b5cd38ad
  Joseph Spisak authored 1 year ago
  
  Delete FAQ.md
  b5cd38ad
- Delete FAQ.md · 664ddc8c
  Joseph Spisak authored 1 year ago
  
  664ddc8c
- Merge pull request #890 from facebookresearch/jspisak-patch-4 · 3f750f4c
  Joseph Spisak authored 1 year ago
  
  Update README.md
  3f750f4c
- Update README.md · 786af967
  Joseph Spisak authored 1 year ago
  
  786af967
Oct 18, 2023
- Add FAQs · 06faf3aa
  Suraj Subramanian authored 1 year ago
  
  06faf3aa
Oct 16, 2023
- Merge pull request #860 from facebookresearch/add-issue-template · 1c95a19e
  Joseph Spisak authored 1 year ago
  
  Update issue templates
  1c95a19e
- Update issue templates · 0cc2987b
  Suraj Subramanian authored 1 year ago
  
  0cc2987b
Oct 15, 2023
- Merge pull request #859 from yonashub/patch-1 · 6b8cff0e
  Joseph Spisak authored 1 year ago
  
  [closes #858] change "Content Length" to "Context Length MODEL_CARD.md
  6b8cff0e
- change "Content Length" to "Context Length MODEL_CARD.md · f9ddb1d0
  yonashub authored 1 year ago
  
  f9ddb1d0
Oct 11, 2023
- Merge pull request #851 from sekyondaMeta/FAQ-updates · 556949fd
  Joseph Spisak authored 1 year ago
  
  Faq updates
  556949fd
- Update FAQ.md · 0da077cf
  Joseph Spisak authored 1 year ago
  
  made some small fixes and added some context.
  0da077cf
- Update FAQ.md · 5d9bb58a
  sekyondaMeta authored 1 year ago
  
  5d9bb58a
- Update FAQ.md · 98851c30
  sekyondaMeta authored 1 year ago
  
  98851c30
Sep 29, 2023
- Merge pull request #822 from kierenAW/main · 7e1b864d
  samuelselvan authored 1 year ago
  
  Add "--continue" flag to wget for model binary in order to resume dl
  7e1b864d
Sep 26, 2023
- Merge pull request #829 from facebookresearch/jspisak-patch-3 · 5e13e29f
  Joseph Spisak authored 1 year ago
  
  Update README.md
  5e13e29f
- Update README.md · f29c9a8a
  Joseph Spisak authored 1 year ago
  
  Updated the Meta AI mention to just Meta.
  f29c9a8a
Sep 23, 2023
- Add "--continue" flag to wget for model binary in order to resume downloads. · 4660bd3b
  Kieren authored 1 year ago
  
  4660bd3b
Sep 21, 2023
- Merge pull request #813 from facebookresearch/jspisak-patch-1 · b00a461a
  Joseph Spisak authored 1 year ago
  
  Update MODEL_CARD.md
  b00a461a
- Merge pull request #814 from facebookresearch/jspisak-patch-2 · 843e41f0
  Joseph Spisak authored 1 year ago
  
  Update FAQ.md
  843e41f0