-
Notifications
You must be signed in to change notification settings - Fork 12.2k
MPT support in llama.cpp #3417
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
MPT support in llama.cpp #3417
Changes from 1 commit
Commits
Show all changes
17 commits
Select commit
Hold shift + click to select a range
b49792b
CUDA: added support for ggml_clamp (see also: https://github.com/gger…
jploski 15236e8
mpt : added an implementation based (mostly) on falcon integration, m…
jploski 84e30e8
mpt : protect against "clip_qkv": null in mpt-7b
jploski 00e8c5c
mpt : quick fix to avoid "Strange model" warning when quantizing MPT …
jploski 1be89c4
mpt : addendum to changeset:84e30e8 - leave parameter clamp_kqv out f…
jploski 26c253e
mpt : standardized all tensor names to follow GGUF spec
jploski df072d2
mpt : addendum to changeset:1be89c40 - use "req" parameter of GGUF_GE…
jploski 90e7d6d
mpt : fixed comment s/gptneox/mpt/
jploski 4708012
mpt : remove tabs, trailing whitespace
jploski 1364bcd
mpt : removed ne01 + n_past == ne00 assertion from alibi (cuda/f32) a…
jploski 7d6a24a
mpt : updated convert-mpt-hf-to-gguf.py to reflect changes made to co…
jploski 292363e
Merge branch 'master' of https://github.com/ggerganov/llama.cpp into …
cebtenzzre ad3c2f3
comment out n_past instead of marking it unused
cebtenzzre 1a454eb
mpt : removed hardcoded +178 from convert script in favor of utilizin…
jploski 32172f1
mpt : remove unused tokenizer_json in convert script
cebtenzzre 96cf3f5
ggml : remove obsolete n_past assert in ggml_alibi
ggerganov 9b66378
llama : print clam_kqv and max_alibi_bias hparams
ggerganov File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Merge branch 'master' of https://github.com/ggerganov/llama.cpp into …
…pull-3417
- Loading branch information
commit 292363e5562e5dd1830d29386a6470cb2ffc57fb
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
You are viewing a condensed version of this merge commit. You can view the full changes here.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.