ggml-org / llama.cpp Public

Notifications You must be signed in to change notification settings
Fork 13.6k
Star 89.6k

Code
Issues 270
Pull requests 580
Discussions
Actions
Projects 10
Wiki
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Wiki
Security
Insights

Pull requests: ggml-org/llama.cpp

Labels 78 Milestones 0

New pull request New

Clear current search query, filters, and sorts

580 Open 7,621 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Add "-e"/"--eval-threads" to distinguish thread counts for single-token eval and prompt eval threading

Parallel processing and thread management

#744 opened Apr 3, 2023 by MagisterLuddite • Draft

Optimize locking behavior threading

Parallel processing and thread management

#813 opened Apr 6, 2023 by janekb04

Loading…

Q4_0 scale selection using RMSE enhancement

New feature or request

Less than 4 bits

Efforts related to viable quantized models using <4 bits

research 🔬 Review Complexity : High

Generally require indepth knowledge of LLMs or GPUs

#835 opened Apr 7, 2023 by sw • Draft

Run several single thread operators parellel threading

Parallel processing and thread management

#850 opened Apr 8, 2023 by howard0su

Loading…

Use Threadpool to schedule the work threading

Parallel processing and thread management

#851 opened Apr 8, 2023 by howard0su • Draft

Add mmap pages stats (disabled by default)

#1015 opened Apr 16, 2023 by prusnak

Loading…

Add command mode to interactive mode. enhancement

New feature or request

Review Complexity : Medium

Generally require more time to grok but manageable by beginner to medium expertise level

#1022 opened Apr 17, 2023 by wbpxre150

Loading…

Add a option to force the token end of text apears even on interative, and also shows loading porcentage

#1058 opened Apr 19, 2023 by jeffersoncgo

Loading…

llama : quantize attention results demo

Demonstrate some concept or idea, not intended to be merged

#1103 opened Apr 21, 2023 by ggerganov • Draft

main: add pledge call on OpenBSD

#1132 opened Apr 22, 2023 by codesoap

Loading…

fix(LoRA): debugging

#1190 opened Apr 26, 2023 by jon-chuang

Loading…

Create run.py enhancement

New feature or request

obsolete?

Marker for potentially obsolete PR

python

python script changes

Review Complexity : Low

Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix

script

Script related

#1204 opened Apr 27, 2023 by jdpsl

Loading…

Implement get_num_physical_cores() for Windows

#1278 opened May 2, 2023 by DannyDaemonic

Loading…

[Research] Steering vectors research 🔬

#1472 opened May 16, 2023 by SlyEcho • Draft

Upgrade v1/v2 format to v3 by leveraging quantize

#1504 opened May 17, 2023 by howard0su

Loading…

ci: add linux binaries to release build

#1505 opened May 17, 2023 by Green-Sky

Loading…

ggml : spread compute across threads in chunks demo

Demonstrate some concept or idea, not intended to be merged

threading

Parallel processing and thread management

#1507 opened May 17, 2023 by ggerganov • Draft

Llama cpp low level python bindings

#1660 opened Jun 1, 2023 by dmahurin

Loading…

Added Arbitrary mixed quantization Less than 4 bits

Efforts related to viable quantized models using <4 bits

research 🔬

#1834 opened Jun 13, 2023 by Milkdrop

Loading…

Disable _O_WTEXT when using main in MinGW

#1897 opened Jun 16, 2023 by asctime

Loading…

Draft: #1776 making bos and eos available for user input

#1986 opened Jun 24, 2023 by HashemAlsaket • Draft

Example work stealing chunked task allocator for issue #291

#2026 opened Jun 27, 2023 by mqy • Draft

Avoid unused constant warnings refactoring

Refactoring

Review Complexity : Low

Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix

#2029 opened Jun 28, 2023 by set-soft

Loading…

try to fix compile warnings on macOS, address issue #2036

#2037 opened Jun 28, 2023 by mqy

Loading…

Avoid allocating beyond MTLDevice.recommendedMaxWorkingSetSize

#2069 opened Jul 2, 2023 by kiltyj • Draft

Previous 1 2 3 4 5 … 23 24 Next

Previous Next

ProTip! Add no:assignee to see everything that’s not assigned.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!