-
Notifications
You must be signed in to change notification settings - Fork 1k
Comparing changes
Open a pull request
base repository: kvcache-ai/ktransformers
base: v0.3.1
head repository: kvcache-ai/ktransformers
compare: main
- 12 commits
- 14 files changed
- 9 contributors
Commits on May 18, 2025
-
VLinearMarlin: padding to input.shape[0] to avoid CUDA error
Fix the following runtime error with --no-use_cuda_graph option Traceback (most recent call last): File "/home/aubrey/miniforge3/envs/kt/lib/python3.11/multiprocessing/process.py", line 314, in _bootstrap self.run() File "/home/aubrey/miniforge3/envs/kt/lib/python3.11/multiprocessing/process.py", line 108, in run self._target(*self._args, **self._kwargs) File "/home/aubrey/miniforge3/envs/kt/lib/python3.11/site-packages/ktransformers/server/backend/interfaces/balance_serve.py", line 282, in run_engine engine.loop() File "/home/aubrey/miniforge3/envs/kt/lib/python3.11/site-packages/ktransformers/server/backend/interfaces/balance_serve.py", line 234, in loop self.model_runner.run(self.batch, self.query_manager) File "/home/aubrey/miniforge3/envs/kt/lib/python3.11/site-packages/ktransformers/server/balance_serve/inference/model_runner.py", line 220, in run self.output.logits[0] = self.output.logits[0][self.input[cuda_graph_idx].minibatch.logits_start] ~~~~~~~~~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ RuntimeError: CUDA error: an illegal memory access was encountered CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect.
Configuration menu - View commit details
-
Copy full SHA for d347aeb - Browse repository at this point
Copy the full SHA d347aebView commit details -
Merge pull request #1320 from aubreyli/no_cuda_graph_err
VLinearMarlin: padding to input.shape[0] to avoid CUDA error
Configuration menu - View commit details
-
Copy full SHA for 01311d2 - Browse repository at this point
Copy the full SHA 01311d2View commit details
Commits on May 19, 2025
-
Configuration menu - View commit details
-
Copy full SHA for 4f78e37 - Browse repository at this point
Copy the full SHA 4f78e37View commit details -
Configuration menu - View commit details
-
Copy full SHA for 7d79735 - Browse repository at this point
Copy the full SHA 7d79735View commit details
Commits on May 21, 2025
-
Configuration menu - View commit details
-
Copy full SHA for 6645398 - Browse repository at this point
Copy the full SHA 6645398View commit details -
Configuration menu - View commit details
-
Copy full SHA for 2589336 - Browse repository at this point
Copy the full SHA 2589336View commit details
Commits on May 22, 2025
-
Configuration menu - View commit details
-
Copy full SHA for adc0906 - Browse repository at this point
Copy the full SHA adc0906View commit details
Commits on May 23, 2025
-
Configuration menu - View commit details
-
Copy full SHA for 71a5fc5 - Browse repository at this point
Copy the full SHA 71a5fc5View commit details -
Merge pull request #1331 from rnwang04/qwen3_xpu_support
add XPU support for qwen3moe local chat
Configuration menu - View commit details
-
Copy full SHA for 0c44f2e - Browse repository at this point
Copy the full SHA 0c44f2eView commit details
Commits on May 28, 2025
-
docs: add Dockerfile.xpu and GPU driver setup instructions
- Add Dockerfile.xpu for oneAPI-based container - Create Docker_xpu.md with usage instructions - Update xpu.md to include Docker guide
Configuration menu - View commit details
-
Copy full SHA for 404ad39 - Browse repository at this point
Copy the full SHA 404ad39View commit details -
Merge pull request #1337 from liu-shaojun/docker_xpu
Add Dockerfile and usage guide for XPU support
Configuration menu - View commit details
-
Copy full SHA for ce75fcd - Browse repository at this point
Copy the full SHA ce75fcdView commit details
Commits on May 29, 2025
-
raise exception on device error (#1342)
* display the unavailable torch device on error * Raise exception on device error --------- Signed-off-by: Emmanuel Ferdman <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for d8bc640 - Browse repository at this point
Copy the full SHA d8bc640View commit details
This comparison is taking too long to generate.
Unfortunately it looks like we can’t render this comparison for you right now. It might be too big, or there might be something weird with your repository.
You can try running this command locally to see the comparison on your machine:
git diff v0.3.1...main