-
AI Research @IBM
- Zurich, Switzerland
- https://orcid.org/0009-0007-7755-6436
- https://ch.linkedin.com/in/yannickschnider
Stars
See vLLM official support: https://github.com/vllm-project/vllm-ascend
Framework to reduce autotune overhead to zero for well known deployments.
Deep Learning Project HS21



