🏠
Working
I am Zhenbin Chen, a PhD student in ShanghaiTech University. My research interests is LLM && RL.
-
ShanghaiTech University
- Foshan,Guangdong
- https://www.zhihu.com/people/chen-zhen-bin-88
- https://scholar.google.com/citations?user=B-9QFwIAAAAJ&hl=zh-CN
Highlights
- Pro
Stars
Speculative Decoding
6 repositories
[COLM 2024] TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding
[ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding
Repository of the paper "Accelerating Transformer Inference for Translation via Parallel Decoding"
Fast inference from large lauguage models via speculative decoding
Codes for our paper "Speculative Decoding: Exploiting Speculative Execution for Accelerating Seq2seq Generation" (EMNLP 2023 Findings)
📰 Must-read papers and blogs on Speculative Decoding ⚡️
