Wt/camb/interface #31

JackWeiw · 2024-12-30T02:52:39Z

Motivation

This PR changes llm op interface of paged_prefiil_attention.

Modification

Add relevant params to dlinfer attention backend and kernel
Added params requested by tmo:
cu_seq_lens_kv (Tensor): The cumulative sequence lengths of the key/value sequences.
max_kv_seq_len (int): The maximum length of any key/value sequence.

…nLM#2979) * Add tool role to the base chat template * fix UT

* support w8a8 smooth_quant and loading * optimize int8 * fix fp8 kernels * update docs for w8a8 * resolve comments * resolve comments * fix ut * disable not quant last norm * disable quant last norm for cogvlm and minicpmv26 models --------- Co-authored-by: grimoire <[email protected]>

* first * better tuning * restore tuning value

* remove threadsafe * optimize performance * 22.4 * 22.5 * delete jsonl * add docs * fix link * rst * remove sleep req step * remove scheduler sleep * fix ut * recovery async engine

* Update ascend get_started.md * Update ascend get_started.md * fix Dockerfile_aarch64_ascend

JackWeiw and others added 8 commits December 30, 2024 10:50

[dlinfer]modify interface to support camb multi-batch-conv

a9121e9

[dlinfer]change order for paged_prefill

17e11f5

add tool role in BaseChatTemplate as tool response in messages (Inter…

3db4793

…nLM#2979) * Add tool role to the base chat template * fix UT

Optimize lora kernel (InternLM#2975)

9e593e7

* first * better tuning * restore tuning value

Remove threadsafe (InternLM#2907)

aabc90d

* remove threadsafe * optimize performance * 22.4 * 22.5 * delete jsonl * add docs * fix link * rst * remove sleep req step * remove scheduler sleep * fix ut * recovery async engine

Fix ascend dockerfile (InternLM#2989)

c6c25ae

* Update ascend get_started.md * Update ascend get_started.md * fix Dockerfile_aarch64_ascend

Merge branch 'InternLM:main' into wt/camb/interface

2fae3b5

JackWeiw closed this Jan 14, 2025

JackWeiw deleted the wt/camb/interface branch January 14, 2025 09:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Wt/camb/interface #31

Wt/camb/interface #31

Uh oh!

JackWeiw commented Dec 30, 2024

Uh oh!

Uh oh!

Wt/camb/interface #31

Wt/camb/interface #31

Uh oh!

Conversation

JackWeiw commented Dec 30, 2024

Motivation

Modification

Uh oh!

Uh oh!