Support attention bias by lzhangzz · Pull Request #14 · InternLM/lmdeploy

lzhangzz · 2023-06-22T05:51:13Z

No description provided.

* support ascend using infer_ext * fix(ascend): make infer_ext using TND format q,k,v in paged_token_attention * support ascend using infer_ext * feat: support ascend moe_gating_topk_softmax * feat: change infer_ext ops function param order (#2) * ascend: align attention mask to 32bytes (#7) * fix attn args (#9) * fix: expand shape of attn_mask (#10) * feat: udpate infer_ext ops interface (#13) * rename infer_ext to dlinfer * format code * Support internlm 2.5 (#14) * refactor ascend pagedattention * fix ascend apply_rotary_pos_emb * fix import dlinfer (#16) * fix: fix rms_norm params (#18) * fix sync on ascend --------- Co-authored-by: chenchiyu <chenchiyu@pjlab.org.cn> Co-authored-by: CyCle1024 <ccy_justin@163.com> Co-authored-by: Wei Tao <1136862851@qq.com> Co-authored-by: jinminxi104 <jinminxi104@hotmail.com> Co-authored-by: pdx1989 <pdx1989@gmail.com>

lzhangzz added 2 commits June 22, 2023 05:09

support attention bias

556cbe0

fix conflict

85e88bf

lvhan028 approved these changes Jun 24, 2023

View reviewed changes

lvhan028 merged commit 2700abb into InternLM:main Jun 24, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support attention bias#14

Support attention bias#14
lvhan028 merged 2 commits intoInternLM:mainfrom
lzhangzz:attn-bias

lzhangzz commented Jun 22, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

lzhangzz commented Jun 22, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants