-
Notifications
You must be signed in to change notification settings - Fork 655
Pull requests: vllm-project/vllm-ascend
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Feat] Support MLP_TP feature, exclude MOE layer
documentation
Improvements or additions to documentation
module:core
module:ops
module:tests
#4999
opened Dec 14, 2025 by
zzhx1
Loading…
[CI][Bugfix] Fix scheduleroutput has no attr get error in prompt logprobs
#4998
opened Dec 14, 2025 by
MengqingCao
Loading…
【Feature】refactor npu_modelrunner for profile_run
module:core
module:tests
ready
read for review
ready-for-test
start test by label for PR
#4993
opened Dec 13, 2025 by
zhenwenqi2024
Loading…
[DoNotMerge]support qk_rmsnorm_rope_fusion
ci/build
merge-conflicts
module:core
module:ops
module:tests
#4987
opened Dec 13, 2025 by
Angazenn
Loading…
[WIP][Bugfix][MoE] Fix allgather in w4a8_dynamic
#4977
opened Dec 13, 2025 by
Pr0Wh1teGivee
Loading…
[Feat] Refactor rejection sampler
ready
read for review
ready-for-test
start test by label for PR
#4975
opened Dec 12, 2025 by
realliujiaxu
Loading…
3 tasks done
[ModelRunner] apply_grammer uses vllm function
ready
read for review
ready-for-test
start test by label for PR
#4974
opened Dec 12, 2025 by
zhenwenqi2024
Loading…
[Bugfix] fix async-scheduling with pipeline parallelism
merge-conflicts
#4973
opened Dec 12, 2025 by
lidenghui1110
Loading…
[perfermance] Eliminate the D2H synchronization operations in post-processing to resolve the Eagle partial fast/slow card issue
#4967
opened Dec 12, 2025 by
coder-fny
Loading…
Fix the accuracy arange change in normal scene is more than 7
module:tests
#4964
opened Dec 12, 2025 by
leo-pony
Loading…
[TEST]Update aisbench params for qwen2.5-vl-7b acc test
module:tests
#4961
opened Dec 12, 2025 by
jiangyunfan1
Loading…
Add model_runner pcp related UTs
merge-conflicts
module:tests
#4951
opened Dec 12, 2025 by
zhangsicheng5
Loading…
[bugfix] Fix dummy-run and multi-node issues in MoE routing and MTP
module:ops
ready
read for review
ready-for-test
start test by label for PR
#4947
opened Dec 12, 2025 by
kiscad
Loading…
[Perf] Optimize memory layout and vectorize PCP/DCP loops in attention_cp.py
#4944
opened Dec 12, 2025 by
ader47
Loading…
[Quant] Add support for Qwen3VL-MoE model quantization
module:quantization
#4942
opened Dec 12, 2025 by
starmountain1997
•
Draft
[Bugfix] Fix matmul allreduce precision issue by using original weight
module:ops
#4939
opened Dec 12, 2025 by
icerain-alt
Loading…
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.