-
Notifications
You must be signed in to change notification settings - Fork 2.1k
Pull requests: NVIDIA/TensorRT-LLM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[None][Perf] Multi-stream attention, fuse rmsnorm add, fuse swiglu
#11362
opened Feb 8, 2026 by
suyoggupta
Loading…
1 task
[TRTLLM-8263][feat] Add ctx-only and gen-only Disagg Perf Tests
#11361
opened Feb 7, 2026 by
chenfeiz0326
Loading…
1 task
[TRTC-265][chore] Add CODEOWNERS coverage for serve/ and commands/ directories
#11359
opened Feb 7, 2026 by
venkywonka
Loading…
1 task done
[TRTC-264][doc] Add CLAUDE.md and AGENTS.md
#11358
opened Feb 7, 2026 by
venkywonka
Loading…
1 task done
# TensorRT-LLM: Enable Jetson Thor (sm_110) Support & Build Fixes
Community want to contribute
PRs initiated from Community
[Draft] AutoDeploy GLM4.7 flash bundle
#11356
opened Feb 6, 2026 by
bmarimuthu-nv
Loading…
1 task done
[#11146][feat] AutoDeploy: Add triton paged attention
#11355
opened Feb 6, 2026 by
nvchenghaoz
•
Draft
1 task
[https://nvbugs/5829097][fix] Disaggregated serving: Only send finished context requests to the KV cache transceiver
#11354
opened Feb 6, 2026 by
Funatiq
Loading…
1 task done
[TRTLLM-10030][perf] avoid syncs in beam search + other improvements
#11349
opened Feb 6, 2026 by
ixlmar
Loading…
1 task done
[TRTLLM-1234][feat] Fixed sharding for shared embedding projections
#11348
opened Feb 6, 2026 by
greg-kwasniewski1
Loading…
1 task done
[TRTLLM-9904][feat] KVCache V2 MTP support
#11346
opened Feb 6, 2026 by
liji-nv
Loading…
1 task done
[None][feat] Optimize mamba2 _chunk_scan_fwd_kernel
#11345
opened Feb 6, 2026 by
JadoTu
Loading…
1 task done
[None][feat] Optimize the q3n decode kernel with IO read
#11344
opened Feb 6, 2026 by
JadoTu
Loading…
1 task done
[None][feat] Optimize superv3 nvfp4 for better perf version3
#11343
opened Feb 6, 2026 by
Wanli-Jiang
•
Draft
1 task
[None][feat] Refactor time breakdown tool (visualization, generation breakdown, etc.)
#11340
opened Feb 6, 2026 by
luyiyun1021
Loading…
1 task done
[https://nvbugs/5866619][fix] Support PEFT-saved safetensors file loading
#11339
opened Feb 6, 2026 by
Wanli-Jiang
Loading…
1 task done
[TRTLLM-10866][feat] implement disaggregated harmony chat
#11336
opened Feb 6, 2026 by
reasonsolo
Loading…
1 task done
[None][chore] Reduce attention module repeated warnings.
#11335
opened Feb 6, 2026 by
yuxianq
Loading…
1 task done
[None][feat] Use new index api, add block scale support, fix max_seq_len esitmation, add flash mla support
#11334
opened Feb 6, 2026 by
yizhang-nv
Loading…
1 task done
Previous Next
ProTip!
Updated in the last three days: updated:>2026-02-05.