-
Notifications
You must be signed in to change notification settings - Fork 4k
Pull requests: microsoft/onnxruntime
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[CUDA] Enable CUDA GQA QK-Norm and XQA decode
#29186
opened Jun 20, 2026 by
tianleiwu
Contributor
Loading…
3 of 4 tasks
[CUDA] Add sliding-window support to non-quantized XQA decode
#29177
opened Jun 20, 2026 by
tianleiwu
Contributor
Loading…
3 of 4 tasks
[CUDA] update doc for cuda contrib op GroupQueryAttention
#29173
opened Jun 20, 2026 by
tianleiwu
Contributor
Loading…
Add
--build-mode to packaging pipeline orchestrator (trigger_and_wait_pipelines.py)
#29169
opened Jun 19, 2026 by
edgchen1
Contributor
Loading…
Add reference examples and guidance (especially for multi-device scenarios) for ValidateCompiledModelCompatibilityInfo
#29168
opened Jun 19, 2026 by
adrastogi
Contributor
Loading…
[CUDA]: Split-K2 QMoE SwiGLU GEMV kernel
#29167
opened Jun 19, 2026 by
tianleiwu
Contributor
Loading…
Bump vite and @vitejs/plugin-vue in /js/web/test/e2e/exports/testcases/vite-default
dependencies
Pull requests that update a dependency file
javascript
Pull requests that update Javascript code
#29165
opened Jun 19, 2026 by
dependabot
Bot
Loading…
[CPU] Enable pre-packed weights sharing for MatMulNBits
#29163
opened Jun 19, 2026 by
derdeljan-msft
Contributor
Loading…
Fix over-copy of packed sub-byte tensors in OrtApi::GetValue
#29157
opened Jun 18, 2026 by
neilmsft
Contributor
Loading…
Bump @babel/core from 7.29.0 to 7.29.6 in /js/react_native/e2e
dependencies
Pull requests that update a dependency file
javascript
Pull requests that update Javascript code
#29156
opened Jun 18, 2026 by
dependabot
Bot
Loading…
Add int8/uint8 CPU support for SpaceToDepth and int8 for DepthToSpace
#29154
opened Jun 18, 2026 by
ArsalanShakil
Loading…
Recover Conv/ConvTranspose rank from weight when input shape is unknown
#29149
opened Jun 18, 2026 by
fanchenkong1
Contributor
Loading…
Remove the dynamic WGSL generator (duktape/Node) path
#29141
opened Jun 17, 2026 by
danielsongmicrosoft
Contributor
Loading…
Guard large-head nonpad Attention MEA dispatch
ep:CUDA
issues related to the CUDA execution provider
#29140
opened Jun 17, 2026 by
Kevin-Li-2025
Loading…
Segregate IExecutionProvider optional capabilities into mix-in interfaces
#29087
opened Jun 17, 2026 by
GopalakrishnanN
Contributor
Loading…
Avoid small MatMul batch parameter heap allocations
#29085
opened Jun 17, 2026 by
GopalakrishnanN
Contributor
Loading…
Fix Shape→Gather→TopK regression: preserve rank-1 single-element index output in data propagation
#29084
opened Jun 16, 2026 by
titaiwangms
Contributor
Loading…
Add 2-bit quantization support to WebGPU GatherBlockQuantized operator
ep:WebGPU
ort-web webgpu provider
#29074
opened Jun 16, 2026 by
Shivani767
Contributor
Loading…
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.