Skip to content

Pull requests: microsoft/onnxruntime

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[CUDA] Enable CUDA GQA QK-Norm and XQA decode
#29186 opened Jun 20, 2026 by tianleiwu Contributor Loading…
3 of 4 tasks
[CUDA] Add sliding-window support to non-quantized XQA decode
#29177 opened Jun 20, 2026 by tianleiwu Contributor Loading…
3 of 4 tasks
[CUDA] update doc for cuda contrib op GroupQueryAttention
#29173 opened Jun 20, 2026 by tianleiwu Contributor Loading…
[CUDA]: Split-K2 QMoE SwiGLU GEMV kernel
#29167 opened Jun 19, 2026 by tianleiwu Contributor Loading…
Bump vite and @vitejs/plugin-vue in /js/web/test/e2e/exports/testcases/vite-default dependencies Pull requests that update a dependency file javascript Pull requests that update Javascript code
#29165 opened Jun 19, 2026 by dependabot Bot Loading…
[CPU] Enable pre-packed weights sharing for MatMulNBits
#29163 opened Jun 19, 2026 by derdeljan-msft Contributor Loading…
Address github issue 29071
#29158 opened Jun 18, 2026 by yuslepukhin Member Loading…
Fix over-copy of packed sub-byte tensors in OrtApi::GetValue
#29157 opened Jun 18, 2026 by neilmsft Contributor Loading…
Bump @babel/core from 7.29.0 to 7.29.6 in /js/react_native/e2e dependencies Pull requests that update a dependency file javascript Pull requests that update Javascript code
#29156 opened Jun 18, 2026 by dependabot Bot Loading…
Experimental C++ API update
#29142 opened Jun 17, 2026 by edgchen1 Contributor Loading…
Remove the dynamic WGSL generator (duktape/Node) path
#29141 opened Jun 17, 2026 by danielsongmicrosoft Contributor Loading…
Guard large-head nonpad Attention MEA dispatch ep:CUDA issues related to the CUDA execution provider
#29140 opened Jun 17, 2026 by Kevin-Li-2025 Loading…
Bump protobufjs from 7.6.0 to 7.6.3 in /js/node
#29090 opened Jun 17, 2026 by maoger Loading…
Avoid small MatMul batch parameter heap allocations
#29085 opened Jun 17, 2026 by GopalakrishnanN Contributor Loading…
Add 2-bit quantization support to WebGPU GatherBlockQuantized operator ep:WebGPU ort-web webgpu provider
#29074 opened Jun 16, 2026 by Shivani767 Contributor Loading…
ProTip! Add no:assignee to see everything that’s not assigned.