microsoft / onnxruntime Public

Notifications You must be signed in to change notification settings
Fork 4k
Star 20.9k

Code
Issues 830
Pull requests 577
Discussions
Actions
Projects
Models
Wiki
Security and quality
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Models
Wiki
Security and quality
Insights

Pull requests: microsoft/onnxruntime

Labels 81 Milestones 2

New pull request New

576 Open 18,788 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

[CUDA] Enable CUDA GQA QK-Norm and XQA decode

#29186 opened Jun 20, 2026 by tianleiwu Contributor

Loading…

3 of 4 tasks

[CUDA] Add sliding-window support to non-quantized XQA decode

#29177 opened Jun 20, 2026 by tianleiwu Contributor

Loading…

3 of 4 tasks

[CUDA] update doc for cuda contrib op GroupQueryAttention

#29173 opened Jun 20, 2026 by tianleiwu Contributor

Loading…

[CUDA] Fuse GPT-OSS router bias into MatMulNBits GEMV

#29170 opened Jun 20, 2026 by tianleiwu Contributor • Draft

Add --build-mode to packaging pipeline orchestrator (trigger_and_wait_pipelines.py)

#29169 opened Jun 19, 2026 by edgchen1 Contributor

Loading…

Add reference examples and guidance (especially for multi-device scenarios) for ValidateCompiledModelCompatibilityInfo

#29168 opened Jun 19, 2026 by adrastogi Contributor

Loading…

[CUDA]: Split-K2 QMoE SwiGLU GEMV kernel

#29167 opened Jun 19, 2026 by tianleiwu Contributor

Loading…

Bump vite and @vitejs/plugin-vue in /js/web/test/e2e/exports/testcases/vite-default dependencies

Pull requests that update a dependency file

javascript

Pull requests that update Javascript code

#29165 opened Jun 19, 2026 by dependabot Bot

Loading…

[CPU] Enable pre-packed weights sharing for MatMulNBits

#29163 opened Jun 19, 2026 by derdeljan-msft Contributor

Loading…

Harden Android NDK setup against apt mirror outages

#29159 opened Jun 18, 2026 by Copilot AI • Draft

Address github issue 29071

#29158 opened Jun 18, 2026 by yuslepukhin Member

Loading…

Fix over-copy of packed sub-byte tensors in OrtApi::GetValue

#29157 opened Jun 18, 2026 by neilmsft Contributor

Loading…

Bump @babel/core from 7.29.0 to 7.29.6 in /js/react_native/e2e dependencies

Pull requests that update a dependency file

javascript

Pull requests that update Javascript code

#29156 opened Jun 18, 2026 by dependabot Bot

Loading…

Add int8/uint8 CPU support for SpaceToDepth and int8 for DepthToSpace

#29154 opened Jun 18, 2026 by ArsalanShakil

Loading…

Recover Conv/ConvTranspose rank from weight when input shape is unknown

#29149 opened Jun 18, 2026 by fanchenkong1 Contributor

Loading…

Experimental C++ API update

#29142 opened Jun 17, 2026 by edgchen1 Contributor

Loading…

Remove the dynamic WGSL generator (duktape/Node) path

#29141 opened Jun 17, 2026 by danielsongmicrosoft Contributor

Loading…

Guard large-head nonpad Attention MEA dispatch ep:CUDA

issues related to the CUDA execution provider

#29140 opened Jun 17, 2026 by Kevin-Li-2025

Loading…

Bump protobufjs from 7.6.0 to 7.6.3 in /js/node

#29090 opened Jun 17, 2026 by maoger

Loading…

Segregate IExecutionProvider optional capabilities into mix-in interfaces

#29087 opened Jun 17, 2026 by GopalakrishnanN Contributor

Loading…

Avoid small MatMul batch parameter heap allocations

#29085 opened Jun 17, 2026 by GopalakrishnanN Contributor

Loading…

Fix Shape→Gather→TopK regression: preserve rank-1 single-element index output in data propagation

#29084 opened Jun 16, 2026 by titaiwangms Contributor

Loading…

Add MLFloat16 QuickGelu CPU kernel for fused fp16 Swish/SiLU

#29080 opened Jun 16, 2026 by Copilot AI • Draft

Add 2-bit quantization support to WebGPU GatherBlockQuantized operator ep:WebGPU

ort-web webgpu provider

#29074 opened Jun 16, 2026 by Shivani767 Contributor

Loading…

Fix CPU GQA NaN output for right-padded batched prompts with rotary embeddings

#29069 opened Jun 16, 2026 by Copilot AI • Draft

Previous 1 2 3 4 5 … 23 24 Next

Previous Next

ProTip! Add no:assignee to see everything that’s not assigned.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!