-
Notifications
You must be signed in to change notification settings - Fork 2.8k
Insights: microsoft/onnxruntime
Overview
Could not load contribution data
Please try again later
50 Pull requests merged by 27 people
-
Delete tools/ci_build/github/azure-pipelines/win-gpu-ci-pipeline.yml
#21529 merged
Jul 27, 2024 -
[AIX]test failure fix using gtest-1.15.0 for AIX
#21497 merged
Jul 27, 2024 -
Security fuzz address sanitizer fix Bug #2 and #3
#21528 merged
Jul 27, 2024 -
Bump Sixlabors.ImageSharp from 2.1.8 to 2.1.9 in /csharp/sample/Microsoft.ML.OnnxRuntime.ResNet50v2Sample
#21444 merged
Jul 27, 2024 -
Fix conda failure for onnxruntime-directml
#21526 merged
Jul 27, 2024 -
[VitisAI] support vaip create ep context nodes & bug fix
#21506 merged
Jul 27, 2024 -
[VitisAI] 1. KernelDef supports StartVersion and EndVersion
#21519 merged
Jul 27, 2024 -
Set version and other info in the C# dll
#21517 merged
Jul 27, 2024 -
Update benchmark_mha.py to compare with PyTorch SDPA
#21449 merged
Jul 27, 2024 -
Add QNN EP option context_node_name_prefix to set EPContext node name prefix
#21236 merged
Jul 26, 2024 -
Separating all GPU stages into different Pipelines
#21521 merged
Jul 26, 2024 -
Potential fix for Adobe Analytics
#21512 merged
Jul 26, 2024 -
Update text formatting in generate_cgmanifest.py
#21489 merged
Jul 26, 2024 -
disables qnn in ort training cpu pipeline
#21510 merged
Jul 26, 2024 -
[WebNN EP] Update argMax/argMin to adapt to latest spec
#21452 merged
Jul 26, 2024 -
[DML EP] Register ReduceMin-20
#20477 merged
Jul 26, 2024 -
Fix SkipLayerNormFusion incorrectly setting modified every time it runs
#21502 merged
Jul 26, 2024 -
Allow cpplint to always be green
#21491 merged
Jul 25, 2024 -
CoreML: Aggregated changes to add all required ops for priority model
#21472 merged
Jul 25, 2024 -
Fix Android CI Pipeline code coverage failure
#21504 merged
Jul 25, 2024 -
Qnn batchnorm support input with rank 2
#21469 merged
Jul 25, 2024 -
Add community-contributed LightGlue blog
#21445 merged
Jul 25, 2024 -
Split ondevice training cpu packaging pipeline to a separated pipeline
#21485 merged
Jul 25, 2024 -
Set CUDA12 as default in GPU packages
#21438 merged
Jul 25, 2024 -
Update 05-performance.yml issue template to auto apply label
#21486 merged
Jul 25, 2024 -
[VitisAI] use binary mode for context ep
#21474 merged
Jul 25, 2024 -
OVEP - PR 1.19
#21443 merged
Jul 25, 2024 -
Ignore ruff rule
N813
#21477 merged
Jul 25, 2024 -
Fix security issue #22016 #22017 #22018
#21333 merged
Jul 25, 2024 -
Extend QDQPropagation transformer to handle multiple consumers
#21313 merged
Jul 24, 2024 -
Update ruff and clang-format versions
#21479 merged
Jul 24, 2024 -
[QNN EP] Update to QNN SDK 2.24.0
#21463 merged
Jul 24, 2024 -
Update copy_strip_binary.sh: use "make install" instead
#21464 merged
Jul 24, 2024 -
CoreML: Add ML Program ConvTranspose
#21416 merged
Jul 24, 2024 -
[QNN EP] Improve QNN error reporting using the error message
#21458 merged
Jul 24, 2024 -
[WebNN EP] ConvTranspose should calculate the pads or output shape
#21292 merged
Jul 24, 2024 -
CoreML: Add GridSample ML Program support
#21431 merged
Jul 24, 2024 -
[Fix] C++ API SetOutputShape for register custom op.
#21366 merged
Jul 23, 2024 -
fix python qnn pipelines issues
#21462 merged
Jul 23, 2024 -
[CUDA] Fix cuda provider fallback inconsistency
#21425 merged
Jul 23, 2024 -
Update nodejs's cmake file to fix a file copy issue
#21390 merged
Jul 23, 2024 -
Update C++ dependencies
#21410 merged
Jul 23, 2024 -
CoreML: ML Program Slice
#21433 merged
Jul 23, 2024 -
Update DirectML from 1.14.1 to 1.15.0
#21323 merged
Jul 22, 2024 -
Adds ATen fallback for scaled_dot_product_attention
#21107 merged
Jul 22, 2024 -
Fix typos according to reviewdog report.
#21335 merged
Jul 22, 2024 -
Replace inline pip install with pip install from requirements*.txt
#21106 merged
Jul 22, 2024 -
[WebNN EP] Add outputDataType option for the ArgMax/ArgMin ops
#21385 merged
Jul 22, 2024 -
[CUDA] FusedMHARunnerFP16v2 thread-safe
#21420 merged
Jul 22, 2024
24 Pull requests opened by 20 people
-
Fix typo: Complete the link symbol in pose-detection.md
#21437 opened
Jul 22, 2024 -
Phi3 Android App tutorial
#21446 opened
Jul 22, 2024 -
[WIP] Out-Tree EP feature
#21450 opened
Jul 23, 2024 -
hack ext data location to reduce qd matmul memory usage
#21451 opened
Jul 23, 2024 -
CoreML: Add ML Program Split Op
#21456 opened
Jul 23, 2024 -
[js/webgpu] Add activation for conv3d naive
#21466 opened
Jul 23, 2024 -
Update build from source instructions
#21468 opened
Jul 24, 2024 -
[Fix] ShapeInferContext GetAttrxxxs support empty value
#21471 opened
Jul 24, 2024 -
Update QNN pipeline pool
#21482 opened
Jul 24, 2024 -
Create new_labeler.ylm
#21488 opened
Jul 24, 2024 -
Add reduce kernels for bigger types
#21490 opened
Jul 25, 2024 -
Propagate NaNs in the CPU min and max operators
#21492 opened
Jul 25, 2024 -
Enable FP16 Clip and Handle Bias in FP16 Depthwise Conv
#21493 opened
Jul 25, 2024 -
Create new stale issue workflow
#21495 opened
Jul 25, 2024 -
[CUDA] Fix DecoderMaskedMultiHeadAttention bias input check
#21498 opened
Jul 25, 2024 -
Add topology sort and remove useless cast nodes to fp16 conversion script
#21499 opened
Jul 25, 2024 -
Bump torch from 1.13.1 to 2.2.0 in /tools/ci_build/github/windows/eager
#21505 opened
Jul 25, 2024 -
[WebNN EP] Create MLGraphBuilder for every model builder
#21514 opened
Jul 26, 2024 -
[WebNN EP] Add labels for all WebNN operators
#21516 opened
Jul 26, 2024 -
Remove references of use-android-ndk.yml
#21522 opened
Jul 26, 2024 -
Add Interactive Decoding support in GQA
#21523 opened
Jul 26, 2024 -
[CUDA] Special case for K==0 in CUDA MatMul
#21525 opened
Jul 26, 2024 -
Remove unused parameters from win-ci-vs-2022-job.yml
#21530 opened
Jul 27, 2024 -
[js/web] allow load WebAssembly binary from buffer
#21534 opened
Jul 28, 2024
17 Issues closed by 8 people
-
[Documentation] hopefully final logic app test - please ignore
#21511 closed
Jul 25, 2024 -
[Documentation] [IGNORE] testing logic app change
#21481 closed
Jul 25, 2024 -
[TEST ISSUE] Using this issue to test logic app - please ignore
#21509 closed
Jul 25, 2024 -
[Feature Request] onnxruntime-node support for FreeBSD.
#21508 closed
Jul 25, 2024 -
[Documentation] Community blog post contribution
#21389 closed
Jul 25, 2024 -
[Build] long paths in NuGet package breaking build on Windows
#21369 closed
Jul 25, 2024 -
Create Custom Node in CUDA
#21442 closed
Jul 25, 2024 -
Custom Op Library does not work for CUDA
#21417 closed
Jul 25, 2024 -
EP Error /onnxruntime_src/onnxruntime/core/providers/cuda/cuda_call.cc:123
#21435 closed
Jul 24, 2024 -
[Documentation] Please Ignore - Using this issue to test broken logic app
#21470 closed
Jul 24, 2024 -
[Documentation] Testing logic app update (please ignore)
#21465 closed
Jul 24, 2024 -
TensorRT EP failed to create engine from network.
#21415 closed
Jul 24, 2024 -
[Build] MSVC shared runtime linking error for TRT, CUDA, OpenVino and DML build
#19642 closed
Jul 23, 2024 -
[TensorRT] Caching to a dedicated ONNX file does not work
#21307 closed
Jul 23, 2024 -
[Build] Missing DirectML build in 1.18.1
#21460 closed
Jul 23, 2024 -
Unable to append DML Provider
#21432 closed
Jul 22, 2024 -
[Build] Cross compilation of the onnxruntime 1.5.1 for ARMv7 32bit target for gcc 4.9.2
#21439 closed
Jul 22, 2024
26 Issues opened by 25 people
-
Pushing Rust bindings forward
#21533 opened
Jul 28, 2024 -
[Build] Docerfile.cuda docker image build error
#21532 opened
Jul 28, 2024 -
MLAS failing with "Could not find an implementation for QLinearMatMul"
#21531 opened
Jul 28, 2024 -
CUDA_PATH is set but CUDA wasnt able to be loaded
#21527 opened
Jul 27, 2024 -
Model saved by ORT as external data format will not be aligned for mapfile support
#21524 opened
Jul 26, 2024 -
Error converting Microsoft Phi3 model to ONNX using Python and Transformers
#21518 opened
Jul 26, 2024 -
[Build] detect nothing.i use opencv4.9 onnxruntime 1.16.1, it detect nothing
#21513 opened
Jul 26, 2024 -
[Feature Request] Introduce get_available_initializers
#21503 opened
Jul 25, 2024 -
Onnxruntime LoadLibrary failed with error 126
#21501 opened
Jul 25, 2024 -
[Build] error cross compiling
#21500 opened
Jul 25, 2024 -
[Performance] DequantizeLinear, pad and QuantizeLinear operation is not fused
#21496 opened
Jul 25, 2024 -
Android build: Execution failed for task ':app:mergeExtDexDebug'.
#21494 opened
Jul 25, 2024 -
failing to find trt_timing_cache_path
#21484 opened
Jul 24, 2024 -
[CUDA, DML] MatMul does not properly handle matrices with inner dim == 0
#21483 opened
Jul 24, 2024 -
[Performance] The 16-bit quantization QDQ model cannot be accelerated by CUDA
#21478 opened
Jul 24, 2024 -
quant_pre_process failed on NonMaxSuppression
#21476 opened
Jul 24, 2024 -
Dll version of Microsoft.ML.OnnxRuntime.dll is 0.0.0.0
#21475 opened
Jul 24, 2024 -
[Web] Conv_token_460" failed. Error: Unsupported activation Tanh
#21467 opened
Jul 23, 2024 -
Activate thread pool will cause crash.
#21461 opened
Jul 23, 2024 -
TensorRT EP's inference results are abnormal.
#21457 opened
Jul 23, 2024 -
Incorrect NaN handling for Min and Max operators on CPU with a single element input
#21455 opened
Jul 23, 2024 -
[Web] Error: Tensor's size(512) does not match data length(1024)
#21454 opened
Jul 23, 2024
67 Unresolved conversations
Sometimes conversations happen on old items that aren’t yet closed. Here is a list of all the Issues and Pull Requests with unresolved conversations.
-
[WebNN EP] Enable IO Bindings with MLBuffer
#21301 commented on
Jul 24, 2024 • 27 new comments -
Update react-native to 0.74 and run npm audit fix
#21122 commented on
Jul 27, 2024 • 6 new comments -
Enable export for inference when eval model is loaded from buffer
#21422 commented on
Jul 25, 2024 • 5 new comments -
Keep QDQ nodes w/ nonpositive scale around MaxPool
#21182 commented on
Jul 27, 2024 • 5 new comments -
Drop QDQ around more nodes
#21376 commented on
Jul 26, 2024 • 4 new comments -
Create CMake option `onnxruntime_USE_VCPKG`
#21348 commented on
Jul 27, 2024 • 2 new comments -
Enable AVX NE CONVERT for FP16 to FP32 cast
#21183 commented on
Jul 28, 2024 • 2 new comments -
Adding CUDNN Frontend and use for CUDA NN Convolution
#19470 commented on
Jul 25, 2024 • 1 new comment -
Added WebNN Intro and Tutorial
#20719 commented on
Jul 24, 2024 • 1 new comment -
Refactor onnxruntime_fetchcontent_makeavailable cmake function
#21328 commented on
Jul 24, 2024 • 1 new comment -
[Feature Request] MPS provider
#21271 commented on
Jul 28, 2024 • 0 new comments -
[Performance] CoreML not being used to it's fullest capacity - custom transformer
#19887 commented on
Jul 28, 2024 • 0 new comments -
Cannot create arena allocator with Environment::CreateAndRegisterAllocator on MAC M2 with clang
#21191 commented on
Jul 28, 2024 • 0 new comments -
[Build] How to build for Android armeabi platform?
#21192 commented on
Jul 28, 2024 • 0 new comments -
Can onnxruntime.quantization.quantize_dynamic() work with onnx-trt?
#21169 commented on
Jul 28, 2024 • 0 new comments -
Could not load library cudnn_cnn_infer64_8.dll. Error code 127
#18973 commented on
Jul 28, 2024 • 0 new comments -
[Web] WebGPU and WASM Backends Unavailable within Service Worker
#20876 commented on
Jul 28, 2024 • 0 new comments -
Support Numpy v2.0
#21063 commented on
Jul 28, 2024 • 0 new comments -
[Feature Request] Request grid_sample 5D support 🌟
#21382 commented on
Jul 28, 2024 • 0 new comments -
[Jvm] Native crash during createSession: std::bad_cast
#21147 commented on
Jul 28, 2024 • 0 new comments -
Enabling c++20 on linux
#17816 commented on
Jul 24, 2024 • 0 new comments -
Not able to load onnx model multilingual-e5-large
#21321 commented on
Jul 24, 2024 • 0 new comments -
[js/node] enable float16 support for Node.js binding
#20581 commented on
Jul 28, 2024 • 0 new comments -
Mlas int4 int8 with avx2/512
#20687 commented on
Jul 26, 2024 • 0 new comments -
Implementation of Set Membership in TreeEnsemble
#21222 commented on
Jul 24, 2024 • 0 new comments -
[JS/WegGPU] Initial changes to support wasm64.
#21260 commented on
Jul 26, 2024 • 0 new comments -
[WebNN EP] Support ConvTranspose for TFLite backend
#21291 commented on
Jul 26, 2024 • 0 new comments -
[VitisAI] Remove shape infer from bridge ort
#21331 commented on
Jul 26, 2024 • 0 new comments -
add registered custom op for perf test
#21336 commented on
Jul 25, 2024 • 0 new comments -
Fix wrong per-tensor quantized weight type for matmul
#21347 commented on
Jul 27, 2024 • 0 new comments -
Remove tools/ci_build/github/android/run_nnapi_code_coverage.sh
#21371 commented on
Jul 27, 2024 • 0 new comments -
Add support tensor element type for register custom op shape infer function
#21387 commented on
Jul 24, 2024 • 0 new comments -
Upgrade emsdk from 3.1.59 to 3.1.62
#21421 commented on
Jul 24, 2024 • 0 new comments -
feat(onnxruntime-web): Allow the WASM backend to import the emscripten Module via a user-land defined loader
#21430 commented on
Jul 22, 2024 • 0 new comments -
[Build] Build python interface for Onnxruntime-qnn on aarch64 Linux
#21203 commented on
Jul 24, 2024 • 0 new comments -
New restricted asymmetric quantization mode in QDQ mode with zero_point restricted to either 128 or 0
#21398 commented on
Jul 24, 2024 • 0 new comments -
[Feature Request] Mark as negative tests for minimal CUDA build
#21394 commented on
Jul 24, 2024 • 0 new comments -
[Feature Request] 4bit and 2bit and 1bit quantization support
#14997 commented on
Jul 24, 2024 • 0 new comments -
[CUDA] Acquiring a CUDA allocator without loading a session.
#19420 commented on
Jul 23, 2024 • 0 new comments -
Microsoft.ML.OnnxRuntime.Gpu not working in MAUI project
#14974 commented on
Jul 23, 2024 • 0 new comments -
CUDA Graph Error - CUDA failure 900: operation not permitted when stream is capturing
#15002 commented on
Jul 23, 2024 • 0 new comments -
[Feature Request] Missing optimization of DequantizeLinear ∘ Flatten ∘ QuantizeLinear?
#21375 commented on
Jul 23, 2024 • 0 new comments -
Trilu op still not work with INT32 input
#21400 commented on
Jul 23, 2024 • 0 new comments -
[Documentation] The documentation for early versions is missing
#20850 commented on
Jul 23, 2024 • 0 new comments -
TensorrtExecutionProvider slower than CUDAExecutionProvider: Faster-rcnn [Performance]
#17434 commented on
Jul 23, 2024 • 0 new comments -
[Feature Request] Support for Florence-2 model family
#21118 commented on
Jul 23, 2024 • 0 new comments -
onnxruntime 在C++上如何实现fp16的推理 yolov5模型
#20395 commented on
Jul 23, 2024 • 0 new comments -
[Mobile] React-native OnnxruntimeJSIHelper install segfaults when registering functions
#21003 commented on
Jul 22, 2024 • 0 new comments -
[Web] `Error: [WebGPU] Kernel "[Conv] /text_encoder/encoder/layers.0/feed_forward/conv_2/Conv" failed. Error: FILTER_IN_CHANNEL should be equal to DATA_CHANNEL`
#21108 commented on
Jul 22, 2024 • 0 new comments -
DirectML Exception 80070057 "The parameter is incorrect"
#20575 commented on
Jul 21, 2024 • 0 new comments -
[Performance] Mapfile support for certain external data files is not working
#21195 commented on
Jul 27, 2024 • 0 new comments -
[Training] [ShapeInferenceError] Dimension could not be inferred: incompatible shapes
#21327 commented on
Jul 26, 2024 • 0 new comments -
[Performance] Whisper model inference results incorrect after Transformer Optimizer
#21150 commented on
Jul 26, 2024 • 0 new comments -
[Feature Request] C# Float16 DEMO, and float convert api
#14303 commented on
Jul 26, 2024 • 0 new comments -
How to do multithreaded infer with onnxruntime
#21419 commented on
Jul 25, 2024 • 0 new comments -
CUDA provider fallback to CPU is not working when CUDA_PATH environment variable exists
#21424 commented on
Jul 25, 2024 • 0 new comments -
using TensorRT EP by nuget
#21428 commented on
Jul 25, 2024 • 0 new comments -
[Web] Failed to compile shader on WebGL
#12927 commented on
Jul 25, 2024 • 0 new comments -
How to convert quantized ONNX model from Tensor-Oriented format to Operator-Oriented format?
#21137 commented on
Jul 25, 2024 • 0 new comments -
Quantized ONNX Model Still Has Float32 Input/Output Tensors
#21138 commented on
Jul 25, 2024 • 0 new comments -
[Training] Onnxruntime-training 1.18.0 for windows not available
#21149 commented on
Jul 25, 2024 • 0 new comments -
[E:onnxruntime:, qnn_execution_provider.cc:591 GetCapability] QNN SetupBackend failed qnn_backend_manager.cc:334 InitializeBackend Failed to initialize backend
#21157 commented on
Jul 25, 2024 • 0 new comments -
[Feature Request] SpaceToDepth & DepthToSpace integer implementations
#21287 commented on
Jul 25, 2024 • 0 new comments -
[Feature Request] Implement dynamically sized CPU sets for linux.
#21241 commented on
Jul 25, 2024 • 0 new comments -
Importing onnxruntime on AWS Lambdas with ARM64 processor causes crash
#10038 commented on
Jul 25, 2024 • 0 new comments -
[Performance] Multiple Sessions on Same GPU is very slow
#21365 commented on
Jul 25, 2024 • 0 new comments -
Inference using the CUDA EP returns nan
#15752 commented on
Jul 24, 2024 • 0 new comments