-
Notifications
You must be signed in to change notification settings - Fork 21.5k
Insights: pytorch/pytorch
Overview
Could not load contribution data
Please try again later
30 Pull requests merged by 13 people
-
Cleanup build docker images
#129273 merged
Jun 21, 2024 -
moving conda builds from builder to pytorch
#129167 merged
Jun 21, 2024 -
[ROCm] Include hsa headers for rocm-triton whl
#129235 merged
Jun 21, 2024 -
[custom ops] Switch out references from old landing page to new landi…
#129237 merged
Jun 21, 2024 -
[docs] Redirect custom ops landing page to the correct place (#129177)
#129236 merged
Jun 21, 2024 -
Re-enable py3.12 nightly wheel builds and add triton dependency for ROCm
#129161 merged
Jun 21, 2024 -
[Release only] Temporary change to depend on pytorch-triton
#129232 merged
Jun 21, 2024 -
[inductor][ci] Fix torchbench dependency issue with numpy
#129074 merged
Jun 21, 2024 -
[ROCm] [Triton] - Include roctracer headers in triton whl
#129227 merged
Jun 21, 2024 -
[Release 2.4] Release only changes for triton 3.0.x build
#129143 merged
Jun 20, 2024 -
Revert "[Release 2.4] Release only changes - use pinned triton."
#129139 merged
Jun 20, 2024 -
Remove leftover warning causing log spew
#128837 merged
Jun 19, 2024 -
[Inductor] Fix the High Order Op layout issue (#128275)
#128834 merged
Jun 19, 2024 -
[Port][Quant][Inductor] Bug fix: mutation nodes not handled correctly for QLinearPointwiseBinaryPT2E
#128591 merged
Jun 19, 2024 -
[tp] refactor and fix PrepareModuleInput for DTensor inputs (#128431)
#128719 merged
Jun 19, 2024 -
[inductor] fix compile time regression by caching get_gpu_type (#128363)
#128717 merged
Jun 19, 2024 -
[Inductor] Update Intel GPU Triton commit pin. (#124842)
#128615 merged
Jun 19, 2024 -
Revert "Make torch_geometric models compatible with export (#123403)"…
#128511 merged
Jun 19, 2024 -
[custom_op] stop using nonlocals to store information (#128547)
#128616 merged
Jun 19, 2024 -
Clean up xpu ut to make CI happy (#128383)
#128614 merged
Jun 19, 2024 -
Change Dynamo's custom ops warning message to be less spammy (#128456)
#128581 merged
Jun 19, 2024 -
[inductor] fix linear add bias pattern (#128473)
#128577 merged
Jun 19, 2024 -
[ALI] Use lf runners for Lint
#128978 merged
Jun 19, 2024 -
Revert "Deprecate `torch._utils.is_compiling()` and `torch._dynamo.external_utils.is_compiling()` (#127690)"
#128542 merged
Jun 19, 2024 -
Revert "Set simdlen based on ATEN_CPU_CAPABILITY (#123514)"
#128541 merged
Jun 19, 2024 -
[dynamo] Fix for #127696
#128530 merged
Jun 18, 2024 -
Bump urllib3 from 2.2.1 to 2.2.2 in /tools/build/bazel
#128908 merged
Jun 18, 2024 -
Add build-conda-images.yml in pytorch/pytorch (#128563)
#128962 merged
Jun 18, 2024 -
[Inductor][FlexAttention] Tune backwards kernel block sizes
#128767 merged
Jun 17, 2024
190 Pull requests opened by 100 people
-
[Dynamic Shapes] fixed dynamic shape inference
#128807 opened
Jun 17, 2024 -
[cpp_extension][inductor] Fix sleef windows depends. (#128770)
#128811 opened
Jun 17, 2024 -
[inductor] refine loop split logic
#128812 opened
Jun 17, 2024 -
[ROCm] Tunableop record untuned
#128813 opened
Jun 17, 2024 -
Fix negative value in profier dump table
#128814 opened
Jun 17, 2024 -
[DOT NOT REVIEW] Update Intel Triton
#128820 opened
Jun 17, 2024 -
[Inductor][CPP] Enable Quantized Linear GEMM Template with FP32 output
#128825 opened
Jun 17, 2024 -
Scale XBLOCK in triton reduction configs to avoid hitting max grid
#128826 opened
Jun 17, 2024 -
[caffe2][be] migrate global static initializer SingletonUndefinedTensor
#128828 opened
Jun 17, 2024 -
[caffe2][be] migrate global static initializer - event_template
#128829 opened
Jun 17, 2024 -
[caffe2][be] migrate global static initializer - version_map
#128831 opened
Jun 17, 2024 -
[caffe2][be] [caffe2][be] migrate global static initializer - unused global initializer
#128832 opened
Jun 17, 2024 -
[caffe2][be] migrate global static initializer - EventTable
#128833 opened
Jun 17, 2024 -
[export] turn on runtime asserts by default
#128839 opened
Jun 17, 2024 -
[not for commit] Random perf opts
#128841 opened
Jun 17, 2024 -
Upload release cut source code to s3
#128842 opened
Jun 17, 2024 -
[torchbind] fix bug of mutating FakeScriptObjects twice in aot_export
#128844 opened
Jun 17, 2024 -
[export] experimental joint graph API.
#128847 opened
Jun 17, 2024 -
[dynamo] Remove torchrec skips
#128857 opened
Jun 17, 2024 -
[BE] enable UFMT for `torch/ao/nn/`
#128861 opened
Jun 17, 2024 -
[BE] enable UFMT for `torch/ao/pruning/`
#128862 opened
Jun 17, 2024 -
[BE] enable UFMT for `torch/ao/quantization/`
#128863 opened
Jun 17, 2024 -
[BE] enable UFMT for `torch/ao/`
#128864 opened
Jun 17, 2024 -
[BE][Easy] enable UFMT for `torch/nn/`
#128865 opened
Jun 17, 2024 -
[WIP][inductor]fallback all view operations
#128883 opened
Jun 17, 2024 -
[Traceable FSDP2] Fixes to preserve inplace ops in AOT joint graph and fwd graph
#128886 opened
Jun 17, 2024 -
[dtensor][debug] fixing CommDebugMode module collective tracing
#128887 opened
Jun 17, 2024 -
Updates to test packed layout
#128888 opened
Jun 17, 2024 -
[aota] Needs autograd if an input requires_grad, agnostic to enable_grad
#128890 opened
Jun 17, 2024 -
Forward fix to skip ROCm tests for #122836
#128891 opened
Jun 17, 2024 -
[inductor] Separate Buffer and Operation into two concepts
#128893 opened
Jun 17, 2024 -
[RFC][Not Aim for landing now] Add a dummy pp hang case for flight recorder
#128897 opened
Jun 17, 2024 -
Grouped Query Attention
#128898 opened
Jun 17, 2024 -
[Fix]: Internal test failures when testing the exportability
#128900 opened
Jun 17, 2024 -
Fix mm pad regresion - more conservative estimation of plannable inputs
#128909 opened
Jun 17, 2024 -
Delete unused line
#128913 opened
Jun 18, 2024 -
Add Strided Input test for flex attention
#128915 opened
Jun 18, 2024 -
__eq__ and __hash__ for SymNode
#128916 opened
Jun 18, 2024 -
[reland][ROCm] TunableOp for gemm_and_bias
#128919 opened
Jun 18, 2024 -
Always use high precision for SDPA math backend
#128922 opened
Jun 18, 2024 -
[inductor] Constant folding for dynamic shape node before pattern matching
#128937 opened
Jun 18, 2024 -
Update start_, end_ and retired only for the right entry when retire a work
#128948 opened
Jun 18, 2024 -
test_jit: Replace plain assert by test assert
#128950 opened
Jun 18, 2024 -
Set correct output dtype for dequantize op during convert_pt2e in decomposed mode
#128953 opened
Jun 18, 2024 -
[BE] Do not crash weight_norm on empty tensors
#128957 opened
Jun 18, 2024 -
Add weight_norm opinfo testing
#128958 opened
Jun 18, 2024 -
[pipelining] Support arbitrary stage ordering on ranks
#128976 opened
Jun 18, 2024 -
fix dynamo isinstance inlining for nn.Parameter + subclasses
#128981 opened
Jun 18, 2024 -
[Pipelining] Support separate dw_runner for PipelineStage
#128983 opened
Jun 18, 2024 -
temp run failing split build pull test
#128984 opened
Jun 18, 2024 -
[Not to be committed][AOTI] Add an option to return cpp file only
#128986 opened
Jun 18, 2024 -
pytorch slp
#128990 opened
Jun 18, 2024 -
[dynamo][easy] Rename NotNNModuleSource to UnspecializedNNModuleSource
#128992 opened
Jun 18, 2024 -
[After Rebase] Top of Traceable FSDP2 stack
#128996 opened
Jun 18, 2024 -
[WIP] Test running canary jobs
#129000 opened
Jun 18, 2024 -
[BE] update type annotations for basic utilities in `torch/__init__.py`
#129001 opened
Jun 18, 2024 -
[experiment] run_test: Unset cpp stacktraces after reruns
#129004 opened
Jun 18, 2024 -
[MPS] Fast math env var
#129007 opened
Jun 18, 2024 -
[RFC] scaffolding of the new B2B_GEMM pass
#129009 opened
Jun 18, 2024 -
Fix DEBUG=1 asserts with NJT ops
#129014 opened
Jun 18, 2024 -
torch._inductor.config.joint_graph_constant_folding = False
#129016 opened
Jun 19, 2024 -
[dtensor][debug] add operation tracing to comm_mode
#129017 opened
Jun 19, 2024 -
Update script path in .github/workflows/build-conda-images.yml
#129022 opened
Jun 19, 2024 -
Enable UFMT for numpy_test files, test_xnnpack_integration.py
#129023 opened
Jun 19, 2024 -
[halide-backend] Dimension-based indexing
#129026 opened
Jun 19, 2024 -
Back out "Remove circular import"
#129031 opened
Jun 19, 2024 -
[halide-backend] Support scan kernels
#129035 opened
Jun 19, 2024 -
[halide-backend] Enable bfloat16 support
#129036 opened
Jun 19, 2024 -
[CI] Enable AOT inductor FP32 accuracy test for CPU
#129040 opened
Jun 19, 2024 -
[wip][inductor] don't materialize the large sparse matrix in CE bwd
#129043 opened
Jun 19, 2024 -
fix add decomposition for complex numbers
#129044 opened
Jun 19, 2024 -
[Do Not Merge]include torch
#129047 opened
Jun 19, 2024 -
[Inductor][CPP] Enable Quantized Linear GEMM Template with INT8 output and Unary Post Op
#129048 opened
Jun 19, 2024 -
[Inductor][Quant] Change the schema of QLinear Binary
#129049 opened
Jun 19, 2024 -
Provide a method to unregister privateuse1
#129056 opened
Jun 19, 2024 -
Restore mixed dtypes GEMM auto-tuning for Ampere
#129058 opened
Jun 19, 2024 -
Don't install remaining caffe2 python files
#129067 opened
Jun 19, 2024 -
use shutil.which in check_compiler_ok_for_platform
#129069 opened
Jun 19, 2024 -
[codemod][lowrisk] Remove extra semi colon from caffe2/aten/src/ATen/functorch/BatchRulesRandomness.cpp
#129073 opened
Jun 19, 2024 -
[inductor] Remove comm-specific node attributes from scheduler
#129084 opened
Jun 19, 2024 -
[Fix]: TSConverter errors on dynamic shapes
#129087 opened
Jun 19, 2024 -
Relax constraints for creating a `GenericContextWrappingVariable`
#129091 opened
Jun 19, 2024 -
Prototype for export_for_training
#129092 opened
Jun 19, 2024 -
[ROCm] Use AOTriton as a dynamic library
#129094 opened
Jun 19, 2024 -
re-export torch.optim._multi_tensor in torch/__init__.py
#129095 opened
Jun 19, 2024 -
Fix scatter lowering when src is a Number
#129096 opened
Jun 19, 2024 -
Fix rot90 decomposition for no rotation
#129097 opened
Jun 19, 2024 -
[executorch hash update] update the pinned executorch hash
#129099 opened
Jun 20, 2024 -
[Inductor][CPP] Enable Quantized Linear GEMM Template with Binary Fusion
#129103 opened
Jun 20, 2024 -
[MPS] Generalize Fused optimizers
#129105 opened
Jun 20, 2024 -
Fix max_pool2d decomposition for empty list and integer limits
#129106 opened
Jun 20, 2024 -
[FSDP] Runtime Error on Checkpoint Loading for optimizer state
#129110 opened
Jun 20, 2024 -
[PT2][Observability] Change the string to dict type
#129112 opened
Jun 20, 2024 -
Refine the logic of device construction when only device index is given
#129119 opened
Jun 20, 2024 -
[Inductor][CPP] Support more than one LocalBuffer
#129121 opened
Jun 20, 2024 -
Fix typo in stack_module_state doc
#129126 opened
Jun 20, 2024 -
Fix integer overflow in quantization
#129127 opened
Jun 20, 2024 -
[aot] Keep backward mutations in backward
#129130 opened
Jun 20, 2024 -
[BE] use relative backwards references in torch.optim._multi_tensor
#129132 opened
Jun 20, 2024 -
[AOTI] Remove the epilogue for generating non-triggered kernels
#129134 opened
Jun 20, 2024 -
[AOTI] Introduce DeferredCudaKernelLine for cuda cpp wrapper
#129135 opened
Jun 20, 2024 -
Refine typing annotation for compile
#129136 opened
Jun 20, 2024 -
Skip ao_sparsity TestComposability for missing FBGEMM
#129137 opened
Jun 20, 2024 -
Move caffe2/serialize to torch/csrc/api
#129141 opened
Jun 20, 2024 -
[inductor] switch CppCodeCache to new cpp_builder. (take 2)
#129144 opened
Jun 20, 2024 -
Fixes T192448049
#129146 opened
Jun 20, 2024 -
[C10D] Avoid lazily creating P2P communicators
#129147 opened
Jun 20, 2024 -
Update README.md
#129149 opened
Jun 20, 2024 -
Update test_torch.py
#129151 opened
Jun 20, 2024 -
[nn-module] Use standard dict for _parameters, _modules and _buffers
#129164 opened
Jun 20, 2024 -
[experiment] build
#129170 opened
Jun 20, 2024 -
fix cpp compilation error
#129173 opened
Jun 20, 2024 -
Back out "Remove circular import"
#129180 opened
Jun 20, 2024 -
typing proxy_tensor.py
#129182 opened
Jun 20, 2024 -
[AOTI] Fix test_cond_non_tensor_predicates
#129183 opened
Jun 20, 2024 -
[CUDAGraph Trees] Enable input mutation support in OSS
#129184 opened
Jun 20, 2024 -
[3.13, WIP] directly use frame localsplus in guards
#129185 opened
Jun 20, 2024 -
Proof-of-concept: manage registered communication buffers with Inductor
#129186 opened
Jun 20, 2024 -
Add lowering for updated _scaled_mm
#129187 opened
Jun 21, 2024 -
[bazel] fix --config=shell
#129194 opened
Jun 21, 2024 -
[RFC] Add JSON logging
#129196 opened
Jun 21, 2024 -
Log whenever we sleep
#129197 opened
Jun 21, 2024 -
Pianpwk/dedup2
#129199 opened
Jun 21, 2024 -
[wip] merge_csv tool
#129202 opened
Jun 21, 2024 -
Add xpu to getAccelerator
#129205 opened
Jun 21, 2024 -
[Inductor] Draft version of block sparse mask for flex attention
#129216 opened
Jun 21, 2024 -
Remove more ONNX Caffe2 code
#129218 opened
Jun 21, 2024 -
Fix license metadata in setup.py
#129219 opened
Jun 21, 2024 -
[Inductor][CPP] Enable Quantized Linear with AMX MicroGEMM
#129220 opened
Jun 21, 2024 -
[Inductor][CPP] Pass weight dtype explicitly for cpp gemm template
#129221 opened
Jun 21, 2024 -
[inductor][cpp] support nested kernel with indirect indexing
#129223 opened
Jun 21, 2024 -
[easy][DCP] make BroadcastingTorchSaveReader device generic
#129231 opened
Jun 21, 2024 -
[pipelining] Support W action for schedules
#129233 opened
Jun 21, 2024 -
[sparse][bfloat16] bmm_sparse_cuda
#129234 opened
Jun 21, 2024 -
Add warning for weights_only
#129239 opened
Jun 21, 2024 -
[FSDP2] Fixed `unshard` without lazy init
#129241 opened
Jun 21, 2024 -
Enable dynamic rollout for pull workflow
#129243 opened
Jun 21, 2024 -
Fix allowlisting of builtins for weights_only unpickler
#129244 opened
Jun 21, 2024 -
[BE] Runner determinator: Expect usernames to be prefixed with '@'
#129246 opened
Jun 21, 2024 -
Make run_decomp work
#129249 opened
Jun 21, 2024 -
Have torch_key hash entire torch directory
#129250 opened
Jun 21, 2024 -
Allow BUILD/NEWOBJ instruction for items added via torch.serialization.add_safe_globals
#129251 opened
Jun 21, 2024 -
[DSD] Correctly handle shared parameters for optimizer state_dict (#1…
#129252 opened
Jun 21, 2024 -
Support HSDP + Monolith Checkpointing (#128446)
#129254 opened
Jun 21, 2024 -
[DSD] Add unittest to verify HSDP1 + broadcast_from_rank0 (#128755)
#129255 opened
Jun 21, 2024 -
[inductor] Fix TORCHINDUCTOR_FORCE_DISABLE_CACHES
#129257 opened
Jun 21, 2024 -
[FSDP2] Used multi-grad hook when no inputs require grad
#129259 opened
Jun 21, 2024 -
[export] Rewrite exportdb formatting.
#129260 opened
Jun 21, 2024 -
TCPStore: retry on validate errors
#129261 opened
Jun 21, 2024 -
Allow SAC policy_fn to return bool for backward compatibility
#129262 opened
Jun 21, 2024 -
[Pipelining] Add to/from CSV format and improved __repr__
#129264 opened
Jun 21, 2024 -
Pianpwk/dedup3
#129265 opened
Jun 21, 2024 -
Documentations for XPU functionality to PyTorch
#129266 opened
Jun 21, 2024 -
[AOTI][refactor] Move generate_user_defined_triton_kernel
#129267 opened
Jun 21, 2024 -
[AOTI] Introduce DeferredCudaGridLine
#129268 opened
Jun 21, 2024 -
TunableOp hotfix
#129281 opened
Jun 21, 2024 -
[C10D] Make new_group eager when used with comm_split
#129284 opened
Jun 21, 2024 -
[dynamo][compile-time][inlining-inbuilt-nn-modules] Manually implement nn.Module._call_impl
#129285 opened
Jun 21, 2024 -
[FSDP2] Added `set_reduce_scatter_divide_factor`
#129286 opened
Jun 21, 2024 -
Preserve _numeric_debug_handle throguh deepcopy
#129287 opened
Jun 22, 2024 -
Inductor to fail gracefully on Voltas for bf16 tensors
#129288 opened
Jun 22, 2024 -
Implement operator for micro-pipelined all-gather -> _scaled_mm
#129289 opened
Jun 22, 2024 -
3d Composability
#129290 opened
Jun 22, 2024 -
Skip signals from older runs of the same workflows
#129291 opened
Jun 22, 2024 -
[Fix]: TSConverter handles call ops with multiple outputs
#129294 opened
Jun 22, 2024 -
[Reopen #114036] Allow "must recompute" in torch.compile + selective checkpointing (SAC)
#129295 opened
Jun 22, 2024 -
Support tensor stride
#129297 opened
Jun 22, 2024 -
Add one more shard for CPU jobs
#129299 opened
Jun 22, 2024 -
[2/N] Fix clang-tidy warnings in torch/csrc/jit/serialization
#129300 opened
Jun 22, 2024 -
[aotinductor][UserDefinedTritonKernel] use appropriate expr printer when printing args
#129301 opened
Jun 22, 2024 -
Added host-side associative scan function
#129307 opened
Jun 22, 2024 -
[halide-backend] Random number generation
#129314 opened
Jun 22, 2024 -
[dynamo][compile-time] Manually implement nn.Module.__getattr__ to reduce compile time
#129315 opened
Jun 22, 2024 -
[docs] fix incorrect example in `convert_conv3d_weight_memory_format`
#129318 opened
Jun 22, 2024 -
[halide-backend] Disable split reductions for Halide
#129320 opened
Jun 23, 2024 -
[halide-backend] Support manual schedules
#129321 opened
Jun 23, 2024 -
fake_tensor - flatten keys
#129323 opened
Jun 23, 2024 -
WIP: fake tensor SymInt support, try 2
#129324 opened
Jun 23, 2024 -
[inductor] Make UserDefinedTritonKernel a multi-output operation
#129325 opened
Jun 23, 2024 -
Fix build error on s390x
#129326 opened
Jun 23, 2024 -
[aotinductor] Only autotune at compile time when enabled via config
#129335 opened
Jun 23, 2024 -
[Inductor][Intel GPU] Support reduction split. (#129120)
#129337 opened
Jun 24, 2024 -
[Easy][Traceable FSDP2] Skip rocm for the E2E tests
#129339 opened
Jun 24, 2024 -
Remove test_mps_allocator_module XFAIL
#129340 opened
Jun 24, 2024 -
[AOTI] Switch the CUDA codegen to one-pass
#129342 opened
Jun 24, 2024 -
[inductor] Add FileCheck to flex attention epilogue test
#129343 opened
Jun 24, 2024 -
[inductor] Use multiple outputs for flex-attention
#129344 opened
Jun 24, 2024 -
[AOTI][not for review] Test cpp_wrapper mode
#129345 opened
Jun 24, 2024 -
[inductor] Kill mark_node_as_mutating
#129346 opened
Jun 24, 2024
136 Issues closed by 43 people
-
dynamo eval the subfunction of a skiped frame with callback, bad performance and more error
#128928 closed
Jun 24, 2024 -
DISABLED test_post_localSGD_optimizer_step_reload (__main__.TestDistBackendWithSpawn)
#84886 closed
Jun 24, 2024 -
UNSTABLE inductor-A100-perf-nightly / cuda12.1-py3.10-gcc9-sm80 / test (inductor_torchbench_perf)
#128846 closed
Jun 23, 2024 -
UNSTABLE pull / linux-focal-cuda12.4-py3.10-gcc9-sm86 / build
#127104 closed
Jun 23, 2024 -
UNSTABLE rocm / linux-focal-rocm6.1-py3.8 / test (default)
#129208 closed
Jun 23, 2024 -
UNSTABLE periodic / linux-focal-rocm6.1-py3.8 / test (distributed)
#129209 closed
Jun 23, 2024 -
UNSTABLE trunk / linux-focal-rocm6.1-py3.8 / test (default)
#129210 closed
Jun 23, 2024 -
UNSTABLE trunk / linux-focal-rocm6.1-py3.8 / test (distributed)
#129211 closed
Jun 23, 2024 -
UNSTABLE inductor-periodic / cuda12.1-py3.10-gcc9-sm86-periodic-dynamo-benchmarks / test (aot_eager_torchbench)
#128929 closed
Jun 23, 2024 -
UNSTABLE inductor / cuda12.1-py3.10-gcc9-sm86 / test (inductor_torchbench)
#128901 closed
Jun 23, 2024 -
UNSTABLE inductor / cuda12.1-py3.10-gcc9-sm86 / test (dynamic_inductor_torchbench)
#128902 closed
Jun 23, 2024 -
UNSTABLE inductor / cuda12.1-py3.10-gcc9-sm86 / test (aot_inductor_torchbench)
#128903 closed
Jun 23, 2024 -
DISABLED test_comprehensive_nn_functional_huber_loss_cuda_float16 (__main__.TestInductorOpInfoCUDA)
#129238 closed
Jun 22, 2024 -
[ONNX] Migrate OpSignature to ONNX Script
#129278 closed
Jun 22, 2024 -
Illegal instruction (core dumped): PyTorch 2.3.0+rocm6.0
#125310 closed
Jun 21, 2024 -
nn.Linear outputs differ on the same input tensor #129029 answer does not match
#129111 closed
Jun 21, 2024 -
Saved variable packing unpacking incorrect aliases version counter
#128611 closed
Jun 21, 2024 -
Forward hooks not called when fast path is used in TransformerEncoderLayer
#128413 closed
Jun 21, 2024 -
[ONNX] Provide an option to not generate `report_dynamo_export.sarif`
#109137 closed
Jun 21, 2024 -
Torch compile does not work on python 3.12
#120233 closed
Jun 21, 2024 -
Gumbel Vector Quantizer produces NaN when using with torch.compile
#127749 closed
Jun 21, 2024 -
Huge solibs in Linux wheel for torch 2.3.1+rocm6.0
#129165 closed
Jun 21, 2024 -
Slow DataLoader in new version when num_workers>0 / objects in collate_fn slow down batching
#123439 closed
Jun 21, 2024 -
The function name `SGD()` should be `GD()` if it's still (classic) GD,
#129190 closed
Jun 21, 2024 -
¿Cómo llamar a United desde Estados Unidos?{1*844*499*2050} CALL NOW !!
#129212 closed
Jun 21, 2024 -
¿Cómo puedo hablar con una persona en Delta? [꜍D꜉🅔꜍L꜉🅣꜍A꜉ AiRlInEs]
#129214 closed
Jun 21, 2024 -
[TorchAO] fail to do fake_tensor_prop with freezing pass
#123522 closed
Jun 21, 2024 -
[dynamo] 'torch._C.ScriptFunction' object has no attribute '__defaults__'
#93698 closed
Jun 21, 2024 -
Is it possble to add a registerable api at the beginning of torch.save
#117840 closed
Jun 21, 2024 -
frombuffer() → "The given buffer is not writable" warning, tensor has some NaNs
#129077 closed
Jun 21, 2024 -
Incompatability between torch>=2.3 and torchdatasets==0.2.0
#129060 closed
Jun 21, 2024 -
DISABLED [WORKFLOW_NAME] / [PLATFORM_NAME] / [JOB_NAME]
#129195 closed
Jun 21, 2024 -
Ignore this
#129128 closed
Jun 21, 2024 -
UNSTABLE periodic / win-vs2019-cuda11.8-py3 / test (default)
#129064 closed
Jun 21, 2024 -
[Feature Request] switch amx isa detection in onednn to cpuinfo
#127368 closed
Jun 21, 2024 -
Failure with setup-ssh on Amazon Linux 2023 runners
#129152 closed
Jun 21, 2024 -
Outdated ncclResult code
#128756 closed
Jun 20, 2024 -
Using PyTorch with Transformers to run inference with 'MPS' backend causes poor results.
#128435 closed
Jun 20, 2024 -
auto_functionalized doesn't work with non-Tensor returns
#120490 closed
Jun 20, 2024 -
torch_dispatch mode silent incorrectness with torch.compile
#115653 closed
Jun 20, 2024 -
torch.compile hang/crashes with worker_start_method=spawn
#126311 closed
Jun 20, 2024 -
DISABLED test_quantization_doc_ptsq (__main__.TestQuantizationDocs)
#125669 closed
Jun 20, 2024 -
DISABLED test_creation_with_zeros_cuda_float8_e5m2 (__main__.TestFloat8DtypeCUDA)
#124474 closed
Jun 20, 2024 -
DISABLED test_graph_concurrent_replay (__main__.TestCuda)
#104055 closed
Jun 20, 2024 -
DISABLED test_tensor_subclasses (__main__.TestScript)
#119949 closed
Jun 20, 2024 -
DISABLED test_quantization_doc_custom (__main__.TestQuantizationDocs)
#125668 closed
Jun 20, 2024 -
DISABLED test_is_isnot (__main__.TestScript)
#120694 closed
Jun 20, 2024 -
DISABLED test_index (__main__.TestPythonBuiltinOP)
#119160 closed
Jun 20, 2024 -
DISABLED test_quantization_doc_ptdq (__main__.TestQuantizationDocs)
#125667 closed
Jun 20, 2024 -
DISABLED test_add_loggers_conv_bn_relu_fusion_quant (__main__.TestFXNumericSuiteNShadows)
#127814 closed
Jun 20, 2024 -
DISABLED test_quantization_doc_fx (__main__.TestQuantizationDocs)
#125670 closed
Jun 20, 2024 -
DISABLED test_quantization_doc_qat (__main__.TestQuantizationDocs)
#128118 closed
Jun 20, 2024 -
DISABLED test_comprehensive_special_bessel_y1_cuda_int32 (__main__.TestInductorOpInfoCUDA)
#127080 closed
Jun 20, 2024 -
DISABLED test_cusparse_multiple_threads_same_device (__main__.TestCuda)
#127536 closed
Jun 20, 2024 -
[nnc][perf] CPU fuser needs to support intra-op parallelism
#50853 closed
Jun 20, 2024 -
[Dynamo x torch_function] methods on torch_function objects require id_match guards, causing recompiles
#128964 closed
Jun 20, 2024 -
Torchrun / torch.distributed.run throws RendezvousConnectionError / DistNetworkError (Connection reset by peer)
#128970 closed
Jun 19, 2024 -
ONNX export for gelu at version 20
#128772 closed
Jun 19, 2024 -
UNSTABLE periodic / win-vs2019-cuda11.8-py3 / test (default)
#129065 closed
Jun 19, 2024 -
nn.Linear outputs differ on the same input tensor
#129029 closed
Jun 19, 2024 -
torch.Tensor.tolist() cancels torch.round()
#128943 closed
Jun 19, 2024 -
torch.compile crash - Aborted exit code 134
#125804 closed
Jun 19, 2024 -
[xpu] ERROR: Failed building wheel for triton when USE_XPU=1 make triton
#129042 closed
Jun 19, 2024 -
torch.gather can be slow on AMD with duplicated index
#128631 closed
Jun 19, 2024 -
[dynamo] Slow compile times for optimizers due to for loops
#110506 closed
Jun 18, 2024 -
[Inductor] [Distributed] DDP torch.compile model hangs on exit (python 3.8/3.9)
#125235 closed
Jun 18, 2024 -
torch.compile() bug in AOTAutograd or Dynamo
#103727 closed
Jun 18, 2024 -
Request: flag to know model is compiled after torch.compile()
#103553 closed
Jun 18, 2024 -
Dynamo trouble shooting dead link
#103276 closed
Jun 18, 2024 -
nondeterminism in torch.compile + custom op
#127995 closed
Jun 18, 2024 -
torch.compile + constructing an nn.Parameter + mutating it can give wrong results
#125284 closed
Jun 18, 2024 -
SyntaxError: unterminated string literal (detected at line 1) (<unknown>, line 1)
#127637 closed
Jun 18, 2024 -
torch._dynamo.export segfaults when calling nn.Parameter
#126109 closed
Jun 18, 2024 -
fullgraph=True doesn't actually raise error when you don't manage full graph inside DDP
#107639 closed
Jun 18, 2024 -
Tied Weight Embeddings Models Fail to Load on Torch 2.4 Nightly
#128011 closed
Jun 18, 2024 -
distributed.gather shape constraints
#103305 closed
Jun 18, 2024 -
DISABLED test_optimizer_parameters_sgd (__main__.TestTorchTidyProfiler)
#123624 closed
Jun 18, 2024 -
DISABLED test_isinstance (__main__.TestScript)
#123832 closed
Jun 18, 2024 -
DISABLED test_tensor_number_math (__main__.TestScript)
#123701 closed
Jun 18, 2024 -
DISABLED test_math_ops (__main__.TestScript)
#123693 closed
Jun 18, 2024 -
DISABLED test_index (__main__.TestScript)
#123635 closed
Jun 18, 2024 -
DISABLED test_number_math (__main__.TestScript)
#123660 closed
Jun 18, 2024 -
[export] Export warnings as no-ops
#113792 closed
Jun 18, 2024 -
Tracing per-param sharding FSDP: Dynamo tracing weakrefs
#114288 closed
Jun 18, 2024 -
`pytest test/dynamo/test_ctx_manager.py -v -k "test_autocast_graph_break_method"` fails locally
#117000 closed
Jun 18, 2024 -
Dynamo'ing Rprop, RMSprop, and Adadelta misses incrementing step due to skipping _init_group
#115679 closed
Jun 18, 2024 -
Performance impact of pre-division.
#128918 closed
Jun 18, 2024 -
[dynamo][eval frame] frame->f_locals is empty after call_callback
#118068 closed
Jun 18, 2024 -
[Dynamo] Better support DTensor
#117670 closed
Jun 18, 2024 -
Power and multiple multiplication don't give the same gradient
#128836 closed
Jun 18, 2024 -
Parameters out of sync over different ranks due to unused parameters
#128949 closed
Jun 18, 2024 -
compiling profiler with ExecutionTraceObserver breaks
#124500 closed
Jun 18, 2024 -
torchinductor error in torchao tests
#128263 closed
Jun 18, 2024 -
Inlining nn modules and FSDP
#128154 closed
Jun 18, 2024 -
address TODO: model is somehow not being freed when z3 is available
#127444 closed
Jun 18, 2024 -
Tracing through __getitem__ -> __len__ for ModuleList fails.
#126445 closed
Jun 18, 2024 -
UNSTABLE pull / win-vs2019-cpu-py3 / build
#103729 closed
Jun 18, 2024 -
UNSTABLE trunk / win-vs2019-cpu-py3 / build
#103732 closed
Jun 18, 2024 -
UNSTABLE trunk / win-vs2019-cuda11.8-py3 / build
#103733 closed
Jun 18, 2024 -
UNSTABLE periodic / win-vs2019-cuda11.8-py3 / build
#128855 closed
Jun 18, 2024 -
taking upper triangular of "-inf" matrix results in nan values
#128429 closed
Jun 18, 2024 -
heartbeatMonitor error after run script multiple times
#128680 closed
Jun 18, 2024 -
PT2 custom ops does not work with future annotations
#105157 closed
Jun 18, 2024 -
ImportError: cannot import name 'OrderedDict' from partially initialized module 'collections'
#128838 closed
Jun 18, 2024 -
OptimizedModule should call _orig_mod's load_state_dict()/state_dict() methods.
#123625 closed
Jun 17, 2024 -
[dynamo] Are we over guarding on `__defaults__`?
#123490 closed
Jun 17, 2024 -
Document the torch.cuda.profiler.profile function
#127901 closed
Jun 17, 2024 -
Can't compile torchaudio.transforms.Spectrogram
#121718 closed
Jun 17, 2024 -
[Dynamo] nn.Module forward hook ends up with a separate graph
#121695 closed
Jun 17, 2024 -
Using torch.compile, batch training step time takes long to converge when adding a LR Scheduler
#120934 closed
Jun 17, 2024 -
TypeError: unhashable type 'dict'
#120932 closed
Jun 17, 2024 -
InternalTorchDynamoError on KL Divergences
#120497 closed
Jun 17, 2024 -
inductor_torchbench_perf jobs are broken due to numpy 2.0 update
#128845 closed
Jun 17, 2024 -
`InternalTorchDynamoError: source code not available` when using multiple modules in ipynb
#119225 closed
Jun 17, 2024 -
Dynamo incorrectly classifies bound methods when used in a closure
#118988 closed
Jun 17, 2024 -
torch.compile() AssertionError: target must be of GPUTarget type
#128357 closed
Jun 17, 2024 -
Dynamo fails due to `TorchRuntimeError: slice step cannot be zero` with `dynamic=True`
#128827 closed
Jun 17, 2024 -
onnx.dynamo_export() fails on torch.numel()
#128882 closed
Jun 17, 2024 -
CuDNN Attention Kernel _scaled_dot_product_cudnn_attention unable to run.
#122695 closed
Jun 17, 2024 -
Error linking(name collision) to libtorch on Windows with OneAPI and Visual Studio
#128823 closed
Jun 17, 2024 -
HSDP + `set_optimizer_state_dict` errors with monolithic checkpointing
#128444 closed
Jun 17, 2024 -
UNSTABLE inductor-A100-perf-nightly / cuda12.1-py3.10-gcc9-sm80 / test (inductor_torchbench_perf)
#128848 closed
Jun 17, 2024 -
UNSTABLE inductor / cuda12.1-py3.10-gcc9-sm86 / test
#120841 closed
Jun 17, 2024 -
Document the torch.cuda.cudart function
#127908 closed
Jun 17, 2024 -
Document the torch.nn.parallel.scatter_gather.gather function
#127899 closed
Jun 17, 2024 -
Does torch have any features or future plans to improve performance on ARM?
#128817 closed
Jun 17, 2024 -
Link https://pytorch.org/docs/stable/nn.html#torch.nn.EmbeddingBag may not exist anymore
#128774 closed
Jun 17, 2024 -
Tensor.new_empty type annotation does not accept SymInt
#115456 closed
Jun 17, 2024 -
[dynamo] Trace through invalid bool tensor operations properly
#127003 closed
Jun 17, 2024 -
DISABLED test_pointwise_bessel_y1_cuda (__main__.GPUTests)
#127756 closed
Jun 17, 2024
113 Issues opened by 81 people
-
Error when loading a module file by calling torch::jit::load(..)
#129347 opened
Jun 24, 2024 -
NCCL Blocking Send/Recv are Non-blocking in practice
#129341 opened
Jun 24, 2024 -
`linspace()` can also use `complex` and `bool` type for `start` and `end` argument against the doc
#129338 opened
Jun 24, 2024 -
pytorch-nightly export KINETO_USE_DAEMON=1 Cannot initialize CUDA without ATen_cuda library
#129336 opened
Jun 23, 2024 -
`end`, `start` and `step` argument of `arange()` work with a 0D tensor against error messages
#129334 opened
Jun 23, 2024 -
`start` and `step` of `arange()` should be optional on the doc
#129333 opened
Jun 23, 2024 -
torch.Tensor.register_hook() source link does not work
#129332 opened
Jun 23, 2024 -
Exporting the operator 'aten::fft_fft' to ONNX opset version 12 is not supported.
#129331 opened
Jun 23, 2024 -
Fuyou Training Framework Integration for PyTorch
#129330 opened
Jun 23, 2024 -
_foreach_addc_
#129329 opened
Jun 23, 2024 -
Torch dynamo deep dive and overview discrepancy
#129328 opened
Jun 23, 2024 -
[export/dynamo] torch._check fails at compile time when the condition evaluates to False
#129327 opened
Jun 23, 2024 -
`repeat_interleave()` without `repeats` argument and `input` keyword works
#129322 opened
Jun 23, 2024 -
`int` type for `dims` of `tile()` without `dims=` works with a tensor against the doc
#129319 opened
Jun 22, 2024 -
`msort()` can use the 0D tensor of a complex type value against error message
#129312 opened
Jun 22, 2024 -
The unexpected behavior of `argsort()`
#129311 opened
Jun 22, 2024 -
Upgrade dependencies MKL and Intel OpenMP to 2024.2.0
#129310 opened
Jun 22, 2024 -
`argsort()` can use the 0D tensor of a complex type value against error message
#129309 opened
Jun 22, 2024 -
[ONNX][low pri] Move old (non-public) implementation into legacy/ and schedule for deprecation
#129308 opened
Jun 22, 2024 -
[ExecutionTraceObserver] Tracer gets stuck using Pytorch 2.2 versions for some models using torch.compile
#129306 opened
Jun 22, 2024 -
C++ API: add torch::manual_seed run error failed(-1073741819)
#129305 opened
Jun 22, 2024 -
`python3 setup.py bdist_wheel` tries to write to /usr/local/... during build
#129304 opened
Jun 22, 2024 -
Incorrect index from torch.mode
#129303 opened
Jun 22, 2024 -
The unexpected behavior of `sort()`
#129298 opened
Jun 22, 2024 -
`sort()` can use the 0D tensor of a `complex` type value against error message
#129296 opened
Jun 22, 2024 -
forward_ad ignores checkpoints
#129293 opened
Jun 22, 2024 -
DISABLED test_dummy_mha_with_nt_cuda (__main__.TestNestedTensorSubclassCUDA)
#129292 opened
Jun 22, 2024 -
Support for torch.Generator with JIT
#129282 opened
Jun 21, 2024 -
[ONNX] Create a new compiler in torchbench to start measuring torch-onnx
#129280 opened
Jun 21, 2024 -
[ONNX] Create unit tests for the new export path by adapting all existing tests
#129279 opened
Jun 21, 2024 -
[ONNX] Migrate logic from torch-onnx to torch.onnx
#129277 opened
Jun 21, 2024 -
[ONNX] Full support of dynamic axes
#129276 opened
Jun 21, 2024 -
[ONNX] Missing operator tracker
#129275 opened
Jun 21, 2024 -
[ONNX] Exporter improvement tasks
#129274 opened
Jun 21, 2024 -
RecursionError for MaskedTensor.where
#129272 opened
Jun 21, 2024 -
Move Memory Allocation for Autotuning out of the critical path
#129258 opened
Jun 21, 2024 -
UNSTABLE pull / linux-focal-py3.12-clang10-experimental-split-build / test (dynamo)
#129256 opened
Jun 21, 2024 -
UNSTABLE pull / linux-focal-py3.12-clang10-experimental-split-build / test (default)
#129248 opened
Jun 21, 2024 -
RuntimeError: get_parameter is not supported on ScriptModules
#129247 opened
Jun 21, 2024 -
Release corronspoding CUDA deps of `pytorch-nightly::pytorch-cuda` along with `pytorch-nightly::pytorch`
#129230 opened
Jun 21, 2024 -
Incorrect behavior of dtensor full_tensor for TP+FSDP2
#129229 opened
Jun 21, 2024 -
CVE-2024-5480 reported by security analyzers
#129228 opened
Jun 21, 2024 -
model.generate(..) slow and huge GPU memory consumption
#129226 opened
Jun 21, 2024 -
Why need to transpose when collate a sequence data in dataloader?
#129225 opened
Jun 21, 2024 -
_If pin_memory_thread is alive but dataqueue is empty, it will fall into a dead loop,
#129224 opened
Jun 21, 2024 -
ignore all target for CrossEntropyLoss 2d,return nan but 1d return 0
#129222 opened
Jun 21, 2024 -
`Conv1d` with out_channels > 65536 gives wrong result in MPS
#129207 opened
Jun 21, 2024 -
Bug in calling full_tensor() when model is sharded with tensor parallel and FSDP-2
#129206 opened
Jun 21, 2024 -
RuntimeError when using torch.ops.aten._jagged_to_padded_dense_forward with large jagged tensors
#129191 opened
Jun 21, 2024 -
extending forward-mode AD docs should really point to an example
#129176 opened
Jun 20, 2024 -
[custom_op] support str as default values
#129175 opened
Jun 20, 2024 -
[custom_op] support dtype as default values
#129174 opened
Jun 20, 2024 -
TORCHINDUCTOR_FORCE_DISABLE_CACHES=1 doesn't appear to clear file cache
#129159 opened
Jun 20, 2024 -
Example usage for `convert_conv3d_weight_memory_format` does not work anymore
#129158 opened
Jun 20, 2024 -
DISABLED test_nn_sequential_invocation (__main__.MiscTests)
#129156 opened
Jun 20, 2024 -
DISABLED test_metadata_parsing_with_layer_split (__main__.TestSerialize)
#129155 opened
Jun 20, 2024 -
[RFC][C10D] Avoid creating new nccl communicator for each P2P pair
#129140 opened
Jun 20, 2024 -
Torch compile initialises CUDA context, even compiling CPU functions
#129131 opened
Jun 20, 2024 -
Questions about CVE-2024-31583 and CVE-2024-31580
#129122 opened
Jun 20, 2024 -
interpolate nearest get values zero when outputs over 4G elements
#129118 opened
Jun 20, 2024 -
[inductor][perf] Inductor/Triton softmax kernel is slower than eager
#129104 opened
Jun 20, 2024 -
[inductor][perf] Suboptimal codegen for horizontally fused softmax
#129102 opened
Jun 20, 2024 -
Torch Threading causes Seg Fault in pygame.
#129100 opened
Jun 20, 2024 -
take_along_dim or gather unstable results on cpu with stride 1
#129093 opened
Jun 19, 2024 -
Adding betainc
#129085 opened
Jun 19, 2024 -
UNSTABLE pull / linux-focal-cuda12.1-py3.10-gcc9-experimental-split-build / test (default)
#129080 opened
Jun 19, 2024 -
Regression in loading optimizer learning rate
#129079 opened
Jun 19, 2024 -
return type of torch.nn.functional.interpolate not working
#129053 opened
Jun 19, 2024 -
Add comment for label_smoothing parameter in torch.nn.CrossEntropyLoss
#129050 opened
Jun 19, 2024 -
[PT2E Quantization] Graph with concatenation of the same node will raise RecursionError when prepare_pt2e
#129038 opened
Jun 19, 2024 -
torch parallel Broadcast inconsistency
#129032 opened
Jun 19, 2024 -
Extract some public APIs from torch::cuda::initModule(module) to torch::initModule()
#129027 opened
Jun 19, 2024 -
Expand Tag Set
#129020 opened
Jun 19, 2024 -
Zluda Support
#129019 opened
Jun 19, 2024 -
Spurious "socket cannot be initialized" error messages
#128998 opened
Jun 18, 2024 -
Look up tensor device member inside Tensor is_pinned() implementation instead of accepting an outside input
#128988 opened
Jun 18, 2024 -
[RFC][Pipelining] Support separate dW/dInput in Schedule and Stage
#128974 opened
Jun 18, 2024 -
RuntimeError: NCCL error: invalid usage when deploy LLM model by vllm. (torch version: 2.3.0+cu118)
#128963 opened
Jun 18, 2024 -
`torch.compile` fails with `fullgraph=True` when accessing `getitem` of a `Tensor` subclass
#128961 opened
Jun 18, 2024 -
[Dynamo x torch_function] magic methods and methods with regular Tensors don't seem to work
#128960 opened
Jun 18, 2024 -
caffe2 removal
#128959 opened
Jun 18, 2024 -
[Dynamo] Maybe we shouldn't attempt to recursively compile inside of a frame if the frame hit cache limit
#128954 opened
Jun 18, 2024 -
Question about torch.lstsq and torch.linalg.lstsq
#128952 opened
Jun 18, 2024 -
Memory consumption of conv3d grows too quickly with certain input shapes.
#128947 opened
Jun 18, 2024 -
[MAC] Convolution with kernel size 3 yields different results depending on whether gradient is enabled or not.
#128945 opened
Jun 18, 2024 -
torch.compile graph break due to unsupported builtin filter function
#128944 opened
Jun 18, 2024 -
torch.compile graph break with unsupported LOAD_BUILD_CLASS
#128942 opened
Jun 18, 2024 -
Inquiry Regarding PyTorch Data Mirroring and Proxy Services
#128940 opened
Jun 18, 2024 -
Unable to install PyTorch on M1 Macos with Python 3.10.14
#128939 opened
Jun 18, 2024 -
Performance degradation for certain input using Conv2D
#128936 opened
Jun 18, 2024 -
version inquiry
#128934 opened
Jun 18, 2024 -
[inductor][cpu]transformers models static/dynamic quant performance/accuracy crash in 2024-06-17 nightly release
#128933 opened
Jun 18, 2024 -
onnx.export() fails on aten::embedding_bag with padding_idx
#128930 opened
Jun 18, 2024 -
xpu: set of not implemented aten ops affecting huggingface tests
#128914 opened
Jun 18, 2024 -
[pipelining] Free memory in stage after use
#128910 opened
Jun 17, 2024 -
Unable to export Phi-3-vision model to exported program
#128906 opened
Jun 17, 2024 -
TypeError: Cannot convert symbols to int
#128895 opened
Jun 17, 2024 -
Improve concat fusion with matmuls when autotuning
#128889 opened
Jun 17, 2024 -
UNSTABLE inductor / rocm6.1-py3.8-inductor / test (inductor)
#128871 opened
Jun 17, 2024 -
Update PyTorch CI to numpy 2.0
#128860 opened
Jun 17, 2024 -
UNSTABLE inductor-cu124 / cuda12.4-py3.10-gcc9-sm86 / test (inductor_torchbench)
#128851 opened
Jun 17, 2024 -
UNSTABLE inductor-cu124 / cuda12.4-py3.10-gcc9-sm86 / test (dynamic_inductor_torchbench)
#128850 opened
Jun 17, 2024 -
UNSTABLE inductor-cu124 / cuda12.4-py3.10-gcc9-sm86 / test (aot_inductor_torchbench)
#128849 opened
Jun 17, 2024 -
Windows builds with VS2022
#128835 opened
Jun 17, 2024 -
TORCHDYNAMO_REPRO_AFTER=aot produces invalid repro code
#128830 opened
Jun 17, 2024 -
Questions about TCPStoreLibUV
#128821 opened
Jun 17, 2024 -
Environment gating of CUDA_VISIBLE_DEVICES returns a CUDA initialization error.
#128819 opened
Jun 17, 2024 -
ONNX Dynamo Export - Unsupported FX nodes: {'call_function': ['aten._upsample_bilinear2d_aa.default']}.
#128818 opened
Jun 17, 2024
502 Unresolved conversations
Sometimes conversations happen on old items that aren’t yet closed. Here is a list of all the Issues and Pull Requests with unresolved conversations.
-
[runtime asserts] deduplicate runtime asserts & CSE
#128599 commented on
Jun 22, 2024 • 38 new comments -
Don't decompose functional composite ops in export inference IR
#128077 commented on
Jun 22, 2024 • 31 new comments -
Fix device propagation for checkpointing
#128671 commented on
Jun 24, 2024 • 23 new comments -
[vision hash update] update the pinned vision hash
#125806 commented on
Jun 24, 2024 • 21 new comments -
[Inductor][ROCm] Composable Kernel backend for Inductor
#125453 commented on
Jun 23, 2024 • 20 new comments -
[cuDNN] Graph-capturable cuDNN CTCLoss
#128271 commented on
Jun 22, 2024 • 20 new comments -
[inductor] Add lowering and codegen for aten.sort
#128458 commented on
Jun 22, 2024 • 19 new comments -
General MPS op coverage tracking issue
#77764 commented on
Jun 23, 2024 • 17 new comments -
[v.2.4.0] Release Tracker
#128436 commented on
Jun 21, 2024 • 15 new comments -
[BE] enable UFMT for `torch/nn/functional.py`
#128592 commented on
Jun 23, 2024 • 13 new comments -
[Inductor][CPP] Enable Local Buffer for Outer loop fusion
#126967 commented on
Jun 20, 2024 • 13 new comments -
Add decompositions for copy variants of view ops
#128416 commented on
Jun 21, 2024 • 12 new comments -
Created docs for make_fx in torch.fx.experimental.proxy_tensor
#128441 commented on
Jun 21, 2024 • 10 new comments -
[Nested Tensor]fix sdpa backward for the special case with ragged second batch dim and constant length
#128349 commented on
Jun 18, 2024 • 9 new comments -
Add `torch.put_along_dim` and `torch.put_along_dim_` like `np.put_along_axis`
#125601 commented on
Jun 20, 2024 • 9 new comments -
skip test_graph_capture_oom for jetson
#128661 commented on
Jun 19, 2024 • 9 new comments -
Fix tensor subclass + dynamic shapes in torch.compile + aot autograd
#125941 commented on
Jun 20, 2024 • 8 new comments -
[inductor] switch CppCodeCache to new cpp_builder.
#128303 commented on
Jun 21, 2024 • 8 new comments -
TorchInductor CPU Performance Dashboard
#93531 commented on
Jun 18, 2024 • 8 new comments -
Support allowlisted modules and op overloads in AOTAutogradCache
#128329 commented on
Jun 21, 2024 • 8 new comments -
Add aten._unsafe_masked_index
#116491 commented on
Jun 20, 2024 • 8 new comments -
Modularize aten parameter parser and checker
#125308 commented on
Jun 20, 2024 • 7 new comments -
Change index_put on GPU to accept FP8 inputs
#128758 commented on
Jun 22, 2024 • 7 new comments -
[ROCm] Check supported archs before setting preferred blas backend to hipblasLT
#128753 commented on
Jun 22, 2024 • 7 new comments -
[NT] Implementing Multi-Head Attention with NestedTensors
#125214 commented on
Jun 23, 2024 • 6 new comments -
[DO NOT MERGE] Testing cuDNN SDPA on sm80+
#128571 commented on
Jun 19, 2024 • 6 new comments -
[CI] add xpu test in periodic workflow
#126410 commented on
Jun 21, 2024 • 6 new comments -
Reduce all tensors to their metadata in AOTAutogradCache; add tests
#128583 commented on
Jun 21, 2024 • 6 new comments -
[ROCm] hipSPARSELt Integration
#124320 commented on
Jun 21, 2024 • 6 new comments -
[CI] Enable amp accuracy check for inductor cpu
#127758 commented on
Jun 24, 2024 • 6 new comments -
Write dynamo benchmarks performance result to csv when throw exceptions
#126764 commented on
Jun 17, 2024 • 6 new comments -
[RFC] Add support for device extension autoloading
#127074 commented on
Jun 21, 2024 • 6 new comments -
[ROCm] Unskip scaled_dot_product_attention tests on ROCm
#127966 commented on
Jun 20, 2024 • 6 new comments -
[1/N] Enable unused variable warnings on torch_cpu and fix some violations
#128670 commented on
Jun 23, 2024 • 6 new comments -
ROCm: `fatal error: aotriton/flash.h: No such file or directory` when building with `USE_ROCM=1`
#125230 commented on
Jun 22, 2024 • 5 new comments -
[AOTAutograd] Micro-optimize runtime_wrapper
#128188 commented on
Jun 20, 2024 • 5 new comments -
Feat: Updated torch.nn.Modules.set_submodules()
#127714 commented on
Jun 21, 2024 • 5 new comments -
[RFC][pipelining] PipelineStage should let user control send/recv endpoints
#128665 commented on
Jun 20, 2024 • 5 new comments -
fix torch.prod vectorized path for bool
#128009 commented on
Jun 22, 2024 • 5 new comments -
[3/N] Non-Tensor: Support string parameter for aten operations
#125831 commented on
Jun 20, 2024 • 5 new comments -
[halide-backend] Add GPU support
#127506 commented on
Jun 23, 2024 • 5 new comments -
sdp::SDPBackend::flash_attention support PrivateUse1
#126392 commented on
Jun 20, 2024 • 5 new comments -
2.6.0 Released a second time on the same version breaking production customers
#128653 commented on
Jun 21, 2024 • 4 new comments -
[RFC] Per-Parameter-Sharding FSDP
#114299 commented on
Jun 18, 2024 • 4 new comments -
Nested tensor subclass support
#127431 commented on
Jun 21, 2024 • 4 new comments -
[Inductor][Intel GPU] Support codegen empty_strided_xpu, align with #118255.
#126678 commented on
Jun 21, 2024 • 4 new comments -
Custom attention recompilations
#121504 commented on
Jun 18, 2024 • 4 new comments -
crash@sleef_tryVXE2 () while trying to run torch.compile() BERT model
#128503 commented on
Jun 19, 2024 • 4 new comments -
Fp8 support for item() with cuda, index_select, and fill_ with cpu
#128780 commented on
Jun 18, 2024 • 4 new comments -
Doc (nn): improve doc-string of class Linear.
#128792 commented on
Jun 22, 2024 • 4 new comments -
[sparse] Add cuSPARSELt as a backend
#128534 commented on
Jun 20, 2024 • 4 new comments -
Errors when 0-dim tensor of complex or bool type passed to aminmax.
#128404 commented on
Jun 22, 2024 • 4 new comments -
[docs] Urls changed => forum links would become invalid
#39007 commented on
Jun 17, 2024 • 4 new comments -
Make `torch.autograd.Function` support `vmap`
#128020 commented on
Jun 18, 2024 • 3 new comments -
[xla hash update] update the pinned xla hash
#126672 commented on
Jun 17, 2024 • 3 new comments -
Fixed CUDA randint generation for large ranges.
#126066 commented on
Jun 20, 2024 • 3 new comments -
Remove ProcessGroupCudaP2P and change async-TP to use SymmetricMemory
#128762 commented on
Jun 22, 2024 • 3 new comments -
[ONNX] Skip assertion nodes
#126889 commented on
Jun 22, 2024 • 3 new comments -
Remove unused type traits in torch/csrc/utils
#128799 commented on
Jun 21, 2024 • 3 new comments -
[pytree] update treespec `children_specs` access
#116374 commented on
Jun 22, 2024 • 3 new comments -
[WIP] mark NestedInts as symints instead of symfloats
#127587 commented on
Jun 17, 2024 • 3 new comments -
adjust thresholds for gluon_inception_v3, beit_base_patch16_224, phli…
#127664 commented on
Jun 19, 2024 • 3 new comments -
[inline-inbuilt-nn-modules] Torch compile with DDP errors on parameterized modules
#113415 commented on
Jun 23, 2024 • 3 new comments -
Separate AOTI Eager utils as a single file
#125819 commented on
Jun 20, 2024 • 3 new comments -
[CI] Add inductor cpu accuracy test running on AVX2 runners
#128682 commented on
Jun 18, 2024 • 3 new comments -
[cuDNN][Quantization] Don't print when plan finalization fails in cuDNN quantization backend
#128177 commented on
Jun 19, 2024 • 3 new comments -
[CI][BE] Update retry action to v3.0.0
#119403 commented on
Jun 18, 2024 • 3 new comments -
torch.compile not compatible with multiprocessing pool
#97992 commented on
Jun 17, 2024 • 3 new comments -
[Profiler] Add TSC Clock Callback to CUPTI
#125036 commented on
Jun 22, 2024 • 3 new comments -
`torch.special.gammainc` backward pass with respect to the first argument
#80025 commented on
Jun 17, 2024 • 3 new comments -
Let dynamo inline functional_call
#128646 commented on
Jun 20, 2024 • 3 new comments -
Support for expandable segments with cuda graph trees
#128068 commented on
Jun 19, 2024 • 3 new comments -
Add support for XPU accumulate type
#128579 commented on
Jun 21, 2024 • 3 new comments -
dynamo: use equality guards instead of id guards for Placement/DeviceMesh
#124401 commented on
Jun 22, 2024 • 3 new comments -
[cuDNN][SDPA] Remove `TORCH_CUDNN_SDPA_ENABLED=1`, enable cuDNN SDPA by default on H100 and 2nd on other archs >= sm80
#125343 commented on
Jun 21, 2024 • 3 new comments -
[Split Build][no commit] Test CI with builder changes
#127958 commented on
Jun 19, 2024 • 3 new comments -
`__getitem__` fails to vmap for one dimensional tensors
#124423 commented on
Jun 18, 2024 • 3 new comments -
Flex Decoding
#128678 commented on
Jun 21, 2024 • 3 new comments -
[WIP] Warn on future divergent behavior for conditional views
#126129 commented on
Jun 22, 2024 • 2 new comments -
2 Dynamo test are failing with "'int' object has no attribute 'wrapped'".
#120650 commented on
Jun 18, 2024 • 2 new comments -
Improve debugability of warnings/errors "Triggered internally at"
#128064 commented on
Jun 18, 2024 • 2 new comments -
Support "symmetric" reflection padding
#46240 commented on
Jun 21, 2024 • 2 new comments -
```FlopCounterMode``` returns 0 when inference mode is on during forwardpropagation.
#126268 commented on
Jun 23, 2024 • 2 new comments -
Dynamo should prune non-live captured variables
#127350 commented on
Jun 18, 2024 • 2 new comments -
Don't run addruntimeassertion pass
#125948 commented on
Jun 18, 2024 • 2 new comments -
2nd compile of deepcopy(model) fails on multiple ubuntu-pc (fatal error: Python.h: file not found)
#128121 commented on
Jun 18, 2024 • 2 new comments -
Expected grad_output types don't match available grad_output types when using tensor parallelism with DTensors
#128636 commented on
Jun 18, 2024 • 2 new comments -
Spectral Normalization can not be applied to Conv{1,2,3}d
#99149 commented on
Jun 24, 2024 • 2 new comments -
Pytorch ROCM windows builds
#109204 commented on
Jun 21, 2024 • 2 new comments -
Failed to compile: null in call to `__builtin_memmove(__result, __first, sizeof(_Tp) * _Num);` Debian 12, ppc64le, gcc 12.2
#112089 commented on
Jun 22, 2024 • 2 new comments -
Dynamo export: limited support in Torch.cond
#123972 commented on
Jun 21, 2024 • 2 new comments -
cudnn not found
#15167 commented on
Jun 21, 2024 • 2 new comments -
Bug in `torch.compile` with standard type checking tools beartype
#122093 commented on
Jun 18, 2024 • 2 new comments -
`torch.cuda.memory_summary()` can give `KeyError`
#117130 commented on
Jun 19, 2024 • 2 new comments -
[dtensor][test] test case suite for comm_mode features
#128729 commented on
Jun 21, 2024 • 2 new comments -
[cond] inlining into one of the branches when pred is a python constant
#128709 commented on
Jun 20, 2024 • 2 new comments -
[c10d][simple] increase the default heartbeat timeout to be larger
#128751 commented on
Jun 21, 2024 • 2 new comments -
[dynamo] Fakify result of delegate
#128752 commented on
Jun 18, 2024 • 2 new comments -
New example for preserve_node_meta
#128681 commented on
Jun 18, 2024 • 2 new comments -
[Fix] torch.numel() in TSCovnerter
#128761 commented on
Jun 22, 2024 • 2 new comments -
[Bug] The cuDNN version is too old!
#128207 commented on
Jun 17, 2024 • 2 new comments -
GradType: a subset of dtype that is differentiable, containing all float and complex dtypes
#128793 commented on
Jun 17, 2024 • 2 new comments -
partitioner doesn't appear to respect SAC region
#128730 commented on
Jun 20, 2024 • 2 new comments -
autograd with `is_grads_batched=True` fails on GroupNorm
#128703 commented on
Jun 17, 2024 • 2 new comments -
[DDP] DDP bucket memory release during fwd step
#128696 commented on
Jun 17, 2024 • 2 new comments -
Add MaskedTensor support to _is_any_true
#128574 commented on
Jun 18, 2024 • 2 new comments -
[C10d]: Work state in dump trace file is not deterministic.
#128805 commented on
Jun 19, 2024 • 2 new comments -
Unable to assign `nn.Parameter(DTensor)` (created outside of compile region) to an nn.Module param attribute during Dynamo tracing
#128742 commented on
Jun 18, 2024 • 2 new comments -
Tracing per-param sharding FSDP
#114286 commented on
Jun 18, 2024 • 2 new comments -
[inductor] use same method to handle exception with aten
#127868 commented on
Jun 19, 2024 • 2 new comments -
[Intel GPU] xpu-ops codegen via backend whitelist
#127865 commented on
Jun 19, 2024 • 2 new comments -
[Dynamo] Fix refleak in 3.12+ and Dynamic Shapes test_parameter_free
#124302 commented on
Jun 18, 2024 • 2 new comments -
Added hpu backend support in fsdp utils
#127757 commented on
Jun 18, 2024 • 2 new comments -
Long compilation time for hf_T5_generate inference cause timeout
#121989 commented on
Jun 17, 2024 • 2 new comments -
[caffe2][be][2/n] migrate gloabl static initializer
#127620 commented on
Jun 23, 2024 • 2 new comments -
Save quantization_tag in export graph serialization
#127473 commented on
Jun 20, 2024 • 2 new comments -
Using autograd.Functions defined in torch/ cause graph breaks
#118334 commented on
Jun 18, 2024 • 2 new comments -
[triton hash update] update the pinned triton hash
#115529 commented on
Jun 17, 2024 • 2 new comments -
[dynamo] Automatically convert loop bodies to function calls
#113538 commented on
Jun 21, 2024 • 2 new comments -
torch.cumprod will silently cast the output data type to int64
#128294 commented on
Jun 18, 2024 • 2 new comments -
[PT2D] Make the speedup benchmark works with DDP + CompiledAutograd
#121315 commented on
Jun 23, 2024 • 1 new comment -
turned on matrix-multiplication => matrix-vector multiplication always on if reduction-dim is contiguous
#120954 commented on
Jun 19, 2024 • 1 new comment -
Increase riscv implementation in DepthwiseConvKernel
#127867 commented on
Jun 18, 2024 • 1 new comment -
PyPy support
#17835 commented on
Jun 23, 2024 • 1 new comment -
`RuntimeError: invalid dtype for bias` when use compile + autocast
#124901 commented on
Jun 23, 2024 • 1 new comment -
[Dynamo] Check for __bool__ attribute before accessing it
#120943 commented on
Jun 18, 2024 • 1 new comment -
[FSDP] Removed clamp to `NO_SHARD` for world size 1
#120334 commented on
Jun 23, 2024 • 1 new comment -
[WIP]Intel GPU oneDNN upstreaming: Conv pointwise fusion
#118064 commented on
Jun 18, 2024 • 1 new comment -
[WIP]Intel GPU oneDNN upstreaming: Linear pointwise fusion
#117824 commented on
Jun 18, 2024 • 1 new comment -
[pytree] support PyStructSequence types for Python pytree
#113258 commented on
Jun 22, 2024 • 1 new comment -
SummaryWriter reports encoding error
#73909 commented on
Jun 20, 2024 • 1 new comment -
[RFC] A device-agnostic Python runtime API design for stream-based accelerators
#128403 commented on
Jun 20, 2024 • 1 new comment -
`torch.compile` with `reduce-overhead`: very long compile time + GPU memory continuously to grow
#128424 commented on
Jun 20, 2024 • 1 new comment -
does FSDP support AMSP (a new DP shard strategy)
#128706 commented on
Jun 20, 2024 • 1 new comment -
Fan out calculation broken for group (depthwise) convolution
#23854 commented on
Jun 20, 2024 • 1 new comment -
ROCm loses some supported GPUs by requiring hipblaslt
#119081 commented on
Jun 20, 2024 • 1 new comment -
PyTorch trunk is frequently broken
#128180 commented on
Jun 20, 2024 • 1 new comment -
DISABLED test_arange_dynamic_cuda (__main__.TestInductorDynamicCUDA)
#127067 commented on
Jun 20, 2024 • 1 new comment -
custom ops should have needs_fixed_stride_order by default
#124647 commented on
Jun 20, 2024 • 1 new comment -
Dynamo silently ignores TorchDispatchMode
#105929 commented on
Jun 20, 2024 • 1 new comment -
[compile] output does not match eager mode
#100075 commented on
Jun 20, 2024 • 1 new comment -
Torch.compile Error: RuntimeError: aten::_conj() Expected a value of type 'Tensor' for argument 'self' but instead found type 'complex'.
#105290 commented on
Jun 20, 2024 • 1 new comment -
torch.compile incorrect when imperative autograd APIs are used
#91468 commented on
Jun 20, 2024 • 1 new comment -
[RFC] Support reinplaceble ops for custom ops in Inductor
#124933 commented on
Jun 20, 2024 • 1 new comment -
Inductor generates unnecessary allocation + copy operations for custom ops with mutable inputs
#127660 commented on
Jun 20, 2024 • 1 new comment -
Add `int` type to `device` parameter of torch.set_default_device() on the doc
#126646 commented on
Jun 20, 2024 • 1 new comment -
custom ops with needs_fixed_stride_order doesn't work with auto_functionalized
#128084 commented on
Jun 20, 2024 • 1 new comment -
version libcudnn_ops_infer.so.8 not defined in file libcudnn_ops_infer.so.8 with link time reference
#104591 commented on
Jun 21, 2024 • 1 new comment -
Pytorch dataloader not loading first-available data with multiple workers
#105203 commented on
Jun 21, 2024 • 1 new comment -
custom_op API follow-ups
#101191 commented on
Jun 21, 2024 • 1 new comment -
Memory usage steadily increasing when using back propagation with sparse CSR parameter matrices on CPU
#109445 commented on
Jun 21, 2024 • 1 new comment -
Pytorch build from source failed with GCC 12.3
#127920 commented on
Jun 22, 2024 • 1 new comment -
compilation fails `error: invalid argument '-std=c++17' not allowed with 'C'`
#103222 commented on
Jun 22, 2024 • 1 new comment -
Dead link in `torch.compile` docs
#119272 commented on
Jun 22, 2024 • 1 new comment -
UserWarning: The TorchScript type system doesn't support instance-level annotations on empty non-base types in `__init__`.
#89064 commented on
Jun 22, 2024 • 1 new comment -
[dynamo] Dynamo traces through __torch_dispatch__ on custom tensor subclasses
#128160 commented on
Jun 22, 2024 • 1 new comment -
[feature request] New function `torch.slice(...)` mirroring TorchScript op signature or add step argument to `torch.narrow`
#41625 commented on
Jun 22, 2024 • 1 new comment -
RuntimeError: Expected to have finished reduction in the prior iteration before starting a new one.
#43259 commented on
Jun 22, 2024 • 1 new comment -
Add Swiglu activation function
#128712 commented on
Jun 23, 2024 • 1 new comment -
[FSDP2] Allowed `List[nn.Module]` as arg
#127786 commented on
Jun 22, 2024 • 1 new comment -
Supervisor as a torchrun rendezvous impl
#127515 commented on
Jun 20, 2024 • 1 new comment -
Implement a generic function scheduler
#127200 commented on
Jun 21, 2024 • 1 new comment -
[Split Build] Test split build in CI
#126699 commented on
Jun 19, 2024 • 1 new comment -
Use float data type for Half var_sum in batchnorm stats updating on CPU
#126525 commented on
Jun 24, 2024 • 1 new comment -
Re-implement pin_memory to be device-agnostic by leveraging the Accelerator concept
#126376 commented on
Jun 19, 2024 • 1 new comment -
[4/N] Non-Tensor: Support layout, device and dtype for aten operations
#125897 commented on
Jun 19, 2024 • 1 new comment -
bool inherited from number
#125577 commented on
Jun 24, 2024 • 1 new comment -
Add raise_last_usage memory optimization pass to Inductor
#125559 commented on
Jun 19, 2024 • 1 new comment -
add uuid in cudaDeviceProperties
#125083 commented on
Jun 19, 2024 • 1 new comment -
[rfc]: vendor in open-telemetry
#124800 commented on
Jun 24, 2024 • 1 new comment -
Allow device tensors that use numpy for serialization to use weights_only unpickler
#124763 commented on
Jun 23, 2024 • 1 new comment -
Fix issue 112919
#124746 commented on
Jun 23, 2024 • 1 new comment -
Prevent cuda:0 context initialization when working on another cuda device
#124722 commented on
Jun 23, 2024 • 1 new comment -
[pytorch] Add a c10::Bfloat16 ctor in OSS repo which takes __hip_bfloat16
#124713 commented on
Jun 22, 2024 • 1 new comment -
Skip `deepspeed` and `triton` in dynamo
#124273 commented on
Jun 18, 2024 • 1 new comment -
[Inductor] support masked vectorization for the tail_loop for integer datatypes and bool datatype
#128802 commented on
Jun 17, 2024 • 1 new comment -
[caffe2][be] migrate gloabl static initializer
#128784 commented on
Jun 21, 2024 • 1 new comment -
AutogradMeta is nullptr for non-differentiable tensors on creation
#128746 commented on
Jun 19, 2024 • 1 new comment -
Kineto profiler: collecting observer traces from C++ child threads
#128743 commented on
Jun 18, 2024 • 1 new comment -
Raise exception if torch.func.* calls torch.compile functions
#128736 commented on
Jun 18, 2024 • 1 new comment -
fix the decomposition of aten.threshold
#128707 commented on
Jun 18, 2024 • 1 new comment -
Remove caffe2 namespace from c10/macros/Macros.h
#128672 commented on
Jun 18, 2024 • 1 new comment -
[POC] Split before autograd allow in graph
#128647 commented on
Jun 20, 2024 • 1 new comment -
Allow get attributes on DDP similar to FSDP
#128620 commented on
Jun 18, 2024 • 1 new comment -
Add hooks for execution on intel gaudi devices - 1
#128584 commented on
Jun 24, 2024 • 1 new comment -
[ROCm] Enable F8 Inductor Unit tests
#128353 commented on
Jun 21, 2024 • 1 new comment -
Ignore functional tensor wrapper when caching
#128335 commented on
Jun 21, 2024 • 1 new comment -
update call map to allow multiple input parameters
#128282 commented on
Jun 20, 2024 • 1 new comment -
Wrap output with FakeTensor if input FakeTensor is not preserved
#128206 commented on
Jun 24, 2024 • 1 new comment -
Make Tensor's __dlpack__ and __dlpack_device__ account for XLA.
#128176 commented on
Jun 21, 2024 • 1 new comment -
[pipelining] enable inputs for all model stages
#128115 commented on
Jun 17, 2024 • 1 new comment -
[inductor] custom do_bench_gpu with smart cache flushing
#127953 commented on
Jun 22, 2024 • 1 new comment -
[sym_shapes][perf] Optimize bound_sympy avoiding sympy equals
#124211 commented on
Jun 22, 2024 • 1 new comment -
Remove seq support check in process group
#124138 commented on
Jun 18, 2024 • 1 new comment -
Add realize after pointwise lowering
#124118 commented on
Jun 22, 2024 • 1 new comment -
Hacks to work around that ScriptMethod does not have code/signature
#124115 commented on
Jun 18, 2024 • 1 new comment -
Update README.md
#124028 commented on
Jun 19, 2024 • 1 new comment -
[BE]: Update NCCL submodule to 2.21.5
#124014 commented on
Jun 23, 2024 • 1 new comment -
[FSDP] Move the flattened tensors back to GPU to prevent CPU OOM
#124008 commented on
Jun 19, 2024 • 1 new comment -
Aarch64 cd upgrade
#123747 commented on
Jun 17, 2024 • 1 new comment -
Fix numerical instability in vector_norm when receiving large size tensor
#123416 commented on
Jun 22, 2024 • 1 new comment -
Reenable dim for python 3.12
#123384 commented on
Jun 23, 2024 • 1 new comment -
prototype for graph transform observer
#123361 commented on
Jun 20, 2024 • 1 new comment -
Dynamo: support proxying tensor subclass constructors, including with non-fx types
#123350 commented on
Jun 17, 2024 • 1 new comment -
DTensor: avoiding crashing on dynamic shapes in a few places
#123349 commented on
Jun 17, 2024 • 1 new comment -
Add Gaudi support to benchmarks/dynamo/* benchmark.
#122960 commented on
Jun 23, 2024 • 1 new comment -
[Dynamic Shapes] Fix error handling for indirectly fully constrained dynamic dimensions
#122913 commented on
Jun 22, 2024 • 1 new comment -
[Quant][PT2E] enable qlinear post op fusion for dynamic quant & qat
#122667 commented on
Jun 20, 2024 • 1 new comment -
[Not for review] Collect cpp_wrapper dashboard status
#124691 commented on
Jun 22, 2024 • 1 new comment -
[WIP}[FSDP] Switch to more memory efficient impl of _sync_module_states
#124679 commented on
Jun 22, 2024 • 1 new comment -
[TESTING] Don't clamp upper to 2
#124631 commented on
Jun 22, 2024 • 1 new comment -
chore(quantization): Enable PT2E symmetric dynamic quantization
#124615 commented on
Jun 22, 2024 • 1 new comment -
Make the CI failures less noicy
#124558 commented on
Jun 20, 2024 • 1 new comment -
[testing] ... int(True) != 1 ??
#124539 commented on
Jun 19, 2024 • 1 new comment -
[do not review] Add API for setting backward stream
#124538 commented on
Jun 19, 2024 • 1 new comment -
modified documentation torch.histogramdd ISSUE#124435
#124537 commented on
Jun 21, 2024 • 1 new comment -
Update README.md
#124514 commented on
Jun 18, 2024 • 1 new comment -
[Environment Variable][2/N] Use thread-safe setenv wrapper
#124485 commented on
Jun 22, 2024 • 1 new comment -
Use thread-safe getenv wrapper
#124478 commented on
Jun 22, 2024 • 1 new comment -
[sym_shapes][perf] Skip repetitive check_is_size on same expr
#124471 commented on
Jun 18, 2024 • 1 new comment -
[comm] Ensure graceful shutdown by waiting watchdog thread to finish
#124467 commented on
Jun 18, 2024 • 1 new comment -
Pianpwk/dynamo qualname
#124434 commented on
Jun 18, 2024 • 1 new comment -
Add trace_via_export option and allow exporting funcs
#124431 commented on
Jun 18, 2024 • 1 new comment -
Update _custom_ops.py to accomodate renaming of impl_abstract
#124410 commented on
Jun 18, 2024 • 1 new comment -
[DONOTREVIEW][DTenosr][Test] DTensor 2D sharding
#124339 commented on
Jun 22, 2024 • 1 new comment -
DISABLED test_deepcopy_after_parametrization_swap_True (__main__.TestNNParametrization)
#127738 commented on
Jun 18, 2024 • 1 new comment -
RuntimeError using nested tensor in Apple M1 device MPS
#127743 commented on
Jun 18, 2024 • 1 new comment -
`CompileProfiler` reports graph breaks while `dynamo.explain` reports no graph breaks
#113443 commented on
Jun 18, 2024 • 1 new comment -
[pt2d] module register_pre_forward_hook and register_forward_hook triggered graph break when it's root module
#117584 commented on
Jun 18, 2024 • 1 new comment -
[RFC] PyTorch next wheel build platform: manylinux-2.28
#123649 commented on
Jun 18, 2024 • 1 new comment -
CUDA error in torch.cdist with compute_mode=donot_use_mm_for_euclid_dist
#128791 commented on
Jun 18, 2024 • 1 new comment -
MultiheadAttention returns NaNs when need_weights=False for long sequences with a mask that ignores old tokens
#127055 commented on
Jun 18, 2024 • 1 new comment -
Dynamo ignores frame using yield
#126360 commented on
Jun 18, 2024 • 1 new comment -
The "step unsupported" graph break will make dynamo can't completely trace code after break
#125141 commented on
Jun 18, 2024 • 1 new comment -
Change the type hint for nn.Module.__call__ to be friendly to overrides.
#74746 commented on
Jun 18, 2024 • 1 new comment -
upstream `apex.normalization.FusedRMSNorm`
#72643 commented on
Jun 18, 2024 • 1 new comment -
Support Exceptions in Pytorch Export
#123499 commented on
Jun 17, 2024 • 1 new comment -
masked_index_add
#122092 commented on
Jun 17, 2024 • 1 new comment -
ImportError: libcudnn.so.8: cannot open shared object file: No such file or directory
#104259 commented on
Jun 17, 2024 • 1 new comment -
Dynamo doesn't generate resume calls after graph breaking on log calls
#120375 commented on
Jun 17, 2024 • 1 new comment -
31 Dynamo test are failing with "'NoneType' object has no attribute 'profiler'".
#119783 commented on
Jun 17, 2024 • 1 new comment -
[feature request] `torch.to(obj, device)` supporting recursive lists/dicts/tuples of tensors probably by uplifting/promoting `torch.distributed.utils._recursive_to`
#69431 commented on
Jun 17, 2024 • 1 new comment -
[dynamo] Refactor switch statements to improve compile times
#119128 commented on
Jun 17, 2024 • 1 new comment -
xpu: can't build XPU backend without sourcing oneAPI environment variables (/opt/intel/oneapi/setvars.sh)
#127008 commented on
Jun 17, 2024 • 1 new comment -
lr_scheduler()
#127884 commented on
Jun 17, 2024 • 1 new comment -
[Inductor] Generate triton block pointers for discontiguous strided tensors
#125077 commented on
Jun 17, 2024 • 1 new comment -
Conformal Prediction framework to enhance reliability in risk sensitive industrial applications
#128380 commented on
Jun 17, 2024 • 1 new comment -
No factory functions for strided quantized tensors
#74540 commented on
Jun 17, 2024 • 1 new comment -
Cannot get deterministic Mask RCNN without running out of CUDA memory
#120240 commented on
Jun 17, 2024 • 1 new comment -
False INTERNAL ASSERT FAILED bug whilst training Neural Network
#128778 commented on
Jun 17, 2024 • 1 new comment -
Support for one-hot of dtypes besides torch.int64
#53785 commented on
Jun 17, 2024 • 1 new comment -
jacrev and jacfwd yield different results if one uses torch.no_grad blocks in module
#128600 commented on
Jun 17, 2024 • 1 new comment -
[export] `nn.GRU` fails to `torch.export` due to unimplemented operator
#120626 commented on
Jun 17, 2024 • 1 new comment -
binaries/dump_operator_names.cc missing iostream include
#125134 commented on
Jun 17, 2024 • 1 new comment -
Add RMS Norm layer
#128713 commented on
Jun 17, 2024 • 1 new comment -
`torch.sparse.sum` does not support boolean and int when summing over dense dimensions
#122711 commented on
Jun 18, 2024 • 1 new comment -
Don't populate f_locals to check guards
#93753 commented on
Jun 18, 2024 • 1 new comment -
Dynamo Export: Support for mutating module attributes
#123971 commented on
Jun 18, 2024 • 1 new comment -
Quantile is limited to 16 million elements and have poor performance.
#64947 commented on
Jun 19, 2024 • 1 new comment -
Import Error: cannot import name 'XNNPACKQuantizer' from 'torch.ao.quantization.quantizer'
#128114 commented on
Jun 18, 2024 • 1 new comment -
Backward pass over torch.nn.functional.pad is extremely slow with half tensors
#13058 commented on
Jun 19, 2024 • 1 new comment -
torch.triu() may returns wrong values using MPS
#100005 commented on
Jun 19, 2024 • 1 new comment -
Support AMD Ryzen Unified Memory Architecture (UMA)
#107605 commented on
Jun 19, 2024 • 1 new comment -
Dark mode please 🙏🏻
#120407 commented on
Jun 20, 2024 • 1 new comment -
RuntimeError: false INTERNAL ASSERT FAILED at "C:\\actions-runner\\_work\\pytorch\\pytorch\\builder\\windows\\pytorch\\aten\\src\\ATen\\native\\BatchLinearAlgebra.cpp":1538, please report a bug to PyTorch. torch.linalg.lstsq: (Batch element 0): Argument 6 has illegal value. Most certainly there is a bug in the implementation calling the backend library.
#125892 commented on
Jun 20, 2024 • 1 new comment -
How to enable XNNPACK instead of NNPACK/MKLDNN in Windows?
#128414 commented on
Jun 20, 2024 • 1 new comment -
[Reland2] Update NVTX to NVTX3
#109843 commented on
Jun 23, 2024 • 0 new comments -
[BE] enable UFMT for `torch/storage.py`
#127706 commented on
Jun 19, 2024 • 0 new comments -
inspect.signature.bind is not supported
#93760 commented on
Jun 18, 2024 • 0 new comments -
[DONT MERGE][dynamo] Turn on inlining of inbuilt nn modules
#128148 commented on
Jun 23, 2024 • 0 new comments -
torch.Tensor.random_ causes invalid syntax in InternalTorchDynamoError
#121621 commented on
Jun 17, 2024 • 0 new comments -
Attempting to copy from device cpu to device meta, but cross-device copies are not allowed!
#121619 commented on
Jun 17, 2024 • 0 new comments -
[autograd] Support GradientEdge as output for torch.autograd.grad
#127766 commented on
Jun 19, 2024 • 0 new comments -
[PT-D] Relaxed `contract` to allow `Sequence[nn.Module]`
#127773 commented on
Jun 22, 2024 • 0 new comments -
[Dynamo] einsum `ConstantVariable(str: 'i').has_unpack_var_sequence(tx)` returns True
#121551 commented on
Jun 17, 2024 • 0 new comments -
[experiment] batch files
#127787 commented on
Jun 17, 2024 • 0 new comments -
[Intel GPU] Dispatch Stub support
#127860 commented on
Jun 20, 2024 • 0 new comments -
torch.compile + ring attention
#121386 commented on
Jun 17, 2024 • 0 new comments -
Compiling lumiere-pytorch results in ~600 recompiles and cache size exceeded
#121369 commented on
Jun 17, 2024 • 0 new comments -
[AudioLM] Graph break: 'skip function zip_longest
#121348 commented on
Jun 17, 2024 • 0 new comments -
Adds support for accelerated sorting with x86-simd-sort
#127936 commented on
Jun 17, 2024 • 0 new comments -
[AudioLM] Graph break: call_method UserDefinedObjectVariable(dict) get [TorchVariable(<class 'torch.Tensor'>), ConstantVariable(NoneType)]
#121345 commented on
Jun 17, 2024 • 0 new comments -
2 Dynamo test are failing with "Global state changed while dynamo tracing, please report a bug".
#120648 commented on
Jun 17, 2024 • 0 new comments -
[AudioLM] Graph break: call_method UserDefinedObjectVariable(_lru_cache_wrapper)
#121344 commented on
Jun 17, 2024 • 0 new comments -
[Traceable FSDP2] Top of Traceable FSDP2 stack
#128103 commented on
Jun 17, 2024 • 0 new comments -
[autograd] Do not detach when unpacking tensors that do not require grad
#127959 commented on
Jun 22, 2024 • 0 new comments -
Dynamo cannot work with non-classmethod torch_function implementation
#120799 commented on
Jun 17, 2024 • 0 new comments -
[AudioLM] Graph break: const method call float.is_integer
#121334 commented on
Jun 17, 2024 • 0 new comments -
Accuracy mismatch with torch.compile(backend="eager") for float16
#121238 commented on
Jun 17, 2024 • 0 new comments -
DO NOT MERGE: Test ALI runner
#128024 commented on
Jun 18, 2024 • 0 new comments -
Support sum() forward and backward for NJT
#128031 commented on
Jun 21, 2024 • 0 new comments -
fake_tensor.py: annotate types
#128041 commented on
Jun 23, 2024 • 0 new comments -
operator.eq(Tensor, non-tensor-scalar) not handled correctly
#120907 commented on
Jun 17, 2024 • 0 new comments -
dont prune unused symint graphargs from inner subclass tensors
#128045 commented on
Jun 19, 2024 • 0 new comments -
Umbrella issue for PyTorch test suite failures from torch.* returned non-Tensor output unimplemented
#93479 commented on
Jun 18, 2024 • 0 new comments -
[torch.compile] torch._dynamo.exc.TorchRuntimeError: Failed running call_function <method 'numpy' of 'torch._C.TensorBase' objects>(*(FakeTensor(..., size=(32, 3, 64, 64)),), **{})
#124247 commented on
Jun 17, 2024 • 0 new comments -
Support `dynamic=True` in torch._dynamo.explain
#124163 commented on
Jun 17, 2024 • 0 new comments -
Dynamo-based ONNX Export: Failed to produce a graph during tracing as no tensor operations were found.
#123973 commented on
Jun 17, 2024 • 0 new comments -
[Inductor] support masked vectorization for the tail_loop
#126526 commented on
Jun 17, 2024 • 0 new comments -
Use return_and_correct_aliasing() for NJT + compatible storage setting
#126552 commented on
Jun 18, 2024 • 0 new comments -
torch.compiler.disable doesn't disable nested functions (also doesn't work as a context manager)
#123771 commented on
Jun 17, 2024 • 0 new comments -
Dynamo unsupported: Dynamic slicing on data-dependent value is not supported
#123592 commented on
Jun 17, 2024 • 0 new comments -
torch.compile dynamo fails indexing into array from internal mutable state
#123535 commented on
Jun 17, 2024 • 0 new comments -
support setattr of arbitrary user provided types in tracing
#93511 commented on
Jun 18, 2024 • 0 new comments -
Add decomposition for upsample_bicubic2d_backward
#126815 commented on
Jun 18, 2024 • 0 new comments -
dynamo/fx doesn't honor 'non-persistent' buffers
#123411 commented on
Jun 17, 2024 • 0 new comments -
[dynamo] incorrect error traceback for runtime errors when executing dynamo codegen
#123374 commented on
Jun 17, 2024 • 0 new comments -
`@functools.wraps` graph breaks in many cases where we should be able to handle it
#123365 commented on
Jun 17, 2024 • 0 new comments -
[ONNX] Use ExportedProgram in dynamo_exporter 1/n
#127096 commented on
Jun 21, 2024 • 0 new comments -
FakeTensor support of pin_memory
#123252 commented on
Jun 17, 2024 • 0 new comments -
UFMT format on test_fake_tesnor.py test_futures.py test_fx.py
#127369 commented on
Jun 24, 2024 • 0 new comments -
[dynamo] Unsupported calling 'getattr' + 'getitem' on custom class
#122649 commented on
Jun 17, 2024 • 0 new comments -
Recompiles and cache_size_limit from detectron2 CycleBatchNormList
#122578 commented on
Jun 17, 2024 • 0 new comments -
AOTInductor Does Not Recompile when Saving at Same Path Even if Model Definition Changes
#122487 commented on
Jun 17, 2024 • 0 new comments -
[Intel GPU]Enable fp64 double GEMM
#127508 commented on
Jun 19, 2024 • 0 new comments -
[PT2][DTensor] crash during compiling 1D TP or SP on MLP models
#122447 commented on
Jun 17, 2024 • 0 new comments -
`torch.compile` should result in an optimized module where `module.training` is the same as in the unoptimized module
#122414 commented on
Jun 17, 2024 • 0 new comments -
WIP: fake tensor SymInt support
#127596 commented on
Jun 19, 2024 • 0 new comments -
torch.export: Unsupported: call_function args: UserDefinedObjectVariable(BatchEncoding) on Gemma
#122340 commented on
Jun 17, 2024 • 0 new comments -
Deprecate `torch._utils.is_compiling()` and `torch._dynamo.external_utils.is_compiling()`
#127690 commented on
Jun 18, 2024 • 0 new comments -
[BE] sort imports in `torch.utils.data`
#127704 commented on
Jun 19, 2024 • 0 new comments -
[BE] enable UFMT in `torch.utils.data`
#127705 commented on
Jun 19, 2024 • 0 new comments -
38 Dynamo test are failing with "BuiltinVariable.tensor_args() got multiple values for argument 'self'".
#120643 commented on
Jun 17, 2024 • 0 new comments -
Do dynamic rollout for the pull workflow
#128597 commented on
Jun 17, 2024 • 0 new comments -
[cuDNN][cuDNN V8 API] cuDNN Flash-Attention Upstreaming RFC/tracking issue
#113713 commented on
Jun 17, 2024 • 0 new comments -
NotImplementedError: Operator aten.native_layer_norm_backward.default does not have a sharding strategy registered.
#128699 commented on
Jun 17, 2024 • 0 new comments -
Resolve circular dependence between `torch.autograd` and `torch.nn.parameter`
#128633 commented on
Jun 19, 2024 • 0 new comments -
[ONNX][dynamo_export] ONNX::Celu Half unsupported but export passed w/ invalid model when opmath disabled
#113808 commented on
Jun 20, 2024 • 0 new comments -
Enable UFMT on all files in PyTorch
#123062 commented on
Jun 17, 2024 • 0 new comments -
[ts migration] Support aten::tensor, prim::Enter, prim::Exit
#128660 commented on
Jun 17, 2024 • 0 new comments -
Refactor c10::DataPtr by subclassing from c10::detail::UniqueVoidPtr
#128669 commented on
Jun 21, 2024 • 0 new comments -
torch.save and torch.load is slow. Slower than numpy. Slower even than pickle.
#124195 commented on
Jun 17, 2024 • 0 new comments -
[RFC] Add new CPP builder for inductor on pytorch Windows
#124245 commented on
Jun 20, 2024 • 0 new comments -
put split memory block in the front of memory block set when stream and size equal.
#128674 commented on
Jun 17, 2024 • 0 new comments -
CUDA nightly docker actually includes CPU build of torch
#125879 commented on
Jun 17, 2024 • 0 new comments -
Pytorch nightly docker image invalidated layers
#125862 commented on
Jun 17, 2024 • 0 new comments -
xpu: a set of foreach ops not implemented for XPU backend affecting Huggingface examples
#127931 commented on
Jun 17, 2024 • 0 new comments -
Fix typo when using _check_tensor_list
#128697 commented on
Jun 17, 2024 • 0 new comments -
xpu: implement grid_sample op for XPU (fallback to CPU not possible for fp16 and bf16)
#127002 commented on
Jun 17, 2024 • 0 new comments -
Label tracking meta-issue (edit me to get automatically CC'ed on issues! cc bot)
#24422 commented on
Jun 17, 2024 • 0 new comments -
add x/0 gradient behaviour to documentation
#128796 commented on
Jun 17, 2024 • 0 new comments -
Any plans for a "torch.minmax" (min-max normalization) function?
#128785 commented on
Jun 17, 2024 • 0 new comments -
Assign `torch.Generator` in APIs like `torch.randn_like()`
#128786 commented on
Jun 17, 2024 • 0 new comments -
Flaky test page should include retry runs
#128735 commented on
Jun 17, 2024 • 0 new comments -
The name of the function `nn.L1Loss()` should be `nn.MAE()` or the name of the function `MSELoss()` should be `nn.L2Loss()`
#128779 commented on
Jun 17, 2024 • 0 new comments -
Automatically bind all DispatchKey to Python
#124083 commented on
Jun 17, 2024 • 0 new comments -
[inductor][cpu]hf_BigBird AMP multiple thread static/dynamic shape default/CPP wrapper performance regression
#128513 commented on
Jun 19, 2024 • 0 new comments -
Build clang18 image for ASAN tests
#128763 commented on
Jun 17, 2024 • 0 new comments -
[Inductor] matmuls in `test_cuda_cpp_wrapper.py` appear broken on A16/A2
#121562 commented on
Jun 17, 2024 • 0 new comments -
xpu: gradient checkpointing wrongly hits cuda path running on non-cuda devices
#128478 commented on
Jun 17, 2024 • 0 new comments -
Add hash function of std::string_view to torch/csrc/lazy/core/hash.h
#128800 commented on
Jun 20, 2024 • 0 new comments -
Ban or change behavior of TensorVariable.size
#120568 commented on
Jun 17, 2024 • 0 new comments -
botorch dynamo errors
#93633 commented on
Jun 18, 2024 • 0 new comments -
Link with MKL::MKL instead of MKL_LIBRARIES
#128195 commented on
Jun 17, 2024 • 0 new comments -
Dynamo support dataclasses with default_factory=list
#120108 commented on
Jun 17, 2024 • 0 new comments -
Set seed per sample for OpInfo tests + support for restricting to a single sample input
#128238 commented on
Jun 22, 2024 • 0 new comments -
6 Dynamo test are failing with "torch.utils.checkpoint: trying to save more tensors during recomputation than during the original forward pass.".
#119794 commented on
Jun 17, 2024 • 0 new comments -
9 Dynamo test are failing with "Failed running call_function <function interpolate at 0xDEADBEEF".
#119790 commented on
Jun 17, 2024 • 0 new comments -
[1/N] Change #include <c10/util/Optional.h> to #include <optional>
#128301 commented on
Jun 18, 2024 • 0 new comments -
10 Dynamo test are failing with "GetAttrVariable(NumpyVariable(), __name__) is not a constant".
#119789 commented on
Jun 17, 2024 • 0 new comments -
17 Dynamo test are failing with "Failed running call_function <function embedding_bag at 0xDEADBEEF".
#119786 commented on
Jun 17, 2024 • 0 new comments -
[Don't merge] Try to restructure code
#128330 commented on
Jun 19, 2024 • 0 new comments -
add TORCH_FORCE_SYNCHRONOUS_COLLECTIVES to force functional collectives to be synchronous
#128331 commented on
Jun 18, 2024 • 0 new comments -
63 Dynamo test are failing with "'QuantizationConfig' object has no attribute '__bool__'".
#119782 commented on
Jun 17, 2024 • 0 new comments -
78 Dynamo test are failing with "somehow causing hanging during python shutdown".
#119781 commented on
Jun 17, 2024 • 0 new comments -
110 Dynamo test are failing with "Failed running call_function <built-in method sparse_coo_tensor of type object at 0xDEADBEEF".
#119780 commented on
Jun 17, 2024 • 0 new comments -
[56+] Graph-break if we try to Fakeify an "unknown" Tensor with no data_ptr.
#119695 commented on
Jun 17, 2024 • 0 new comments -
torch._dynamo.exc.Unsupported: call_function args: UserDefinedObjectVariable(EasyDict)
#120219 commented on
Jun 17, 2024 • 0 new comments -
Add warpSize to Device properties
#128449 commented on
Jun 19, 2024 • 0 new comments -
Deprecate unsupported types in operator registration
#124863 commented on
Jun 20, 2024 • 0 new comments -
[pipelining] lazy shape inference for manual
#128527 commented on
Jun 17, 2024 • 0 new comments -
Report sizes/strides of input argument that raised an error
#119396 commented on
Jun 17, 2024 • 0 new comments -
Preserve storage size when generating functional tensor
#128546 commented on
Jun 17, 2024 • 0 new comments -
[traced-graph][sparse] propagate compressed sparsity (WIP)
#128549 commented on
Jun 18, 2024 • 0 new comments -
Dynamo does not support user-defined objects that define custom __new__
#119203 commented on
Jun 17, 2024 • 0 new comments -
Fast path detach()/alias() in FakeTensor
#128281 commented on
Jun 20, 2024 • 0 new comments -
dynamo graph breaks on DTensor.to_local(grad_placements=grad_placements)
#119023 commented on
Jun 17, 2024 • 0 new comments -
[hierarchical compilation] A way to designate a portion of torch.compile as a noinline block that is compiled/guarded separately, but less disruptive than a graph break (e.g., for loops)
#118966 commented on
Jun 17, 2024 • 0 new comments -
vmap fails to call torch.compiled function
#128711 commented on
Jun 17, 2024 • 0 new comments -
[pytree] add APIs to determine a class is a namedtuple or PyStructSequence
#113257 commented on
Jun 22, 2024 • 0 new comments -
[dynamo] we do not instantiate guards for ambient autocast mode
#112260 commented on
Jun 18, 2024 • 0 new comments -
Dynamo Compile samples should record file/line that raised exception
#111674 commented on
Jun 18, 2024 • 0 new comments -
[pytree] traverse `dict` in sorted key ordering
#114947 commented on
Jun 22, 2024 • 0 new comments -
[POC][pytree] test flattening dict in sorted order
#115014 commented on
Jun 22, 2024 • 0 new comments -
Automated submodule update: FBGEMM
#115316 commented on
Jun 24, 2024 • 0 new comments -
Custom `ModuleDict.__getitem__(key: tuple)` produces a graph break
#111551 commented on
Jun 18, 2024 • 0 new comments -
[pytree] update treespec dict keys access
#116372 commented on
Jun 22, 2024 • 0 new comments -
torch.compile support for SeamlessExpressivity/SeamlessM4T in fairseq2
#114373 commented on
Jun 18, 2024 • 0 new comments -
[pytree] make `context` and `children_specs` as private implementation details
#116375 commented on
Jun 22, 2024 • 0 new comments -
[PT2] Compile Cold Start - Async JIT compile with Eager fallback
#114346 commented on
Jun 18, 2024 • 0 new comments -
[1/N] Elimates c10::to_string and other STL string workarounds
#116571 commented on
Jun 22, 2024 • 0 new comments -
torch._dynamo.exc.Unsupported: call_method UserDefinedObjectVariable(FrozenDict) __contains__ [ConstantVariable(str)] {}
#114202 commented on
Jun 18, 2024 • 0 new comments -
Unexpected `None` value for stream with dynamo
#114105 commented on
Jun 18, 2024 • 0 new comments -
WIP Add 3D channels last tensor iterator support
#118377 commented on
Jun 23, 2024 • 0 new comments -
[MPS] Add SDPA implentation
#119200 commented on
Jun 19, 2024 • 0 new comments -
[ONNX] stft export fails with dynamo_export
#113067 commented on
Jun 21, 2024 • 0 new comments -
Implement Variable Tracker for Dataclasses
#113670 commented on
Jun 18, 2024 • 0 new comments -
[FSDP2] Eager-Mode Execution Tracker
#120003 commented on
Jun 21, 2024 • 0 new comments -
[dynamo,torch_function] __torch_function__ does not respect kwargs
#117971 commented on
Jun 18, 2024 • 0 new comments -
[dynamo] Assigning result of Tensor in-place op destroys mutation tracking
#113271 commented on
Jun 18, 2024 • 0 new comments -
(WIP) to_padded_tensor() triton kernel for NJT
#121947 commented on
Jun 18, 2024 • 0 new comments -
[draft] python 3.13 test
#121979 commented on
Jun 21, 2024 • 0 new comments -
[dynamo] self-assigning operation causes `TensorVariable` to lose `mutable_local`, thus causing its attribute mutations to be untracked
#113160 commented on
Jun 18, 2024 • 0 new comments -
torch._dynamo.exc.InternalTorchDynamoError: DeviceMeshVariable() has no type
#117042 commented on
Jun 18, 2024 • 0 new comments -
torch._dynamo.exc.InternalTorchDynamoError: ListIteratorVariable() has no type
#117026 commented on
Jun 18, 2024 • 0 new comments -
Decompositions for upsample linear backward
#123222 commented on
Jun 18, 2024 • 0 new comments -
torch.compile fullgraph=True is failing for GPTJ model for toy_backend
#116835 commented on
Jun 18, 2024 • 0 new comments -
[Dynamo][DeepSpeed] torch._dynamo.exc.InternalTorchDynamoError: NestedUserFunctionVariable() has no type
#116766 commented on
Jun 18, 2024 • 0 new comments -
[dtensor][compile] assertion on placements causing trouble with torch.compile
#116712 commented on
Jun 18, 2024 • 0 new comments -
[ONNX] None as input to `aten::index_put` unsupported
#119363 commented on
Jun 21, 2024 • 0 new comments -
[ONNX] Support Fake Tensor Mode on new Dynamo based ONNX exporter
#105464 commented on
Jun 21, 2024 • 0 new comments -
[dynamo] Diffusers - Graph break on OrderedDict
#102878 commented on
Jun 18, 2024 • 0 new comments -
ONNX export fails for aten::full_like op when exporting UDOP model from transformers
#122898 commented on
Jun 21, 2024 • 0 new comments -
[ONNX] export() with dynamic shapes fails where dynamo_export(dynamic_shapes=True) succeeds
#126607 commented on
Jun 21, 2024 • 0 new comments -
[ONNX] beartype discovers previously undiscovered type annotation errors
#123203 commented on
Jun 21, 2024 • 0 new comments -
Dynamo should only unroll loops by a preset factor (unless otherwise explicitly instructed)
#102839 commented on
Jun 18, 2024 • 0 new comments -
[inlined-inbuilt-nn-modules][dynamo][BE] Revisit call_method of NNModuleVariable
#102063 commented on
Jun 18, 2024 • 0 new comments -
[Dynamo] Can't inline functions under torch.nn.parallel
#101609 commented on
Jun 18, 2024 • 0 new comments -
[Dynamo] TB hf_Reformer graph breaks
#101154 commented on
Jun 18, 2024 • 0 new comments -
Stop importing HuggingFace transformers in DataClassVariable
#100386 commented on
Jun 18, 2024 • 0 new comments -
[dynamo] Investigate interop issues with torch_scatter/torch_sparse/pyg_lib
#111223 commented on
Jun 18, 2024 • 0 new comments -
[dynamo] Add asserts to prevent user defined objects/classes from going into ConstantVariable
#110871 commented on
Jun 18, 2024 • 0 new comments -
torch._dynamo.exc.Unsupported: unexpected sourceless type bases: (<class 'torchrec.streamable.Pipelineable'>,)
#110315 commented on
Jun 18, 2024 • 0 new comments -
moco: torch._dynamo.exc.Unsupported: hasattr: TensorVariable()
#109895 commented on
Jun 18, 2024 • 0 new comments -
[DDP + Dynamo] Tracing DDP AllReduce (Compiled DDP)
#109774 commented on
Jun 18, 2024 • 0 new comments -
[dynamo] torch._dynamo.exc.Unsupported: comparison SymNodeVariable() <built-in function is_> ListVariable()
#109504 commented on
Jun 18, 2024 • 0 new comments -
Support the `ExitStack` context manager (or a simplified version)
#109309 commented on
Jun 18, 2024 • 0 new comments -
'make html' will print 'duplicate object description' warnings when there are 1~5 CPUs in the running machine
#128495 commented on
Jun 22, 2024 • 0 new comments -
[RFC] Add third-party malloc library to improve pytorch memory performance on Windows
#102534 commented on
Jun 22, 2024 • 0 new comments -
Dynamo Swallowing Exception In Lambda
#108798 commented on
Jun 18, 2024 • 0 new comments -
ONNX Export - miscompilation for complex-valued operators
#113444 commented on
Jun 21, 2024 • 0 new comments -
__torch_dispatch__ + compile: extra guards
#114405 commented on
Jun 18, 2024 • 0 new comments -
`torch.cuda.is_bf16_compatible()` output inconsistent with with TorchInductor support
#118122 commented on
Jun 18, 2024 • 0 new comments -
[dynamo] Implement enumerate fallback as polyfill
#112794 commented on
Jun 18, 2024 • 0 new comments -
torch._dynamo.export raises Unexpected type in sourceless builder <class 'nemo.core.neural_types.elements.VoidType'> for torchaudio model
#112745 commented on
Jun 18, 2024 • 0 new comments -
[dynamo] Implement iter fallback (and possibly all iters/generators) as polyfill
#112727 commented on
Jun 18, 2024 • 0 new comments -
[Tracking] Follow ups for itertools infinite iterators
#112532 commented on
Jun 18, 2024 • 0 new comments -
[test/dynamo] BE: cleanup `test_misc.py`
#112344 commented on
Jun 18, 2024 • 0 new comments -
`set` of enums produces a graph break (no repro)
#112338 commented on
Jun 18, 2024 • 0 new comments -
Automated submodule update: kineto
#106149 commented on
Jun 21, 2024 • 0 new comments -
'FakeRootModule' object has no attribute 'self___aot_engines_0_short_term_memories_list_0_0_0'
#128251 commented on
Jun 18, 2024 • 0 new comments -
torch.compile Jamba: long compilation time with backend="eager"
#128153 commented on
Jun 18, 2024 • 0 new comments -
torch.compile with Custom tensor subclass doesn't inline the tensor subclass methods
#128149 commented on
Jun 18, 2024 • 0 new comments -
[inductor] Graph breaks in CohereForAI/aya-23-8b
#128095 commented on
Jun 18, 2024 • 0 new comments -
Verify that guards are well formed before concluding that Dynamo complication has succeeded
#128090 commented on
Jun 18, 2024 • 0 new comments -
[User Empathy Day 2] non-deterministic recompiles for ChatTTS model
#128074 commented on
Jun 18, 2024 • 0 new comments -
Map with multiple arguments not supported in Dynamo and causes graph breaks
#128072 commented on
Jun 18, 2024 • 0 new comments -
[user empathy day 2][based] torch.compile issues
#128071 commented on
Jun 18, 2024 • 0 new comments -
Dynamo Graph break in Unsupported: call_method ConstDictVariable()
#128067 commented on
Jun 18, 2024 • 0 new comments -
Add MaskedTensor passthrough: unfold, F.Unfold, F.Fold, stack
#125262 commented on
Jun 17, 2024 • 0 new comments -
Invalidate StorageImpl instances when tensor is overwritten with cudagraphs
#125264 commented on
Jun 18, 2024 • 0 new comments -
[dynamo] DAC: 'AudioSignal' object has no attribute 'sample_rate'
#128065 commented on
Jun 18, 2024 • 0 new comments -
[Dynamo] torch.cuda.device context manager doesn't work
#128059 commented on
Jun 18, 2024 • 0 new comments -
Add line number to ` _warn_capture_scalar_outputs():`
#127667 commented on
Jun 18, 2024 • 0 new comments -
[FX] Refactor immutable collections implementation
#125470 commented on
Jun 22, 2024 • 0 new comments -
[inline-inbuilt-nn-modules] tensordict functional calls with nn.Module silently gives the wrong (non-functional) result
#127173 commented on
Jun 18, 2024 • 0 new comments -
Requesting dynamo support for fraction.Fraction
#126917 commented on
Jun 18, 2024 • 0 new comments -
[dynamo] Handle inplace op aliasing errors
#126474 commented on
Jun 18, 2024 • 0 new comments -
[AOTI][not for review] Test cpp_wrapper mode
#125733 commented on
Jun 20, 2024 • 0 new comments -
torch.export 'inline in skipfiles: Signature.bind | bind /usr/lib/python3.10/inspect.py, skipped according trace_rules.lookup SKIP_DIRS'
#126242 commented on
Jun 18, 2024 • 0 new comments -
[inline-inbuilt-nn-modules] dynamo recompiles identical layers when they have (identical) hooks
#125836 commented on
Jun 18, 2024 • 0 new comments -
[Dynamo] Support tracing through _get_current_dispatch_mode_stack
#125694 commented on
Jun 18, 2024 • 0 new comments -
[inductor] Enable FX graph caching in OSS by default
#125863 commented on
Jun 21, 2024 • 0 new comments -
[autograd.Function] freevar lifting is too aggressive?
#106894 commented on
Jun 18, 2024 • 0 new comments -
torch._dynamo.allow_in_graph seems to silently no-op on staticmethods
#124735 commented on
Jun 18, 2024 • 0 new comments -
NJT <-> padded dense conversions
#125947 commented on
Jun 19, 2024 • 0 new comments -
inductor creates unnecessary buffers
#124653 commented on
Jun 18, 2024 • 0 new comments -
[dynamo][inlining-inbuilt-nn-modules] decide how dynamo/export should handle parametrizations
#124524 commented on
Jun 18, 2024 • 0 new comments -
Dynamo handling for all methods of torch.Generator
#88576 commented on
Jun 18, 2024 • 0 new comments -
Enable UFMT format on test/quantization
#126152 commented on
Jun 24, 2024 • 0 new comments -
[wip][inductor] move loop ordering after fusion
#126254 commented on
Jun 18, 2024 • 0 new comments -
Dynamo fails to track dataclass
#116264 commented on
Jun 18, 2024 • 0 new comments -
torch.compile(fullgraph=True): can't pass lambdas to hooks?
#116220 commented on
Jun 18, 2024 • 0 new comments -
Add auto-tuning for sparse semi-structured MM operator
#123742 commented on
Jun 23, 2024 • 0 new comments -
[Dynamo] bytecode transformed by Dynamo is not serializable by marshal
#116013 commented on
Jun 18, 2024 • 0 new comments -
torch._dynamo.exc.Unsupported: SETUP_WITH UserDefinedObjectVariable(TorchAutocast)
#115520 commented on
Jun 18, 2024 • 0 new comments -
torch.compile() breaks when using DeepSpeed ZeRO Level 3 sharding
#115484 commented on
Jun 18, 2024 • 0 new comments -
[Tracker] Move nested tensors to beta
#112398 commented on
Jun 18, 2024 • 0 new comments -
[dynamo] Format string with __class__
#118675 commented on
Jun 18, 2024 • 0 new comments -
[dynamo][recompilation] test_set_get_descriptor
#118563 commented on
Jun 18, 2024 • 0 new comments -
Dynamo x autograd.Function: graph breaks on all the staticmethods on autograd.Function
#118397 commented on
Jun 18, 2024 • 0 new comments -
Make it more obvious when Dynamo is triggering on unexpected frames
#118262 commented on
Jun 18, 2024 • 0 new comments -
TorchDynamo mistranslates end of tensor slice
#118227 commented on
Jun 18, 2024 • 0 new comments -
streams x torch.compile: stream is treated as None sometimes
#118204 commented on
Jun 18, 2024 • 0 new comments -
Dynamo CI Shard naming proposal
#118127 commented on
Jun 18, 2024 • 0 new comments -
Dynamo: assert "source" in options and options["source"] is not None for default_generator.set_state call
#118072 commented on
Jun 18, 2024 • 0 new comments -
Supporting custom attributes with `__torch_function__` tensor subclasses
#117806 commented on
Jun 18, 2024 • 0 new comments -
Can't call allow_in_graph inside of a function being torch.compile'd
#103615 commented on
Jun 18, 2024 • 0 new comments -
Pt2 - Discussion around user defined type->behavior dispatching
#117321 commented on
Jun 18, 2024 • 0 new comments -
[dynamo][inline-inbuilt-nn-modules]torch.compile silently incorrect with full_backward_pre_hook
#117265 commented on
Jun 18, 2024 • 0 new comments -
[dynamo] AssertionError for custom iterable nn.Module
#103831 commented on
Jun 18, 2024 • 0 new comments -
xpu: set of unimplemented ops affect huggingface examples performance
#127941 commented on
Jun 18, 2024 • 0 new comments -
tts_angular: fail_to_run, torch._dynamo.exc.Unsupported: call_method NNModuleVariable() flatten_parameters [] {}
#105532 commented on
Jun 18, 2024 • 0 new comments -
[dynamo] calling __torch_function__ with dynamically created subclass of torch.Tensor fails compilation
#107143 commented on
Jun 18, 2024 • 0 new comments -
Extend dict and by extension __dict__ modeling in dynamo to support `setdefault`, `get`
#107054 commented on
Jun 18, 2024 • 0 new comments -
AssertionError: <class 'torch._dynamo.variables.torch.TorchInGraphFunctionVariable'> when compiling `torch.nn.functional.layer_norm`
#128797 commented on
Jun 18, 2024 • 0 new comments -
Dynamo: contextlib.contextmanager doesn't work
#128651 commented on
Jun 18, 2024 • 0 new comments -
Extend Dynamo support for arbitrary context managers
#128650 commented on
Jun 18, 2024 • 0 new comments -
Migrate linux-jammy-py3-clang12-mobile-build to ARC
#124605 commented on
Jun 17, 2024 • 0 new comments -
Migrate linux-jammy-cuda-11_8-cudnn8-py3_8-clang12-build to ARC
#124606 commented on
Jun 17, 2024 • 0 new comments -
torch._dynamo.exc.Unsupported: call_method GetAttrVariable(UnspecializedNNModuleVariable(CenterCrop), _transformed_types) __iter__ () {}
#128417 commented on
Jun 18, 2024 • 0 new comments -
[dynamo] Recompilation on a counter-like attribute of nn module
#128319 commented on
Jun 18, 2024 • 0 new comments