-
Notifications
You must be signed in to change notification settings - Fork 5.5k
Insights: ray-project/ray
Overview
Could not load contribution data
Please try again later
1 Release published by 1 person
-
ray-2.32.0 Ray-2.32.0
published
Jul 10, 2024
102 Pull requests merged by 28 people
-
[doc] correct a typo in the parameter server example
#45927 merged
Jul 12, 2024 -
[ci] fix horovod master test
#46602 merged
Jul 12, 2024 -
[test] fix test_pip_with_env_vars for python 3.12 support (#46591)
#46605 merged
Jul 12, 2024 -
[test] fix test_pip_with_env_vars for python 3.12 support
#46591 merged
Jul 12, 2024 -
[ci] split window serve jobs into two
#46595 merged
Jul 12, 2024 -
[RLlib] Fix
sample_timeout_s
setting being ignored by PPO/DQN/SAC/... on old API stack.#46594 merged
Jul 12, 2024 -
[test] disable test_runtime_env_cache_with_pip_check on python 3.12 (#46581)
#46600 merged
Jul 12, 2024 -
[Docs][Kuberay] Create an example doc for Modin
#46345 merged
Jul 12, 2024 -
[test] runtime env pip version pinned to 24.1.2 for python 3.12 support
#46584 merged
Jul 12, 2024 -
[image] upgrade to 22.04 for real
#46589 merged
Jul 12, 2024 -
[test] disable test_runtime_env_cache_with_pip_check on python 3.12
#46581 merged
Jul 12, 2024 -
setup-dev.py: remove outdated dashboard link
#46585 merged
Jul 12, 2024 -
[data] fix doc build
#46587 merged
Jul 12, 2024 -
Update shipped dependencies for 2.32.0 pre cut
#46558 merged
Jul 11, 2024 -
[Docs] Fix dark mode table visibility issues
#46203 merged
Jul 11, 2024 -
[test] skip test_run_in_virtualenv in py312
#46578 merged
Jul 11, 2024 -
[test] runtime env pip version pinned to 24.1.2 for python 3.12 support
#46574 merged
Jul 11, 2024 -
[lint] fix : add extra line between function
#46582 merged
Jul 11, 2024 -
[Data] Deprecate
Dataset.get_internal_block_refs()
#46455 merged
Jul 11, 2024 -
[data][doc] auto-generate dataset api documentation
#46557 merged
Jul 11, 2024 -
[Data] Replace lambda mutable default arguments
#46493 merged
Jul 11, 2024 -
[core][logging] Add a unix timestamp for the structured logger
#46074 merged
Jul 11, 2024 -
[ci] deflake serve window tests
#46560 merged
Jul 11, 2024 -
docs: update ray summit banner text
#46542 merged
Jul 11, 2024 -
[ci] deflake //python/ray/tests:test_implicit_resource
#46572 merged
Jul 11, 2024 -
[test] skip test_run_in_virtualenv in py312
#46577 merged
Jul 11, 2024 -
[Data] Add unit to progress bars
#46432 merged
Jul 11, 2024 -
[Serve] Fix return type of
_DeploymentHandleBase._get_or_create_router
#46480 merged
Jul 11, 2024 -
[release auto] block pre-release jobs when RAY_VERSION is specified
#46568 merged
Jul 11, 2024 -
[Core] Replace lambda default arguments
#46554 merged
Jul 11, 2024 -
Update starting-ray.rst to fix broken link
#46475 merged
Jul 11, 2024 -
[Serve] Add status code to retry when request timed out
#46527 merged
Jul 11, 2024 -
remove "dask-expr" from requirements_compiled.txt
#46566 merged
Jul 11, 2024 -
docs: update GTM tag
#46549 merged
Jul 11, 2024 -
compile dependencies with python 3.12
#46561 merged
Jul 11, 2024 -
[core] Add 1s timeout in RPC to CoreWorkerService.NumPendingTasks in GcsJobManager::HandleGetAllJobInfo
#46335 merged
Jul 11, 2024 -
upgrade cuda
#46555 merged
Jul 11, 2024 -
Update shipped dependencies for 2.32.0
#46559 merged
Jul 11, 2024 -
[Data] Set for better performance in loop
#46541 merged
Jul 11, 2024 -
[doc][api] fix check for auto-generated api docs
#46543 merged
Jul 11, 2024 -
[Data] Rename
_estimated_output_blocks
to_estimated_num_output_bundles
#46547 merged
Jul 10, 2024 -
[data] lint fixes on unnecessary comprehension
#46463 merged
Jul 10, 2024 -
[Data] Remove dead
InputDataBuffer._set_num_output_blocks
#46546 merged
Jul 10, 2024 -
[Data] Add
snowflake-connector-python
to test requirements#46544 merged
Jul 10, 2024 -
[Doc] Fix security doc link
#46352 merged
Jul 10, 2024 -
[doc][api] recursively get all rsts from the head file
#46530 merged
Jul 10, 2024 -
[Data] Add
decord
to test dependencies#46526 merged
Jul 10, 2024 -
[core] Add more methods to GcsClient Accessors.
#46359 merged
Jul 10, 2024 -
[Data] Update
Dataset.count()
to avoid unnecessarily keepingBlockRef
s in-memory#46369 merged
Jul 10, 2024 -
[spark] Fix Ray on Spark fractional GPU error
#46443 merged
Jul 10, 2024 -
[serve] deflake test_serve_dashboard
#46535 merged
Jul 10, 2024 -
[Core] Fix structured logging to use explicit value (#46523)
#46534 merged
Jul 10, 2024 -
Add perf metrics for 2.32.0
#46478 merged
Jul 10, 2024 -
[dashboard] revert back to pushd and popd for win build
#46532 merged
Jul 10, 2024 -
[ci] fix bazel version in window flaky tests
#46533 merged
Jul 10, 2024 -
[core] fix structured logging to use explicit value
#46523 merged
Jul 10, 2024 -
[core][experimental] Check whether the channel is closed for the shared memory write operation
#46508 merged
Jul 10, 2024 -
[core][experimental] Support multiple readers for IntraProcessChannel
#46431 merged
Jul 10, 2024 -
[ci] split window flaky tests into smaller jobs
#46531 merged
Jul 10, 2024 -
[Release] Use with clause to make sure result json file is closed properly
#46484 merged
Jul 10, 2024 -
[ci] do not block windows on release automation run
#46520 merged
Jul 10, 2024 -
Revert https://github.com/ray-project/ray/pull/46393
#46517 merged
Jul 10, 2024 -
[image] fix manual docker image building script
#46490 merged
Jul 10, 2024 -
[dashboard] remove symbolic link
#46461 merged
Jul 10, 2024 -
[Core] Rename state_ts to state_ts_ns to indicate that the unit is nanosecond
#46518 merged
Jul 10, 2024 -
[github action] limit branches of pr sync hook
#46487 merged
Jul 9, 2024 -
[ci] promote kevin to reviewer
#46514 merged
Jul 9, 2024 -
[wheel] exclude 3rd-party
__pycache__
andtests
dir#46405 merged
Jul 9, 2024 -
[doc] buildifier format doc/BUILD
#46458 merged
Jul 9, 2024 -
[core] add "last exception" to error message when GCS connection fails in ray.init().
#46516 merged
Jul 9, 2024 -
Revert "docs: warn about running multiple local Ray instances (#45836)"
#46515 merged
Jul 9, 2024 -
[core] catch exception in async_callback (#46488)
#46519 merged
Jul 9, 2024 -
[Dashboard] Correct the event time of failed tasks
#46439 merged
Jul 9, 2024 -
[rllib] only run rllib gpu tests on rllib changes
#46509 merged
Jul 9, 2024 -
[core] catch exception in async_callback
#46488 merged
Jul 9, 2024 -
[Data] Prevent
from_pandas
from combining input blocks#46363 merged
Jul 9, 2024 -
[Core] Task status should start with PENDING_ARGS_AVAIL when retry
#46494 merged
Jul 9, 2024 -
docs: warn about running multiple local Ray instances
#45836 merged
Jul 9, 2024 -
[ci] remove nvidia-disable-check
#46491 merged
Jul 9, 2024 -
Revert "[Data] Change offsets to int64 and change to LargeList for ArrowTensorArray"
#46511 merged
Jul 9, 2024 -
Add section on launching clusters on Kubernetes to Getting Started
#46149 merged
Jul 9, 2024 -
[core] move cluster_id to GcsClientOptions, and sends RPC to get it if absent.
#46358 merged
Jul 9, 2024 -
[rllib] skip rllib gpu test on premerge
#46504 merged
Jul 9, 2024 -
[RLlib] - Enable multi-learner setup for hybrid stack BC
#46436 merged
Jul 9, 2024 -
[Data] Change offsets to int64 and change to LargeList for ArrowTensorArray
#45352 merged
Jul 9, 2024 -
[dashboard] apply isort to dashboard dir
#46483 merged
Jul 8, 2024 -
[Core] Add object back to memory store when object recovery is skipped
#46460 merged
Jul 8, 2024 -
CI: enable jemalloc in release tests
#46393 merged
Jul 8, 2024 -
[doc] update read_csv api comment
#46466 merged
Jul 8, 2024 -
[Doc] Update visibility of the search button
#46237 merged
Jul 8, 2024 -
[release auto] fix bash styling stuff in osx wheel verify script
#46471 merged
Jul 8, 2024 -
[Serve] Avoid looping over all snapshot ids for each long poll request
#45881 merged
Jul 8, 2024 -
docs: setup unified GTM
#46290 merged
Jul 8, 2024 -
[doc] drop ray-ml reference in runtime env
#46388 merged
Jul 8, 2024 -
[RLlib] Replace all
Mapping
typehints withDict
.#46474 merged
Jul 8, 2024 -
[RLlib] Enable complex action spaces with stateful modules.
#46468 merged
Jul 8, 2024 -
[RLlib] Cleanup examples folder vol19: Add example script for custom loss function (new API stack).
#46445 merged
Jul 8, 2024 -
[release] change version to 2.32.0
#46470 merged
Jul 8, 2024 -
[ray client] lint: do not use comprehension
#46465 merged
Jul 7, 2024 -
[ray client] use is / is not on type check
#46464 merged
Jul 7, 2024 -
[release-auto] smartly detect ray version and commit
#46456 merged
Jul 7, 2024
29 Pull requests opened by 21 people
-
[doc test] tests hashes of external examples
#46457 opened
Jul 6, 2024 -
[Doc] Add Algolia search to docs
#46477 opened
Jul 8, 2024 -
[Serve/Logs] Break serve status details across multiple lines
#46486 opened
Jul 8, 2024 -
Accelerated DAG: Support channel writes larger than the max gRPC payload size
#46498 opened
Jul 9, 2024 -
python/ray/autoscaler/gcp/*.yaml: change scheduling from dict to list
#46500 opened
Jul 9, 2024 -
[RLlib] Optimize rnn_sequencing performance
#46502 opened
Jul 9, 2024 -
Bump zipp from 3.17.0 to 3.19.1 in /python
#46513 opened
Jul 9, 2024 -
[autoscaler][aws] Fix replace cloudwatch alarm config
#46537 opened
Jul 10, 2024 -
[Core]Fix the issue of actor tasks hanging during resubmission
#46539 opened
Jul 10, 2024 -
[Data] [1/n] Fix dependency related Ray Data tests for Python 3.12
#46545 opened
Jul 10, 2024 -
[Data] Enable streaming json read
#46550 opened
Jul 10, 2024 -
[dont review] release 2.10.0+pinterest1
#46551 opened
Jul 10, 2024 -
[RLlib] Replace lambda default argument
#46553 opened
Jul 10, 2024 -
[dashboard] don't prepend "0" or "1" on non streaming logs
#46556 opened
Jul 11, 2024 -
[Core] Fix ObjectFetchTimedOutError
#46562 opened
Jul 11, 2024 -
[Ray Dashboard] make memory profile href link relative path
#46564 opened
Jul 11, 2024 -
soloved ScalingConfig issue
#46565 opened
Jul 11, 2024 -
[WIP][core][experimental][1/n] Readers across different nodes
#46567 opened
Jul 11, 2024 -
Fix mlflow artifact logging
#46570 opened
Jul 11, 2024 -
[Train] Replace lambda default arguments
#46576 opened
Jul 11, 2024 -
Fix return type annotation for parse_placement_group_resource_str.
#46583 opened
Jul 11, 2024 -
[util] remove pygloo support
#46590 opened
Jul 12, 2024 -
[Tune] Replace lambda default arguments
#46596 opened
Jul 12, 2024 -
docs: update readme to include debugger
#46597 opened
Jul 12, 2024 -
small ui fixes to the serve page
#46599 opened
Jul 12, 2024 -
[Data] Allow unknown estimate of operator output bundles and `ProgressBar` totals
#46601 opened
Jul 12, 2024 -
[ADAG] Fix DAG input
#46604 opened
Jul 12, 2024 -
[Doc] Switch to RTD search
#46606 opened
Jul 12, 2024
52 Issues closed by 13 people
-
CI test linux:https://rllib:learning_tests_multi_agent_cartpole_appo_gpu is flaky
#46316 closed
Jul 13, 2024 -
CI test windows:https://python/ray/tests:test_actor_retry is flaky
#43845 closed
Jul 12, 2024 -
CI test linux:https://rllib:learning_tests_cartpole_appo_fake_gpus_old_api_stack is consistently_failing
#45402 closed
Jul 12, 2024 -
CI test linux:https://python/ray/dashboard:test_serve_dashboard is flaky
#46459 closed
Jul 12, 2024 -
CI test windows:https://python/ray/serve/tests:test_model_composition is flaky
#46449 closed
Jul 12, 2024 -
Ray Rllib+Tune does not check if the stopping condition metric exist
#46593 closed
Jul 12, 2024 -
Release test microbenchmark_unstable failed
#46291 closed
Jul 12, 2024 -
CI test linux:https://rllib:examples/offline_rl/pretrain_bc_single_agent_evaluate_as_multi_agent is flaky
#46338 closed
Jul 11, 2024 -
[Docs] First time users table is not easily visible in dark mode
#44072 closed
Jul 11, 2024 -
CI test windows:https://python/ray/tests:test_implicit_resource is consistently_failing
#43849 closed
Jul 11, 2024 -
CI test windows:https://python/ray/serve/tests:test_regression is consistently_failing
#46450 closed
Jul 11, 2024 -
CI test windows:https://python/ray/serve/tests:test_metrics is consistently_failing
#45843 closed
Jul 11, 2024 -
CI test windows:https://python/ray/serve/tests:test_long_poll is consistently_failing
#46044 closed
Jul 11, 2024 -
CI test windows:https://python/ray/serve/tests:test_kv_store is consistently_failing
#46042 closed
Jul 11, 2024 -
CI test windows:https://python/ray/serve/tests:test_enable_task_events is consistently_failing
#46026 closed
Jul 11, 2024 -
CI test windows:https://python/ray/serve/tests:test_deploy_app is consistently_failing
#46448 closed
Jul 11, 2024 -
[Ray Core] Add a `timestamp_ns` for the structured logger
#46050 closed
Jul 11, 2024 -
[Data] Unclear what "it" represents in progress bars
#46383 closed
Jul 11, 2024 -
[Core] Getting the job list from gcs might hang, causing /api/jobs to time out.
#40922 closed
Jul 11, 2024 -
CI test windows:https://python/ray/tests:test_reference_counting_2 is flaky
#45964 closed
Jul 11, 2024 -
[Data] Inefficient iteration over Arrow table columns
#46482 closed
Jul 11, 2024 -
【Ray doc】Can't access ray-security doc
#46350 closed
Jul 10, 2024 -
Ray on Spark fractional GPU error calling setup_ray_cluster
#39537 closed
Jul 10, 2024 -
Release test single_node_oom.aws failed
#46536 closed
Jul 10, 2024 -
[core][experimental] Support multiple readers for IntraProcessChannel
#46469 closed
Jul 10, 2024 -
Python 3.12: test `python/ray/tests/test_logging_2.py::TestTextModeE2E` failed
#46522 closed
Jul 10, 2024 -
Ray + pynndescent (numba) compatibility issue
#44714 closed
Jul 10, 2024 -
CI test linux:https://python/ray/data:test_streaming_integration is flaky
#43481 closed
Jul 10, 2024 -
Failed to start the dashboard , return code 1
#45871 closed
Jul 10, 2024 -
Ray installation in Azure fail for timeout
#44525 closed
Jul 10, 2024 -
[Core] ray.init() stuck at "Started a local Ray instance."
#37373 closed
Jul 10, 2024 -
CI test linux:https://rllib:examples/learners/custom_loss_fn_simple is flaky
#46510 closed
Jul 9, 2024 -
CI: `test_on_completed_callback_refcount` failed on Python 3.12
#46452 closed
Jul 9, 2024 -
Release test torch_batch_inference_1_gpu_10gb_parquet.aws failed
#46495 closed
Jul 9, 2024 -
Release test torch_batch_inference_16_gpu_300gb_parquet.aws failed
#46496 closed
Jul 9, 2024 -
Release test stable_diffusion_benchmark.aws failed
#46499 closed
Jul 9, 2024 -
[docs] [kuberay] Add instructions and link to RayClusters Quickstart in top level Getting Started
#42129 closed
Jul 9, 2024 -
Release test rllib_learning_tests_impala_old_api_stack_tf.aws failed
#44551 closed
Jul 9, 2024 -
Release test rllib_learning_tests_sac_tf.aws failed
#44550 closed
Jul 9, 2024 -
Release test rllib_learning_tests_appo_old_api_stack_tf.aws failed
#44549 closed
Jul 9, 2024 -
Release test rllib_learning_tests_ppo_old_stack_tf.aws failed
#44548 closed
Jul 9, 2024 -
Release test rllib_learning_tests_dqn_old_api_stack_tf.aws failed
#44547 closed
Jul 9, 2024 -
Release test rllib_learning_tests_cql_old_api_stack_tf.aws failed
#44521 closed
Jul 9, 2024 -
Release test rllib_learning_tests_bc_hybrid_api_stack_tf2.aws failed
#43703 closed
Jul 9, 2024 -
Release test rllib_learning_tests_appo_hybrid_api_stack_torch.aws failed
#43705 closed
Jul 9, 2024 -
Release test rllib_learning_tests_dqn_old_api_stack_torch.aws failed
#43710 closed
Jul 9, 2024 -
Release test rllib_learning_tests_marwil_old_api_stack_tf.aws failed
#44553 closed
Jul 9, 2024 -
[docs] Dark mode > search bar has low visibility
#43566 closed
Jul 8, 2024 -
[RLlib] Action masking example with new API stack error
#44452 closed
Jul 8, 2024 -
CI test linux:https://rllib:learning_tests_multi_agent_cartpole_crashing_appo_old_api_stack is flaky
#45693 closed
Jul 8, 2024 -
CI test windows:https://python/ray/tests:test_object_spilling_debug_mode is consistently_failing
#43796 closed
Jul 6, 2024
30 Issues opened by 29 people
-
Accelerated DAG: Ensure chunks are received in order on the receiver node
#46603 opened
Jul 12, 2024 -
[Core] Allow library to pass attributes to Ray's structured logger
#46598 opened
Jul 12, 2024 -
[RLlib] Configurable log path/directory when using the train() API
#46592 opened
Jul 12, 2024 -
[CORE][CLUSTER] Ray Autoscaler Overprovisioning Resources on AWS
#46588 opened
Jul 12, 2024 -
[Data] Update Data progress bars to use `row` as the iteration unit
#46579 opened
Jul 11, 2024 -
PP + DAG
#46573 opened
Jul 11, 2024 -
[air] MLFlowLoggerCallback Does not include Artifacts
#46569 opened
Jul 11, 2024 -
[<Ray component: Core>] num_gpus not working with ROCM devices
#46563 opened
Jul 11, 2024 -
[Cluster] ray running inside docker on a cloud VM losing GPU access after few hours
#46552 opened
Jul 10, 2024 -
[Core] The actor task hangs when it is re-submitted
#46538 opened
Jul 10, 2024 -
[core][experimental] Handle leaf node in accelerated DAG
#46528 opened
Jul 9, 2024 -
RLLib: Vectorized Mult Agent Envs
#46521 opened
Jul 9, 2024 -
[core] Using pybind11 to replace Cython bindings
#46512 opened
Jul 9, 2024 -
[Data / Train] Inconsistent type conversion when using `map_batches` and preprocessors
#46507 opened
Jul 9, 2024 -
[Ultralytics][Ray Tune] not reporting GPU usage
#46506 opened
Jul 9, 2024 -
Error in example tune_tensorflow_autoencoder_example.py
#46505 opened
Jul 9, 2024 -
[Core] Ray GCS should expose a KillNode rpc
#46503 opened
Jul 9, 2024 -
ray.tune.error.TuneError: ('Trials did not complete', [A3C_A3C_four_way_train-v0_ff741_00000])
#46501 opened
Jul 9, 2024 -
[Serve] Calculate autoscaling decisions over whole scale_delay_s period
#46497 opened
Jul 9, 2024 -
[Ray Autoscaler] Autoscaler kills working nodes unexpectedly
#46492 opened
Jul 9, 2024 -
[core] Fiber vs thread_local SIGSEGV in Actor async methods
#46489 opened
Jul 8, 2024 -
[Data] `read_json` reads whole file into memory
#46485 opened
Jul 8, 2024 -
[Data] Databricks UC Datasource schema() is not callable
#46481 opened
Jul 8, 2024 -
[Data] Enhancement Proposal: Persistent Actors for Ray Dataset Pipeline
#46479 opened
Jul 8, 2024 -
Ray Train incompatible broken with XGBoost 2.1.0
#46476 opened
Jul 8, 2024 -
[Ray autoscaler v2] Can't scaler up when using autoscaler v2
#46473 opened
Jul 8, 2024 -
[RLlib] Error when running pettingzoo_parameter_sharing.py on an environment in pettingzoo.mpe
#46472 opened
Jul 8, 2024 -
SET STATIC NODEPORT WHEN DEPLOY RAY CLUSTER
#46467 opened
Jul 7, 2024
104 Unresolved conversations
Sometimes conversations happen on old items that aren’t yet closed. Here is a list of all the Issues and Pull Requests with unresolved conversations.
-
[Train] Update run status and actor status for train runs.
#46395 commented on
Jul 12, 2024 • 39 new comments -
[RLlib] Make most RLlib components implementers of the new `Checkpointable` API for unified checkpointing experience.
#46435 commented on
Jul 12, 2024 • 35 new comments -
[doc] Added MNIST training using KubeRay doc page
#46123 commented on
Jul 12, 2024 • 12 new comments -
fix dashboard process reporting on windows
#45578 commented on
Jul 12, 2024 • 4 new comments -
[Doc][KubeRay] Add KubeRay image resize example to Ray doc page
#46447 commented on
Jul 12, 2024 • 4 new comments -
[RFC][python 3.12] upgrade pytorch
#46256 commented on
Jul 12, 2024 • 2 new comments -
[Data] Add read API for Delta sharing tables
#46072 commented on
Jul 12, 2024 • 2 new comments -
add more execution and iteration metrics to prometheus
#44971 commented on
Jul 8, 2024 • 1 new comment -
[Dashboard] Add cleanup of `job_table` in `delete_job`
#46173 commented on
Jul 11, 2024 • 1 new comment -
[Data] Add support for objects to Arrow blocks
#45272 commented on
Jul 12, 2024 • 1 new comment -
[RLlib] Remove all remaining usages of `NestedDict`.
#46446 commented on
Jul 9, 2024 • 1 new comment -
fix performance bug in arrow to numpy transform
#46433 commented on
Jul 7, 2024 • 0 new comments -
[Serve] downscaling_factor logic is broken for no-scaling decisions
#46148 commented on
Jul 12, 2024 • 0 new comments -
[runtime_env] Passing NFS path to runtime_env raises error due to ValueError: ZIP does not support timestamps before 1980
#46379 commented on
Jul 11, 2024 • 0 new comments -
Python 3.12 alpha support
#45904 commented on
Jul 11, 2024 • 0 new comments -
[in-progress] Readers across different nodes - ADAG Developer Preview - Test Coverage
#46269 commented on
Jul 11, 2024 • 0 new comments -
[Data] Arguments for specifying ray_remote_args are not consistent among the APIs
#40673 commented on
Jul 11, 2024 • 0 new comments -
[Ray Dashboard] The format of the GPU profile file downloaded from the dashboard is incorrect
#46240 commented on
Jul 11, 2024 • 0 new comments -
[<Ray component: RLlib] module 'numpy' has no attribute 'product'
#46250 commented on
Jul 11, 2024 • 0 new comments -
[Workflow] Ray workflow does not work in kuberay rayjob
#37607 commented on
Jul 11, 2024 • 0 new comments -
CI test linux:https://rllib:learning_tests_multi_agent_cartpole_impala_multi_gpu is flaky
#46225 commented on
Jul 11, 2024 • 0 new comments -
[Dashboard] Decoupling dashboard and dashboard lifetime from Ray Cluster
#46444 commented on
Jul 11, 2024 • 0 new comments -
[Data] More GPUs in progress bar summary than actually running
#46384 commented on
Jul 10, 2024 • 0 new comments -
[Ray Air] ArrowVariableShapedTensorArray with LargeListArray
#46434 commented on
Jul 10, 2024 • 0 new comments -
[Core] ray.init() hangs/fails after "Started a local Ray instance."
#31897 commented on
Jul 10, 2024 • 0 new comments -
[RLLib] Evaluation duration doesn't match with results num episodes for evaluation
#46412 commented on
Jul 10, 2024 • 0 new comments -
Setup Commands not running with Ray Cluster on GCP
#46451 commented on
Jul 10, 2024 • 0 new comments -
Release test long_running_apex.aws failed
#40950 commented on
Jul 10, 2024 • 0 new comments -
EOFError error during remote_worker_envs flags
#46346 commented on
Jul 9, 2024 • 0 new comments -
[DAG] cpu tensor returned by DAG actor method gets automatically converted to gpu tensor
#46440 commented on
Jul 9, 2024 • 0 new comments -
[python 3.12][rllib] fix tests for python 3.12
#46221 commented on
Jul 9, 2024 • 0 new comments -
[RLlib] - `Algorithm.add_module` does not use the `module_state` argument.
#46247 commented on
Jul 9, 2024 • 0 new comments -
[Core] Use real CPU count available to a Ray process
#46424 commented on
Jul 11, 2024 • 0 new comments -
[serve] Update docs on max ongoing requests default change
#46414 commented on
Jul 9, 2024 • 0 new comments -
for wheel verification..
#46402 commented on
Jul 11, 2024 • 0 new comments -
[RLlib] Rename all np.product usage to np.prod
#46317 commented on
Jul 11, 2024 • 0 new comments -
[Dashboard] Fix the display of task/actor GPU info.
#46295 commented on
Jul 10, 2024 • 0 new comments -
[Dashboard] Display accelerators info on demand, add Huawei Ascend NPU monitoring.
#46287 commented on
Jul 10, 2024 • 0 new comments -
[core] The New GcsClient binding
#46186 commented on
Jul 12, 2024 • 0 new comments -
[train] Sort workers by node ID rather than by node IP
#46163 commented on
Jul 12, 2024 • 0 new comments -
[RLlib] Single-GPU, multi-CPU (Learner), multi-GPU tests for DQN/SAC single-/multi-agent setups in new API stack.
#46153 commented on
Jul 12, 2024 • 0 new comments -
Bump ws from 7.5.9 to 7.5.10 in /dashboard/client
#46096 commented on
Jul 10, 2024 • 0 new comments -
Bump braces from 3.0.2 to 3.0.3 in /dashboard/client
#46082 commented on
Jul 10, 2024 • 0 new comments -
[Serve] Expose Serve app source in `get_serve_instance_details`
#45522 commented on
Jul 8, 2024 • 0 new comments -
blind try on ubuntu upgrade ..
#45427 commented on
Jul 12, 2024 • 0 new comments -
[testing] cumulative pinterest upgrades for 2.10
#45000 commented on
Jul 8, 2024 • 0 new comments -
[RLlib] Initial design for Ray-Data based offline RL Algos (on new API stack).
#44969 commented on
Jul 12, 2024 • 0 new comments -
verify windows wheels.
#43442 commented on
Jul 13, 2024 • 0 new comments -
[Tune][Air] Fix MLflowLoggerCallback to enable its use with PBT (#27783)
#42182 commented on
Jul 8, 2024 • 0 new comments -
[Runtime Env] working dir refactor
#36953 commented on
Jul 8, 2024 • 0 new comments -
lonnie's workspace
#36406 commented on
Jul 10, 2024 • 0 new comments -
[air] Artifact syncing doesn't work for ddp workers
#34475 commented on
Jul 12, 2024 • 0 new comments -
[Core] Enormous memory usage scanning PyArrow Dataset in Ray worker
#46366 commented on
Jul 8, 2024 • 0 new comments -
[Serve] Make `max_num_models_per_replica` in `@serve.multiplexed` reconfigurable
#46422 commented on
Jul 8, 2024 • 0 new comments -
[Ray Head] How to ensures the High reliability of ray head?
#46354 commented on
Jul 8, 2024 • 0 new comments -
[ Core] cannot serialize polars.LazyFrame
#46343 commented on
Jul 8, 2024 • 0 new comments -
RPC issues with changing network topology
#46301 commented on
Jul 8, 2024 • 0 new comments -
[Serve] Expose internal VLLM metrics
#46360 commented on
Jul 8, 2024 • 0 new comments -
[SERVE] tracing_startup_hook doesn't work
#46357 commented on
Jul 8, 2024 • 0 new comments -
[Ray security] The redis password in ray head is plaintext password
#46351 commented on
Jul 8, 2024 • 0 new comments -
[Core] Allow customizing session name
#46281 commented on
Jul 8, 2024 • 0 new comments -
[Ray Core]: Cannot find gpu on Jetson AGX Orin
#46263 commented on
Jul 8, 2024 • 0 new comments -
[tune] Don't re-evaluate HyperOpt's points_to_evaluate
#46325 commented on
Jul 8, 2024 • 0 new comments -
[Data] Execution hangs when there are many operators in the pipeline
#46319 commented on
Jul 8, 2024 • 0 new comments -
[tune==2.10.0] TypeError: _setup_ray_cluster() missing 1 required keyword-only argument: 'num_worker_nodes'
#44295 commented on
Jul 8, 2024 • 0 new comments -
[Ray Core] Actor On Finish Function
#46258 commented on
Jul 8, 2024 • 0 new comments -
[Job] Ray job log streaming misses to report the last log line
#46413 commented on
Jul 8, 2024 • 0 new comments -
[<Ray component: data] `ray.data.read_text` raise `numpy.core._exceptions._ArrayMemoryError: Unable to allocate`
#46293 commented on
Jul 8, 2024 • 0 new comments -
Data: `PandasBlockAccessor` does not have the attribute `_munge_conflict`
#46275 commented on
Jul 8, 2024 • 0 new comments -
[Data] Support read Hudi table as a DataSource
#46272 commented on
Jul 8, 2024 • 0 new comments -
Create a graph that shows a ratio of cpu utilization / logical cpu per task name?
#45910 commented on
Jul 8, 2024 • 0 new comments -
[Data] Error converting dtype category to Arrow
#41974 commented on
Jul 8, 2024 • 0 new comments -
Release test dataset_shuffle_push_based_sort_1tb.aws failed
#44085 commented on
Jul 8, 2024 • 0 new comments -
[Data] Ray Data read_webdataset assumes files are ordered by key when reading a tarfile
#44068 commented on
Jul 8, 2024 • 0 new comments -
[RLlib + Tune] PermissionError: [WinError 5] Access is denied: '../.tmp_generator' -> '..basic-variant-state-..' while training with ``Tuner``
#43702 commented on
Jul 7, 2024 • 0 new comments -
[RLlib] 'PPOConfig' object has no attribute 'env_runners'
#45665 commented on
Jul 7, 2024 • 0 new comments -
Rllib: Fractional GPU setup
#46176 commented on
Jul 6, 2024 • 0 new comments -
[Dashboard] Kill a job
#30182 commented on
Jul 6, 2024 • 0 new comments -
Exception raised in creation task: The actor died because of an error raised in its creation task, ray::RolloutWorker.__init__()
#46322 commented on
Jul 9, 2024 • 0 new comments -
[RLLib] Want to update PolicyClient and PolicyServerInput to the new API stack
#46430 commented on
Jul 9, 2024 • 0 new comments -
[RLlib]: PPO agent training error: Invalid NaN values in Normal distribution parameters
#46442 commented on
Jul 9, 2024 • 0 new comments -
Feature Request: Python API to get current Ray cluster information
#14998 commented on
Jul 9, 2024 • 0 new comments -
[core] autoscaler occasionally goes into exception loop when using preemptible GCP instances
#29698 commented on
Jul 9, 2024 • 0 new comments -
[autoscaler][gcp] wrong values for scheduling in example gcp cluster yaml files
#46248 commented on
Jul 9, 2024 • 0 new comments -
Ray multiprocessing.Pool: core_worker_process.cc:278: The core worker has already been shutdown.
#46144 commented on
Jul 9, 2024 • 0 new comments -
specifying conda runtime_env using fullpath no longer works
#44373 commented on
Jul 9, 2024 • 0 new comments -
[data] autodoc mishandling type annotations
#45129 commented on
Jul 9, 2024 • 0 new comments -
[Core|Serve] Ray Serve inside Ray Remote introduces slowdowns related to number of available CPU's
#46362 commented on
Jul 9, 2024 • 0 new comments -
[Data] Progress bar estimates are initially incorrect it read tasks yields multiple outputs
#46420 commented on
Jul 8, 2024 • 0 new comments -
[RFC] Accelerated DAGs
#44288 commented on
Jul 8, 2024 • 0 new comments -
[core][serve] 1MB latency performance regression
#46428 commented on
Jul 8, 2024 • 0 new comments -
[Bug] Please add supports for 3.12.1 from pip pypi
#45931 commented on
Jul 8, 2024 • 0 new comments -
[Core] Extend chaos testing utility to more closely replicate spot instance preemptions
#46367 commented on
Jul 8, 2024 • 0 new comments -
[python 3.12][core] fix tests for python 3.12
#46197 commented on
Jul 8, 2024 • 0 new comments -
[Data] Ray Data prematurely closes progress bar
#44979 commented on
Jul 8, 2024 • 0 new comments -
[Ray Cluster on Google Cloud] Example YAML script breaks with A100 GPU type
#44308 commented on
Jul 8, 2024 • 0 new comments -
[Core] Ray's random generated port doesn't exclude already used ports
#46453 commented on
Jul 8, 2024 • 0 new comments -
[Ray-Clusters] Azure Cluster Autoscaler failing to start worker nodes when using non-spot instances.
#46198 commented on
Jul 8, 2024 • 0 new comments -
[core] lineage-reconstructed Non-deterministic generators hang callers
#46425 commented on
Jul 8, 2024 • 0 new comments -
[DAG] Incompatible `.execute()` API for DAG and Compiled DAG
#46441 commented on
Jul 8, 2024 • 0 new comments -
[Core] Optionally allow user code to directly produce data into a plasma store managed buffer
#46438 commented on
Jul 8, 2024 • 0 new comments -
[Core| Client] Ray client fail to reconnect after explicit disconnect
#46403 commented on
Jul 8, 2024 • 0 new comments -
Ray Core Hangs After Seconds of Parallel Execution
#46396 commented on
Jul 8, 2024 • 0 new comments -
[Ray Cluster] Resource monitoring of Ray Tasks and Actors
#46377 commented on
Jul 8, 2024 • 0 new comments