Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add controller deployment doc. #1

Open
wants to merge 448 commits into
base: master
Choose a base branch
from

Conversation

Antlera
Copy link

@Antlera Antlera commented Mar 29, 2023

Add the controller deployment doc to help users deploy the elastic job without knowing the k8s.

hxdtest and others added 30 commits March 2, 2023 18:39
…/fix-scale-down-ps

Bugfix: Scaler execute scaleplan to scaledown PS
…synchronization

[Bug Fix] Start Tensorflow Failover
…machine-learning/optimizer-manager-impl

Optimizer manager implementation
…/fix-migrate-ps

Remove codes to skip migrating PS
…machine-learning/design-virtual-env

Virtual env design
…/refacto-job-resource-opter

Remove the method to optimize configured worker resource.
…synchronization_for_scaling

Add synchronization for auto scaling
workingloong and others added 30 commits March 27, 2023 13:30
…/polist-deeprec-blog

Add the plan in the future.
…features/deeprec_update_doc

Update document of how to auto-scale a DeepRec distributed job.
Environment Test before Start
…/implement-deepctr-model

Implement deepctr models to train the CRITEO dataset.
…/fix-image-for-torch

Set the latest image to launch a torch job.
…/fix-estimator-tutorial

Add the tutorial to build the model image
Environment Test before Start
…tch-2

Environment Test before Start In One Step
…/fix-master-addr

Set MASTER_ADDR as the RDZV_ENDPOINT if necessary.
…deployment-yaml-kustomize

Orgnaize the controller deployment yaml in kustomization style.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

7 participants