Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implement distributed training using Kubernetes #77

Merged
merged 17 commits into from
Jan 23, 2021
Merged
Prev Previous commit
Next Next commit
Fix deploy script to use right id
  • Loading branch information
leogao2 committed Jan 23, 2021
commit 0e36734c7c10185bc061e63adf6c941c048e627a
4 changes: 2 additions & 2 deletions deploy_k8s.sh
Original file line number Diff line number Diff line change
Expand Up @@ -15,10 +15,10 @@ mv id_rsa.pub authorized_keys
for id in $(kubectl get pods | grep eleuther-neox | awk '{print $1}')
do
echo copying keys to $id
kubectl cp $PWD/authorized_keys $MASTER_ID:/root/.ssh/
kubectl cp $PWD/authorized_keys $id:/root/.ssh/
echo 'chmod 600 ~/.ssh/authorized_keys && chmod 700 ~/.ssh && chown -R root /root/.ssh' | kubectl exec --stdin $id -- /bin/bash
done
rm authorized_keys hosts
rm id_rsa*
echo 'chmod 600 ~/.ssh/authorized_keys && chmod 700 ~/.ssh && chown -R root /root/.ssh' | kubectl exec --stdin --tty $MASTER_ID -- /bin/bash

kubectl exec --stdin --tty $MASTER_ID -- /bin/bash