-
Notifications
You must be signed in to change notification settings - Fork 981
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Better deployment #107
Better deployment #107
Conversation
I'm getting I also tried telling it to make 50 nodes and it just spun for half an hour with no feedback. |
No need for separate deployment script. Let me look into the issues |
Simple mistake. That should work now.
This is printed when you haven't already got a deployment running. I have now suppressed this message.
Here are some useful commands to determine whats going on (your deployment will be called
I also just tried to run a deployment with 50 nodes. Only 4 pods started. If you then do
Is there a limit to the number of resources we can use? |
right now, we can have 8 nodes total. So when you asked it to make 50 and it made 4, we probably had 4 other nodes already made. That’s also why I chose to test 50; I wanted to see what would happen if I over-requested resources. |
I just tried to deploy a two-pod deployment (six previously existed) and got an interesting message when I ran I also opened up Lens and saw something interesting. Despite the deployments being listed separately (see left), when I click on either |
# Conflicts: # Dockerfile
…to upload files to all machines
@@ -20,21 +20,27 @@ spec: | |||
command: ["/usr/sbin/sshd"] | |||
args: ["-D"] | |||
tty: true | |||
image: leogao2/gpt-neox | |||
image: leogao2/gpt-neox:synced-deployment |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This should be changed to leogao2/gpt-neox:main
once merged into main
branch
I have added two util scripts to be used when logged into the main node of a deployment:
|
requirements.txt
versions have been pinned. This is to prevent version mismatch issues in the future.requirements.txt
Script usage: