ServerTool

The tool developed to collect the gpus info of the linux cluster in my lab and display them in wechat mini-program "考研备忘录"

Info collected

hostname
ram_available
gpu_name, gpu_memory_total, gpu_memory_free, gpu_memory_used

hostname
cat /proc/meminfo | grep MemAvailable | tr -cd "[0-9]"
nvidia-smi --query-gpu=name,memory.total,memory.free,memory.used --format=csv,noheader,nounits

The info listed above are collected using the project.
Check the code for more details
About nvidia-smi

Info displayed

hostname
ram_available
gpu_name, gpu_memory_total, gpu_memory_free, gpu_memory_used

The info listed above are displayed in mini-program "考研备忘录", which is not an open-source project yet.

Package requirements

numpy
requests

Setup And Run

git clone https://github.com/2812427914/ServerTool.git
or
git clone https://github.com.cnpmjs.org/2812427914/ServerTool.git
(faster, mirror)

cd ServerTool

before run bash wgpu.sh

set the python_path in wgpu.sh . (The python version contains the packages required)
set the cron_freq in wgpu.sh .(Optional; the time in crontab tasks, in minutes; default 3 minutes and recommended)
set the group_name = "bdaa_edu" in main_v1.py.(Later feature; "bdaa_edu" supported only currently)

bash wgpu.sh

Python path examples

sis cluster python_path=~/anaconda3/envs/learn/bin/python3.6
pangpang clusterpython_path=/usr/bin/python3.6
huzx clusterpython_path=/usr/bin/python3.7

Update changes from remote branch

git clean -f -d
git fetch --all
git reset --hard origin/master

Then do not forget to:

set the python_path in wgpu.sh(!!!).
set the cron_freq in wgpu.sh .
set the group_name in main_v1.py.

Delete extra files and git push to make contributions

rm access_token.txt record_id.txt cron ; rm -r gpustat_v1/ __pycache__/
git add -A
git commit -m 'fix bugs'
git pull origin master
git push -u origin master

Later features

Add linux group feature. (one can set the group_name and check the gpus info in the "考研备忘录" according to group_name)
Change the way of setting the python_path to the level of naive
Support setting the variables (cron_freq, alive_servers) of a linux cluster through logining in one of the them.

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
LICENSE		LICENSE
README.md		README.md
ServerChan.py		ServerChan.py
main_v1.py		main_v1.py
wgpu.sh		wgpu.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ServerTool

Info collected

Info displayed

Package requirements

Setup And Run

Python path examples

Update changes from remote branch

Delete extra files and git push to make contributions

Later features

About

Releases

Packages

Languages

License

2812427914/ServerTool

Folders and files

Latest commit

History

Repository files navigation

ServerTool

Info collected

Info displayed

Package requirements

Setup And Run

Python path examples

Update changes from remote branch

Delete extra files and git push to make contributions

Later features

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages