Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[ASoC 2022] Metrics visualization and health scoring model for job #253

Open
hoaresky opened this issue May 30, 2022 · 0 comments
Open

[ASoC 2022] Metrics visualization and health scoring model for job #253

hoaresky opened this issue May 30, 2022 · 0 comments
Labels
asoc2022 Alibaba Summer of Code, 2022 community Community discussions enhancement New feature or request

Comments

@hoaresky
Copy link
Contributor

hoaresky commented May 30, 2022

Background

For now, KubeDL dashboard supports displaying basic informations such as jobs, logs and events, and users are able to manipulate objects through some build-in buttons. However, dashboard can help users digging more insights with visualization of core metrics such as resources utilization, I/O tracing. Usually, system metrics will be collected and gathered in Prometheus protocol, which is a good entry point.

Goals to be achieved

  1. Implement data/metrics visualization leveraging prometheus.
  2. Based on the job information and data metrics, design a job health model to quantify degree of job runtime healthiness.

Additional context

This issue is part of our #249.

Difficulty: Normal
Mentor: Xuelin Hong (@hoaresky )

@SimonCqk SimonCqk added enhancement New feature or request asoc2022 Alibaba Summer of Code, 2022 community Community discussions labels May 30, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
asoc2022 Alibaba Summer of Code, 2022 community Community discussions enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

3 participants