[dont merge]: make dashboard more robust by wz1qqx · Pull Request #56 · novitalabs/pegaflow

wz1qqx · 2026-01-25T07:08:21Z

Copilot

Pull request overview

This pull request enhances the metrics and monitoring infrastructure by adding comprehensive Grafana dashboards and expanding Prometheus scrape configurations.

Changes:

Added vLLM and Router-Int scrape targets to Prometheus configuration with 8 vLLM instances and 1 Router-Int instance
Created comprehensive vLLM monitoring dashboard with 36 panels covering token throughput, request latency, cache metrics, and performance indicators
Created Router-Int monitoring dashboard with 31 panels tracking HTTP, router, worker, circuit breaker, and retry metrics

Reviewed changes

Copilot reviewed 2 out of 3 changed files in this pull request and generated 7 comments.

File	Description
examples/metric-prometheus/prometheus.yml	Expanded scrape configuration to include vLLM (8 instances on ports 9000-9007) and Router-Int (port 22000) services
examples/metric-prometheus/grafana/dashboards/vllm.json	New comprehensive dashboard monitoring vLLM inference server with metrics for tokens, latency, cache, requests, and performance
examples/metric-prometheus/grafana/dashboards/router-int.json	New dashboard for Router-Int service covering HTTP layer, router layer, worker layer, circuit breaker, and retry metrics

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-01-25T07:11:01Z

examples/metric-prometheus/prometheus.yml

  evaluation_interval: 5s

 scrape_configs:
+  # Pegaflow 服务


The comment contains Chinese text "Pegaflow 服务" (Pegaflow service). For consistency and accessibility, configuration comments should be in English. Consider changing this to "# Pegaflow service" to match the pattern used in other parts of the codebase.

Suggested change

# Pegaflow 服务

# Pegaflow service

Copilot · 2026-01-25T07:11:02Z

examples/metric-prometheus/prometheus.yml

    static_configs:
      - targets: ['host.docker.internal:9091']
+
+  # vLLM 服务 (8个实例，端口 9000-9007)


The comment contains Chinese text "vLLM 服务 (8个实例，端口 9000-9007)" (vLLM service, 8 instances, ports 9000-9007). For consistency and accessibility, configuration comments should be in English. Consider changing this to "# vLLM service (8 instances, ports 9000-9007)".

Copilot · 2026-01-25T07:11:02Z

examples/metric-prometheus/prometheus.yml

+          - 'host.docker.internal:9006'
+          - 'host.docker.internal:9007'
+
+  # Router-int 服务


The comment contains Chinese text "Router-int 服务" (Router-int service). For consistency and accessibility, configuration comments should be in English. Consider changing this to "# Router-int service".

Suggested change

# Router-int 服务

# Router-int service

Copilot · 2026-01-25T07:11:02Z

examples/metric-prometheus/grafana/dashboards/router-int.json

+      }
+    ],
+    "preload": false,
+    "refresh": "15m",


The refresh interval is set to "15m" which seems unusually long for a monitoring dashboard. This means the dashboard will only refresh every 15 minutes, which may not provide timely insights for performance monitoring. Consider using a shorter interval like "5s", "10s", or "30s" to match the monitoring nature of the dashboard and be consistent with the vLLM dashboard which uses "5s".

Suggested change

"refresh": "15m",

"refresh": "5s",

Copilot · 2026-01-25T07:11:02Z

examples/metric-prometheus/grafana/dashboards/vllm.json

+        "type": "prometheus",
+        "uid": "Prometheus"
+      },
+      "description": "분당 성공 요청 수",


The description contains Korean text "분당 성공 요청 수" (Success requests per minute). For consistency and accessibility, dashboard descriptions should be in English. Consider changing this to "Success requests per minute" to match the panel title.

Suggested change

"description": "분당 성공 요청 수",

"description": "Success requests per minute",

Copilot · 2026-01-25T07:11:03Z

examples/metric-prometheus/grafana/dashboards/vllm.json

+        "type": "prometheus",
+        "uid": "Prometheus"
+      },
+      "description": "분당 전체 요청 수 (성공+실패 포함)",


The description contains Korean text "분당 전체 요청 수 (성공+실패 포함)" (Total requests per minute including success and failures). For consistency and accessibility, dashboard descriptions should be in English. Consider changing this to "Total requests per minute (including success and failures)" to match the panel title.

Suggested change

"description": "분당 전체 요청 수 (성공+실패 포함)",

"description": "Total requests per minute (including success and failures)",

Copilot · 2026-01-25T07:11:03Z

examples/metric-prometheus/grafana/dashboards/vllm.json

+        "type": "prometheus",
+        "uid": "Prometheus"
+      },
+      "description": "누적 성공 요청 수",


The description contains Korean text "누적 성공 요청 수" (Cumulative success requests). For consistency and accessibility, dashboard descriptions should be in English. Consider changing this to "Cumulative success requests" to match the panel title.

Suggested change

"description": "누적 성공 요청 수",

"description": "Cumulative Success Requests",

make dashboard more robust

6452c32

wz1qqx requested review from Copilot and xiaguan January 25, 2026 07:08

Copilot started reviewing on behalf of wz1qqx January 25, 2026 07:08 View session

Copilot AI reviewed Jan 25, 2026

View reviewed changes

jimmy-evo changed the title ~~metrics:make dashboard more robust~~ feat(metrics): make dashboard more robust Jan 26, 2026

wz1qqx changed the title ~~feat(metrics): make dashboard more robust~~ [dont merge]: make dashboard more robust Feb 2, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[dont merge]: make dashboard more robust#56

[dont merge]: make dashboard more robust#56
wz1qqx wants to merge 1 commit intonovitalabs:masterfrom
wz1qqx:make-more-dashboard

wz1qqx commented Jan 25, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Jan 25, 2026

Uh oh!

Copilot AI Jan 25, 2026

Uh oh!

Copilot AI Jan 25, 2026

Uh oh!

Copilot AI Jan 25, 2026

Uh oh!

Copilot AI Jan 25, 2026

Uh oh!

Copilot AI Jan 25, 2026

Uh oh!

Copilot AI Jan 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	"description": "분당 성공 요청 수",
	"description": "Success requests per minute",

	"description": "분당 전체 요청 수 (성공+실패 포함)",
	"description": "Total requests per minute (including success and failures)",

	"description": "누적 성공 요청 수",
	"description": "Cumulative Success Requests",

Conversation

wz1qqx commented Jan 25, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Jan 25, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Jan 25, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Jan 25, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Jan 25, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Jan 25, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Jan 25, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Jan 25, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants