Conversation
[FEATURE] GPU μ 보 μμ§μ μν λͺ¨λν°λ§ λμ λ³κ²½
Summary of ChangesHello @youyeon11, I'm Gemini Code Assist1! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed! This pull request implements a significant update to the project's monitoring stack, shifting from direct FastAPI application instrumentation to a more specialized GPU monitoring solution. The core changes involve removing the Highlights
Using Gemini Code AssistThe full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips. Invoking Gemini You can request assistance from Gemini at any point by creating a comment using either
Customization To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a Limitations & Feedback Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with π and π on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here. You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension. Footnotes
|
There was a problem hiding this comment.
Code Review
μ΄ PRμ μ ν리μΌμ΄μ
λ 벨μ Prometheus λͺ¨λν°λ§μ μ κ±°νκ³ , dcgm-exporterλ₯Ό λμ
νμ¬ GPU λ©νΈλ¦ λͺ¨λν°λ§μΌλ‘ μ ννλ κ²μ λͺ©νλ‘ νλ κ²μΌλ‘ 보μ
λλ€. κ΄λ ¨νμ¬ docker-compose.yml, prometheus.yml, requirements.txt νμΌλ€μ΄ μΌκ΄μ± μκ² μμ λμμ΅λλ€. μ λ°μ μΈ λ³κ²½ λ°©ν₯μ λͺ
ννμ§λ§, λͺ κ°μ§ κ°μ μ μ μ μν©λλ€. Docker μ΄λ―Έμ§μ latest νκ·Έλ₯Ό μ¬μ©νλ λΆλΆκ³Ό requirements.txtμ λΆνμν logging ν¨ν€μ§κ° ν¬ν¨λ μ μ μ μ¬μ μΈ λ¬Έμ λ₯Ό μΌκΈ°ν μ μμΌλ―λ‘ μμ μ΄ νμν©λλ€. λν, μ€μ νμΌμ μ μ°μ±κ³Ό νμ€ μ€μλ₯Ό μν΄ λͺ κ°μ§ μμ μ μ μνμ΅λλ€. μμΈν λ΄μ©μ κ° νμΌμ μ£Όμμ νμΈν΄μ£ΌμΈμ.
| - mynetwork | ||
|
|
||
| dcgm-exporter: | ||
| image: nvcr.io/nvidia/k8s/dcgm-exporter:latest |
There was a problem hiding this comment.
Docker μ΄λ―Έμ§μ latest νκ·Έλ₯Ό μ¬μ©νλ κ²μ νλ‘λμ
νκ²½μμ μνν μ μμ΅λλ€. latest νκ·Έλ μΈμ λ μ§ λ³κ²½λ μ μμ΄, μκΈ°μΉ μμ λ²μ μ΄ λ°°ν¬λμ΄ μ₯μ λ₯Ό μ λ°ν μ μμ΅λλ€. μμ μ μΈ λ°°ν¬λ₯Ό μν΄ 3.3.5-3.4.2-ubuntu22.04μ κ°μ΄ ꡬ체μ μΈ λ²μ νκ·Έλ₯Ό μ¬μ©ν΄μ£ΌμΈμ.
image: nvcr.io/nvidia/k8s/dcgm-exporter:3.3.5-3.4.2-ubuntu22.04| logging | ||
| prometheus-client==0.19.0 | ||
| prometheus-fastapi-instrumentator==6.1.0 No newline at end of file | ||
| logging No newline at end of file |
There was a problem hiding this comment.
|
|
||
| networks: | ||
| mynetwork: | ||
| driver: bridge No newline at end of file |
| - targets: ['127.0.0.1:9090'] | ||
| labels: | ||
| service: 'monitor-1' | ||
| - job_name: 'node' | ||
| static_configs: | ||
| - targets: ['${REMOTE_HOST}:9090'] | ||
| - job_name: 'fastapi-actuator-blue' | ||
| metrics_path: '/actuator/prometheus' | ||
| scrape_interval: 1m | ||
| - targets: ['127.0.0.1:9090'] |
There was a problem hiding this comment.
127.0.0.1:9090μΌλ‘ νκ²μ νλμ½λ©νλ©΄ λ€λ₯Έ νκ²½μμ μ¬μ¬μ©νκ±°λ ꡬμ±μ λ³κ²½ν λ μ μ°μ±μ΄ λ¨μ΄μ§λλ€. μ΄μ μ²λΌ ${REMOTE_HOST}μ κ°μ νκ²½ λ³μλ₯Ό μ¬μ©νκ±°λ, μ΄κ²μ΄ μλλ λ³κ²½μ΄λΌλ©΄ μ νλμ½λ©μ΄ νμνμ§μ λν μ£Όμμ μΆκ°νλ κ²μ κ³ λ €ν΄ λ³΄μΈμ.
| - targets: [ 'app-green:8001' ] No newline at end of file | ||
| - targets: ['dcgm-exporter:9400'] | ||
| labels: | ||
| exporter: 'dcgm' No newline at end of file |
π μμ λͺ©μ