Add support for configurable inference device for the model #5478

A-Artemis · 2026-02-09T15:29:53Z

Summary

The emitted event is now handled, and the model is reloaded with the desired inference device.

How to test

Start an inference pipeline, check the logs:

2026-02-10 11:32:30.270 | INFO | model_api.adapters.openvino_adapter:load_model:240 - The model data/projects/9d6af8e8-6017-4ebe-9126-33aae739c5fa/models/977eeb18-eaac-449d-bc80-e340fbe052ad/model.xml is loaded to CPU

Change the device:

curl -s -X PATCH localhost:7860/api/projects/${PROJECT_ID}/pipeline -H "Content-Type: application/json" --data '{"device": "xpu-0"}'

Check the logs again:

2026-02-10 11:32:45.835 | INFO | model_api.adapters.openvino_adapter:load_model:240 - The model data/projects/9d6af8e8-6017-4ebe-9126-33aae739c5fa/models/977eeb18-eaac-449d-bc80-e340fbe052ad/model.xml is loaded to GPU.0

Checklist

The PR title and description are clear and descriptive
I have manually tested the changes
All changes are covered by automated tests
All related issues are linked to this PR (if applicable)
Documentation has been updated (if applicable)

Copilot

Pull request overview

Adds support for selecting an inference device and ensures the model reloads when the device changes.

Changes:

Extend model activation / loaded model state to include an inference device.
Trigger model reload on INFERENCE_DEVICE_CHANGED events (including new unit coverage).
Reload the inference model when the configured device differs from the currently loaded one.

Reviewed changes

Copilot reviewed 7 out of 7 changed files in this pull request and generated 3 comments.

Show a summary per file

File	Description
application/backend/app/models/model_activation.py	Adds `device` to persisted activation state.
application/backend/app/services/active_model_service.py	Loads device from active pipeline and reloads model when device changes.
application/backend/app/services/event/event_bus.py	Emits model-reload signal on inference device change.
application/backend/app/workers/dispatching.py	Subscribes dispatching worker to inference device change event.
application/backend/app/repositories/active_model_repo.py	Adds DB query for active pipeline’s configured device.
application/backend/tests/unit/services/test_active_model_service.py	Asserts `LoadedModel.device` matches activation state.
application/backend/tests/unit/services/event/test_event_bus.py	Adds test ensuring device-change triggers model reload.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

application/backend/app/models/model_activation.py

application/backend/app/workers/dispatching.py

application/backend/app/services/event/event_bus.py

github-actions · 2026-02-09T15:34:17Z

📊 Test coverage report

Metric	Coverage
Lines	34.7%
Functions	74.1%
Branches	88.3%
Statements	34.7%

github-actions · 2026-02-09T15:36:54Z

Docker Image Sizes

CPU

Image	Size
geti-tune-cpu:pr-5478	2.97G
geti-tune-cpu:sha-06d3032	2.97G

GPU

Image	Size
geti-tune-gpu:pr-5478	10.95G
geti-tune-gpu:sha-06d3032	10.95G

XPU

Image	Size
geti-tune-xpu:pr-5478	9.76G
geti-tune-xpu:sha-06d3032	9.76G

…instead.

leoll2

LGTM, preferably ask for an additional review since I pushed some changes myself.

Add support for configurable inference device for the model

46ace71

A-Artemis self-assigned this Feb 9, 2026

A-Artemis linked an issue Feb 9, 2026 that may be closed by this pull request

Inference model is always loaded on CPU instead of the selected device #5423

Open

github-actions bot added TEST Any changes in tests Geti Tune Backend Issues related to Geti Tune backend labels Feb 9, 2026

A-Artemis requested a review from Copilot February 9, 2026 15:30

Copilot AI reviewed Feb 9, 2026

View reviewed changes

application/backend/app/models/model_activation.py Show resolved Hide resolved

application/backend/app/workers/dispatching.py Show resolved Hide resolved

application/backend/app/services/event/event_bus.py Outdated Show resolved Hide resolved

A-Artemis and others added 3 commits February 10, 2026 10:46

streamline model reload event notifications

40a960b

Fix server error (500) on requests with invalid 'device'; return 400 …

2039209

…instead.

Map device names to OV names

58557e0

A-Artemis marked this pull request as ready for review February 10, 2026 11:12

A-Artemis requested a review from a team as a code owner February 10, 2026 11:12

leoll2 approved these changes Feb 10, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for configurable inference device for the model #5478

Add support for configurable inference device for the model #5478

Uh oh!

A-Artemis commented Feb 9, 2026 •

edited by leoll2

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Feb 9, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Feb 9, 2026 •

edited

Loading

Uh oh!

leoll2 left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add support for configurable inference device for the model #5478

Are you sure you want to change the base?

Add support for configurable inference device for the model #5478

Uh oh!

Conversation

A-Artemis commented Feb 9, 2026 • edited by leoll2 Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

How to test

Checklist

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Feb 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

📊 Test coverage report

Uh oh!

github-actions bot commented Feb 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Docker Image Sizes

CPU

GPU

XPU

Uh oh!

leoll2 left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

A-Artemis commented Feb 9, 2026 •

edited by leoll2

Loading

github-actions bot commented Feb 9, 2026 •

edited

Loading

github-actions bot commented Feb 9, 2026 •

edited

Loading