[AINode] Refactoring of Model Storage, Loading, and Inference Pipeline #16819

CRZbulabula · 2025-11-26T09:40:56Z

This PR introduces significant improvements in the model storage, loading, and inference pipeline management for better extensibility, efficiency, and ease of use. The changes include the refactoring of model storage to support a wider range of models, streamlining the model loading process, and the introduction of a unified inference pipeline. These improvements aim to optimize model management, reduce memory usage, and enhance the overall inference workflow.

Model Storage Refactoring
- Extended Support for Models: The system now supports not only built-in models like TimerXL and Sundial but also allows the integration of fine-tuned and user-defined models.
- Unified Model Management: A new model management system enables model registration, deletion, and loading from both local paths and Hugging Face.
- Code Optimization: Redundant code from previous versions has been removed, and hard-coded model management has been replaced by a more flexible approach that integrates seamlessly with the Hugging Face Transformers ecosystem.
Model Loading Refactoring
- Simplified Model Loading: The previous custom loading logic with complex if...else... conditions has been replaced by a unified model loading interface, simplifying the process.
- Automatic Model Type Detection: The system now automatically detects the model type and selects the appropriate loading method, supporting models from Transformers, sktime, and PyTorch.
- Lazy Loading: The PR introduces lazy loading for Python modules, eliminating the need to load multiple modules at startup, reducing initialization time and memory consumption.
Inference Pipeline Addition
- Unified Inference Workflow: The introduction of the Inference Pipeline encapsulates the entire model inference process, offering a standardized interface for preprocessing, inference, and post-processing.
- Support for Multiple Tasks: The pipeline is versatile, supporting various inference tasks such as prediction, classification, and dialogue-based tasks.

codecov · 2025-11-26T10:42:40Z

Codecov Report

❌ Patch coverage is 12.50000% with 196 lines in your changes missing coverage. Please review.
✅ Project coverage is 38.98%. Comparing base (1746cdb) to head (5cc12be).
⚠️ Report is 4 commits behind head on master.

Files with missing lines	Patch %	Lines
...ache/iotdb/db/protocol/client/an/AINodeClient.java	1.88%	104 Missing ⚠️
...g/apache/iotdb/commons/model/ModelInformation.java	0.00%	21 Missing ⚠️
...ion/config/executor/ClusterConfigTaskExecutor.java	5.88%	16 Missing ⚠️
...apache/iotdb/commons/client/ClientPoolFactory.java	0.00%	14 Missing ⚠️
...relational/function/tvf/ForecastTableFunction.java	11.11%	8 Missing ⚠️
...he/iotdb/db/queryengine/plan/udf/UDTFForecast.java	0.00%	7 Missing ⚠️
...db/db/queryengine/plan/analyze/AnalyzeVisitor.java	0.00%	5 Missing ⚠️
...plan/parameter/model/ModelInferenceDescriptor.java	0.00%	5 Missing ⚠️
...ode/procedure/impl/node/RemoveAINodeProcedure.java	0.00%	3 Missing ⚠️
...ecution/operator/process/ai/InferenceOperator.java	0.00%	3 Missing ⚠️
... and 6 more

Additional details and impacted files

@@             Coverage Diff              @@
##             master   #16819      +/-   ##
============================================
+ Coverage     38.87%   38.98%   +0.11%     
  Complexity      207      207              
============================================
  Files          5022     5009      -13     
  Lines        333113   332063    -1050     
  Branches      42390    42260     -130     
============================================
- Hits         129488   129449      -39     
+ Misses       203625   202614    -1011

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

CRZbulabula

PTAL.

iotdb-core/ainode/iotdb/ainode/core/inference/inference_request_pool.py

iotdb-core/ainode/iotdb/ainode/core/model/sundial/pipeline_sundial.py

iotdb-core/ainode/iotdb/ainode/core/model/timer_xl/pipeline_timer.py

remove useless codes in IoTDB

fix ci

Update AINodeInstanceManagementIT.java Fix. CI

* stash * Support loading inference pipelines for user-defined models * Support loading inference pipelines for different models

…kage (#16861)

…ient model loading (#16865)

…ownload (#16868)

sonarqubecloud · 2025-12-08T08:21:01Z

Quality Gate passed

Issues
27 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
3.8% Duplication on New Code

See analysis details on SonarQube Cloud

CRZbulabula marked this pull request as draft November 26, 2025 09:41

CRZbulabula force-pushed the model-management branch 5 times, most recently from 9a726eb to 3335e1e Compare November 30, 2025 06:00

CRZbulabula commented Dec 3, 2025

View reviewed changes

CRZbulabula marked this pull request as ready for review December 4, 2025 02:19

RkGrit and others added 21 commits December 5, 2025 17:17

refactor_built_in_models

c5e8663

delete old code in model folder

e07dad8

Reconstruct model management and model loading

f6ef03f

remove useless codes in IoTDB

20923d5

remove useless codes in IoTDB

Support loading inference pipelines for user-defined models

d651489

Fix IoTDBDatabaseIT

59fe301

fix ci

refactor AIN CI tests

e441289

Update AINodeInstanceManagementIT.java Fix. CI

call inference bug fix

6436417

Remove useless codes in CN

c247bd1

Support loading inference pipelines for user-defined models (#16835)

1b7fb27

* stash * Support loading inference pipelines for user-defined models * Support loading inference pipelines for different models

Main process manages models, child process loads models. (#16850)

98ab1b3

unify some parameter names

c066df4

support pipeline for sktime models

0e584b7

Update model_loader.py

a8bfbcc

support various pipeline Interfaces and support arima with sktime pac…

8d00ce7

…kage (#16861)

spotless ainode codes

8e8cd6f

Add dependencies of python packages for arima, gaussian_hmm and effic…

71daf3b

…ient model loading (#16865)

Fix call inference cannot specify outputLength

381aea8

If model is already ACTIVATING or ACTIVE, skip duplicate update and d…

f8751af

…ownload (#16868)

Update builtin model path in CI envs

89db67f

update torch version, should less than 2.8.0

ef84301

CRZbulabula force-pushed the model-management branch from 09f98a0 to ef84301 Compare December 5, 2025 09:19

CRZbulabula added 14 commits December 7, 2025 13:43

Fix IoTDBDatabaseIT

626fbc8

More essential libs when packaging

bacd010

use sklearn rather than scikit-learn in .spec

57a6959

delete useless dependency

b6ebe39

append hidden imports

08a0ea6

update dependency collection in .spec

305409d

accelerate ainode compile

4d26751

update dependency version

322e94c

remove useless pre-build process

45f349e

Update ainode.spec

3a45d71

update hidden import collection

1168d21

Update ainode.spec

7a6865e

Update ainode.spec

344fffa

Fix system dir location

5cc12be

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[AINode] Refactoring of Model Storage, Loading, and Inference Pipeline #16819

[AINode] Refactoring of Model Storage, Loading, and Inference Pipeline #16819

CRZbulabula commented Nov 26, 2025 •

edited

Loading

Uh oh!

codecov bot commented Nov 26, 2025 •

edited

Loading

Uh oh!

CRZbulabula left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sonarqubecloud bot commented Dec 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[AINode] Refactoring of Model Storage, Loading, and Inference Pipeline #16819

Are you sure you want to change the base?

[AINode] Refactoring of Model Storage, Loading, and Inference Pipeline #16819

Conversation

CRZbulabula commented Nov 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Nov 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

CRZbulabula left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sonarqubecloud bot commented Dec 8, 2025

Quality Gate passed

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

CRZbulabula commented Nov 26, 2025 •

edited

Loading

codecov bot commented Nov 26, 2025 •

edited

Loading