45 add interpolation methods for input output #51

SiAndRo2002 · 2025-11-20T23:13:18Z

Pull Request

Description

Restructure and individualise shift and add interpolation methods

Type of Change

Bug fix (non-breaking change fixing an issue)
New feature (non-breaking change adding functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
Refactoring (code change that neither fixes a bug nor adds a feature)
Documentation update

Required Checklist

Testing

Unit tests have been created/updated for new/modified functionality
CI/CD pipeline passes all tests (pytest, coverage, executables, installation)

Examples

Add examples for new features and functionality

Compatibility

Changes are backward compatible OR deprecation warnings added
No breaking changes to public APIs
New dependencies added to pyproject.toml (required in dependencies or optional in [project.optional-dependencies])

Documentation

Docstrings added/updated for new/modified public methods (Google style)
Type hints added for new functions/methods

Optional

GitHub Copilot Review

Request Copilot review via GitHub UI (add 'copilot' as a reviewer)

that specifies the shift for each input individually

…olation-methods-for-input-output

output wasn't considered, error occurred when using pinn

…olation-methods-for-input-output

…://github.com/RWTH-EBC/physXAI into 45-add-interpolation-methods-for-input-output

updated docstrings

PatrickHenkel1 · 2025-12-04T12:34:35Z

executables/bestest_hydronic_heat_pump/Dummy_shifting.py

+shift = {
+    'reaTZon_y': 'previous',  # for all lags of reaTZon_y, the shift will be set automatically
+    'weaSta_reaWeaHDirNor_y': 'mean_over_interval',
+    '_default': 0,
+}


We should include this as part of Feature with attribute sampling_method. Default should be None, and there should be a class attribute default_sampling_method that is used if None
Shift should be removed from preprocessing (not backwards compatible, but necessary to maintain usability)

PatrickHenkel1 · 2025-12-04T12:44:32Z

physXAI/preprocessing/preprocessing.py

+        if (len(self.inputs) != len(self.shift.keys())) or not all(inp in self.shift.keys() for inp in self.inputs):
+            self.shift = convert_shift_to_dict(self.shift, self.inputs, custom_default=self.shift_default)
+
+        assert len(self.inputs) == len(self.shift.keys()), (
+            f"Something went wrong, number of inputs ({len(self.inputs)})"
+            f" doesn't match number of inputs defined in shift ({len(self.shift.keys())})")


Is this not already done in the init function?

L291-292 are necessary in case of recursive feature selection since preprocessing object is only created once but inputs are changed

PatrickHenkel1 · 2025-12-04T12:51:07Z

physXAI/preprocessing/preprocessing.py

-        FeatureConstruction.process(df)
+        # Only apply for those features that are not lags since lags must be constructed after sampling the data
+        # according to the given time step
+        FeatureConstruction.process(df, feature_names=inputs_without_lags + [out for out in self.output if out not in inputs_without_lags])


It might be easier to implement a process_without_lags and process_only_lags function, to avoid too much code in high level functions

PatrickHenkel1 · 2025-12-04T12:55:15Z

physXAI/preprocessing/preprocessing.py

    """

-    def __init__(self, inputs: list[str], output: Union[str, list[str]], shift: int = 1,
+    def __init__(self, inputs: list[str], output: Union[str, list[str]], shift: Union[int, str, dict] = 'previous',


Can probably be removed if sampling_method is part of feature

PatrickHenkel1 · 2025-12-04T14:43:17Z

physXAI/preprocessing/preprocessing.py

 os.environ['TF_CPP_MIN_LOG_LEVEL'] = '0'


+def convert_shift_to_dict(s: Union[int, str, dict], inputs: list[str], custom_default: Union[int, str] = None) -> dict:


Function should (hopefully) get much easier if shift is a attribute of Feature

PatrickHenkel1 · 2025-12-04T14:50:52Z

physXAI/preprocessing/preprocessing.py

+        if all('current' == self.shift[k] for k in inputs_without_lags):
+            # filter / sample data
+            X = self.filter_df_according_to_timestep(X)
+            # nothing more to do here
+        elif all('previous' == self.shift[k] for k in inputs_without_lags):
+            # filter / sample data
+            X = self.filter_df_according_to_timestep(X)
+
+            # shift data by 1 and shorten DataFrames accordingly
+            X = X.shift(1)
+            y = y.iloc[1:]
+            X = X.iloc[1:]
+        elif all('mean_over_interval' == self.shift[k] for k in inputs_without_lags):
+            X = get_mean_over_interval(y, X)
+            # synchronize length between X and y
+            y = y.iloc[1:]
+
+        else:  # different inputs have different shifts


Is this not a duplicate of the more general "else: # different inputs have different shifts" statement?

Yes, in fact, the else would do the same thing. The idea was that it might be faster in case all inputs have the same shift

I see, i think the preprocessing is not really a performance bottleneck, so i would sacrifice a little performance for a more general implementation

executables/bestest_hydronic_heat_pump/Dummy_shifting.py

PatrickHenkel1 · 2025-12-04T14:59:41Z

physXAI/preprocessing/preprocessing.py

+        if isinstance(shift, dict) and '_default' in shift.keys():
+            self.shift_default = shift['_default']
+            shift.__delitem__('_default')


Should get easier if Feature has a default sampling_method (as class attribute, so it is changeable)

PatrickHenkel1 · 2025-12-04T15:00:24Z

physXAI/preprocessing/preprocessing.py

        df = df.loc[first_valid_index:last_valid_index]
-        if df.isnull().values.any():
+
+        def get_mean_over_interval(y: pd.DataFrame, x: pd.DataFrame):


Is this similar to how the sampling is done in agentlib/agentlib-mpc?

The get_mean_over_interval function is mostly copied from agentlib-mpc/utils/sampling.py. However, small adaptions were necessary due to differing data structures

PatrickHenkel1 · 2025-12-04T15:04:52Z

physXAI/preprocessing/preprocessing.py

-            y = y.iloc[:-self.shift]
-            X = X.iloc[:-self.shift]
+        # Applies feature constructions defined in `FeatureConstruction` to the lagged inputs
+        FeatureConstruction.process(res_df, feature_names=lagged_inputs)


How is it handeld if there is a constructed feature that is based on a lagged input?

Resolved in commit 367624f

fixing review issue #51 (comment)

before: only str allowed

…mple

deleted deprecated code and test for shift conversion

sampling_method of constructed features determined based on corresponding base_features(s)

resetting FeatureConstruction.features also affected p_hp_data

…olation-methods-for-input-output

…://github.com/RWTH-EBC/physXAI into 45-add-interpolation-methods-for-input-output

…olation-methods-for-input-output

ross.simon added 3 commits November 21, 2025 00:04

Added function to convert shift to dict

1ffcccb

that specifies the shift for each input individually

Added unittests for function preprocessing.convert_shift_to_dict

b8706cc

Small import improvement

499b4c7

SiAndRo2002 linked an issue Nov 20, 2025 that may be closed by this pull request

Add Interpolation Methods for Input / Output #45

Open

ross.simon and others added 18 commits November 21, 2025 09:18

Bug fix for backwards compatibility with python 3.9

f404396

Corrected bug fix for backwards compatibility with python 3.9

15c2ed7

Merge remote-tracking branch 'remotes/origin/main' into 45-add-interp…

690cb96

…olation-methods-for-input-output

partly integrated new structure for shifting inputs and outputs

828c64d

Merge remote-tracking branch 'remotes/origin/main' into 45-add-interp…

95ff10f

…olation-methods-for-input-output

Fixed error occurring with recursive_feature_elimination

e2986de

Implemented new structure and methods for shifting input data

9973f0c

Fixed small error with feature selection test script

0d3783b

Fixed error in feature construction

2d23228

output wasn't considered, error occurred when using pinn

Update coverage badge [skip ci]

5ba2f22

Merge remote-tracking branch 'remotes/origin/main' into 45-add-interp…

4b12e2b

…olation-methods-for-input-output

Merge branch '45-add-interpolation-methods-for-input-output' of https…

3e0a8ad

…://github.com/RWTH-EBC/physXAI into 45-add-interpolation-methods-for-input-output

Update coverage badge [skip ci]

3dec440

implemented custom default for shift

88b1ccc

Updated docstrings

ef9a8d4

Implemented test and example for different shifts

b458803

updated docstrings

Update coverage badge [skip ci]

23ac15b

reduce number of epochs for more efficient testing

80815ee

SiAndRo2002 marked this pull request as ready for review December 3, 2025 15:07

SiAndRo2002 requested a review from PatrickHenkel1 December 3, 2025 15:08

PatrickHenkel1 requested changes Dec 4, 2025

View reviewed changes

ross.simon and others added 4 commits December 5, 2025 11:11

implemented handling of constructed features including lagged features

367624f

fixing review issue #51 (comment)

Partly integrated shift as attribute sampling_method in Feature

567efcd

Implemented input list as list of Features and str

e0fc769

before: only str allowed

Update coverage badge [skip ci]

be1365b

ross.simon and others added 24 commits December 7, 2025 11:58

Fix SyntaxError in python versions earlier than 3.12

7d439f8

Added DeprecationWarning for shift parameter, updated testing and exa…

f160948

…mple

Fixed small mistake regarding DataFrame length

11d1719

Fixed testing bug

8d9af45

Update coverage badge [skip ci]

8f8c377

Updated testing for sampling method as attribute of Feature

c42f424

deleted deprecated code and test for shift conversion

Fixed error: char not allowed in folder name

8323ee5

fixed bug with property & added testing for deprecated shift

8fd0093

Fixed small error in testing script

0496761

fixed small syntax error with older python versions

03335f6

Moved default_sampling_method from FeatureConstruction to Feature

9450d0f

Deleted deprecated code

93068a0

Implemented handling of constructed outputs

096a863

Update coverage badge [skip ci]

203c601

restructured sampling_method

bafeb3c

sampling_method of constructed features determined based on corresponding base_features(s)

fixed testing bug

79db1dc

resetting FeatureConstruction.features also affected p_hp_data

Refactoring of sampling, corrected use of UserWarnings

c2f002a

Merge remote-tracking branch 'remotes/origin/main' into 45-add-interp…

ae0e378

…olation-methods-for-input-output

Updated

3cbcdba

Merge branch '45-add-interpolation-methods-for-input-output' of https…

6dbe18a

…://github.com/RWTH-EBC/physXAI into 45-add-interpolation-methods-for-input-output

Merge remote-tracking branch 'remotes/origin/main' into 45-add-interp…

69a27f6

…olation-methods-for-input-output

Updated

1af0fb4

Updated

bcef26e

corrected usage of input list

095516e

		os.environ['TF_CPP_MIN_LOG_LEVEL'] = '0'


		def convert_shift_to_dict(s: Union[int, str, dict], inputs: list[str], custom_default: Union[int, str] = None) -> dict:

45 add interpolation methods for input output #51

Are you sure you want to change the base?

45 add interpolation methods for input output #51

Uh oh!

Conversation

SiAndRo2002 commented Nov 20, 2025 • edited by PatrickHenkel1 Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Pull Request

Description

Type of Change

Required Checklist

Testing

Examples

Compatibility

Documentation

Optional

GitHub Copilot Review

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

SiAndRo2002 commented Nov 20, 2025 •

edited by PatrickHenkel1

Loading