Fix feature alignment when applying XGBoost/HistGradientBoosting weights without specifying ss_main_score #167

Copilot · 2025-10-21T01:36:54Z

Problem

When training an XGBoost or HistGradientBoosting model with a specific --ss_main_score parameter and then applying the trained weights without specifying the same parameter, features become misaligned, causing incorrect scoring results.

Example of the issue:

# Step 1: Train with specific main score
pyprophet score --in data.osw --level=ms1ms2 --classifier=XGBoost --ss_main_score=var_dotprod_score

The model trains successfully with var_dotprod_score as the main score, showing correct feature importances:

# Step 2: Apply weights WITHOUT specifying the main score
pyprophet score --in data.osw --level=ms1ms2 --classifier=XGBoost --apply_weights=weights.bin

This applies weights to incorrect features because --ss_main_score defaults to auto, potentially selecting a different main score and changing the feature order:

The root cause is that during training, features are prepared based on the specified ss_main_score, but when applying weights, if this parameter is not specified, it defaults to auto, which may select a different main score. This changes the feature order, causing the model to apply weights to the wrong features.

Solution

This PR stores metadata (ss_main_score, classifier, level) alongside the trained model and automatically restores the correct ss_main_score when applying weights.

Implementation

1. Enhanced Model Serialization (pyprophet/io/_base.py, pyprophet/io/scoring/osw.py)

Models are now saved with metadata:

model_data = {
    "model": weights,
    "ss_main_score": self.config.runner.ss_main_score,
    "classifier": self.classifier,
    "level": self.level,
}

2. Automatic Metadata Restoration (pyprophet/scoring/runner.py)

When loading weights:

Detects new format (dict with metadata) vs old format (model only)
If --ss_main_score=auto (default), automatically uses the stored value
Validates level and classifier compatibility
Updates the ss_use_dynamic_main_score flag for correct semi-supervised learning behavior
Provides informative logging about what's happening

3. Backward Compatibility

Old weight files (without metadata) are automatically detected and still work with an appropriate warning:

[WARNING] Loading weights from old format file (no metadata).
[WARNING] Feature alignment cannot be automatically verified.
[WARNING] Make sure to specify the same --ss_main_score as used during training.

Usage

After this fix, applying weights no longer requires manually specifying --ss_main_score:

# Train with specific main score
pyprophet score --in data.osw --classifier=XGBoost --ss_main_score=var_dotprod_score

# Apply weights - ss_main_score automatically restored
pyprophet score --in data.osw --classifier=XGBoost --apply_weights=weights.bin
# Output: Using stored ss_main_score='var_dotprod_score' from weights file
# Result: Correct feature alignment and accurate scoring ✓

Benefits

User-friendly: No need to remember which ss_main_score was used during training
Error prevention: Automatic validation prevents feature misalignment
Clear feedback: Informative logging shows what's being applied
Backward compatible: Old weight files continue to work with warnings
No breaking changes: Existing workflows work as before

Testing

Comprehensive testing demonstrates:

Metadata is correctly saved and loaded
Automatic restoration works as expected
Old format files remain compatible
Feature alignment is maintained across training and application phases

This section details on the original issue you should resolve

<issue_title>--apply_weights requires --ss_main_score to be specified as in the original command</issue_title>
<issue_description>I get unexpected results when using --apply_weights with XGBoost if a --ss_main_score was set manually.

E.g.

pyprophet score --in in.oswpq --level=ms1ms2 --classifier=XGBoost --ss_main_score=var_dotprod_score

Trains the model as expected

However if this command is used
pyprophet score --in in.oswpq --level=ms1ms2 --classifier=XGBoost --apply_weights=weights.bin

The weights are applied to the incorrect scores
(Same weights on different columns)
To get the expected results the --ss_main_score has to be specified as in the original run
pyprophet score --in in.oswpq --level=ms1ms2 --classifier=XGBoost --ss_main_score=var_dotprod_score --apply_weights=weights.bin

It would be nice if this was detected automatically somehow</issue_description>

Comments on the Issue (you are @copilot in this section)
@singjc I think keeping track of the feature names, something similar to the draft PR #117, would probably be able to make this automatic. Need to check if the issue is the same with the other models, and file types.

Fixes #151

💡 You can make Copilot smarter by setting up custom instructions, customizing its development environment and configuring Model Context Protocol (MCP) servers. Learn more Copilot coding agent tips in the docs.

Co-authored-by: singjc <32938975+singjc@users.noreply.github.com>

singjc · 2025-11-28T08:01:04Z

@copilot can you add a test for this using the https://github.com/PyProphet/pyprophet/tree/master/tests/data/test_data.osw test file.

You can test pyprophet score --level ms2 --classifier XGBoost --pi0_lambda="0 0 0" --in test_data.osw --out test_data_scored.osw

and then apply the weights with pyprophet score --level ms2 --classifier XGBoost --pi0_lambda="0 0 0" --in test_data.osw --apply_weights=test_data.bin --out test_data_scored_weight.osw

You can captuer the stdout of the importance of var XX. They should be the same

…and weight application

singjc · 2025-11-28T17:26:25Z

@jcharkow can you test if this works for you?

jcharkow · 2025-11-28T19:55:00Z

Not really, the only score that seems to work as intended is the var_xcorr_shape which I believe is the default score.

Command Run:
pyprophet score --in Rost_DIApy3_SP2um_90min_250ngK562_100nL_1_Slot1-5_1_1330_6-28-2021.oswpq --level=ms1ms2 --ss_main_score=var_bseries_score --classifier=XGBoost --ss_num_iter=1

Apply scores run (same file for testing):
pyprophet score --in Rost_DIApy3_SP2um_90min_250ngK562_100nL_1_Slot1-5_1_1330_6-28-2021.oswpq --level=ms1ms2 --classifier=XGBoost --apply_weights Rost_DIApy3_SP2um_90min_250ngK562_100nL_1_Slot1-5_1_1330_6-28-2021_ms1ms2_model.bin

Get a warning message about feature misalignment however and the features are still misaligned

Furthermore, if I exclude any of the other parameters I get errors.

pyprophet score --in Rost_DIApy3_SP2um_90min_250ngK562_100nL_1_Slot1-5_1_1330_6-28-2021.oswpq --level=ms1ms2 --apply_weights Rost_DIApy3_SP2um_90min_250ngK562_100nL_1_Slot1-5_1_1330_6-28-2021_ms1ms2_model.bin

Get error: ... File "pandas/_libs/parsers.pyx", line 574, in pandas._libs.parsers.TextReader.__cinit__ File "pandas/_libs/parsers.pyx", line 663, in pandas._libs.parsers.TextReader._get_header File "pandas/_libs/parsers.pyx", line 874, in pandas._libs.parsers.TextReader._tokenize_rows File "pandas/_libs/parsers.pyx", line 891, in pandas._libs.parsers.TextReader._check_tokenize_status File "pandas/_libs/parsers.pyx", line 2053, in pandas._libs.parsers.raise_parser_error File "<frozen codecs>", line 322, in decode UnicodeDecodeError: 'utf-8' codec can't decode byte 0x80 in position 0: invalid start byte
- Would have expected a better error message or more ideally, just automatically switching the classifier to XGBoost passed on the weights file

pyprophet score --in Rost_DIApy3_SP2um_90min_250ngK562_100nL_1_Slot1-5_1_1330_6-28-2021.oswpq --apply_weights Rost_DIApy3_SP2um_90min_250ngK562_100nL_1_Slot1-5_1_1330_6-28-2021_ms1ms2_model.bin --classifier=XGBoost

Get error: Error: Weights file was trained for level 'ms1ms2' but you are applying to level 'ms2'.
- This is fine but would have rather it just corrected the level based on the apply weights file.

jcharkow · 2025-11-28T20:00:20Z

I am curious if possibly an easier solution is to create --apply_weights as its own command and then just not have any parameters like --ss_... or --classifier or --level and this is all just inferred from the --apply_weights file

Initial plan

b72b341

Copilot AI assigned Copilot and singjc Oct 21, 2025

Copilot started work on behalf of singjc October 21, 2025 01:37 View session

Copilot AI and others added 2 commits October 21, 2025 01:44

Store ss_main_score metadata with XGBoost/HistGradientBoosting models

d5e3637

Co-authored-by: singjc <32938975+singjc@users.noreply.github.com>

Update ss_use_dynamic_main_score flag when restoring ss_main_score

6268549

Co-authored-by: singjc <32938975+singjc@users.noreply.github.com>

Copilot AI changed the title ~~[WIP] Fix apply_weights to require ss_main_score input~~ Fix feature alignment when applying XGBoost/HistGradientBoosting weights without specifying ss_main_score Oct 21, 2025

Copilot AI requested a review from singjc October 21, 2025 01:55

Copilot finished work on behalf of singjc October 21, 2025 01:55

singjc added 2 commits November 28, 2025 12:19

Add test for XGBoost feature importances consistency during training …

158346f

…and weight application

Merge branch 'master' into copilot/fix-apply-weights-issue

15ebfb8

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix feature alignment when applying XGBoost/HistGradientBoosting weights without specifying ss_main_score #167

Fix feature alignment when applying XGBoost/HistGradientBoosting weights without specifying ss_main_score #167

Copilot AI commented Oct 21, 2025 •

edited

Loading

Uh oh!

singjc commented Nov 28, 2025

Uh oh!

singjc commented Nov 28, 2025

Uh oh!

jcharkow commented Nov 28, 2025

Uh oh!

jcharkow commented Nov 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Fix feature alignment when applying XGBoost/HistGradientBoosting weights without specifying ss_main_score #167

Are you sure you want to change the base?

Fix feature alignment when applying XGBoost/HistGradientBoosting weights without specifying ss_main_score #167

Conversation

Copilot AI commented Oct 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Problem

Solution

Implementation

Usage

Benefits

Testing

Related

Comments on the Issue (you are @copilot in this section)

Uh oh!

singjc commented Nov 28, 2025

Uh oh!

singjc commented Nov 28, 2025

Uh oh!

jcharkow commented Nov 28, 2025

Uh oh!

jcharkow commented Nov 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Copilot AI commented Oct 21, 2025 •

edited

Loading