Clarification on input normalization parameters when reloading trained scPred model

Hi, thank you for the great scPred model implementation!

I have a question regarding the input normalization step during model training and prediction.
In the training script, standardization is done manually by computing the mean and standard deviation of the training data as follows:
train_mean = train_epi.mean(dim=0)
train_std = train_epi.std(dim=0)
train_epi = (train_epi - train_mean) / train_std
However, in Section 5* of the tutorial (“Re-load the trained model”), it is mentioned:

“When use the model to do new predictions, if you run the input normalization during the model training, use the same scaler of training data to normalize your new input data.”

Could you please clarify:

Is it necessary to manually save train_mean and train_std during training to apply the same normalization when making new predictions?

If so, do you have any recommended way to store and reuse those parameters (e.g., torch.save() or pickle)?

Would you consider adding this saving/loading process into the codebase or tutorial for reproducibility?

Thanks again for your help and for providing this model to the community!

Best regards,
Bunny

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Clarification on input normalization parameters when reloading trained scPred model #5

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Clarification on input normalization parameters when reloading trained scPred model #5

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions