Skip to content

A reusability pipeline for the original study "Delineating the effective use of self-supervised learning in single-cell genomics"

License

Notifications You must be signed in to change notification settings

CSHCY/Reusability_SSL_in_SCG

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Reusability Report: Evaluating the Transferability of Self-Supervised Learning Models from Single-Cell to Spatial Transcriptomics

This repository contains the code associated with our reusability study, entitled "Evaluating the Transferability of Self-Supervised Learning Models from Single-Cell to Spatial Transcriptomics".

Overview

This work builds upon the research presented in Richter et al. (2024), "Delineating the effective use of self-supervised learning in single-cell genomics", Nature Machine Intelligence. We evaluate the transferability of their pre-trained self-supervised learning (SSL) models (Random Mask, GP Mask, and Barlow Twins) to the domain of spatial transcriptomics. Our study investigates the performance of these models on cell type prediction and spatial clustering tasks, using various spatial transcriptomics datasets generated by different technologies (MERSCOPE, Xenium, and Slide-seqV2). Reusability Report Fig1 250125

Key Findings

  • Pre-trained SSL models show limited transferability to spatial transcriptomics data for cell type prediction, showcasing a significant domain gap between non-spatial single-cell transcriptomics and spatial transcriptomics.
  • Random Mask embeddings enhance the performance of spatial clustering methods (STAGATE-RM and GraphST-RM).
  • Gene imputation can negatively impact SSL model performance in spatial transcriptomics in our exprimental case.
  • Cross-species transfer presents significant challenges for SSL models.

Getting Started

Clone the repository:

  git clone https://github.com/CSHCY/Reusability_SSL_in_SCG

Create and activate the conda environment:

conda create -n ssl-transferability python==3.10
conda activate ssl-transferability

Install requirements:

pip install -r requirements-gpu.txt

Pretrained Models:

Download the pre-trained models from https://huggingface.co/TillR/sc_pretrained/tree/main/Pretrained%20Models.

Reproduce our experiments:

Jupyter notebooks can be used to reproduce the experiments and generate the figures in our paper. Before reproducing our expriments, please first git clone the code from the orginal study at https://github.com/theislab/ssl_in_scg.

Original Study

Richter T, Bahrami M, Xia Y, et al. Delineating the effective use of self-supervised learning in single-cell genomics[J]. Nature Machine Intelligence, 2024: 1-11.

Original Code

https://github.com/theislab/ssl_in_scg

Contact

For any questions or issues, please contact ChuangyiHan at hanchuangyi22@m.fudan.edu.cn.

About

A reusability pipeline for the original study "Delineating the effective use of self-supervised learning in single-cell genomics"

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published