S³F-Net

S³F-Net: A Multi-Modal Approach to Medical Image Classification via Spatial-Spectral Summarizer Fusion Network

Official TensorFlow/Keras implementation for the paper: "S³F-Net: A Multi-Modal Approach to Medical Image Classification via Spatial-Spectral Summarizer Fusion Network"

Summary: Standard Convolutional Neural Networks (CNNs) have become a cornerstone of medical image analysis due to their proficiency in learning hierarchical spatial features. However, this focus on a single domain is inefficient at capturing global, holistic patterns and fails to explicitly model an image’s frequency- domain characteristics. To address these challenges, we propose the Spatial-Spectral Summarizer Fusion Network (S³F-Net), a dual-branch framework that learns from both spatial and spectral representations simultaneously. The S³F-Net combines a deep spatial CNN for local morphology with our proposed SpectraNet branch. A key component within SpectraNet is the SpectralFilter Layer, which leverages the Convolution Theorem by applying a bank of learnable filters directly to an image’s Fourier spectrum via a parameter-efficient element-wise multiplication. This allows the SpectraNet to attain a global receptive field from its first layer, with its output being distilled by a lightweight ”funnel” head. We conduct a comprehensive investigation across four diverse medical imaging datasets to demonstrate S³F-Net’s efficacy and show that the optimal fusion strategy is task-dependent. With a powerful Bilinear Fusion, S³F-Net achieves a state-of-the- art competitive accuracy of 98.76% on the BRISC2025 brain tumor dataset. In contrast, a simpler Concatenation Fusion proves superior on the texture-dominant Chest X-Ray Pneumonia dataset, achieving up to 93.91% accuracy, also a result surpassing many top-performing, much deeper models. In all evaluated cases, the S³F-Net framework significantly outperforms its strong spatial-only baseline, demonstrating that our dual-domain approach is a powerful and generalizable paradigm for medical image analysis.

Architecture Overview

The S³F-Net is an asymmetric two-tower architecture. It consists of:

A deep, conventional Spatial Branch (CNN) that acts as a powerful expert on local, morphological features.
Our novel, shallow, and parameter-efficient SpectraNet Branch that acts as an expert on global, textural, and frequency-based features.

The final, refined feature vectors from these two independent experts are then combined using a task-dependent fusion strategy to produce the final classification.

Key Results

A central finding of this research is that the optimal fusion strategy is task-dependent. Our S³F-Net framework consistently and significantly outperforms its strong spatial-only baseline across all four datasets.

Dataset	Baseline (Spatial-Only)	S³F-Net (Best Fusion)	Performance Gain	Best Fusion Method
BRISC2025 (MRI)	98.27% Acc.	98.76% Acc.	+0.49%	Bilinear Fusion
Chest X-Ray	87.98% Acc.	93.11% Acc.	+5.13%	Concatenation
BUSI (Ultrasound)	84.30% Acc.	87.82% Acc.	+3.52%	Concatenation
HAM10000	0.689 W. F1	0.704 W. F1	+1.5%	Bilinear Fusion

The Bilinear Fusion excels on feature-complex tasks, while the simpler Concatenation Fusion is superior for texture-dominant modalities.

Setup and Installation

This project is built using TensorFlow 2.18.

1. Clone the repository:

git clone https://github.com/Saiful185/S3F-Net.git
cd S3F-Net

2. Install dependencies: It is recommended to use a virtual environment.

pip install -r requirements.txt

Key dependencies include: tensorflow, pandas, opencv-python, scikit-learn, seaborn.

Dataset Preparation

The experiments are run on four publicly available datasets. For fast I/O, it is highly recommended to download the datasets, zip them, upload the zip file to your Google Drive, and then use the unzipping cells in the Colab notebooks.

1. HAM10000

Download: From the Kaggle dataset page or the ISIC archive.

2. BRISC2025

Download: From the Kaggle dataset page.

3. Chest X-Ray (Pneumonia)

Download: From the Kaggle dataset page.

4. BUSI

Download: From the Kaggle dataset page.

Usage: Running the Experiments

The code is organized into Jupyter/Colab notebooks (.ipynb) for each key experiment.

Open a notebook (e.g., S3F-Net_Bilinear_SN1_BRISC2025.ipynb).
Update the paths in the first few cells to point to your dataset's location (either on Google Drive for unzipping or a local path).
Run the cells sequentially to perform data setup, model training, and final evaluation.

Pre-trained Models

The pre-trained weights for our SOTA competetive models are available for download from the v1.0.0-sn1 release on this repository.

Model	Trained On	Description	Download Link
S³F-Net (Bilinear SN1 variant)	BRISC2025	The best performing model on BRISC2025 dataset.	Link
S³F-Net (Concatenation SN1 variant)	Chest X-Ray (Pneumonia)	The best performing model on the Chest X-Ray dataset.	Link

Citation

If you find this work useful in your research, please consider citing our paper:

@article{siddiqui2025s3fnet,
      title={S³F-Net: A Multi-Modal Approach to Medical Image Classification via Spatial-Spectral Summarizer Fusion Network},
      author={Md. Saiful Bari Siddiqui and Mohammed Imamul Hassan Bhuiyan},
      year={2025},
      eprint={2509.23442},
      archivePrefix={arXiv},
      primaryClass={eess.IV},
      url={https://arxiv.org/abs/2509.23442}, 
}

License

This project is licensed under the MIT License. See the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
BRISC2025 Experiments		BRISC2025 Experiments
BUSI Experiments		BUSI Experiments
Chest X-Ray (Pneumonia) Experiments		Chest X-Ray (Pneumonia) Experiments
Explainability Analysis		Explainability Analysis
HAM10000 Experiments		HAM10000 Experiments
figures		figures
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

S³F-Net

Architecture Overview

Key Results

Setup and Installation

Dataset Preparation

1. HAM10000

2. BRISC2025

3. Chest X-Ray (Pneumonia)

4. BUSI

Usage: Running the Experiments

Pre-trained Models

Citation

License

About

Uh oh!

Releases 1

Packages

Languages

License

Saiful185/S3F-Net

Folders and files

Latest commit

History

Repository files navigation

S³F-Net

Architecture Overview

Key Results

Setup and Installation

Dataset Preparation

1. HAM10000

2. BRISC2025

3. Chest X-Ray (Pneumonia)

4. BUSI

Usage: Running the Experiments

Pre-trained Models

Citation

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages