GitHub - fadh1l/Inception-Resnet-Swin-Transformer: Swin Transformer + Inception-ResNet = Improved Performance ✨ Evaluated on a Retinal OCT dataset.

Swin Transformer with Inception-ResNet Patch Merging

This project investigates the performance improvements possible by integrating an Inception-ResNet-based feature extraction module within the patch merging stage of the Swin Transformer architecture.

Motivation

Swin Transformers employ linear embedding for patch merging. Could a more sophisticated feature extraction technique improve performance?
Inception-ResNet modules excel at capturing features at multiple scales. This might enrich the representation learned during patch merging.

Modifications

The original linear embedding layer in the Swin Transformer patch merging is replaced with an Inception-ResNet module.

Performance on Retinal OCT Dataset

Epoch	Train Loss	Valid Loss	Accuracy	F1-Score	Precision	Recall	Time
0	0.0783	0.0175	0.9959	0.9959	0.9959	0.9959	09:04
1	0.0789	0.0619	0.9783	0.9784	0.9798	0.9783	09:07
2	0.0600	0.0262	0.9948	0.9948	0.9949	0.9948	09:05

Our study significantly advances medical picture classification and opens the door for more developments.
Interestingly, our improved Swin model outperforms other well-known models like Efficient-Swin, VGG16, ResNet18, and AlexNet.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
IR-Swin.ipynb		IR-Swin.ipynb
README.md		README.md
swinc4.ipynb		swinc4.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Swin Transformer with Inception-ResNet Patch Merging

About

Releases

Packages

Contributors 2

Languages

fadh1l/Inception-Resnet-Swin-Transformer

Folders and files

Latest commit

History

Repository files navigation

Swin Transformer with Inception-ResNet Patch Merging

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages