This repository contains code for detecting kidney diseases using CT scan images. The project explores how changes in input image resolution affect the algorithm's output. Various resolutions, ranging from 2x2 to 256x256 pixels, are considered.
The repository has the following structure:
dataset/
resized dataset/
.DS_Store/
128x128.ipynb
16x16.ipynb
256x256.ipynb
2x2.ipynb
32x32.ipynb
4x4.ipynb
64x64.ipynb
8x8.ipynb
LICENSE
README.md
requirements.txt
To use this code, follow these steps:
- Clone the repository to your local machine:
git clone https://github.com/Danish-Jamil-01/kidney-tumor-detection.git
- Setup:
- Ensure you have Python 3.x installed.
- Install the required libraries using 'pip install -r requirements.txt'.
- Data Preprocessing:
- The 'dataset' folder contains the original CT scan images.
- Run the notebook corresponding to the desired input resolution (e.g., '2x2.ipynb') to preprocess the images.
- The 'resized dataset' folder will contain the resized images.
- Model Training:
- Execute the notebook related to the desired input resolution (e.g., '2x2.ipynb') to train the deep learning model.
- Experiment with different resolutions to observe their impact on model performance.
- Evaluation:
- Run the notebook related to the desired input resolution (e.g., '2x2.ipynb') to evaluate the trained model using test data.
- Compare the evaluation metrics across different input resolutions.
- Sample Prediction:
- Use the trained model from the corresponding notebook to make predictions on new CT scan images of the same resolution.
The project demonstrates how changes in input image resolution affect the algorithm's output. By comparing the performance metrics (e.g., accuracy, F1 score) across different resolutions, insights can be gained into the optimal resolution for kidney disease detection.
The F1 Score across different input resolutions is as follows:
- 2x2 --> 0.445
- 4x4 --> 0.62
- 8x8 --> 0.68
- 16x16 --> 0.76
- 32x32 --> 0.865
- 64x64 --> 0.92
- 128x128 --> 0.93
- 256x256 --> 0.945
Potential improvements to enhance the project:
- Explore additional image preprocessing techniques.
- Incorporate advanced deep learning algorithms for improved performance.
- Conduct further analysis on the impact of input resolution on model interpretability.
Contributions to this project are welcome. Feel free to open an issue or submit a pull request.
This project is licensed under the MIT License.