-
Notifications
You must be signed in to change notification settings - Fork 284
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
1 parent
62cfd81
commit 82e50bd
Showing
1 changed file
with
56 additions
and
0 deletions.
There are no files selected for viewing
56 changes: 56 additions & 0 deletions
56
PW41_2024_MIT/Projects/SelfSupervisedDepthEstimationSurgery/README.md
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,56 @@ | ||
--- | ||
layout: pw41-project | ||
|
||
permalink: /:path/ | ||
|
||
project_title: Methods for self-supervised depth estimation and motion estimation in coloscopy under deformation | ||
category: Uncategorized | ||
presenter_location: In-person | ||
|
||
key_investigators: | ||
- name: Megha Kalia | ||
affiliation: Brigham and Women’s Hospital, Harvard Medical School, Boston, MA | ||
|
||
--- | ||
|
||
# Project Description | ||
|
||
Estimating Depth and localizing endoscope in surgical environment is critical in many tasks such as, intra-operative registration, augmented reality, surgical automation, among many others. Monocular self-supervised depth and pose estimation methods can estimate depth and camera pose without requiring labels. However, how do these methods perform in presence of deformation while endoscope moves through the lumen is not known. Therefore, through this project we want to evaluate the effect of addition of primarily two modules on depth and pose estimation accuracy. These modules are TransUnet and Optical Flow Module. Optical Flow can capture the image intensity changes in the scene because of deformation. And TransUnet can potentially capture the temporal correlations between the image frames to give better pose and depth predictions. For the project open sources dataset and github codes will be utilized. | ||
|
||
|
||
## Objective | ||
|
||
|
||
1. Objective A. To build and run and train the FlowNet on colonoscopy dataset | ||
1. Objective B. To integrate the flowNet module in the Monodepth2 framework | ||
1. Objective C. To integrate and evaluate TransUnet blocks in Monodepth2 framework. | ||
|
||
## Approach and Plan | ||
|
||
|
||
1. Run the Monodepth2 on the colonoscopy dataset. | ||
1. Train the optical flow network on the coloscopic dataset | ||
1. | ||
|
||
## Progress and Next Steps | ||
|
||
1. Run the model on the coloscopic dataset. | ||
2. Run TransUnet | ||
3. Replace the TransUnet in Monodepth2 and run on the dataset | ||
|
||
1. Describe specific steps you **have actually done**. | ||
1. Initial implementation of optical flow network | ||
1. Replace the Unet with TransUnet and see its effect of depth and pose prediction | ||
|
||
# Illustrations | ||
|
||
Coming Soon! | ||
|
||
# Background and References | ||
|
||
Dataset: http://cmic.cs.ucl.ac.uk/ColonoscopyDepth/Data/ | ||
https://data.mendeley.com/datasets/cd2rtzm23r/1 | ||
Monodepth2: https://github.com/nianticlabs/monodepth2 | ||
TansUnet: https://github.com/Beckschen/TransUNet/tree/main/networks | ||
|
||
|