Skip to content

yuxl2023/ADINet

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

81 Commits
 
 
 
 

Repository files navigation

Asymmetric Deep Interaction Network for RGB-D Salient Object Detection

1. Overview

1.1. Introduction

In recent years, most of the existing RGB-D SOD models use summation or splicing strategies to directly aggregate information from different modalities and decode features from different layers to predict saliency maps. However, they ignore the complementary properties of depth images and RGB images and the effective use of features between the same layers, resulting in a degraded model performance. To address this issue, we propose an asymmetric deep interaction network (ADINet) with three indispensable components with a focus on information fusion & embedding. Specifically, we design a cross-modal fusion encoder for enhancing the information fusion & embedding on semantic signals that is employed to benefit from the mutual interaction of RGB and depth information. Then, we propose a global-and-local feature decoder to enrich the global & local information for improving the recognition of salient objects. We have conducted the experiments on seven RGB-D benchmarks, and the results demonstrate that the proposed method is superior to or competitive with the state-of-the-art works.

1.2. Framework Overview

image

Figure 1: Overall architecture of the proposed ADINet

1.3. Quantitative Results

alt

Figure 2: Quantitative Results

1.4. PR Curves

alt

Figure 3: PR Curves

1.5. Qualitative Results

alt

Figure 4: Qualitative Results

2. Preparing the necessary data

The training and testing experiments were conducted using PyTorch with a single Tesla T4 GPU 16GB.\

2.1. Requirements\

python 3.9 pytorch 1.11.0

2.2. downloading training datasets from Baidu Drive(extraction code: o3o4).\

2.3. downloading testing datasets from Baidu Drive(extraction code: 211k).\

2.4. downloading Swin V2 weights (Swin V2(extraction code: 6hyq)) and move it into [./pretrain/swinv2_base_patch4_window16_256.pth].

Training

python train_ADINet.py

Testing

python test_ADINet.py

Training and Testing

python run_ADINet.py

Results

We provide saliency maps of ADINet on seven benchmark datasets, including: DUT-RGBD, NJU2K, NLPR, SIP, SSD, LFSD and RedWeb-S from Baidu Drive(extraction code: ADIN).

Evaluating Results

When training is complete, the predictions for the test set are saved in . /test_maps. We provided python versions(extraction code: dr6d) for evaluation.
python main.py

Note: Our core code is being organized and will be uploaded later!

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%