Asymmetric Deep Interaction Network for RGB-D Salient Object Detection

1. Overview

1.1. Introduction

In recent years, most of the existing RGB-D SOD models use summation or splicing strategies to directly aggregate information from different modalities and decode features from different layers to predict saliency maps. However, they ignore the complementary properties of depth images and RGB images and the effective use of features between the same layers, resulting in a degraded model performance. To address this issue, we propose an asymmetric deep interaction network (ADINet) with three indispensable components with a focus on information fusion & embedding. Specifically, we design a cross-modal fusion encoder for enhancing the information fusion & embedding on semantic signals that is employed to benefit from the mutual interaction of RGB and depth information. Then, we propose a global-and-local feature decoder to enrich the global & local information for improving the recognition of salient objects. We have conducted the experiments on seven RGB-D benchmarks, and the results demonstrate that the proposed method is superior to or competitive with the state-of-the-art works.

1.2. Framework Overview

Figure 1: Overall architecture of the proposed ADINet

1.3. Quantitative Results

Figure 2: Quantitative Results

1.4. PR Curves

Figure 3: PR Curves

1.5. Qualitative Results

Figure 4: Qualitative Results

2. Preparing the necessary data

The training and testing experiments were conducted using PyTorch with a single Tesla T4 GPU 16GB.\

2.1. Requirements\

python 3.9 pytorch 1.11.0

2.2. downloading training datasets from Baidu Drive(extraction code: o3o4).\

2.3. downloading testing datasets from Baidu Drive(extraction code: 211k).\

2.4. downloading Swin V2 weights (Swin V2(extraction code: 6hyq)) and move it into [./pretrain/swinv2_base_patch4_window16_256.pth].

Training

python train_ADINet.py

Testing

python test_ADINet.py

Training and Testing

python run_ADINet.py

Results

We provide saliency maps of ADINet on seven benchmark datasets, including: DUT-RGBD, NJU2K, NLPR, SIP, SSD, LFSD and RedWeb-S from Baidu Drive(extraction code: ADIN).

Evaluating Results

When training is complete, the predictions for the test set are saved in . /test_maps. We provided python versions(extraction code: dr6d) for evaluation.
python main.py

Name		Name	Last commit message	Last commit date
Latest commit History 81 Commits
ADINet		ADINet
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Asymmetric Deep Interaction Network for RGB-D Salient Object Detection

1. Overview

1.1. Introduction

1.2. Framework Overview

1.3. Quantitative Results

1.4. PR Curves

1.5. Qualitative Results

2. Preparing the necessary data

The training and testing experiments were conducted using PyTorch with a single Tesla T4 GPU 16GB.\

2.1. Requirements\

2.2. downloading training datasets from Baidu Drive(extraction code: o3o4).\

2.3. downloading testing datasets from Baidu Drive(extraction code: 211k).\

2.4. downloading Swin V2 weights (Swin V2(extraction code: 6hyq)) and move it into [./pretrain/swinv2_base_patch4_window16_256.pth].

Training

Testing

Training and Testing

Results

Evaluating Results

Note: Our core code is being organized and will be uploaded later!

About

Releases

Packages

Languages

yuxl2023/ADINet

Folders and files

Latest commit

History

Repository files navigation

Asymmetric Deep Interaction Network for RGB-D Salient Object Detection

1. Overview

1.1. Introduction

1.2. Framework Overview

1.3. Quantitative Results

1.4. PR Curves

1.5. Qualitative Results

2. Preparing the necessary data

The training and testing experiments were conducted using PyTorch with a single Tesla T4 GPU 16GB.\

2.1. Requirements\

2.2. downloading training datasets from Baidu Drive(extraction code: o3o4).\

2.3. downloading testing datasets from Baidu Drive(extraction code: 211k).\

2.4. downloading Swin V2 weights (Swin V2(extraction code: 6hyq)) and move it into [./pretrain/swinv2_base_patch4_window16_256.pth].

Training

Testing

Training and Testing

Results

Evaluating Results

Note: Our core code is being organized and will be uploaded later!

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages