Aerial-Object-Detection

This repository contains our research work on Aerial Object Detection.

Introduction

This work proposes a novel deep learning approach which optimizes the detection of objects in aerial scenes captured by UAVs. In our setup, the power-constrained drone is used only for data collection, while the computationally intensive tasks are offloaded to a GPU edge server. Our work first categorises the current methods for aerial object detection using deep learning techniques and discusses how the task is different from general object detection scenarios. We delineate the specific challenges involved and experimentally demonstrate the key design decisions which significantly affect the accuracy and robustness of model. We further propose an optimized architecture which utilizes these optimal design choices along with the recent ResNeSt backbone in order to achieve superior performance in aerial object detection. Finally, we reflect on what we have achieved and further propose several shining directions of future work to inspire further research and advancement in aerial object detection.

Instructions

To train RetinaNet with VGG16 or ResNet50 feature extractor :

python keras-retinanet/keras_retinanet/bin/train.py --gpu <gpu_id> --backbone <vgg16 | resnet50> --epochs <total_epochs> --tensorboard-dir <tensorboard_dir> --compute-val-loss --config <path_to_config> --snapshot-path <snapshot_save_dir> --random-transform --snapshot <resume_snapshot> csv <train_csv> <class_mapping_csv> --val-annotations <val_csv>

To train RetinaNet with ResNeSt50 feature extractor:

python detectron2-ResNeSt/tools/train_net.py --num-gpus <num_gpus> --config-file <path_to_config>

Citation

Bibtex to be uploaded soon!

Nice Readings

Object Tracking

https://arxiv.org/pdf/1707.00569.pdf

Retinanet

RRNet

Paper : http://openaccess.thecvf.com/content_ICCVW_2019/papers/VISDrone/Chen_RRNet_A_Hybrid_Detector_for_Object_Detection_in_Drone-Captured_Images_ICCVW_2019_paper.pdf

Feature Pyramid Network

Anchor Boxes

Receptive Fields

https://medium.com/mlreview/a-guide-to-receptive-field-arithmetic-for-convolutional-neural-networks-e0f514068807

RetinaMask

Name		Name	Last commit message	Last commit date
Latest commit History 86 Commits
ResNeSt_VisDrone		ResNeSt_VisDrone
annotations		annotations
docs		docs
output_predictions		output_predictions
university-report		university-report
utils		utils
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Aerial-Object-Detection

Introduction

Instructions

Citation

Nice Readings

Object Tracking

Retinanet

RRNet

Feature Pyramid Network

Anchor Boxes

Receptive Fields

RetinaMask

Retinanet Exemplar Implementation

Aiskyeye Dataset

VisDrone2019 Paper

VisDrone2018 Paper

Loss Functions

Feature Visualization

About

Contributors 3

Languages

ayushjain1144/Aerial-Object-Detection

Folders and files

Latest commit

History

Repository files navigation

Aerial-Object-Detection

Introduction

Instructions

Citation

Nice Readings

Object Tracking

Retinanet

RRNet

Feature Pyramid Network

Anchor Boxes

Receptive Fields

RetinaMask

Retinanet Exemplar Implementation

Aiskyeye Dataset

VisDrone2019 Paper

VisDrone2018 Paper

Loss Functions

Feature Visualization

About

Resources

Stars

Watchers

Forks

Contributors 3

Languages