M5 Project: Scene Understanding for Autonomous Vehicles

The goal of this project is to learn the basic concepts and techniques to build deep neural networks to detect, segment and recognize specific objects, focusing on the self-driving car application. With the aim to solve the problem of automatic image understanding, the tasks performed include object recognition, detection and semantic segmentation in images recorded by an on-board vehicle camera.

Team members

Daniel Azemar (daniel.azemar@e-campus.uab.cat)
María Gil Aragones (maria.gilaragones@gmail.com)
Laura Mora Ballestar (lmoraballestar@gmail.com)
Richard Segovia (richard.segovia@e-campus.uab.cat)

Applications

This repository creates a PyTorch based framework to achieve three goals:

Get Started

Object Recognition and Semantic Segmentation

Installation

Environment Set Up:

Python 3.7
Pytorch -- cudatoolkit, torchvision

pip install -r requirements.txt

Run the code

# --exp_name: directory where results are stored
# --config_dile: file where the configuration for code is set up
python3 main.py --exp_name dir_name --exp_folder ./ --config_file config/configFile.yml

Object Detection

Installation

In order to execute the framework for object detection, different steps have to be followed. First, see source repository

1. Prerequisits

Python 3.6
Pytorch 1.0
Cuda 8 or hihger

2. Data preparation

The framework requires COCO and PASCAL to be installed in order to work properly

PASCAL_VOC 07+12: Please follow the instructions in py-faster-rcnn to prepare VOC datasets. After downloading the data, create softlinks in the folder object_detection/faster-rcnn.pytorch/data/.
COCO: Download from the respository COCOAPI and store in folder object_detection/faster-rcnn.pytorch/data/
UDACITY and other nonVoc Datasets
- First make a folder inside of the data folder with the name of the dataset.
- Create a folder called annotations_cache
- Create a folder called results
- Create a folder called nameOfDatasetYear
- Inside the nameOfDatasetYear folder, create the following structure:
```
/Annotations 
/ImageSets/Layout 
/ImageSets/Main 
/ImageSets/Segmentation 
/JPEGImages 
/test 
/train 
/valid 
```
- Copy the images and the txt files of the dataset to the test, train and valid folders.
- Copy all the images to the JPEGImages folder
- Copy the convert_to_voc.py file to the /nameofDataset/nameOfDatasetYear and execute it with python
- Clone /lib/datasets/pascal_voc.py and make the modifications to adapt it to your dataset
- Go to /lib/datasets/factory.py and add the cll to your clone of the /lib/datasets/pascal_voc.py
- Add the name of dataset to the options in the /test_net.py and /trainval_net.py

3. Pretrained Models

The framework uses VGG16 or Restnet101 as baseline architectures. The weights of the networks, trained with Caffe, must be stored in the folder object_detection/framework/pretrained_models/

Link to download the models from the source repository:

VGG16: Dropbox
ResNet101: Dropbox

4. Compilation

pip install -r requirements.txt

cd lib
python setup.py build develop

Run the code

Train

LEARNING_RATE=lr
BATCH_SIZE=batchSize
DECAY_STEP=decayStep
DATASET=udacity_voc #udacity_voc or pascal_voc
NETWORK=res101 #res101 or vgg16 
EPOCHS=numberEpochs

python3 trainval_net.py --dataset $DATASET --net $NETWORK \
                       --bs $BATCH_SIZE --nw 1 \
                       --lr $LEARNING_RATE --lr_decay_step $DECAY_STEP \
                       --cuda --mGPUs --epochs $EPOCHS

Test

python3 test_net.py --dataset  $DATASET --net $NETWORK \
                       --cuda --mGPUs --checksession $CHECK_SESSION --checkepoch $CHECK_EPOCH --checkpoint $CHECK_MODEL

Demo

Script which loads the trained model and saves the result image detection in the folder object_detection/framework/images/

python demo.py --net res101 \
               --checksession $SESSION --checkepoch $EPOCH --checkpoint $CHECKPOINT --cuda --load_dir models/

Report

Object Recognition	Semantic Segmentation	Object Detection
Presentation	Presentation	Presentation

Name		Name	Last commit message	Last commit date
Latest commit History 168 Commits
config		config
dataloader		dataloader
dataset_analysis		dataset_analysis
devkit_kitti_txt		devkit_kitti_txt
fonts		fonts
metrics		metrics
models		models
obj_detection/faster-rcnn.pytorch		obj_detection/faster-rcnn.pytorch
object_detection		object_detection
papers		papers
tasks		tasks
test		test
utils		utils
.gitignore		.gitignore
README.md		README.md
__init__.py		__init__.py
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

M5 Project: Scene Understanding for Autonomous Vehicles

Team members

Index

Applications

Get Started

Object Recognition and Semantic Segmentation

Installation

Run the code

Object Detection

Installation

Run the code

Report

Complete Report

State of the Art publications

Weights Folder

About

Releases

Packages

Languages

mgilar/MCV_CNN_framework

Folders and files

Latest commit

History

Repository files navigation

M5 Project: Scene Understanding for Autonomous Vehicles

Team members

Index

Applications

Get Started

Object Recognition and Semantic Segmentation

Installation

Run the code

Object Detection

Installation

Run the code

Report

Complete Report

State of the Art publications

Weights Folder

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages