Monocular Neural Image-based Rendering with Continuous View Control

This is the code base for our paper Monocular Neural Image-based Rendering with Continuous View Control. We propose an approach to generate novel views of objects from only one view, with fine-grained control over the virtual viewpoints. With out method, one can view a single 2D Internet image as in 3D:

Car

Street View

Videos are generated by our method in real-time (20 FPS) folowing user's cursor. The only input is a single 2D image.

Prerequisites

Ubuntu 16.04
Python 3.6
NVIDIA GPU + CUDA 8.0
Pytorch 1.1.0

Installation

Clone this repo:

git clone https://github.com/xuchen-ethz/continuous_view_synthesis.git
cd continuous_view_synthesis

Install dependencies.

pip install -r requirements.txt

Interactive Demonstration

Download a pre-trained model from our Google Drive;
Unzip the model under ./checkpoints/ folder;
For interactive demonstration, run ./demo_car.sh or ./demo_kitti.sh.
In car demo, you can drag the image to move the car as in 3D.
In KITTI demo, you can move by pressing w,a,s,d to move in the scene. After single-clicking the image, the viewing angle will change with the cursor.
Note that the interactive demonstration only works within a certain movement range due to training data and dis-occlusions.

Testing

Dowoload and unzip pre-trained weights in the same way as for interactive demonstration.
Run ./test_car.sh, ./demo_chair.sh or ./demo_kitti.sh to run the demo. The test results will be saved to .gif files and a html file here: ./results/car/latest_test/.

Training

Download a dataset from our Google Drive;
Unzip the dataset under ./datasets/ folder;
Train a model by running ./train_car.sh, ./demo_chair.sh or ./train_kitti.sh
To view training results and loss plots, run python -m visdom.server and click the URL http://localhost:8097. To see more intermediate results, check out ./checkpoints/$name/

Citation

If you find this repository useful for your research, please consider citing our paper.

@article{chen2019mono,
  title={Monocular Neural Image Based Rendering with Continuous View Control},
  author={Chen, Xu and Song, Jie and Hilliges, Otmar},
  year= {2019},
  booktitle = {International Conference on Computer Vision (ICCV)},
}

Acknowledgments

Code is based on pytorch-CycleGAN-and-pix2pix written by Jun-Yan Zhu and Taesung Park and SfmLearner-Pytorch written by Clément Pinard.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
data		data
demo		demo
demo_images		demo_images
misc		misc
models		models
options		options
scripts		scripts
util		util
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
test.py		test.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Monocular Neural Image-based Rendering with Continuous View Control

Car

Street View

Prerequisites

Installation

Interactive Demonstration

Testing

Training

Citation

Acknowledgments

About

Releases

Packages

Languages

License

xuchen-ethz/continuous_view_synthesis

Folders and files

Latest commit

History

Repository files navigation

Monocular Neural Image-based Rendering with Continuous View Control

Car

Street View

Prerequisites

Installation

Interactive Demonstration

Testing

Training

Citation

Acknowledgments

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages