GitHub - kbarszczak/Frame_booster: The project uses AI techniques to generate new frames in the video.

The project uses AI techniques to generate new frames in the video. The key factor of the system is the neural network that uses mostly convolutional layers to predict the final frame.

Motivation

The key motivation is to build a system that generates new frames more accurately than existing methods such as interpolation or optical flow. The idea is to create a neural network that properly moves the objects on the scene without leaving blank spots behind them.

Build Status

The FBNet models v1, v2, v3, and v4 are written in TensorFlow and Keras, and other models such as v5, v6, and v7 are written in PyTorch. Each model is pre-trained and ready to use. Proper model files can be found in a 'models' directory. Each model has been trained on the Vimeo90K triplet dataset. The below graph shows the comparison of models v5-v7 with different modifications applied.

Currently, the best model is the one in version 5. Its PSNR metric on the Vimeo90K test dataset is around 32.33 points. The comparison to other SOTA models is shown below (The mix column is simply a linear combination of 2 images from the sequence).

Screenshots

The learning progress of model v5.

The comparison of results produced by model v5 (right upper corner), model v6_7 (left lower corner), model v7_1 (right lower corner), and the ground truth image (left upper corner).

And the video shows the original video on the left and the result of boosting the frame rate 4x on the right (boosting by model v1).

Tech/Framework used

The project uses the following tech/frameworks:

python 3.10
keras/tensorflow (models v1-v4)
pytorch (models v5-v7)
opencv
numpy

Features

The net can process every type of video without any restrictions regarding its length. The only restriction is that the generated video is always in size 144x256 px. The net can also be used to predict a single image from 2 similar images rather than predicting the frames based on the sequence of frames.

Installation

Clone the repository

mkdir frame_booster
cd frame_booster
git clone https://github.com/kbarszczak/Frame_booster .

Set up python environment
1. Either install the requirements
```
pip install -r requirements.txt
```
1. or create the conda environment and install the requirements inside a conda container
```
conda create --name frame_booster
conda activate frame_booster
pip install -r requirements.txt
```

After these steps, everything is set up and ready to use.

How to use?

Frame boosting may be performed by launching the 'src/frame_generator.py' script with the following switches (supports models in version 1-3):

-s the filename of the source video (absolute or relative)
-m the path to the trained model saved in a .h5 format
-t the path to a dictionary where the results files will be created
-vn the name of the created video
-c the boosting rate (2, 4, 8, 16, 32). example: rate 4 will add 3 new frames between each frame in the original video
-e the extension of the result file (mp4, avi)
-md the mode of the generator (fast, low_mem)
-iw the width of the net input
-ih the height of the net input Example:

python src/frame_generator.py -s 'test.mp4' -m 'FBNet.h5' -t 'C:/Users/kamil/test' -c 2 -vn 'test_result_2x' -md fast -e avi

To train your own net one may use the 'train.py' script either from src/tensorflow or src/pytorch directory with the switches:

-tr the path to the training .tfrecords file
-trc the amount of training data
-ts the path to the testing .tfrecords file
-tsc the amount of testing data
-v the path to the validating .tfrecords file
-vc the amount of validating data
-t the path where trained models will be saved
-n the name of the saved model
-b the batch size
-e the epochs
-iw the input images width
-ih the input images height Example:

python src/tensorflow/train.py -tr train_144x256_19000.tfrecords -trc 19000 -ts test_144x256_1000.tfrecords -tsc 1000 -v valid_144x256_500.tfrecords -vc 500 -t C:/Users/kamil/test -n model -b 5 -e 10

Create your dataset for TensorFlow by running 'src/tensorflow/data_generator.py' with the following switches:

-s the source path of the raw dataset to process
-t the target path where the result files will be created
-l the loaded script (vimeo90k, custom). In the case of vimeo90k, the -d parameter is ignored and the source path has to point to the vimeo90k triplet dataset where the following files/dirs are: sequences, tri_trainlist.txt, tri_testlist.txt. In custom, the path has to point to a directory containing only the video files. Each file will be loaded and processed
-tr the limit of the train data
-ts the limit of the test data
-tv the split ratio between training and validating datasets
-i the interpolation method (bilinear, bicubic)
-iw the width of the target images
-ih the height of the target images
-d the delay parameres means that d - 1 frames will be skipped between generated data (applies only to -l set to custom) Example:

python src/tensorflow/data_generator.py -s C:/Users/kamil/raw/240fps_horizontal -t data -l custom -tr 1000 -ts 500 -tv 0.9 -i bicubic -iw 256 -ih 144 -d 5

Contribute

clone the repository
make the changes
create the pull request with a detailed description of your changes

Acknowledgements

@article{xue2019video,
  title={Video Enhancement with Task-Oriented Flow},
  author={Xue, Tianfan and Chen, Baian and Wu, Jiajun and Wei, Donglai and Freeman, William T},
  journal={International Journal of Computer Vision (IJCV)},
  volume={127},
  number={8},
  pages={1106--1125},
  year={2019},
  publisher={Springer}
}

Name		Name	Last commit message	Last commit date
Latest commit History 98 Commits
documentation		documentation
models		models
notebooks		notebooks
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Motivation

Build Status

Screenshots

Tech/Framework used

Features

Installation

How to use?

Contribute

Acknowledgements

About

Releases

Packages

Languages

License

kbarszczak/Frame_booster

Folders and files

Latest commit

History

Repository files navigation

Motivation

Build Status

Screenshots

Tech/Framework used

Features

Installation

How to use?

Contribute

Acknowledgements

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages