Research possible DataFrame implementation

Right now we use RDD, but need to research if that would possible to improve performance using DataFrames 

It will depend on un-publish artifact 
```
  // for DataFrames
  //compile "tapanalyticstoolkit:spark-tensorflow-connector:1.0.0-s_$scalaVersion"
```