Right now we use RDD, but need to research if that would possible to improve performance using DataFrames It will depend on un-publish artifact ``` // for DataFrames //compile "tapanalyticstoolkit:spark-tensorflow-connector:1.0.0-s_$scalaVersion" ```