Skip to content

Latest commit

 

History

History
25 lines (13 loc) · 1.52 KB

spark-sql-catalyst.adoc

File metadata and controls

25 lines (13 loc) · 1.52 KB

Catalyst Query Optimizer

Catalyst is a framework for manipulating trees of relational operators and expressions. It contains logical query plans that it can translate to executable code (as Spark RDDs).

Catalyst is an extensible query plan optimizer with constant folding, predicate pushdown, nullability (NULL value) propagation, vectorized Parquet decoder and whole-stage code generation optimization techniques.

Catalyst supports both rule-based and cost-based optimization.

Among the rule-based optimizations is nullability (NULL value) propagation.

SparkOptimizer

Caution
FIXME

RowEncoder

RowEncoder is a factory object that maps StructType to ExpressionEncoder[Row]

It belongs to org.apache.spark.sql.catalyst.encoders package.