Version 0.6.0
Fixes:
- Fixed an error that could cause records to be dropped when uncompressed files are read and XML tags happen to span an input split boundary, but fit within the stream read buffer (#400)
- Fixed issue with nested tags names in attributes (#374)
Improvements:
inferSchema
can now be set tofalse
during parsing to leave all values as string type (#393)- Also treat empty values as null if the nullValue is "" (#381)
- Log malformed records for debugging (#372)
Changes: https://github.com/databricks/spark-xml/issues?utf8=%E2%9C%93&q=milestone%3A0.6.0+is%3Aclosed+