python-v0.15.2: predicate overwrite, improved table state replay
ion-elgreco
released this
05 Feb 12:41
·
439 commits
to main
since this release
New features
- feat: allow merge_execute to release the GIL by @emcake in #2091
- feat: arrow backed log replay and table state by @roeap in #2037
- feat: update table config to contain new config keys by @roeap in #2127
- feat: expose stats schema on Snapshot by @roeap in #2128
- feat: implementation for replaceWhere by @r3stl355 in #1996
- feat: implement clone for DeltaTable struct by @mightyshazam in #2160
- feat: introduce schema evolution on RecordBatchWriter by @rtyler in #2024
Bug Fixes
- fix: properly deserialize percent-encoded file paths of Remove actions, to make sure tombstone and file paths match by @sigorbor in #2035
- fix: reinstate copy-if-not-exists passthrough by @emcake in #2083
- refactor: add deltalake-gcp crate by @ion-elgreco in #2061
- fix: schema issue within writebuilder by @universalmind303 in #2106
- fix: temporarily skip s3 roundtrip test by @roeap in #2124
- fix: set partition values for added files when building compaction plan by @alexwilcoxson-rel in #2119
- fix: clean-up paths created during tests by @roeap in #2126
- fix: add missing pandas import by @Tim-Haarman in #2116
- fix: order logical schema to match physical schema by @Blajda in #2129
- fix: do not write empty parquet file/add on writer close; accurately … by @alexwilcoxson-rel in #2123
- fix: prevent empty stats struct during parquet write by @alexwilcoxson-rel in #2125
- fix(#2143): keep specific error type when writing fails by @abaerptc in #2144
- fix(s3): restore working test for DynamoDb log store repair log on read by @dispanser in #2120
- fix: made generalize_filter less permissive, also added more cases by @emcake in #2149
- fix: allow loading of tables with identity columns by @rtyler in #2155
- fix: replace BTreeMap with IndexMap to preserve insertion order by @roeap in #2150
Other Changes
- chore(deps): update serial_test requirement from 2 to 3 by @dependabot in #2052
- chore: update documentation for S3 / DynamoDb log store configuration by @dispanser in #2041
- docs: make an overview tab visible in docs by @r3stl355 in #2080
- docs: update docs for rust print statement by @skariyania in #2077
- docs: add usage guide for check constraints by @hntd187 in #2079
- docs: add page on why to use delta lake by @MrPowers in #2076
- docs: how delta lake transactions work by @MrPowers in #2089
- docs: move dynamo docs into new docs page by @ion-elgreco in #2093
- docs: delta lake file skipping by @MrPowers in #2096
- chore: removed unnecessary print statement from update method by @LilMonk in #2111
- chore: temporarily ignore the repair on update test by @rtyler in #2114
- chore: bump python by @ion-elgreco in #2092
- docs: add dask page to integration docs by @avriiil in #2122
- docs: fix arg indent by @wchatx in #2103
- docs: delta lake is great for small data by @MrPowers in #2113
- docs: use transparent logo in README by @roeap in #2132
- chore: shorten up the crate folder names in the tree by @rtyler in #2145
- refactor(python): drop custom filesystem in write_deltalake by @ion-elgreco in #2137
- chore: upgrade to DataFusion 35.0 by @philippemnoel in #2121
- chore: cleanup minor clippies and other warns by @rtyler in #2161
- fix: allow checkpoints to contain metadata actions without a createdTime value by @rtyler in #2059
New Contributors
- @skariyania made their first contribution in #2077
- @LilMonk made their first contribution in #2111
- @alexwilcoxson-rel made their first contribution in #2119
- @Tim-Haarman made their first contribution in #2116
- @avriiil made their first contribution in #2122
- @wchatx made their first contribution in #2103
- @abaerptc made their first contribution in #2144
- @philippemnoel made their first contribution in #2121
- @mightyshazam made their first contribution in #2160
Full Changelog: python-v0.15.1...python-v0.15.2