Skip to content

Commit

Permalink
Make CTAS with replace honor column mapping when writing first set of…
Browse files Browse the repository at this point in the history
… parquet files (#3704)

<!--
Thanks for sending a pull request!  Here are some tips for you:
1. If this is your first time, please read our contributor guidelines:
https://github.com/delta-io/delta/blob/master/CONTRIBUTING.md
2. If the PR is unfinished, add '[WIP]' in your PR title, e.g., '[WIP]
Your PR title ...'.
  3. Be sure to keep the PR description updated to reflect all changes.
  4. Please write your PR title to summarize what this PR proposes.
5. If possible, provide a concise example to reproduce the issue for a
faster review.
6. If applicable, include the corresponding issue number in the PR title
and link it in the body.
-->

#### Which Delta project/connector is this regarding?
<!--
Please add the component selected below to the beginning of the pull
request title
For example: [Spark] Title of my pull request
-->

- [x] Spark
- [ ] Standalone
- [ ] Flink
- [ ] Kernel
- [ ] Other (fill in here)

## Description

Make CTAS with replace honor column mapping when writing first set of
parquet files; Otherwise the uniform table will end up with parquet
files without column mapping and thus have null in query results.



## How was this patch tested?

manual test with Spark

## Does this PR introduce _any_ user-facing changes?

<!--
If yes, please clarify the previous behavior and the change this PR
proposes - provide the console output, description and/or an example to
show the behavior difference if possible.
If possible, please also clarify if this is a user-facing change
compared to the released Delta Lake versions or within the unreleased
branches such as master.
If no, write 'No'.
-->
  • Loading branch information
lzlfred authored Sep 24, 2024
1 parent 19374e2 commit 971deda
Showing 1 changed file with 5 additions and 1 deletion.
Original file line number Diff line number Diff line change
Expand Up @@ -732,7 +732,11 @@ case class CreateDeltaTableCommand(
if (txn.readVersion > -1L && isReplace && !dontOverwriteSchema) {
// When a table already exists, and we're using the DataFrameWriterV2 API to replace
// or createOrReplace a table, we blindly overwrite the metadata.
val newMetadata = getProvidedMetadata(table, schema.json)
var newMetadata = getProvidedMetadata(table, schema.json)
val updatedConfig = UniversalFormat.enforceDependenciesInConfiguration(
newMetadata.configuration,
txn.snapshot)
newMetadata = newMetadata.copy(configuration = updatedConfig)
txn.updateMetadataForNewTableInReplace(newMetadata)
}
}
Expand Down

0 comments on commit 971deda

Please sign in to comment.