We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
I checked the answer here: #623
but wondering whether the of inputFiles is a reliable enough method of getting all the files? https://spark.apache.org/docs/3.1.3/api/python/reference/api/pyspark.sql.DataFrame.inputFiles.html
inputFiles
Docs are a bit vague on how reliable this is, we hosting our tables on S3
The text was updated successfully, but these errors were encountered:
Came across this answer in regards to reliability with delta: https://stackoverflow.com/a/77107953
Sorry, something went wrong.
I think this is much more useful for what I need: https://docs.databricks.com/en/ingestion/file-metadata-column.html#metadata-examples
No branches or pull requests
I checked the answer here: #623
but wondering whether the of
inputFiles
is a reliable enough method of getting all the files? https://spark.apache.org/docs/3.1.3/api/python/reference/api/pyspark.sql.DataFrame.inputFiles.htmlDocs are a bit vague on how reliable this is, we hosting our tables on S3
The text was updated successfully, but these errors were encountered: