Skip to content

Intermittent error when creating tables: NOT_FOUND: Requested entity was not found #1324

Closed
@MasterDDT

Description

@MasterDDT

We are using Spark 3.3, BigQuery connector 0.30.0, Scala 2.12, Java 17. We are creating tables via DataFrame write() API with these options:

    "viewsEnabled" -> "true",
    "writeMethod" -> "direct",
    "dataset" -> datasetName,
    "materializationDataset" -> datasetName,
    "table" -> s"${datasetName}.${tableName}",

When writing large tables sometimes (unclear if number of rows or number of columns), we see this error:

Caused by: java.util.concurrent.ExecutionException: com.google.cloud.spark.bigquery.repackaged.com.google.api.gax.rpc.NotFoundException: com.google.cloud.spark.bigquery.repackaged.io.grpc.StatusRuntimeException: NOT_FOUND: Requested entity was not found.
	at com.google.cloud.spark.bigquery.repackaged.com.google.common.util.concurrent.AbstractFuture.getDoneValue(AbstractFuture.java:588)
	at com.google.cloud.spark.bigquery.repackaged.com.google.common.util.concurrent.AbstractFuture.get(AbstractFuture.java:547)
	at com.google.cloud.bigquery.connector.common.BigQueryDirectDataWriterHelper.retryCallable(BigQueryDirectDataWriterHelper.java:131)
	at com.google.cloud.bigquery.connector.common.BigQueryDirectDataWriterHelper.retryCreateWriteStream(BigQueryDirectDataWriterHelper.java:98)
	at com.google.cloud.bigquery.connector.common.BigQueryDirectDataWriterHelper.<init>(BigQueryDirectDataWriterHelper.java:77)
	... 19 more

Full callstack here https://gist.github.com/MasterDDT/da7e93cbf14d34a6e88acfe7213d1360. We cannot repro this consistently. Any ideas how to debug it?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions