Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG] ColumnarFileFormat does not have a pretty string #12226

Open
amahussein opened this issue Feb 25, 2025 · 0 comments
Open

[BUG] ColumnarFileFormat does not have a pretty string #12226

amahussein opened this issue Feb 25, 2025 · 0 comments
Labels
bug Something isn't working good first issue Good for newcomers

Comments

@amahussein
Copy link
Collaborator

Describe the bug

The RAPIDS plugin does not generate the correct value for the format field in the eventlog.
Instead of dumping a pretty string value of the format, it dumps the object string (i.e com.nvidia.spark.rapids.GpuParquetFileFormat@9f5022c)

It would be nice to fix that to make it easy to analyze the eventlog. Otherwise, the consumer of the eventlog has to keep track of any modification in the ClassNames in the RAPIDS code which is not ideal.

This impacts the ColumnarFileFormat types

GpuHiveParquetFileFormat
GpuHiveTextFileFormat
GpuOrcFileFormat
GpuParquetFileFormat

Sample eventlog:

Execute GpuInsertIntoHadoopFsRelationCommand /path/to/warehouse/db/tableName, false, [dt#23, lvl#30], com.nvidia.spark.rapids.GpuParquetFileFormat@4be4680b, [serialization.format=1, mergeSchema=false, __hive_compatible_bucketed_table_insertion__=true, partitionOverwriteMode=DYNAMIC], Overwrite, CatalogTable(\nDatabase: databaseName\nTable: tableName\nOwner: root\nCreated Time: Tue Feb 25 16:58:00 UTC 2025\nLast...
@amahussein amahussein added ? - Needs Triage Need team to review and classify bug Something isn't working labels Feb 25, 2025
@sameerz sameerz added good first issue Good for newcomers and removed ? - Needs Triage Need team to review and classify labels Feb 25, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working good first issue Good for newcomers
Projects
None yet
Development

No branches or pull requests

2 participants