[WIP] feat: Add TPC-H generator CLI #118
Draft
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
This tool is useful for generating TPC-H data to run some of the benchmarks included with bolt
What problem does this PR solve?
There is a TPC-H benchmark tool, but it requires an existing TPC-H dataset. The codebase doesn't seem to provide an easy way to generate a TPC-H dataset.
Type of Change
Description
Add a new executable which generates a set of TPC-H tables at a given scale factor. It re-uses existing TPC-H generation infrastrucutre, but just exposes it as an easy to use CLI interface so users can generate data. Then a benchmark executable like
bolt_tpch_benchmarkcan be used and pointed to the data directory without having to generate the data another way.Performance Impact
Release Note
Release Note:
Checklist (For Author)
Breaking Changes