Small INSERTs #89

dpxcc · 2025-01-17T01:32:43Z

What feature are you requesting?

Make columnstore tables good for small INSERTs

Why are you requesting this feature?

Small INSERTs into columnstore tables are currently very inefficient:

Each query creates a new Parquet file
Each transaction generates a new Delta Lake log record

What is your proposed implementation for this feature?

Use a rowstore table (Postgres heap table) to store small INSERTs
Small INSERTs will go into the rowstore table, while large INSERTs will be written directly to Parquet files
ColumnstoreScan will scan the union of the rowstore table and Parquet files
Add a command, e.g. mooncake.flush_table(), to flush the rowstore table to a new Parquet file
When a transaction ends, if the rowstore table is too large, it will also be flushed
For now, we will focus on mostly append-only use case. UPDATE/DELETE operations will not be optimized in this version. They will trigger a flush of the rowstore table, followed by UPDATE/DELETE operations as they are currently handled

The text was updated successfully, but these errors were encountered:

franz101 · 2025-01-23T15:57:40Z

Question: Would this mean the scan will load the both tables in memory / RAM? Or is this something duckdb can separate?

I think depending on the insert rate a buffer table / async inserts that batch every x seconds would be a great feature. See the following diagram:

dpxcc · 2025-01-23T18:59:23Z

No

The main difference between our design and ClickHouse's is that our table scan will scan not only the parts stored in cloud object storage but also the rows in the buffer. The way we scan the parts stored in cloud object storage is not much different

In ClickHouse, data only becomes visible after a delay following the transaction commit, which is undesirable. With our design, queries access the most up-to-date data immediately. This added functionality comes at minimal cost, as scanning the buffer is negligible compared to scanning the parts

You can learn more at https://dl.acm.org/doi/pdf/10.1145/3514221.3526055. The design presented here is a simplified version for workloads that are mostly append-only

franz101 · 2025-01-24T04:11:21Z

wow, thanks for linking that paper. makes sense. in terms of coding effort the delay seemed easier. looking forward for the progress.

nicosuave · 2025-01-25T16:12:44Z

Looking forward to this. Seeing insert performance roughly the same for 1k records vs 1m today

nbiscaro · 2025-01-28T01:12:30Z

I can take this one!

tbanys · 2025-03-09T23:51:12Z

What is the state of this feature? Really looking forward to use it asap it's implemented.

dpxcc · 2025-03-10T21:36:12Z

We are actively working on this and other v0.2 features. Stay tuned!

tbanys · 2025-03-17T10:12:36Z

Hello,

I'm working on a project that requires tracking user interactions (widget impressions, orders, etc.) which will involve numerous small writes to a database followed by analytics queries. While the queries won't be complex, I'm expecting to accumulate a large volume of data over time.

I'm considering using Neon.tech's serverless Postgres along with the pg_mooncake columnar extension, but I've noticed that pg_mooncake is still in early development (v0.1.2) and has performance issues with small writes (per GitHub issue #89).

As a primarily frontend/app developer without extensive database experience, I'm wondering:

Would it be advisable to start with a standard Neon Postgres database for now, and later migrate to a new Neon instance with pg_mooncake when it becomes more stable?

How complicated would this migration process be?

Are there any alternative approaches you'd recommend for my use case?

Neon's serverless features like autoscaling and scale-to-zero seem particularly appealing for my variable workload, but I want to ensure I'm making the right architectural decision from the start.

Thank you for your guidance!

zhousun · 2025-03-17T17:58:22Z

Hello,

I'm working on a project that requires tracking user interactions (widget impressions, orders, etc.) which will involve numerous small writes to a database followed by analytics queries. While the queries won't be complex, I'm expecting to accumulate a large volume of data over time.

I'm considering using Neon.tech's serverless Postgres along with the pg_mooncake columnar extension, but I've noticed that pg_mooncake is still in early development (v0.1.2) and has performance issues with small writes (per GitHub issue #89).

As a primarily frontend/app developer without extensive database experience, I'm wondering:

Would it be advisable to start with a standard Neon Postgres database for now, and later migrate to a new Neon instance with pg_mooncake when it becomes more stable?

How complicated would this migration process be?

Are there any alternative approaches you'd recommend for my use case?

Neon's serverless features like autoscaling and scale-to-zero seem particularly appealing for my variable workload, but I want to ensure I'm making the right architectural decision from the start.

Thank you for your guidance!

If you have really small inserts (concurrent single inserts), the recommended way is to have a regular postgres table as buffer for writes and use pg_cron to batch insert them into columnstore, and this should work good enough on existing version of pg_mooncake& neon. If you prefer not to do the manual workaround, I would suggest wait until v0.2. In v0.2 we will enable remove the restriction and allow small inserts directly to columnstore table without any perf penalty.

zhang-wenchao · 2025-04-30T02:04:41Z

I'm using it now and can't wait for the new version. I want to know how small an insert is considered a small insert. My use case is between 10 and 2000 rows per insert.

dpxcc · 2025-04-30T05:10:42Z

@zhang-wenchao
v0.1 will try to create 1GB parquet file by default (https://github.com/Mooncake-Labs/pg_mooncake/blob/main/src/columnstore/columnstore_table.cpp#L83). I would say an insert is considered small if it’s less than 100MB. 10 to 2000 rows per insert is pretty small and inefficient in v0.1

As a workaround, I would recommend using a Postgres heap table as a staging area to accumulate smaller inserts. You can then periodically flush this data into the columnstore table in batches. Also, you can run a no-op UPDATE like UPDATE t SET a=a to manually merge small parquet files into larger ones

v0.2 is coming very soon and will support small INSERT/UPDATE/DELETE

zhang-wenchao · 2025-04-30T12:11:29Z

Is there a rough release plan for v2.0? Thanks for your work.

dpxcc · 2025-04-30T17:51:40Z

The current plan is to have a preview version available in mid-late May, with a release in June

dpxcc added the feature label Jan 17, 2025

dpxcc added this to the 0.2.0 milestone Jan 17, 2025

dpxcc pinned this issue Jan 17, 2025

dpxcc mentioned this issue Jan 20, 2025

Add benchmark for inserts per second #95

Closed

dpxcc assigned nbiscaro Jan 28, 2025

dpxcc mentioned this issue Jan 28, 2025

Vaccum/ Merge small file command #87

Open

paurora17 mentioned this issue Mar 17, 2025

Clarify use-cases in the documentation #140

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Small INSERTs #89

Small INSERTs #89

dpxcc commented Jan 17, 2025

franz101 commented Jan 23, 2025

dpxcc commented Jan 23, 2025 •

edited

Loading

franz101 commented Jan 24, 2025

nicosuave commented Jan 25, 2025

nbiscaro commented Jan 28, 2025

tbanys commented Mar 9, 2025

dpxcc commented Mar 10, 2025

tbanys commented Mar 17, 2025

zhousun commented Mar 17, 2025 •

edited

Loading

zhang-wenchao commented Apr 30, 2025

dpxcc commented Apr 30, 2025

zhang-wenchao commented Apr 30, 2025

dpxcc commented Apr 30, 2025

Small INSERTs #89

Small INSERTs #89

Comments

dpxcc commented Jan 17, 2025

What feature are you requesting?

Why are you requesting this feature?

What is your proposed implementation for this feature?

franz101 commented Jan 23, 2025

dpxcc commented Jan 23, 2025 • edited Loading

franz101 commented Jan 24, 2025

nicosuave commented Jan 25, 2025

nbiscaro commented Jan 28, 2025

tbanys commented Mar 9, 2025

dpxcc commented Mar 10, 2025

tbanys commented Mar 17, 2025

zhousun commented Mar 17, 2025 • edited Loading

zhang-wenchao commented Apr 30, 2025

dpxcc commented Apr 30, 2025

zhang-wenchao commented Apr 30, 2025

dpxcc commented Apr 30, 2025

dpxcc commented Jan 23, 2025 •

edited

Loading

zhousun commented Mar 17, 2025 •

edited

Loading