fix: optimize memory for stacks tsv import into rocksdb #634

rafaelcr · 2024-08-07T22:19:56Z

This PR changes the way chainhook imports a Stacks node TSV into rocksdb.

Before, it loaded the entire canonical chinstate (including the full block JSON messages) onto a VecDeque in memory and then drained that data into rocksdb. This was a very memory intensive process which crashed our dev pods ever time it ran.

Now, the process was changed to a VecDeque that only keeps the line numbers of the TSV where the block data exists, so it can later read blocks from the file 1 by 1 and insert them into rocksdb.

tippenein

Does this have a noticeable difference in processing speed? Does that even matter?

Code looks good. Just curious if you've run this

rafaelcr · 2024-08-12T16:39:18Z

Thanks @tippenein . Yep, it does, it runs a bit faster I assume because it doesn't have to deal with the very large data structure that was in place before but perhaps that's only in the dev env I was using to test

fix: tsv conn

fb9304f

rafaelcr had a problem deploying to Development-mainnet August 7, 2024 22:57 — with GitHub Actions Error

rafaelcr had a problem deploying to Development-testnet August 7, 2024 22:57 — with GitHub Actions Error

fix: use line numbers for a tsv canonical fork

0b8f549

rafaelcr had a problem deploying to Development-mainnet August 8, 2024 18:16 — with GitHub Actions Failure

rafaelcr had a problem deploying to Development-testnet August 8, 2024 18:16 — with GitHub Actions Failure

rafaelcr changed the title ~~fix: refresh rocksdb connection after importing TSV blocks~~ fix: optimize memory for stacks tsv import into rocksdb Aug 8, 2024

rafaelcr marked this pull request as ready for review August 8, 2024 19:25

rafaelcr requested a review from tippenein August 8, 2024 19:26

tippenein approved these changes Aug 12, 2024

View reviewed changes

rafaelcr merged commit dcf545c into develop Aug 12, 2024
10 of 12 checks passed

rafaelcr deleted the fix/tsv-read branch August 12, 2024 16:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: optimize memory for stacks tsv import into rocksdb #634

fix: optimize memory for stacks tsv import into rocksdb #634

rafaelcr commented Aug 7, 2024 •

edited

Loading

tippenein left a comment

rafaelcr commented Aug 12, 2024

fix: optimize memory for stacks tsv import into rocksdb #634

fix: optimize memory for stacks tsv import into rocksdb #634

Conversation

rafaelcr commented Aug 7, 2024 • edited Loading

tippenein left a comment

Choose a reason for hiding this comment

rafaelcr commented Aug 12, 2024

rafaelcr commented Aug 7, 2024 •

edited

Loading