Ingester behavior when disk is full #5589

fulmicoton · 2024-12-17T10:44:57Z

Currently, ingester may end up accepting persist request when their disk is full.
If the OS buffer is not full, no error might be returned.

We need to poll-check for the disk usage, and change the behavior of quickwit when it goes above
a threshold.

The behavior is yet to be decided. Probably, the closest thing is decommissionning: close all shards and not accept the creation of new shards. In addition, it might not be possible to run indexing/merge pipelines; which could really make the control plane's task hard.

rdettai · 2025-01-06T15:47:22Z

@fulmicoton How did you identify the main problem comes from records accumulating in the OS buffer? I thought the OS buffer would usually be quite small (few MBs).

It seems to me that the problem might also come from the persist policy that is configured on mrecordlogs. A full disk is only detected after the persist delay (5s), and when that happens, the error is bubbled up and converted to a persist failure here. The problem is that when that happens, a transient error is returned to the user, but meanwhile the shard is closed, a new one is opened, and records are accepted again during the mrecordlog persist delay. I didn't manage to reproduce it yet, but does this seem like a plausible explanation to you?

EDIT: I tried to mimic the WAL disk being full using a small loop device mounted on wal/

sudo dd if=/dev/zero of=virtual_disk.img bs=1M count=10
sudo mkfs.ext4 virtual_disk.img 
mkdir wal
sudo mount -o loop virtual_disk.img wal/

The error I get (and I get it conistently) when the disk is full is:

{
  "message": "ingest service is unavailable (no shards available)"
}

fulmicoton · 2025-01-07T09:03:52Z

How did you identify the main problem comes from records accumulating in the OS buffer? I thought the OS buffer would usually be quite small (few MBs).

Just an hypothesis to explain how we could accept message and eventually lose them.

The problem is that when that happens, a transient error is returned to the user, but meanwhile the shard is closed, a new one is opened, and records are accepted again during the mrecordlog persist delay. I didn't manage to reproduce it yet, but does this seem like a plausible explanation to you?

Plausible yes, but we still need to know by which mechanism we end up accepting writes sometimes. Mezmo mentions they lost data.

fulmicoton added the bug Something isn't working label Dec 17, 2024

rdettai mentioned this issue Jan 8, 2025

Warn when data dir is too small #5601

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ingester behavior when disk is full #5589

Ingester behavior when disk is full #5589

fulmicoton commented Dec 17, 2024 •

edited

Loading

rdettai commented Jan 6, 2025 •

edited

Loading

fulmicoton commented Jan 7, 2025

Ingester behavior when disk is full #5589

Ingester behavior when disk is full #5589

Comments

fulmicoton commented Dec 17, 2024 • edited Loading

rdettai commented Jan 6, 2025 • edited Loading

fulmicoton commented Jan 7, 2025

fulmicoton commented Dec 17, 2024 •

edited

Loading

rdettai commented Jan 6, 2025 •

edited

Loading