Don't call file.length() for every append if we can avoid it #790

axiak · 2024-03-14T14:38:21Z

This change introduces a way for the ResiliantFileOutputStream to track its own position in the file. This means that instead of calling File.length() (which ends up being a syscall) for each event, we only call it when we're recovering a stream.

ceki · 2024-03-16T19:41:59Z

@axiak Hi Michael,

Thank you for this contribution. When you say that File.length() is called for each event, on which execution path do you think this occurs?

griffinjm · 2024-03-17T04:51:57Z

logback-core/src/main/java/ch/qos/logback/core/recovery/CountingOutputStream.java

+
+  private final OutputStream delegate;
+
+  private long count;


This should probably be volatile. While there is external synchronization using various locks when calling write(), the new state of the variable may not be reflected immediately in other threads after this thread updates it. The increment operations += and ++ are actually each 2 operations, a read and then a write, this means the count may be incorrectly updated by another thread afterwards, as the value it reads could be stale.

griffinjm · 2024-03-17T05:01:55Z

logback-core/src/test/java/ch/qos/logback/core/recovery/ResilientOutputStreamTest.java


        spy.getChannel().close();
        spy.write("b".getBytes());
        spy.flush();
+        // we have 2 in our countingoutput stream
+        // but the 'b' write failed due to the channel closing


This means the count can become inaccurate if any IOException occurs, I think it would be best to mitigate that by either only updating the count after the operation succeeds, or alternatively, if we still want to eagerly increment the count we should catch any exceptions, decrement the count, then rethrow the exception again. This would ensure the count remains accurate.

griffinjm

Accidentally clicked "start review", see my previous comments regarding the count atomicity and error handling.

griffinjm · 2024-03-17T19:05:35Z

@axiak Hi Michael,

Thank you for this contribution. When you say that File.length() is called for each event, on which execution path do you think this occurs?

From my read through the file length check occurs only when the rollover check interval is passed. For all standard TriggeringPolicy implementations it is not on every append.

This would be an optimization when checking for rollover if someone has a low rollover check interval. Depending on the triggering policy this can vary, for SizeBased it is by default 60 seconds in DefaultInvocationGate, but configurable, TimeBased are derived from the filename pattern provided. If someone has a filename date pattern which causes the policy to use a very low periodicity, e.g. MILLISECONDS or SECONDS, or someone configures the InvocationGate for SizeBased with a very low value.

In these limited cases this could be an improvement, in exchange for extra state and complexity.

griffinjm reviewed Mar 17, 2024

View reviewed changes

jaredstehler mentioned this pull request Oct 22, 2024

Don't call file.length() for every append if we can avoid it HubSpot/logback#6

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Don't call file.length() for every append if we can avoid it #790

Don't call file.length() for every append if we can avoid it #790

axiak commented Mar 14, 2024

ceki commented Mar 16, 2024

griffinjm Mar 17, 2024

griffinjm Mar 17, 2024

griffinjm left a comment

griffinjm commented Mar 17, 2024

Don't call file.length() for every append if we can avoid it #790

Are you sure you want to change the base?

Don't call file.length() for every append if we can avoid it #790

Conversation

axiak commented Mar 14, 2024

ceki commented Mar 16, 2024

griffinjm Mar 17, 2024

Choose a reason for hiding this comment

griffinjm Mar 17, 2024

Choose a reason for hiding this comment

griffinjm left a comment

Choose a reason for hiding this comment

griffinjm commented Mar 17, 2024