Optimize decorator parsing #458

akiselev98 · 2025-10-17T20:43:58Z

Profiling showed that a relatively high percentage of CPU time and memory allocations are coming from decorator parsing. This makes sense, since this parsing is applied to every unified log line which goes through gctoolkit.

To speed things up, we:

Fold the tags regex into the broader decorator regex, avoiding the need for a second matcher, and removing the need for an expensive negative lookbehind in the tag pattern
Add a start of line anchor to the decorator regex
Defer sanitization of tags until getTags() is called for the first time

To reduce unnecessary memory allocations, match groups are retrieved once and stored in an array.

In testing, these optimizations cut the running time of gctoolkit roughly in half.

Profiling showed that a relatively high percentage of CPU time and memory allocations are coming from decorator parsing. This makes sense, since this parsing is applied to every unified log line which goes through gctoolkit. To speed things up, we: - Fold the tags regex into the broader decorator regex, avoiding the need for a second matcher, and removing the need for an expensive negative lookbehind in the tag pattern - Add a start of line anchor to the decorator regex - Defer sanitization of tags until getTags() is called for the first time To reduce unnecessary memory allocations, match groups are retrieved once and stored in an array. In testing, these optimizations cut the running time of gctoolkit roughly in half.

parser/src/main/java/com/microsoft/gctoolkit/parser/unified/UnifiedLoggingTokens.java

parser/src/main/java/com/microsoft/gctoolkit/parser/jvm/Decorators.java

…tors.java

…ifiedLoggingTokens.java

obourgain · 2025-10-23T10:57:23Z

this is indeed faster, about twice faster for my use case too.

karianna · 2025-10-27T23:21:05Z

@johnoliver can you review as well

johnoliver · 2025-10-28T13:22:42Z

parser/src/main/java/com/microsoft/gctoolkit/parser/jvm/Decorators.java

-            if ( decoratorMatcher.group(i) != null)
+        // Retrieving a group from a matcher calls substring each time
+        // Store all the groups in an array ahead of time to avoid paying this cost unnecessarily
+        decoratorGroups = new String[11];


new String[decoratorMatcher.groupCount()] might be a bit more robust here

karianna previously approved these changes Oct 19, 2025

View reviewed changes

parser/src/main/java/com/microsoft/gctoolkit/parser/unified/UnifiedLoggingTokens.java Show resolved Hide resolved

parser/src/main/java/com/microsoft/gctoolkit/parser/jvm/Decorators.java Outdated Show resolved Hide resolved

Update parser/src/main/java/com/microsoft/gctoolkit/parser/jvm/Decora…

13b132d

…tors.java

karianna dismissed their stale review via 13b132d October 19, 2025 20:31

Update parser/src/main/java/com/microsoft/gctoolkit/parser/unified/Un…

cf5795b

…ifiedLoggingTokens.java

karianna approved these changes Oct 27, 2025

View reviewed changes

johnoliver reviewed Oct 28, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Optimize decorator parsing #458

Optimize decorator parsing #458

akiselev98 commented Oct 17, 2025

Uh oh!

Uh oh!

Uh oh!

obourgain commented Oct 23, 2025

Uh oh!

karianna commented Oct 27, 2025

Uh oh!

johnoliver Oct 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Optimize decorator parsing #458

Are you sure you want to change the base?

Optimize decorator parsing #458

Conversation

akiselev98 commented Oct 17, 2025

Uh oh!

Uh oh!

Uh oh!

obourgain commented Oct 23, 2025

Uh oh!

karianna commented Oct 27, 2025

Uh oh!

johnoliver Oct 28, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants