Fixed UnpackerConfig.bufferSize to work #658

napo0703 · 2022-06-22T15:20:45Z

This will Resolve #657

It limits internal buffer size when unpacking Array and Map.
However, I think the default bufferSize of 8192 is too small.
I changed the default buffer size to 100MiB following msgpack-python.

xerial · 2022-06-28T07:07:15Z

msgpack-core/src/main/java/org/msgpack/core/MessagePack.java

@@ -518,7 +518,7 @@ public boolean isStr8FormatSupport()

        private int stringSizeLimit = Integer.MAX_VALUE;

-        private int bufferSize = 8192;
+        private int bufferSize = 100 * 1024 * 1024;


Don't change the default buffer size. msgpack-java is frequently used even for the small size of data.

Especially, creating a large buffer is not free as Java will use CPU resources for filling zeros to the array contents.

xerial · 2022-06-28T07:15:52Z

msgpack-core/src/main/java/org/msgpack/core/MessageUnpacker.java

     * @throws IOException when underlying input throws IOException
     */
    public byte[] readPayload(int length)
            throws IOException
    {
+        if (length > bufferSize) {


This part is a good catch. If that happens, instead of throwing an error, we need to enlarge the buffer size by creating a new buffer large enough for storing the expected data size.

This implementation would be similar to ArrayList implementation of Java.

I'm sorry that I don't quite understand here.
Am I correct that the purpose of UnpackerConfig.bufferSize is to limit the size of the new byte array in readPayload(int)?
Do I need to change this code?

xerial

Do not change the default buffer size from 8K
Setting the default buffer size from config is ok
Do not throw an exception in unpackXXXheader. Only readPayload needs to be fixed to support large volumes of data
readPayload should swap the existing buffer to a large buffer if the payload length exceeds the current bufferSize

xerial · 2022-06-28T07:19:58Z

msgpack-core/src/main/java/org/msgpack/core/MessageUnpacker.java

+        int len;
        if (Code.isFixedArray(b)) { // fixarray
-            return b & 0x0f;
+            len = b & 0x0f;
        }
-        switch (b) {
-            case Code.ARRAY16: { // array 16
-                int len = readNextLength16();
-                return len;
-            }
-            case Code.ARRAY32: { // array 32
-                int len = readNextLength32();
-                return len;
+        else {
+            switch (b) {
+                case Code.ARRAY16: { // array 16
+                    len = readNextLength16();
+                    break;
+                }
+                case Code.ARRAY32: { // array 32
+                    len = readNextLength32();
+                    break;
+                }
+                default: {
+                    throw unexpected("Array", b);
+                }
            }
        }
-        throw unexpected("Array", b);
+
+        if (len > bufferSize) {
+            throw new MessageSizeException(String.format("cannot unpack a Array of size larger than %,d: %,d", bufferSize, len), len);
+        }
+
+        return len;


Throwing error here is unnecessary if readPayload checks the buffer size.

xerial · 2022-06-28T07:20:21Z

msgpack-core/src/main/java/org/msgpack/core/MessageUnpacker.java

+        int len;
        if (Code.isFixedMap(b)) { // fixmap
            return b & 0x0f;
        }
-        switch (b) {
-            case Code.MAP16: { // map 16
-                int len = readNextLength16();
-                return len;
-            }
-            case Code.MAP32: { // map 32
-                int len = readNextLength32();
-                return len;
+        else {
+            switch (b) {
+                case Code.MAP16: { // map 16
+                    len = readNextLength16();
+                    break;
+                }
+                case Code.MAP32: { // map 32
+                    len = readNextLength32();
+                    break;
+                }
+                default: {
+                    throw unexpected("Map", b);
+                }
            }
        }
-        throw unexpected("Map", b);
+
+        if (len > bufferSize / 2) {
+            throw new MessageSizeException(String.format("cannot unpack a Map of size larger than %,d: %,d", bufferSize / 2, len), len);
+        }
+
+        return len;


Unnecessary change for the same reason

napo0703 · 2022-06-29T15:03:49Z

Buffer size limits now work, but some tests no longer pass.
Should I change the Unpacker buffer size used for this test in the settings?

-        val unpacker = MessagePack.newDefaultUnpacker(new InputStreamBufferInput(new ByteArrayInputStream(out.toByteArray)))
+        val unpacker = new MessagePack.UnpackerConfig().withBufferSize(16 * 1024).newUnpacker(new InputStreamBufferInput(new ByteArrayInputStream(out.toByteArray)))

xerial · 2023-09-24T04:23:53Z

Upon re-reading the PR, I noticed some misunderstandings.

UnpackerConfig.bufferSize is used to control the internal buffer size, but it is not meant to limit the size of the uncompressed data. In order to prevent loading excessively large data into memory, we need to introduce another configuration.

I will close this pull request because it has become outdated. The requirement to limit the uncompressed data size and fail the Unpacker is still valid, but it should be addressed in a different context.

Fixed UnpackerConfig.bufferSize to work

8dc9850

xerial reviewed Jun 28, 2022

View reviewed changes

xerial requested changes Jun 28, 2022

View reviewed changes

xerial reviewed Jun 28, 2022

View reviewed changes

Fix default buffer size and buffer size check.

74fed2d

xerial closed this Sep 24, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixed UnpackerConfig.bufferSize to work #658

Fixed UnpackerConfig.bufferSize to work #658

napo0703 commented Jun 22, 2022

xerial Jun 28, 2022

xerial Jun 28, 2022

xerial Jun 28, 2022

napo0703 Jun 28, 2022

xerial left a comment •

edited

Loading

xerial Jun 28, 2022

xerial Jun 28, 2022

napo0703 commented Jun 29, 2022

xerial commented Sep 24, 2023 •

edited

Loading

Fixed UnpackerConfig.bufferSize to work #658

Fixed UnpackerConfig.bufferSize to work #658

Conversation

napo0703 commented Jun 22, 2022

xerial Jun 28, 2022

Choose a reason for hiding this comment

xerial Jun 28, 2022

Choose a reason for hiding this comment

xerial Jun 28, 2022

Choose a reason for hiding this comment

napo0703 Jun 28, 2022

Choose a reason for hiding this comment

xerial left a comment • edited Loading

Choose a reason for hiding this comment

xerial Jun 28, 2022

Choose a reason for hiding this comment

xerial Jun 28, 2022

Choose a reason for hiding this comment

napo0703 commented Jun 29, 2022

xerial commented Sep 24, 2023 • edited Loading

xerial left a comment •

edited

Loading

xerial commented Sep 24, 2023 •

edited

Loading