Add optional gzip compression #1389

kannibalox · 2025-01-22T17:48:30Z

This introduces a single new value command network.gzip_response_min_size for determining if the response should be gzipped (I'm very open to better names for that variable). It'll additionally require the SCGI client to announce support for gzip responses in the ACCEPT_ENCODING header. Since this is of dubious benefit to most use cases, it's disabled by default via setting it to a value of less than 0. This also technically makes zlib a new dependency for rtorrent, but since it's already a hard dependency for libtorrent that didn't seem unreasonable.

A fun little interaction is that calling network.gzip_response_min_size over RPC impacts the response immediately.

rakshasa · 2025-01-23T09:42:06Z

network.scgi.use_gzip and network.scgi.gzip.min_size

src/rpc/scgi_task.cc

rakshasa · 2025-01-23T09:53:30Z

src/rpc/scgi_task.cc

+      should_compress = false;
+    }
+  }
+  header += "\r\n";


This use of std::strings causes unnecessary copying of buffers.

Rewrite both the gzip compressor and this to write directly to m_buffer, pass e.g. a lambda function to gzip_compressor that does the writing.

This causes a chicken-and-egg problem where we need to know the Content-Length to figure out where the write cursor in m_buffer should start from (given that there's a more than decent chance we might drop a digit in the size string in the course of compression), but we don't know the length until the compression itself is complete.

The only way I can think to work around that is to first write the compressed output directly a little ways into the m_buffer and std::memmove it back to the correct position after the headers have been written. I'll go in that direction unless you tell me otherwise.

Don't see why you need to do it in such a convoluted way, just pass a lambda function that does the writing and have it resize m_buffer if more is needed.

You can change m_buffer to std::vector to make it cleaner.

github-actions

⚠️ Clang-Tidy found issue(s) with the introduced code (1/1)

src/rpc/scgi_task.h

src/rpc/scgi_task.cc

rakshasa · 2025-02-06T15:55:27Z

src/rpc/scgi_task.h

-  static const          int max_content_size    = (2 << 23);
+  static constexpr unsigned int default_buffer_size = 2047;
+  static constexpr int          max_header_size     = 2000;
+  static constexpr int          max_content_size    = (2 << 23);


unsigned int

rakshasa · 2025-02-06T15:55:59Z

src/rpc/scgi_task.h

+  static constexpr int          max_content_size    = (2 << 23);
+
+  static int                    gzip_min_size() { return m_min_compress_response_size; }
+  static void                   set_gzip_min_size(int size) { m_min_compress_response_size = size; }


Should throw input_error on bad size.

rakshasa · 2025-02-06T15:56:49Z

src/rpc/scgi_task.h

+  unsigned int m_buffer_size;
+
+  ContentType  m_content_type{XML};
+  bool         m_client_accepts_compressed_response = false;


{false} to be consistent.

rakshasa · 2025-02-06T16:32:15Z

src/rpc/scgi_task.cc

+// indeterminate state, but m_position and m_bufferSize remain the
+// same.
+bool
+SCgiTask::gzip_compress_response(const char* buffer, uint32_t length, std::string_view header_template) {


This is a bit too ugly a solution, not what I had in mind.

Add to utils a function like this:

gzip_compress(const char* buffer, uint32_t length, std::function<bool(char*,uint32_t,uint32_t)>)

The lambda function will write from m_buffer+max_header_size (where max_header_size is calculated from the current header template plus max length of the printed response size).

It will receive deflateBound value so it can resize on the first call.

Once done writing, copy the bytes from the header and response size to m_buffer so they end at the start of the written gzip'ed data. Then event_write starts sending from m_buffer+m_header_offset.

Don't use snprintf for a whole template, instead the header should be two static const strings, and you copy them and the response size.

rakshasa · 2025-02-06T16:33:07Z

src/rpc/scgi_task.cc

+  if (m_client_accepts_compressed_response &&
+      gzip_enabled() &&
+      length > gzip_min_size() &&
+      gzip_compress_response(buffer, length, header)) {


Don't like how a failure to compress falls back to plaintext, this should only happen if there's a serious bug so fail it completely.

rakshasa · 2025-02-06T16:33:40Z

src/rpc/scgi_task.cc

+  m_buffer_size = length + header_size;
+
+  snprintf(m_buffer, m_buffer_size, header.c_str(), length);
+  std::memcpy(m_buffer + header_size, buffer, length);


If we have a compressed path in a separate function, also put the plaintext in one.

rakshasa reviewed Jan 23, 2025

View reviewed changes

src/rpc/scgi_task.cc Outdated Show resolved Hide resolved

rakshasa reviewed Jan 23, 2025

View reviewed changes

src/rpc/scgi_task.cc Outdated Show resolved Hide resolved

rakshasa reviewed Jan 23, 2025

View reviewed changes

src/rpc/scgi_task.cc Outdated Show resolved Hide resolved

rakshasa reviewed Jan 23, 2025

View reviewed changes

github-actions bot reviewed Jan 26, 2025

View reviewed changes

src/rpc/scgi_task.h Outdated Show resolved Hide resolved

src/rpc/scgi_task.h Outdated Show resolved Hide resolved

kannibalox force-pushed the feature/gzip-response branch from e74b4fe to 8de5eca Compare January 27, 2025 05:47

Add optional gzip compression

8de5eca

rakshasa reviewed Jan 27, 2025

View reviewed changes

src/rpc/scgi_task.cc Show resolved Hide resolved

Reorder headers correctly

bd87b4d

rakshasa reviewed Feb 6, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add optional gzip compression #1389

Add optional gzip compression #1389

kannibalox commented Jan 22, 2025

rakshasa commented Jan 23, 2025

rakshasa Jan 23, 2025

kannibalox Jan 23, 2025

rakshasa Jan 26, 2025

github-actions bot left a comment

rakshasa Feb 6, 2025

rakshasa Feb 6, 2025

rakshasa Feb 6, 2025

rakshasa Feb 6, 2025 •

edited

Loading

rakshasa Feb 6, 2025

rakshasa Feb 6, 2025

Add optional gzip compression #1389

Are you sure you want to change the base?

Add optional gzip compression #1389

Conversation

kannibalox commented Jan 22, 2025

rakshasa commented Jan 23, 2025

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

github-actions bot left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

rakshasa Feb 6, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

rakshasa Feb 6, 2025 •

edited

Loading