-
-
Notifications
You must be signed in to change notification settings - Fork 51
Open
Description
In order to solve #288 and #235 and #222, we've decided to stop compressing (optimizing) ePubs on our own.
Recent runs and analysis done in #374 proved that optimization we were doing on ePubs was not that useless.
See for example book IDs 63630
Or book ID 68838
It is very important to note that many ePub of 2023-08 were missing all images (including the two examples above, due to #222) but it is not sufficient so far to explain all the file size increase.
I assume it would be safe to:
- first fix Book HTML images are not always sufficiently compressed #375
- adapt to optimize ePub images (while we could theoritically reuse the same images, not sure it is feasible, at least use some compression settings)
- confirm expected size difference (should save about 3G on Gutenberg DE)
- if size difference is not there, check what else could be optimized in ePub
- if not yet done, decide how to handle optimization cache invalidation (see Scraper takes ages to complete EN version and books are regularly updated #288)
- put optimization cache back in place
Remind that we've moved to ePub3 format, so the optimization logic is probably going to be different from what we used to have
Reactions are currently unavailable