Skip to content

Book ePub are not always sufficiently compressed #376

@benoit74

Description

@benoit74

In order to solve #288 and #235 and #222, we've decided to stop compressing (optimizing) ePubs on our own.

Recent runs and analysis done in #374 proved that optimization we were doing on ePubs was not that useless.

See for example book IDs 63630

2023-08 2025-10
Size 265K 520K
URL https://dev.library.kiwix.org/content/gutenberg_de_all_2023-08/Der%20Einzige%20auf%20der%20weiten%20Welt:%20Ein%20Menschenleben.63630.epub https://browse.library.kiwix.org/content/gutenberg_de_all_2025-10/Der%20Einzige%20auf%20der%20weiten%20Welt:%20Ein%20Menschenleben.63630.epub

Or book ID 68838

2023-08 2025-10
Size 438K 4.6M
URL https://dev.library.kiwix.org/content/gutenberg_de_all_2023-08/Der%20Graf%20von%20Saint-Germain:%20Das%20Leben%20eines%20Alchimisten.68838.epub https://browse.library.kiwix.org/content/gutenberg_de_all_2025-10/Der%20Graf%20von%20Saint-Germain:%20Das%20Leben%20eines%20Alchimisten.68838.epub

It is very important to note that many ePub of 2023-08 were missing all images (including the two examples above, due to #222) but it is not sufficient so far to explain all the file size increase.

I assume it would be safe to:

Remind that we've moved to ePub3 format, so the optimization logic is probably going to be different from what we used to have

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Relationships

None yet

Development

No branches or pull requests

Issue actions