Investigate storing notebook images as files, rather than inline #1672

Eric-Arellano · 2024-07-09T20:06:06Z

Potential benefits to explore:

If Sharp can optimize the images better. https://nextjs.org/docs/messages/install-sharp
If it makes yarn dev and yarn build (with static site builder for preview app) faster
If it reduces the HTML file size, such that you can access the HTML quicker while images load. This means you can access our content sooner on low bandwidth
Right now, users can't expand inline images by clicking on them.
We don't have alt text - is that possible?

Look at our largest notebook files when investigating, such as by running dust guides/.

The text was updated successfully, but these errors were encountered:

Eric-Arellano · 2024-09-06T20:07:47Z

Update: Frank thinks this will be non-trivial to do via Jupyter itself. One complication with storing the image as a file path is we still need to render the file, so the author would need to write code to save it and then load it to display.

Frank had an idea that we could explore adding a script during the closed source sync that extracts out the images from the notebook, saves them in files, and updates the notebook to point to them. Given that complexity and that the pain point isn't that bad, we think this is low priority.

Eric-Arellano · 2024-12-23T22:20:33Z

I agree with Frank's assessment: there is no easy way to have it so that we always save an image to a file without requiring writing code for it. https://discourse.jupyter.org/t/cell-magic-to-save-image-output-as-a-png-file/11906/4 and https://discourse.jupyter.org/t/any-way-to-automatically-save-some-outputs-as-separate-files/17651. Jupytext's idea of paired notebooks is interesting to decouple inputs from outputs, but it still requires having an .ipynb file for outputting the outputs.

We could add a post-process step in qiskit/documentation to extract the outputs of a Jupyter notebook and save them in other files. However, I don't think we should pursue it. Downsides:

Adds another step for content authors
Makes output in VSCode and GitHub less useful during development and code review

--

Instead, I agree with Frank's suggestion that this type of post-process step should happen in closed source when we sync the open source content. There, no humans are manually editing the notebook file so we could split out the image output from the cell into the dedicated files. Then, we could use Next.js's <Image> component to load the file.

This post-processing would complicate the download notebooks button because our local copy of the Jupyter notebook in the closed source repo would have had its images stripped out. However, I think it is perfectly acceptable for the downloaded notebook to be "clean" and not have any outputs at all inside. The user can refer back to the website or qiskit/documentation GitHub to see the outputs we have. So, our code for downloading notebooks should strip all outputs, both images (already removed during the open source sync process) and text.

--

We need to add alt text to the images. I think we have two options:

Require setting alt text as cell metadata. lint for this in qiskit/documentation. iqp-channel-docs applies the alt text
Auto-generate description like "output from previous code". I'm skeptical this is helpful enough; it's useful to say things between things like "circuit diagram" and "sampler output"

--

Update Dec 26, 2024: Consider if #2520 changes anything given that we now expect users for MDX files to have a semi-manual step to use AVIF images.

Eric-Arellano added the infra 🏗️ label Jul 9, 2024

Eric-Arellano added this to Docs Planning Jul 9, 2024

Eric-Arellano mentioned this issue Jul 10, 2024

Consider adding a check for notebook (and image?) file size #1678

Open

Eric-Arellano mentioned this issue Dec 26, 2024

Convert all images to avif file format #2520

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Investigate storing notebook images as files, rather than inline #1672

Investigate storing notebook images as files, rather than inline #1672

Eric-Arellano commented Jul 9, 2024 •

edited

Loading

Eric-Arellano commented Sep 6, 2024

Eric-Arellano commented Dec 23, 2024 •

edited

Loading

Investigate storing notebook images as files, rather than inline #1672

Investigate storing notebook images as files, rather than inline #1672

Comments

Eric-Arellano commented Jul 9, 2024 • edited Loading

Eric-Arellano commented Sep 6, 2024

Eric-Arellano commented Dec 23, 2024 • edited Loading

Eric-Arellano commented Jul 9, 2024 •

edited

Loading

Eric-Arellano commented Dec 23, 2024 •

edited

Loading