Add a `--debug` flag to the CLI to help retrieve more logs. #941

almet · 2024-10-07T14:34:05Z

While adding Github Issue Templates on this repository (#939) we found that the commands we require the user to enter in their terminals can be quite complex. Some of them might require some bash-fu, for instance to define the location of the custom seccomp profile we are using, as it differs depending the OS.

This is a proposal to add a --debug flag to the dangerzone-cli to simplify the process of generating proper logs.

When the flag is set:

The RUNSC_DEBUG=1 environment variable is added to the outer container ;
the stderr from the outer container is attached to the exception, and displayed to the user on failures.

Info: tests are currently failing and I didn't put too much effort in making them pass, as this is here mainly to see if it could make sense to add this in the first place.

apyrgio

That's a nice step towards getting debug logs from users without asking them to run weird Docker/Podman commands. I've commented on a few things that we can improve, but overall I like where this is going.

dangerzone/cli.py

dangerzone/conversion/errors.py

dangerzone/cli.py

almet · 2024-10-14T11:00:56Z

This seems correlated to #442.

apyrgio · 2024-11-21T13:45:13Z

We had a discussion with Alexis. He mentioned that adding the --debug flag makes the Dangezone CLI hang. The reason behind this bug is the fact that we read from stderr only after the conversion is done. However, with RUNSC_DEBUG=1, gVisor logs to stderr early, and thus the container hangs, because we don't read from the pipe.

almet · 2024-11-26T16:34:06Z

I've rebased this branch on top of the latest main (actually replayed the changes manually) and it shows that it hangs.

almet · 2024-12-18T16:03:46Z

This is currently blocking, because the docker process writes to stderr but the pipe is never called by the other side, resulting in a process hang (the buffer is full and so it stops there).

Our conversion process is currently running in a thread pool,

After a discussion with @apyrgio this morning on the matter, there seem to be two different outcomes for this:

One "involved" way where we change all of the caller code to be async;
We start a thread reading stderr

I went with the second option for now, but the first one could also make sense.

dangerzone/isolation_provider/base.py

dangerzone/isolation_provider/container.py

dangerzone/logic.py

dangerzone/isolation_provider/base.py

apyrgio

Thanks a lot for the changes. I think we're 🤏 close to merging it. I've left a few comments, but overall I like the new approach.

Oh, let's also have an issue to ask people to use the --debug flag, once we have a release out.

apyrgio · 2025-01-08T15:13:41Z

dangerzone/isolation_provider/base.py

+                self.stderr.seek(0)
+                debug_log = read_debug_text(self.stderr, MAX_CONVERSION_LOG_CHARS)


Hm, if the stderr thread is still writing to the BytesIO buffer (e.g., due to a race), then seeking to 0 risks overwriting the first bytes in the buffer. Also, we may get incomplete logs, without us realizing, which was not the case before (because we use p.poll() is not None.

I propose we make some minor adjustments to the above logic:

Check if the thread is alive after waiting for 1 second (.join(timeout=1) and .is_alive())

Grab the BytesIO buffer with BytesIO.getvalue(), which doesn't require seeking.

Adjust read_debug_text() to sanitize the bytes buffer of (2), instead of reading the BytesIO object.

In case the thread is still alive, add an (incomplete) indication to the ----- DOC TO PIXELS LOG START ----- message.

How does that sound, do you have a better idea?

I've added this in 3a56f51, let me know if that suits you.

Nice, thanks a lot for the change. I have one important comment, and then some nits that you can ignore if you want to:

I think that MAX_CONVERSION_LOG_CHARS will actually truncate the debug output of gVisor. It amounts to 150 lines of 50 chars roughly, but I think gVisor's debug logs are more than that. Now that we have a non-blocking way to read the stderr, I think we can drop the MAX_CONVERSION_LOG_CHARS constant altogether, since it no longer serves the original purpose.

Nits:

I'd maybe rename read_debug_text() to sanitize_debug_text().

It would be nice to know if the debug log is incomplete, without scrolling to the end.

About MAX_CONVERSION_LOGS_CHARS, I felt exactly the same so thanks for the feedback! Dropped now and renamed the function in 8532688

About the "incomplete" log, I feel that having in the end is enough as it's where we would expect to have some more info. I might be missing some practical information you have. Do you have anything specific in mind?

About MAX_CONVERSION_LOGS_CHARS, I felt exactly the same so thanks for the feedback! Dropped now and renamed the function in 8532688

Great, thanks!

About the "incomplete" log, I feel that having in the end is enough as it's where we would expect to have some more info. I might be missing some practical information you have. Do you have anything specific in mind?

Nothing fancy, something like:

DOC_TO_PIXELS_LOG_START = "----- DOC TO PIXELS LOG START -----" DOC_TO_PIXELS_LOG_START_INCOMPLETE = "----- DOC TO PIXELS LOG START (incomplete) -----" log_header = DOC_TO_PIXELS_LOG_START if stderr_thread.is_alive(): log_header = DOC_TO_PIXELS_LOG_START_INCOMPLETE log.info( "Conversion output (doc to pixels)\n" f"{log_header}\n" f"{debug_log}" # no need for an extra newline here f"{DOC_TO_PIXELS_LOG_END}" )

But I'll leave it to you, really. I've approved the PR, so you can resolve this thread and merge it.

dangerzone/isolation_provider/container.py

dangerzone/logic.py

dangerzone/isolation_provider/base.py

When the flag is set, the `RUNSC_DEBUG=1` environment variable is added to the outer container, and stderr is captured in a separate thread, before printing its output.

almet mentioned this pull request Oct 7, 2024

Use github issue templates #939

Merged

apyrgio reviewed Oct 7, 2024

View reviewed changes

dangerzone/cli.py Outdated Show resolved Hide resolved

dangerzone/conversion/errors.py Outdated Show resolved Hide resolved

dangerzone/cli.py Show resolved Hide resolved

apyrgio mentioned this pull request Oct 17, 2024

Perform on-host conversion for the pixels to PDF stage #748

Merged

almet force-pushed the debug-cli branch from 12fa830 to 981fcd4 Compare October 17, 2024 14:20

almet force-pushed the debug-cli branch from 981fcd4 to 9810ae4 Compare November 26, 2024 16:22

almet force-pushed the debug-cli branch from 9810ae4 to d66af44 Compare December 18, 2024 10:11

almet force-pushed the debug-cli branch 2 times, most recently from bea256d to 4f4c523 Compare December 19, 2024 09:14

apyrgio reviewed Jan 7, 2025

View reviewed changes

almet force-pushed the debug-cli branch 3 times, most recently from 9e86988 to a4e34ad Compare January 8, 2025 10:46

apyrgio reviewed Jan 8, 2025

View reviewed changes

almet added 2 commits January 13, 2025 15:07

Add a --debug flag to the CLI to help retrieve more logs

9705e92

When the flag is set, the `RUNSC_DEBUG=1` environment variable is added to the outer container, and stderr is captured in a separate thread, before printing its output.

FIXUP: join the thread before writing output

3a56f51

almet force-pushed the debug-cli branch from a4e34ad to 3a56f51 Compare January 13, 2025 14:54

almet added 3 commits January 13, 2025 16:08

FIXUP: Do not attach the stderr to the base object

3b961e6

FIXUP: use proc_stderr when needed

f6a616f

FIXUP: remove string tailcut, rename function

8532688

almet force-pushed the debug-cli branch from f76ec36 to 8532688 Compare January 14, 2025 13:31

apyrgio approved these changes Jan 14, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a `--debug` flag to the CLI to help retrieve more logs. #941

Add a `--debug` flag to the CLI to help retrieve more logs. #941

almet commented Oct 7, 2024 •

edited

Loading

apyrgio left a comment

almet commented Oct 14, 2024

apyrgio commented Nov 21, 2024

almet commented Nov 26, 2024

almet commented Dec 18, 2024

apyrgio left a comment

apyrgio Jan 8, 2025

almet Jan 13, 2025

apyrgio Jan 14, 2025

almet Jan 14, 2025

apyrgio Jan 14, 2025 •

edited

Loading

		self.stderr.seek(0)
		debug_log = read_debug_text(self.stderr, MAX_CONVERSION_LOG_CHARS)

Add a --debug flag to the CLI to help retrieve more logs. #941

Are you sure you want to change the base?

Add a --debug flag to the CLI to help retrieve more logs. #941

Conversation

almet commented Oct 7, 2024 • edited Loading

apyrgio left a comment

Choose a reason for hiding this comment

almet commented Oct 14, 2024

apyrgio commented Nov 21, 2024

almet commented Nov 26, 2024

almet commented Dec 18, 2024

apyrgio left a comment

Choose a reason for hiding this comment

apyrgio Jan 8, 2025

Choose a reason for hiding this comment

almet Jan 13, 2025

Choose a reason for hiding this comment

apyrgio Jan 14, 2025

Choose a reason for hiding this comment

almet Jan 14, 2025

Choose a reason for hiding this comment

apyrgio Jan 14, 2025 • edited Loading

Choose a reason for hiding this comment

Add a `--debug` flag to the CLI to help retrieve more logs. #941

Add a `--debug` flag to the CLI to help retrieve more logs. #941

almet commented Oct 7, 2024 •

edited

Loading

apyrgio Jan 14, 2025 •

edited

Loading