WIP: Assume MOK is already enrolled, add tests #19

crichez · 2025-01-05T16:47:04Z

Objectives

Don't enroll machine owner keys as part of the role, pass them in as an argument. Also add tests.

Detailed Changes

uki_config is renamed to uki: this simplifies argument names.
Machine owner keys must be enrolled before the role starts: we still verify enrollment to avoid bricking a host, but it is now the caller's responsibility to enroll. This was impossible to automate, so doesn't belong in an Ansible role.
make test now runs integration tests: using libvirt, xorriso, GNU make, and a single configuration file. This closes Pick a testing platform #18.

This has been tested with `python -m pip -r requirements.txt` and has been proven to build an enviornment sufficient to run the test suite.

Still don't have an interactive VM at this point, I fear it may be crashing on boot. We will try using the default OVMF firmware images and see if that solves our problems.

This commit configures a testing environment for the 'uki' role. To run these tests, libvirt and qemu are required. The machine runs and configures well, but this commit is still missing configuration instructions, and tests still do not pass. This is a milestone commit that accomplishes the environment setup, most future commits will focus on the tests and the collection themselves.

This commit does not provide a complete testing setup. We are running into a limitation on ansible-test, where we can't provide a custom inventory file without using the libvirt inventory plugin, but we can't use SELinux with that plugn. We need to switch to a Makefile or something else to generate an inventory file, then we can delegate properly.

This commit changes the test VM configuration to not require become or sudo. This only works with SELinux disabled due to a bug in the passt policy that prevents the user from writing to the passt socket. Tests still don't run, we don't inherit the tempdir variable between the two plays we try to run in setup.yml. This will be fixed by writing it externally, but we should do our work in a separate temporary directory instead of the workspace directory to make tracking changes easier.

This is meant to be set by the user and should not have been committed.

This commit changes paths and variable inheritance logic to ensure we can run make test through 'ansible-test' instead of a raw script. Currently this has lead to some odd behavior where libvirt is spawning the qemu, passt and other supporting processes, but then loses track of the domain entirely. I went from using staff to unconfined, so we may need to reset the system for this to work properly. This is still a milestone commit since we have a spawning VM.

This change avoids issues we were having with libvirt-python starting machine processes but losing track of them. We now have a working machine that we can authenticate with, and a makefile that has (almost) perfectly working dependencies. There are still oddities related to the machine rule, since the name of the machine file is not predictable. But it mostly works, and this is another WIP commit, since we don't actually have the role passing yet. The next step is to copy the MOK files to the test machine, and re-read the role to ensure we didn't make any mistakes or depend on things we didn't respect during machine setup.

This commit now uses the PID file of the VM to detect whether it needs to be created.

We might have found a bug in libvirt? If I template the firmware image file that is generated by virt-fw-vars, the signature database for the MokList variable gets mangled somehow, and the certificate fingerprint changes. If I use the immediate output of virt-fw-vars, everything works fine.

We now install pip from the ensurepip module, since we were getting some missing library errors using ansible.builtin.package.

Note there is a bug right now where the openssh server become available before the python3-libdnf5 package is installed by cloud-init. This results in the package module failing due to an unmet dependency. The fix is to wait longer or for a different condition, but we're committing this as is because it's our first ever successful test run.

This is a dependency for Ansible that isn't included in the image for some reason.

This commit moves most of the test setup logic out of the makefile and into a new setup playbook. We still use make, but mostly as a convenience tool to avoid passing all those arguments to ansible-playbook. There is now a teardown target that stops machines, but doesn't remove images and configuration. The testimage.yml file is now a platforms.yml file that lists platforms, the url to download a cloud image, and the prefix directory before shim (fedora, centos, redhat, etc). The looping mechanics in ansible are used to generate an inventory file using all of these platforms, which allows us to run tests in parallel.

Add a new teardown rule to Makefile that removes test machines without removing built files sot hey can quickly be spun up again. Fix detection of UKI rebuild need. Note we should also inspect whether the certificate and/or private key changed, not just their file paths are those will likely be the same. Added conditions to only reboot if the UKI was rebuilt.

tests/integration/targets/role_uki/setup.yml

We now verify that key identity is our condition for idempotency, not key nickname.

Previously unnecessarily inserted uki signature verification tasks between two postinst script installation tasks.

This adds an additional pre-reboot check: that BootNext is set to the UKI for the kernel version we installed. We now call kernel-install directly in case the package for the kernel version we need is not available, in which case apt will fail silently.

This avoids verification issues with pesigcheck.

pesigcheck is used in favor of osslsigncode since it is available on CentOS and RHEL. Detection of the BootNext EFI variable was broken on ubuntu since it apparently doesn't include the 'UKI' tag, so a special case was added.

This should avoid reusing addresses.

This allows the system to upgrade itself instead of relying on cloning a repository.

roles/uki/files/zz-kernel-install

crichez · 2025-02-04T00:40:42Z

roles/uki/tasks/main.yml

+        - name: Reboot the host to try and boot from the UKI
+          become: true
+          ansible.builtin.reboot:
+            reboot_timeout: 300


It doesn't really make much sense to try and recover from this error, we should print some helpful debug information and exit.

crichez · 2025-02-04T00:42:43Z

roles/uki/tasks/main.yml

+    - name: Backup virt-firmware boot validation service
+      when:
+        - ansible_facts.distribution_file_variety == 'Debian'
+        - boot_svc_search.stat.exists
+      ansible.builtin.slurp:
+        src: "{{ boot_svc_path }}"
+      register: boot_svc_backup
+      changed_when: false
+
+    - name: Install virt-firmware boot validation service from git
+      when: ansible_facts.distribution_file_variety == 'Debian'
+      ansible.builtin.copy:
+        src: "{{ clone_dir.path }}/systemd/{{ boot_svc_name }}"
+        dest: "{{ boot_svc_path }}"
+        owner: root
+        group: root
+        mode: "0644"
+        setype: systemd_unit_file_t


All of these Debian-conditioned tasks should likely be moved to a different tasks file so we don't pollute the playbook run output with skipped tasks. It's getting a little difficult to figure out what's going on.

tests/integration/targets/role_uki/test.yml

this is a debian-only change

crichez · 2025-02-08T04:31:38Z

roles/uki/vars/main.yml

+---
+uki_mok:
+  database_path: /etc/pki/pesign
+  friendly_name: mok


It turns out that this method causes some interface stability issues. If we nest all of our variables in a dictionary, the user needs to declare each key, not just the ones they want to modify. There is likely a way around this by merging the default dictionary and the one provided by the caller, but it's bad form in my opinion. I think we need to change this, and it's a good opportunity to revisit some naming ideas.

The role should be renamed to kernel, since what it essentially does is modify the kernel layout and desired signature state. We then have to add the "kernel_" prefix to all of our arguments. We should flatten the structure as discussed, and it might look like this:

kernel_layout: uki kernel_sign: true kernel_signing_tool: pesign kernel_pesign_key: mok kernel_pesign_db: /etc/pki/pesign

Eventually as we reintroduce support for sbsign, other UKI initramfs generators:

kernel_signing_tool: sbsign kernel_sbsign_key: /etc/kernel/mok.priv kernel_sbsign_cert: /etc/kernel/mok.pem kernel_initrd_generator: dracut kernel_uki_generator: ukify

This change aligns the layout of the integration tests directory with the expected structure of collection tests, even though we don't use ansible-test. This lays the groundwork for splitting test components into their own tasks files and reusing more tasks.

Christopher Palmer-Richez and others added 30 commits September 3, 2024 16:51

Add new "uki" role that doesn't enroll MOKs.

77b5e1b

Add libvirt-python and libxml as project requirements.

d6b98ff

This has been tested with `python -m pip -r requirements.txt` and has been proven to build an enviornment sufficient to run the test suite.

Add requirements file

727a926

Fix splitext call to extract index

08a9d74

Improve tests to the point that our machine starts.

04ffbad

Still don't have an interactive VM at this point, I fear it may be crashing on boot. We will try using the default OVMF firmware images and see if that solves our problems.

Remove VM template, will use virt-install instead.

14374dd

Remove customization from test_image variable.

cb0c9bd

This is meant to be set by the user and should not have been committed.

Remove unused config.yml in integration test dir.

334c3a2

Remove workflows.

92749b7

Fixed detection of VM dependency state.

6e9ec3d

This commit now uses the PID file of the VM to detect whether it needs to be created.

Fix quoting, remove unnecessary dependencies

679e091

Fix mok der file extension, use become

e6ef983

Reduce delay to 25 sec, copy MOK files before test.

9413236

Fixed cert enrollment detection, pip install

c22b22f

We now install pip from the ensurepip module, since we were getting some missing library errors using ansible.builtin.package.

Don't re-create vm, use ssh-keygen for host keys

7790116

Install python3-libdnf5 on first boot

9b8ede9

This is a dependency for Ansible that isn't included in the image for some reason.

Wait for cloud-init to install libdnf5 bindings.

c44b833

Clean known_hosts, rebuild machine on xml change

0259728

Write console to file, extract and trust host keys

56cb01f

Remove unused task files and playbooks

3b57223

Removed unused test script

f7505b3

crichez commented Jan 22, 2025

View reviewed changes

tests/integration/targets/role_uki/setup.yml Outdated Show resolved Hide resolved

Christopher Palmer-Richez added 17 commits January 22, 2025 16:31

Remove dep on arp tables, get ips from console

63aa35e

Add more failure mode and idempotency tests

2167bd6

We now verify that key identity is our condition for idempotency, not key nickname.

Fix task ordering

e12d476

Previously unnecessarily inserted uki signature verification tasks between two postinst script installation tasks.

Add a test to ensure UKIs are removed on Debian

fdb659d

Refactoring and cleanup

55195f0

Fix host key removal to match new host detection

668fc0c

Remove unnecessary block from host key add

3290494

Linting pass

e8210c4

Remove old uki_config role

1d2a857

Add key usage attributes to generated mok

fe20e3d

This avoids verification issues with pesigcheck.

Remove osslsigncode dependency

5aa825b

Use pesigcheck, not osslsigncode, fix nextboot re

3328006

pesigcheck is used in favor of osslsigncode since it is available on CentOS and RHEL. Detection of the BootNext EFI variable was broken on ubuntu since it apparently doesn't include the 'UKI' tag, so a special case was added.

Remove short_mac, start randomizing addresses

0d4372e

This should avoid reusing addresses.

Don't track workspace directories

9ec5f5d

Remove pip dependency

b398ff6

Get uki-direct from dnf on RedHat

b6511e6

This allows the system to upgrade itself instead of relying on cloning a repository.

crichez commented Feb 4, 2025

View reviewed changes

roles/uki/files/zz-kernel-install Outdated Show resolved Hide resolved

crichez commented Feb 4, 2025

View reviewed changes

tests/integration/targets/role_uki/test.yml Outdated Show resolved Hide resolved

Christopher Palmer-Richez added 6 commits February 3, 2025 21:39

Update README.md

35d70b4

Formatting fixes

d8b034e

Add recovery instructions, more references.

ef7b615

Fix callout syntax

e59036a

Fix incorrect platforms.yml file example

367634e

Only run kernel-install when layout is uki

84ee4ae

this is a debian-only change

crichez commented Feb 8, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP: Assume MOK is already enrolled, add tests #19

WIP: Assume MOK is already enrolled, add tests #19

crichez commented Jan 5, 2025 •

edited

Loading

crichez Feb 4, 2025

crichez Feb 4, 2025

crichez Feb 8, 2025

crichez Feb 8, 2025

WIP: Assume MOK is already enrolled, add tests #19

Are you sure you want to change the base?

WIP: Assume MOK is already enrolled, add tests #19

Conversation

crichez commented Jan 5, 2025 • edited Loading

Objectives

Detailed Changes

crichez Feb 4, 2025

Choose a reason for hiding this comment

crichez Feb 4, 2025

Choose a reason for hiding this comment

crichez Feb 8, 2025

Choose a reason for hiding this comment

crichez Feb 8, 2025

Choose a reason for hiding this comment

crichez commented Jan 5, 2025 •

edited

Loading