-
Notifications
You must be signed in to change notification settings - Fork 8
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Rocky Linux images resulting in PAM sudo error #56
Comments
Heya Jeff - Thanks for the detailed report. I'll check this out -- nothing springs to mind but it's totally possible something has changed due to our use of kiwi to build the container root filesystems since 9.4. |
Thanks for your patience. We had an outage with our powerpc cluster yesterday I had to work on. The error here indicates that pam isn't able to resolve the user running the container to anything in its database(s). Would it be possible to get the contents of /etc/shadow and /etc/passwd on an affected instance, as well as the output of |
This has been happening for a few weeks to a couple of months but...I can no longer reproduce this with any of the test cases I put into geerlingguy's repo (geerlingguy/docker-rockylinux9-ansible#6 (comment)) besides a GitHub Actions runner. Going to go pull out that troubleshooting info you asked for, but wanted to add that new nuance. Trying on a Ubuntu 22.04 cloud instance, leaving notes: Don't see PAM errors in the rocky linux container now. I see docker-ce released a new minor version 2 days ago (27.5.0). But pinning back to the older docker-ce ( |
ansible/molecule#4365 is reporting this same error running on GitHub Actions with |
Looks like the Action runners use Docker-CE 26.x. My suspicion is this is from a kernel/syscall error on the Docker end.
id -u
cat /etc/shadow
cat /etc/passwd
|
Although I'm not sure why but we are seeing the same issue even when running the GitHub Action using Podman instead of Docker. |
Heya folks - Have not forgotten about this but looks like it's just moving into other areas and doesn't feel deterministic. For example, in the OpenStack-Ansible project, we've been having failures like this due to AppArmor when running CentOS Stream 9 containers on Ubuntu hosts, but not Rocky. Has anyone seen any root cause analysis on this yet? I'm struggling to see common threads to look down. |
I have not seen anything more, unfortunately :( I haven't had time to dig any deeper. |
It looks like (ansible task) - name: Debugging
changed_when: false
register: thing
ansible.builtin.shell:
cmd: |
echo ""
ls -l /etc/shadow
ls -l /etc/passwd
echo ""
- name: Print debug
ansible.builtin.debug:
var: thing.stdout @andtra realized that in geerlingguy/docker-rockylinux9-ansible#6 (comment) I can reproduce that as the source issue as well FWIW this also seems to affect RHEL and Oracle Linux, but not AlmaLinux. |
When I'm running my
geerlingguy/docker-rockylinux9-ansible
containers in CI in GitHub Actions to test my Ansible projects, I have been seeing the following errors whenever running a task withsudo
/`become:Other users have reported the same, for both Rocky Linux 8 and 9, for the past few weeks. For example: geerlingguy/docker-rockylinux9-ansible#6
This error is not reproducible on a Mac running Docker Desktop, but it is in instances running
docker-ce
or on GitHub Actions. We usesudo
in the container because it is testing/verifying playbooks that are run against instances wheresudo
may be required.In the past this was never an issue; it seems like it could be also related to the
yum install sudo
command that I run that updates PAM (perhaps?) in my Ansible/Docker project: https://github.com/geerlingguy/docker-rockylinux9-ansible/blob/master/Dockerfile#L22Is there something that's changed in Rocky Linux lately that could be causing this?
The text was updated successfully, but these errors were encountered: