Auotmate resource auditing for account #2148

berryd · 2024-03-25T19:30:33Z

Description

The purpose of this PR is to prototype a scheduled job which produces an automated report. The report is scheduled to run once a week, on Monday morning, and will create a downloadable zip file containing four distinct CSV files. The job also posts JSON output for each of the four reports within the build log, allowing viewing without needing to download and subsequently open the spreadsheets.

Under the hood, this report is scanning the tagging API output which returns all active resources. Two other approaches were investigated and the result of that work can be reviewed in comments in this ticket. The tagging API is a relatively new feature and allows for querying and applying tags to resources.

In our application stack, our CI pipeline applies a tag,{"Key":"STAGE","Value":"[stage_name]"}, to every resource directly created by the generated Cloudformation template. Some child resources, such as rules, do not inherit these mapping and will subsequently be reported in the untagged report. What this audit is effectively doing is comparing tags against branch names, using the derived branch name for Snyk and Dependabot generated stages. There are effectively four categories of reports for this resource audit:

CI Active

These resources were created by our CI pipeline, and have an active branch in the repository.

CI Inctive

These resources were created by our CI pipeline, but there is no matching branch. This is likely infrastructure that was not cleaned up during a destroy action, but in any case, this is orphaned infrastructure.

CF Other

These resources have tags and are assumed to have been created by a Cloudformation template but they do not have the STAGE tag and therefore were not created by our CI pipeline. These were likely created by CMS Cloud Engineering teams. It's possible to manually tag a resource, and those resources would also appear in this report.

Untagged

These resources have no tags. They could have been manually created, or they could be child resources that simply don't inherit tags.

Related ticket(s)

CMDCT-3410

How to test

Workflow (note that this will vanish after this PR is merged)

Download the artifacts from the above link, and verify the contents.

Important updates

At some point in the future we may want to automate reporting of certain items produced by these reports, such as active stages (did you push a branch and forget about it), and we could also amend this script to check for open PRs against branches. We could possibly introduce a culture change where we're expecting engineers to create draft PRs for any active branch and use this to report on forgotten work, but that's not something I'm necessarily advocating for at this time. At the minimum, this should improve observability to orphaned and neglected resources in any account to which this automated workflow is operating against.

Author checklist

I have performed a self-review of my code
I have added thorough tests, if necessary
I have updated relevant documentation, if necessary

convert to a different template: test → val | val → prod

.github/workflows/audit-account.yml

gmrabian

I'm not strong enough in bash to know if this works as intended but I do like the idea and see no harm in trying it

berryd · 2024-03-28T13:40:32Z

I'm not strong enough in bash to know if this works as intended but I do like the idea and see no harm in trying it

It really doesn't affect an app in any manner other than providing some observability. We still have to go hunt for the report, but I'm hoping once this is proven, we can use it to start automating notifications. One thing that is apparent, is that our destroy scripts are not very reliable and we're leaking a lot of infrastructure.

codeclimate · 2024-03-28T13:54:43Z

Code Climate has analyzed commit 8b66f06 and detected 0 issues on this pull request.

The test coverage on the diff in this pull request is 100.0% (90% is the threshold).

This pull request will bring the total coverage in the repository to 73.4%.

View more on Code Climate.

Auotmate resource auditing for account

396b712

github-actions bot assigned berryd Mar 25, 2024

berryd added 2 commits March 25, 2024 16:24

Set retention policy for artifacts

05372c0

Remove event triggers that enable job to run on ephems

8333a2e

berryd requested a review from dwhitestratiform March 26, 2024 16:25

berryd added the ready for review Ready for all the reviews! label Mar 26, 2024

berryd marked this pull request as ready for review March 26, 2024 16:48

berryd requested review from BearHanded, braxex and ailZhou as code owners March 26, 2024 16:48

dwhitestratiform reviewed Mar 26, 2024

View reviewed changes

.github/workflows/audit-account.yml Outdated Show resolved Hide resolved

dwhitestratiform reviewed Mar 26, 2024

View reviewed changes

.github/workflows/audit-account.yml Outdated Show resolved Hide resolved

gmrabian previously approved these changes Mar 27, 2024

View reviewed changes

Up retention, set file to be able to test, and remove jq install

d7b553a

berryd dismissed gmrabian’s stale review via d7b553a March 28, 2024 13:33

dwhitestratiform previously approved these changes Mar 28, 2024

View reviewed changes

Set job to be isoalted to the integration branch

8b66f06

berryd dismissed dwhitestratiform’s stale review via 8b66f06 March 28, 2024 13:35

berryd closed this Mar 28, 2024

berryd reopened this Mar 28, 2024

dwhitestratiform approved these changes Mar 28, 2024

View reviewed changes

berryd merged commit 85ec899 into master Mar 28, 2024
181 checks passed

berryd deleted the automate-audit branch March 28, 2024 14:09

berryd mentioned this pull request Apr 1, 2024

Add workflow that creates scheduled job to audit AWS account Enterprise-CMCS/macpro-mdct-mfp#514

Merged

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Auotmate resource auditing for account #2148

Auotmate resource auditing for account #2148

berryd commented Mar 25, 2024 •

edited

Loading

gmrabian left a comment

berryd commented Mar 28, 2024

codeclimate bot commented Mar 28, 2024

Auotmate resource auditing for account #2148

Auotmate resource auditing for account #2148

Conversation

berryd commented Mar 25, 2024 • edited Loading

Description

CI Active

CI Inctive

CF Other

Untagged

Related ticket(s)

How to test

Important updates

Author checklist

gmrabian left a comment

Choose a reason for hiding this comment

berryd commented Mar 28, 2024

codeclimate bot commented Mar 28, 2024

berryd commented Mar 25, 2024 •

edited

Loading