Fix spider: cuya_audit #81

haileyhoyat · 2024-01-25T16:29:51Z

What's this PR do?
Fixes our Cuyahoga County Audit Advisory Committee spider (aka. cuya_audit), which broke due to URL and page structure changes across Cuyahoga County's website.

Why are we doing this?
To ensure our Cuyahoga County Audit spider works. The changes in this PR include use of a new class mixin for scraping the Cuyahoga County website.

Steps to manually test
Clone this repo, run the command: cd city_scrapers && scrapy crawl cuya_audit

Are there any smells or added technical debt to note?
No

haileyhoyat · 2024-01-25T16:45:38Z

@SimmonsRitchie I think you'll be proud of me on this one. I utilized the CuyaMixin2, DRY, all tests pass, no dead code, noqa only for long lines.

SimmonsRitchie · 2024-01-29T16:07:04Z

city_scrapers/spiders/cuya_audit.py

    location = {
        "name": "County Headquarters, 4-407 Conference Room B",
        "address": "2079 East 9th St Cleveland, OH 44115",
    }


Small request here, Hails. Could you please delete this class variable? In this case, CuyaCountyMixin2 is handling the parsing of location info from the page – so this variable is essentially unused.

Just in case there's any confusion here: I know in the city-scrapers-indianapolis PR we discussed treating location info as a constant via class variables. I think that approach makes sense for certain agencies that don't display location information on their meeting detail pages and we generally know a meeting is always going to be in the same place. In cases where location information is reliably displayed on meeting detail pages – which appears to be the case with Cuyahoga County agencies – I think it makes sense to parse it from the page. Since meeting venues can occasionally shift around (especially between years), I think this increases the chance we're scraping good data longterm.

SimmonsRitchie

This looks great, @haileyhoyat! Great job. The code looks nice and clean and the new tests pass 🥇

I have only one little comment, re: class var. If you can remove that var, I'll approve the PR and you can merge it. 🚀

SimmonsRitchie

LGTM! Merge away 🚀

haileyhoyat added 2 commits January 25, 2024 11:23

revise cuya_audit spider

8ce3ceb

add pytest files

3825ca8

haileyhoyat requested a review from SimmonsRitchie January 25, 2024 16:43

SimmonsRitchie reviewed Jan 29, 2024

View reviewed changes

SimmonsRitchie suggested changes Jan 29, 2024

View reviewed changes

remove location variable

1fd2f4b

haileyhoyat requested a review from SimmonsRitchie January 29, 2024 20:56

SimmonsRitchie approved these changes Jan 29, 2024

View reviewed changes

haileyhoyat merged commit c2cb3fa into City-Bureau:main Jan 29, 2024
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix spider: cuya_audit #81

Fix spider: cuya_audit #81

haileyhoyat commented Jan 25, 2024

haileyhoyat commented Jan 25, 2024

SimmonsRitchie Jan 29, 2024 •

edited

Loading

SimmonsRitchie left a comment •

edited

Loading

SimmonsRitchie left a comment

Fix spider: cuya_audit #81

Fix spider: cuya_audit #81

Conversation

haileyhoyat commented Jan 25, 2024

haileyhoyat commented Jan 25, 2024

SimmonsRitchie Jan 29, 2024 • edited Loading

Choose a reason for hiding this comment

SimmonsRitchie left a comment • edited Loading

Choose a reason for hiding this comment

SimmonsRitchie left a comment

Choose a reason for hiding this comment

SimmonsRitchie Jan 29, 2024 •

edited

Loading

SimmonsRitchie left a comment •

edited

Loading