1972 replace covidcast #2004

aysim319 · 2024-07-26T20:29:50Z

Description

refactored covidcast that under the hood uses a for loop for each day to grab signals

Changelog

Itemize code/test/documentation changes and files added/removed.

replaced instances of covidcast.signal and covidcast.metadata with respective epidata api calls

Associated Issue(s)

Addresses #(1972)
Addresses #(1931)
Addresses #(1987)

aysim319 · 2024-07-30T18:59:31Z

~~The refactoring takes more time than the original, possibly because the refactored grabs the whole data instead of truncated data, so longer processing time to go through the validator~~

~~cprofile_google_symptoms.txt~~
~~refactored_cprofile_google_symptoms.txt~~

edited:
removing the issues as it's not used in the other signal calls in google symptoms and sircomplainsalot and it did go after and the refactored optimizer has the same result as main

runs: 171.824 secs
opt_cprofile_google_symptoms.txt
opt_profile_google_symptoms.log

runs: 265.703
profile_google_symptoms.log
cprofile_google_symptoms.txt

melange396 · 2024-07-31T21:47:50Z

@aysim319 can you explain your previous comment a little more? what do each of those files represent? the first two files appear to be telling me that this branch runs a little bit slower (281s) than the dev branch (267s)... but what operation are they performing (ie, what commands and arguments did you use to run these samples)?

_delphi_utils_python/delphi_utils/covidcast_wrapper.py

_delphi_utils_python/delphi_utils/validator/datafetcher.py

_delphi_utils_python/delphi_utils/validator/dynamic.py

_delphi_utils_python/tests/test_covidcast_wrapper.py

aysim319 · 2024-07-31T23:39:18Z

@aysim319 can you explain your previous comment a little more? what do each of those files represent? the first two files appear to be telling me that this branch runs a little bit slower (281s) than the dev branch (267s)... but what operation are they performing (ie, what commands and arguments did you use to run these samples)?

The first time I ran the comparison the profiler was taking more time because of the issues param that I thought I also needed to figure out to format to make the call to epidata. I later found out I didn't need to pass along the issues param which fixed both the speed and the difference between the validator result.

_delphi_utils_python/tests/test_covidcast_wrapper.py

_delphi_utils_python/delphi_utils/validator/datafetcher.py

dshemetov

Want to make sure the tests we have are solid, so made some a few change requests.

testing_utils/check_covidcast_port.py

google_symptoms/delphi_google_symptoms/run.py

aysim319 · 2024-08-09T02:51:08Z

Couldn't find a way to elegantly run the whole thing while having 2 seperate logs so I ran the first half where it's calling with covidcast api and then saved the resulted into parquet with the rest commented it out,
covidcast_signal.log this took about an hour

then ran the whole thing to get the logs: where the only logs except the initial metadata run is just from the epidata
epidata_signal.log this took about 30 minutes

dshemetov

As far as correctness goes, this is looking good to me. I tested locally and the new port is parsing things identically to the old covidcast function (FWIW, the client logs aren't that important here, we just really want to make sure the DataFrame outputs from our API calls are absolutely identical).

I convinced myself that we don't need to test every signal: the covidcast response schema is the same for every signal, so testing a single signal API query gets you 99% coverage (as long as the query returns a representative data subset). The only snag is that time_value can be in two different formats (date or epiweek), so as long as we test NCHS along with the other sources, we get full coverage. I tweaked the test to test a single signal per source, so it runs much faster now (thank you for doing the comprehensive runs nonetheless, it's nice to have that extra safety!).

~~I also found and fixed some anti-patterns in the covidcast code you ported over, specifically _parse_datetimes. Should be a bit faster.~~ I made an error here and fixing it made the code a lot uglier. I think clarity is more important, so I reverted it.

TODO:

fix conflicts
test with CI (for some reason CI isn't running?)

sir_complainsalot/delphi_sir_complainsalot/check_source.py

aysim319 · 2024-08-21T20:48:47Z

_delphi_utils_python/delphi_utils/validator/datafetcher.py

+
+        response = Epidata.covidcast_meta()
+
+        if response["result"] != 1:


@melange396 would this do the trick? or still add the conditional anyway?

aysim319 · 2024-09-13T15:21:06Z

Screwed up with rebasing instead of merge: Continuation from last comment #2056

aysim319 force-pushed the 1972-replace-covidcast branch 2 times, most recently from 1ff5866 to 6e22db8 Compare July 30, 2024 22:45

aysim319 requested review from melange396 and dshemetov July 30, 2024 22:47

aysim319 linked an issue Jul 31, 2024 that may be closed by this pull request

replace the python covidcast client in validator #1972

Open

melange396 requested changes Jul 31, 2024

View reviewed changes

dshemetov reviewed Aug 2, 2024

View reviewed changes

_delphi_utils_python/tests/test_covidcast_wrapper.py Outdated Show resolved Hide resolved

dshemetov reviewed Aug 2, 2024

View reviewed changes

_delphi_utils_python/tests/test_covidcast_wrapper.py Outdated Show resolved Hide resolved

aysim319 force-pushed the 1972-replace-covidcast branch from 323f481 to 9269621 Compare August 5, 2024 21:21

aysim319 commented Aug 5, 2024

View reviewed changes

_delphi_utils_python/delphi_utils/validator/datafetcher.py Outdated Show resolved Hide resolved

dshemetov reviewed Aug 6, 2024

View reviewed changes

_delphi_utils_python/delphi_utils/validator/datafetcher.py Outdated Show resolved Hide resolved

dshemetov requested changes Aug 6, 2024

View reviewed changes

dshemetov reviewed Aug 8, 2024

View reviewed changes

google_symptoms/delphi_google_symptoms/run.py Outdated Show resolved Hide resolved

aysim319 force-pushed the 1972-replace-covidcast branch from 8ed8c6b to 76f1519 Compare August 8, 2024 19:30

dshemetov force-pushed the 1972-replace-covidcast branch from fa2208e to 7f60275 Compare August 10, 2024 00:05

dshemetov reviewed Aug 10, 2024

View reviewed changes

aysim319 commented Aug 21, 2024

View reviewed changes

sir_complainsalot/delphi_sir_complainsalot/check_source.py Outdated Show resolved Hide resolved

aysim319 commented Aug 21, 2024

View reviewed changes

dshemetov mentioned this pull request Sep 9, 2024

refactor: bundle covidcast==0.2.2 in delphi_utils #1985

Closed

aysim319 closed this Sep 13, 2024

aysim319 force-pushed the 1972-replace-covidcast branch from 4d6117f to d5be1bd Compare September 13, 2024 15:01

aysim319 mentioned this pull request Sep 13, 2024

1972 replace covidcast #2056

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

1972 replace covidcast #2004

1972 replace covidcast #2004

aysim319 commented Jul 26, 2024 •

edited

Loading

aysim319 commented Jul 30, 2024 •

edited

Loading

melange396 commented Jul 31, 2024

aysim319 commented Jul 31, 2024

dshemetov left a comment

aysim319 commented Aug 9, 2024 •

edited

Loading

dshemetov left a comment •

edited by aysim319

Loading

aysim319 Aug 21, 2024

aysim319 commented Sep 13, 2024 •

edited

Loading


		response = Epidata.covidcast_meta()

		if response["result"] != 1:

1972 replace covidcast #2004

1972 replace covidcast #2004

Conversation

aysim319 commented Jul 26, 2024 • edited Loading

Description

Changelog

Associated Issue(s)

aysim319 commented Jul 30, 2024 • edited Loading

melange396 commented Jul 31, 2024

aysim319 commented Jul 31, 2024

dshemetov left a comment

Choose a reason for hiding this comment

aysim319 commented Aug 9, 2024 • edited Loading

dshemetov left a comment • edited by aysim319 Loading

Choose a reason for hiding this comment

aysim319 Aug 21, 2024

Choose a reason for hiding this comment

aysim319 commented Sep 13, 2024 • edited Loading

aysim319 commented Jul 26, 2024 •

edited

Loading

aysim319 commented Jul 30, 2024 •

edited

Loading

aysim319 commented Aug 9, 2024 •

edited

Loading

dshemetov left a comment •

edited by aysim319

Loading

aysim319 commented Sep 13, 2024 •

edited

Loading