-
Notifications
You must be signed in to change notification settings - Fork 8
Open
Description
We will use pubmed_parser in order to grab all PIs name from MEDLINE. This will be in format: KP Kording. Right now, in NIH, we have first_name, last_name columns. We will transform author string into same format as in MEDLINE. String matching would be nice for the first attempt. We will check affiliation later to make sure they are same person.
File from MEDLINE is located in S3, downloading by using,
aws s3 sync s3://science-of-science-bucket/medline/pmid_author_affil.csv/ pmid_author_affil/
Metadata
Metadata
Assignees
Labels
No labels