Skip to content

Link unique NIH PIs with MEDLINE data #18

@titipata

Description

@titipata

We will use pubmed_parser in order to grab all PIs name from MEDLINE. This will be in format: KP Kording. Right now, in NIH, we have first_name, last_name columns. We will transform author string into same format as in MEDLINE. String matching would be nice for the first attempt. We will check affiliation later to make sure they are same person.

File from MEDLINE is located in S3, downloading by using,

aws s3 sync s3://science-of-science-bucket/medline/pmid_author_affil.csv/ pmid_author_affil/

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions