Skip to content

CADRE_COVID_2020_07_06

XiaoranYan edited this page Aug 15, 2020 · 1 revision

Data

CORD-WOS mapping table

  • /N/project/rcsc/shared_space/RCSCdata/cord/WOS-cord-mapping.csv
Column Description
UID WoS Paper ID
cord_paper_id:ref_id "(paper id):(ref_id)", where paper id is the id of paper in the cord dataset. Ref id is the id of the reference, e.g., BIBREF0
confidence Matching confidence. Range [0, 100]. >90: Nearly perfect match. >80: Weak match. Calculated by 0.9 * score_title_0 + 0.1 * score_author_0
score_title_0 Levenshtein distance similarity ratio for the titles
score_author_0 Levenshtein distance similarity ratio for the author names

CORD-MAG mapping table

  • /N/project/rcsc/raw_data/mag-2020-07-02/CORD-19-07-06-map/CORD-19-MAG-Inst-Full.csv

It contains all columns in /N/project/rcsc/raw_data/2020-07-06/metadata.csv plus the following additional columns:

Column Description
pmcAuthors Author information from pmc full text scans
pmcAffiliation Affiliation information from pmc full text scans (mostly empty)
pdfAuthors Author information from pdf full text scans
pdfAffiliation Affiliation information from pmc full text scans (might have department info)
MAGids MAGid for recorresponding CORD-19 paper, might have multiple matches
authorids MAG authorIDs for recorresponding CORD-19 paper, delaminated list
authorOrders MAG author orders for recorresponding authors (matched with authorids, affiliationids), delaminated list
affiliationids MAG affiliationIDs for recorresponding CORD-19 paper, one for each author, delaminated list
affiliationNames MAG affiliationNames for recorresponding CORD-19 paper, one for each author, delaminated list
Latitudes MAG Latitudes for recorresponding MAG affiliation, delaminated list
Longitudes MAG Longitudes for recorresponding MAG affiliation, delaminated list
GRIDids GRIDids for recorresponding MAG affiliation, delaminated list

MAG affiliation table

  • /N/project/rcsc/raw_data/mag-2020-07-02/CORD-19-07-06-map/MAGInstTable.csv

affiliationID displayName normalizedName wiki paperTotal citationTotal Latitude Longitude GRIDid

It contains additioanl information for MAG insititutions, with the following columns:

Column Description
affiliationID MAG affiliationIDs, mapped to affiliationids of the CORD-MAG mapping table
displayName MAG affiliation displayName
normalizedName MAG affiliation normalizedName
wiki MAG affiliation wikipedia link
paperTotal MAG affiliation paper count in all of MAG
citationTotal MAG affiliation citation count in all of MAG
Latitude MAG Latitude for recorresponding MAG affiliation
Longitude MAG Longitude for recorresponding MAG affiliation
GRIDid GRIDids for recorresponding MAG affiliation