Skip to content

ORCID and Publication Reports

Peter Mangiafico edited this page Nov 14, 2022 · 5 revisions

ORCID Report

Produce statistics about the number of profiles users who have gone through ORCID integration and the scope authorized:

RAILS_ENV=production bundle exec rake sul:orcid_integration_stats

Results: https://docs.google.com/spreadsheets/d/1kT-IXWprFeR5QsPa1x297oMrx7ZWhOLMg2E2H4GZkjA

Publication Report

Produce statistics about the number of publications imported, number of unique authors, and numbers in each state in the specified time period:

RAILS_ENV=production bundle exec rake sul:publication_import_stats['1/1/2022','1/31/2022']

Results: https://docs.google.com/spreadsheets/d/17MBdJ3w7KJUCtwNs-DVa8q63atviE5aUaZQkXB2CbxI

Exporting Publications

For all authors

We sometimes get requests to export publications for all authors for a specific time period. This can be done via a rake task. Specify the output file name and the start and end date for which publications should be exported. Publications in any status (accepted, new, denied) will be exported.

See the rake task in https://github.com/sul-dlss/sul_pub/blob/main/lib/tasks/sul.rake for more info.

RAILS_ENV=production bundle exec rake sul:export_pubs_csv['tmp/pubs.csv','01/01/2022','12/31/2022']

For specific authors

We sometimes get requests to export publications for specific sets of authors (e.g. for all med school authors). This can be done given an input file which contains sunets.

  1. Ensure the provided spreadsheet/csv has a column called "sunetid" (may need to rename it) and no extra blank rows
  2. The output will be a very big file if there are many thousands of authors and will take a while to run, so run it in screen
  3. Often times we will be asked to provide all publications for all time, which basically means using a very early "from date" (e.g. 1950)

Specify the input file name (with sunetids), the output file name and the start and end date for which publications should be exported.

See the rake task in https://github.com/sul-dlss/sul_pub/blob/main/lib/tasks/sul.rake for more info.

RAILS_ENV=production bundle exec rake sul:export_pubs_for_authors_csv['tmp/general_rosters_from_fy18_to_fy23_sunet.csv','tmp/med_school_pubs.csv','01/01/1950', '12/31/2022']

Exporting Author Information

We sometimes get requests to export author information for specific sets of authors (e.g. for all med school authors). This can be done given an input file which contains sunets.

  1. Ensure the provided spreadsheet/csv has a column called "sunetid" (may need to rename it) and no extra blank rows

See the rake task in https://github.com/sul-dlss/sul_pub/blob/main/lib/tasks/sul.rake for more info.

RAILS_ENV=production bundle exec rake sul:author_export['tmp/general_rosters_from_fy18_to_fy23_sunet.csv','tmp/med_school_authors.csv','csv']