Skip to content

CADRE Fellowship proposal

XiaoranYan edited this page May 1, 2019 · 8 revisions

Project name

Project team and affiliations

We encourage applicants to form teams that span across discipline and institutional boundaries. If you plan to use WoS data, please verify your university affiliations by providing official working email address of each team member.

Project abstract

  1. We will select 6 teams from the application pool on June 14th (first round), but the call will remain open and new fellows will be selected if new openings are available.
  2. The choice will be based on merit as well as its potential value to the development of CADRE platform. Considerations include community services such as doing data quality analysis and data cleaning, feasibility under current development status of CADRE, and value as test cases of different CADRE components.

Data set requested

  1. To access Web of Science, you must be affiliated with one of the 14 member institutions of BTAA. The WoS data set is a proprietary data set own by Clarivate Analytics and subject to access policies.

  2. CADRE also provide technical support for accessing Microsoft Academic Graph and its associated patent data. The MAG data has a ODC Attribution License and as such can be used freely as long as the appropriate citation is included. To qualify as a CADRE Fellow, however, a commitment to the "GOTO" principle is required, which stands for Good and Open data with Transparent and Objective process and methodologies. We will open-source and publish all MAG related project materials.

Computing resources requested

  1. For better testing purposes, we encourage the applicant to use CADRE's cloud based offerings. This include a GUI query service and Jupyter notebook environments on AWS. The notebooks can support languages like Python, R, Spark and Tensor-flow and the applicant should let us know if special resources like GPU are required.
  2. CADRE also offers a HPC environment housed at Indiana University Carbonate cluster. It supports SQL database, R, SAS, Stata, SPSS, MatLab, Python and other software upon user's request.
  3. If you prefer to use your own computing environment, please describe your planned data pipeline. The CADRE team will work with you to ensure data delivery.

Training interests

The CADRE team is planning for training webinars that can help fellows and other interested users to learn about our data sets/tools. Topics include but not limited to data set schemas and versions, relational/graph database query and tools, data analytics programming languages (Spark, R, Python, etc.). Please name webinar topics that will interest you.

Travel Support

Please indicate if you are interested in our upcoming events at ISSI 2019 in Rome, Sept 2-5, 2019. The recipients will present their latest progress on the proposed work. Up to 2 team members can ask for travel support (flight, registration, and hotel). Participation is not a requirement for Fellowship status. You can check out our ISSI proposals here.

You can find out more about CADRE and related data sets at our website. For illustrative examples, you can also check out our current on-going collaborative project issues at our public GitHub repositories.