-
Notifications
You must be signed in to change notification settings - Fork 1
/
Copy pathpreserve_data-fair.qmd
179 lines (79 loc) · 4.51 KB
/
preserve_data-fair.qmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
---
title: "How FAIR are your data?"
---
## FAIR pinciples
**We recommend using the [FAIR](https://www.go-fair.org/fair-principles/) & [CARE](https://www.gida-global.org/care) principles as guidance** to maximize the reusability of your data by you, your collaborators, other researchers, and future-you. Your plan should ensure that detailed documentation adopting existing standards is developed during the entire duration of your project (don't wait until the very end!!) and that this documentation is archived along with your data and code in a publicly accessible data repository will set you up for success.
![source: <https://www.gida-global.org/care>](img/be-FAIR-and-CARE.png){width="80%" fig-align="left"}
A good reference to get you started with FAIR principles: <https://perma.cc/CT8P-D5MK>{target="_blank"}
and on CARE principles: <https://perma.cc/3ZHR-6JAG>{target="_blank"}
## FAIR assessment
### **F**indable
#### 1. Does the dataset have any identifiers assigned?
a. No identifier
b. Local identifier
c. Web address (URL)
d. Globally unique identifier, citable and persistent (e.g. DOI, PURL, ARK or Handle)
#### 2. Is the dataset identifier included in all metadata records/files describing the data?
a. No
b. Yes
#### 3. How is the data described with metadata?
a. The data are not described
b. Brief title and description
c. Comprehensive, but in a text-based, non-standard format
d. Comprehensively using a recognized formal machine-readable metadata schema
#### 4. What type of repository or registry is the metadata record in?
a. The data are not described in any repository
b. Local institutional repository
c. Domain-specific repository
d. Generalist public repository
e. Data are in one place but discoverable through several registries
### **A**ccessible
#### 5. How accessible is the data
a. No access to data or metadata
b. Access to metadata only
c. Unspecified conditional access, e.g. contact the data custodian
d. Embargoed access after a specified date
e. A de-identified / modified subset of the data is publicly accessible
f. Fully accessible to persons who meet explicitly stated conditions, e.g. ethics approval for
sensitive data
g. Publicly accessible
#### 6. Is the data available online without requiring specialized protocols or tools once access has been approved?
a. No access to data
b. By individual arrangement
c. File download from online location
d. Non-standard web service (e.g. OpenAPI, Swagger, Informal API)
e. Standard web service API (e.g. OGC)
#### 7. Will the metadata record be available even if the data are no longer available?
a. Unsure
b. No
c. Yes
### **I**nteroperable
#### 8. What (file) formats is the data available in?
a. Mostly in a proprietary format
b. In a structured, open standard, non-machine-readable format
c. In a structured, open standard, machine-readable format
#### 9. What best describes the types of vocabularies/ontologies/tagging schemas used to define the data elements?
a. Data elements not described
b. No standards have been applied in the description of data elements
c. Standardised vocabularies/ontologies/schemas without global identifiers
d. Standardised, open and universal, using resolvable global identifiers linking to explanations
#### 10. How is the metadata linked to other data and metadata (to enhance context and clearly indicate relationships)?
a. There are no links to other metadata
b. The metadata record includes URI links to related metadata, data and definitions
c. Metadata is represented in machine-readable format, e.g. in a linked format such as
Resource Description Framework (RDF)
### **R**eusable
#### 11. Which of the following best describes the licence/usage rights attached to the data?
a. No licence
b. Non-standard text-based licence
c. Non-standard machine-readable licence (clearly indicating under what conditions the data
may be reused)
d. Standard text-based licence
e. Standard machine-readable licence (e.g. Creative Commons)
#### 12. How much provenance information has been captured to facilitate data reuse?
a. No provenance information is recorded
b. Partially recorded
c. Fully recorded in a text format
d. Fully recorded in a machine-readable format
## Acknowledgements
This evaluation portion of this section has been adapted from the UK Data Service [https://ukdataservice.ac.uk/app/uploads/exercise_fairdata.pdf](https://ukdataservice.ac.uk/app/uploads/exercise_fairdata.pdf)target="_blank" and is licensed under a Creative Commons Attribution CC-BY 4.0 International License.