FeatureRequest: Anonymous (dataset) publication #5
PGijsbers
started this conversation in
OpenML Design and Feature Requests
Replies: 1 comment
-
Discussed at virtual hackathon 17-11-2022. Instead of using |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Current: All entities are publicly linked to a user. There is no way to hide which user uploaded the entity.
Request: Allow entities (datasets, flows, tasks, runs, ...) to be created without publicly showing their creator. Optionally making the relationship public at a later date.
Use case: Anonymously sharing datasets and/or experimental results for a paper under review.
Impacted Projects: This impacts all parts of OpenML. It requires additional data stored for each entity in the back-end and the front-end and connector packages to allow a user field on an entity being null/absent (depending on implementation).
Suggested Implementation: One way to do this with minimal overhead might be to introduce a second column in each entity, we keep the foreign key to the user but also add an extra column
visible_user
which indicates which user is publicly associated with the entity (for now, it'd be either the same user ornull
). The data sent to the front-end/connector would be ofvisible_user
only and it gets processed in the same way as the user id currently is (except it should allow fornull
). Initially, I would limit this to datasets and onlyanonymous
topublic
status and expand from there.Beta Was this translation helpful? Give feedback.
All reactions