Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Structure of the tsv #17

Open
YvonneKallberg opened this issue Dec 19, 2024 · 1 comment
Open

Structure of the tsv #17

YvonneKallberg opened this issue Dec 19, 2024 · 1 comment

Comments

@YvonneKallberg
Copy link
Collaborator

I don't know if mentioned and discussed earlier, but the way the .tsv file examples are structured when automatically generated is not the way it is structured when doing a submission. My question is, should they be? Is the purpose of creating .tsv templates to make it easier for a submitter/researcher to submit, or is the purpose to have templates for the data producers to fill?

  • Interactive .tsv has a first row with 3 columns: 'FileType', the actual file type value e.g. 'fastq' and 'Read submission file type'
  • Automated .tsv is missing this (currently at least), and the file_type is given in a column

I'm open for both, but think we need to make it clear.

@somathias
Copy link
Collaborator

Very good question (and important difference in the current state) to be raised.
So far we have been trying to cater to both researchers/submitters and data producers at the same time and I agree that however we chose to go forward, we need to make the purpose more clear.

Personally I think, a large motivation (although not the only one) for us to collect the technical metadata in this form is to facilitate data submission to end repositories. This motivates me to try to have tsv files be in a format easy for submitters to work with.

However, we are of course dependent on the data producers being able to supply the technical metadata to enable submission. This means that it is even more important that the template is in a format the data producers can easily work with. If that can be the same format as will be submitted (minus the orga metadata), perfect. If not, it would of course also be possible to reformat a "data producer suitable tsv" to a "submission suitable tsv" later on.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants