-
Notifications
You must be signed in to change notification settings - Fork 14
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Refactor volcano iii #361
Refactor volcano iii #361
Conversation
for k, v in dict( | ||
zip( | ||
self.rawinput[self._gene_names].tolist(), | ||
self.rawinput[self.index_column].tolist(), | ||
) | ||
).items() |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is not necessarily unique, and should ideally raise an error in some of our test-data sets that contain isoform ids. E.g. TPM1 occurs 5 times in testfiles/maxquant/proteinGroups.txt with different base Protein IDs.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
Anything regarding ids needs to be addressed in a separate PR.
move the gene to protein id mapping to DataSet, thus simplifying LLM part