Replies: 6 comments 2 replies
-
Or more comprehensively written:
|
Beta Was this translation helpful? Give feedback.
-
I feel its also a kind of measure how dispersed a neighborhood is growing. E.g. if I set |
Beta Was this translation helpful? Give feedback.
-
Hi @LukasHats, first of all, there is no better compliment than what you wrote, so thank you very much :) Yes, that's a very good point. In the past, I have measured something similar to the number of "components" for a niche/domain, but your approach is definitely better. If you are up for it, after the holidays we can write a pull request together and decide whether to keep it as a separate measure or to find ways to combine it with purity to have a more comprehensive score. |
Beta Was this translation helpful? Give feedback.
-
That sounds awesome @marcovarrone ! As you can guess, I would love to contribute something. I already have an enhanced version of the function above that takes the |
Beta Was this translation helpful? Give feedback.
-
How do you prefer to proceed? Would you open a new branch so we can first open the PR to the newly opened branch and work on this? I already started to implement the plotting function into your shape.py function (see here). But I don't have the overview about the whole package and if you want to implement test etc. So let me know where to open the draft PR. Happy holidays! |
Beta Was this translation helpful? Give feedback.
-
Hi @LukasHats, sorry I was on holiday and decided not to check work stuff during that time :) The best approach would be to create a pull request directly from the |
Beta Was this translation helpful? Give feedback.
-
Hey @marcovarrone,
as you already feel, I am obsessed with your method :D While playing around with the shape features, I realized that maybe the connected component could hold additional value for users. I will try to explain my thoughts:
When looking at the tissue with neighborhoods displayed, people usually then quantify the frequencies of cells belonging to specific neighborhoods and try to summarize and compare this across cohorts. But I feel this again is only a part of the bigger picture.
When calculating the connected components, if I understood it correctly, we basically get how many cells belong to one connected neighborhood component. So to speak a real connected neighborhood blob. As its automatically stored in
.obs
One could quantify the amount or frequencie of cells of a specific neighborhood that actually is inside such a connected component/neighborhood. This could somehow be a measure of tendencies to form actual connected structures than rather dispersed behaviour?I have already tried to calculate this with a first code draft (for my anndata format obviously):
Although this goes into the direction of purity, I get different results.
Happy to hear your thoughts about this and also maybe you have other ideas on how to use the connected component?
Beta Was this translation helpful? Give feedback.
All reactions