Add context-dependent spatial randomization #215

alex-l-kong · 2020-09-10T18:58:48Z

What is the purpose of this PR?

Addresses and closes #207. Now that we've built a foundation for context-dependent spatial randomization, we can now begin to work on the actual randomization process. Naive randomization would assume that we do not care about the cell types (aka the FlowSOM IDs) associated with the marker list we randomize over. We no longer make that assumption in a context-dependent environment.

How did you implement your changes

We will be working primarily out of a new function: compute_close_cell_num_random_context. Good news is that compute_close_cell_num does not need to be changed, only the randomization process we're comparing against. For channel enrichment, we allow the user to specify a list of FlowSOM IDs they wish to specifically facet over, with all non-specified IDs getting grouped into an 'else' category. Erin would also like for the user to be able to specify parameters to tune the randomization process. I'm not exactly sure how this would be done yet, but this is something we'll look into after we've gotten the basic logic written in stone.

Remaining issues

Non-optimal code is the biggest issue right now. To make everything clear, I've brute-forced my way through the initial implementation. This will need to be optimized to increase efficiency.

Some code had to be duplicated from compute_close_cell_num to properly ensure markers by type were indexed properly. If we can remove some or all of that code, that'd be mighty fine.

…dded

…nv with angelo packages helps

alex-l-kong · 2020-09-10T20:10:59Z

Still need to add testing, this code review is mainly so you can kind of see what's going on.

ngreenwald

Most of the logic makes sense. I'm sure there is more optimization to be had, two main things I noticed. The first is that I believe the first two nested for loops can be combined; see my comment. The second is that I don't think the code actually runs as written; thresh isn't defined, for example.

ark/utils/spatial_analysis_utils.py

ngreenwald · 2020-09-10T23:25:36Z

The way you wrote it is how it's implemented in matlab; we changed it for the python version

…

On Thu, Sep 10, 2020 at 4:15 PM alex-l-kong ***@***.***> wrote: ***@***.**** commented on this pull request. ------------------------------ In ark/utils/spatial_analysis_utils.py <#215 (comment)> : > + + for j in range(num): + # we need to regenerate the positive inds per marker so we can compare them + # with the positive indices for each cell type, needed so we can bootstrap + # in a context-based environment properly + marker1posinds = current_fov_channel_data[current_fov_channel_data.columns[j]] > thresh + + # generate the number of positive hits per cell type for a specific marker + # in cell_type_facets or else for all the non-cell_type_facets cell types + cell_type_nums_per_facet_1 = {} + for cell_type, cell_type_data in cell_type_data_per_facet.values(): + cell_type_nums_per_facet_1[cell_type] = np.sum( + np.logical_and(marker1posinds, cell_type_data.index)) + + # new iteration, needed to properly generate a pair of 1 vs 2 analysis + # basically the same thing as generating cell_type_nums_per_facet_1 I'll double check Erin's code, but yeah absolutely, I think we collapse this loop out. — You are receiving this because your review was requested. Reply to this email directly, view it on GitHub <#215 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ADJB47PRF2SG6ABRX7BSQ2DSFFMZNANCNFSM4RFUZJ5A> .

ackagel

looks like a good start, but taking a look into optimizing the loops would probs be a good idea

ark/utils/spatial_analysis_utils.py

… cases

alex-l-kong · 2020-09-14T19:29:48Z

Added very basic testing for context spatial analysis, the thing to look at now is the updated logic which is now (more) optimized. I'm unsure about one aspect which I've added in a long TODO: if anyone happens knows anything about that please feel free to comment!

I will be working on integrating this with calculate_channel_spatial_enrichment, which would include a flag indicating whether we want context-randomization, as well as another argument to specify the FlowSOM ID's we want to randomize over.

ackagel

Looking like it's getting there; I mostly have a few clarification questions

ark/utils/spatial_analysis_utils.py

ngreenwald

Agree with all of Adam's comments

alex-l-kong · 2020-10-01T00:37:55Z

The logic is sound now, at least when comparing Erin's data between her script and our optimized version. However, we will probably need to wait until next week until we can get full verification.

For the record, the context-randomized close_num_rand Erin provided me to test against is completely wrong. It doesn't maintain the same symmetry as would be expected, making it completely wacko. I suspect Felix ran a much older version of the context-based randomization which caused this. I don't want to bother Erin too much right now, but this is something that will have to be double checked.

I do know for a fact that when running Erin's non-optimized MATLAB context-randomization script on her data to produce the real close_num_rand, the ranges check out with our optimized version in spatial_analysis_utils. So I think we're pretty close to finishing this up, after verifying our process just need to add testing and notebook integration.

…y test_utils function called correctly

…x_results

ngreenwald · 2022-08-26T05:00:27Z

Resolved by #451

alex-l-kong added 7 commits September 10, 2020 11:52

Brute force implementation of compute_close_cell_num_random_context a…

a69566e

…dded

Fix return errors for close num

4cd2f09

Fix errors in spatial analysis

3ef65ce

Fix PYCODESTYLE errors

c942df3

Make small changes to metadata in conf.py, see if changing to a new e…

24be2cd

…nv with angelo packages helps

Fix final code style error

b61f29a

Remove nbsphinx support

7070d04

alex-l-kong self-assigned this Sep 10, 2020

alex-l-kong requested review from ngreenwald and ackagel September 10, 2020 20:05

Merge branch 'master' into context_spatial

511666b

ngreenwald requested changes Sep 10, 2020

View reviewed changes

ark/utils/spatial_analysis_utils.py Outdated Show resolved Hide resolved

ark/utils/spatial_analysis_utils.py Outdated Show resolved Hide resolved

ackagel requested changes Sep 11, 2020

View reviewed changes

ark/utils/spatial_analysis_utils.py Outdated Show resolved Hide resolved

ark/utils/spatial_analysis_utils.py Show resolved Hide resolved

ark/utils/spatial_analysis_utils.py Outdated Show resolved Hide resolved

alex-l-kong added 7 commits September 10, 2020 19:49

Merge master, because my OCD...

134e448

Add more optimized version of spatial_analysis_utils

d970c78

Fix PYCODESTYLE

dde48a4

Fix backslash error

6ae2c4b

Add testing for compute_close_cell_num_random_context and handle edge…

4061505

… cases

Fix PYCODESTYLE errors

8bbfb25

This is what happens when you run --pycodestyle on the wrong file...

8a9fc20

alex-l-kong requested review from ngreenwald and ackagel September 14, 2020 19:29

alex-l-kong added 2 commits September 14, 2020 12:39

Make comment for cell_type generation a bit clearer

c4da9db

Trailing whitespace, AAAAAARRRRRRRRGGGGGGGHHHHH

cafc677

ackagel requested changes Sep 14, 2020

View reviewed changes

ngreenwald requested changes Sep 14, 2020

View reviewed changes

Begin addressing code review comments for context spatial randomization

aa8aacb

alex-l-kong and others added 26 commits September 30, 2020 19:27

Merge branch 'master' into context_spatial

0c3005d

Merge branch 'master' into context_spatial

0b26ad1

Merge branch 'master' into context_spatial

6cce05f

Fix testing: add cell lineage column to test_utils and make sure ever…

a4a8843

…y test_utils function called correctly

Merge branch 'master' into context_spatial

6d992b3

Merge branch 'master' into context_spatial

9ef46e8

Update dimension to address new lineage column

e05d885

Add cell_lineage to excluded_colnames for test_generate_cluster_matri…

3d08454

…x_results

Merge branch 'master' into context_spatial

413b4a6

Merge branch 'master' into context_spatial

e5f65d2

Fix save path for calc_dist_matrix

381bb3f

Change FileNotFoundError to ValueError

e59da1f

Merge branch 'master' into context_spatial

d8d5240

Merge branch 'master' into context_spatial

7e87200

Merge branch 'master' into context_spatial

c41bf18

Merge branch 'master' into context_spatial

981ca72

Merge branch 'master' into context_spatial

33eb4a1

Merge branch 'master' into context_spatial

fc7add7

Merge branch 'master' into context_spatial

663876c

Merge branch 'master' into context_spatial

a89a831

Propagate settings.py parameters into context-spatial randomization

6105173

Remove unnecessary argument in compute_close_cell_num

5912551

Merge branch 'master' into context_spatial

bd9dda4

Merge branch 'master' into context_spatial

6dd3b99

Merge branch 'master' into context_spatial

65eb0c5

Merge branch 'master' into context_spatial

dd67093

ngreenwald closed this Aug 26, 2022

ngreenwald deleted the context_spatial branch August 26, 2022 05:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add context-dependent spatial randomization #215

Add context-dependent spatial randomization #215

alex-l-kong commented Sep 10, 2020 •

edited

Loading

alex-l-kong commented Sep 10, 2020

ngreenwald left a comment

ngreenwald commented Sep 10, 2020 via email

ackagel left a comment

alex-l-kong commented Sep 14, 2020 •

edited

Loading

ackagel left a comment

ngreenwald left a comment

alex-l-kong commented Oct 1, 2020 •

edited

Loading

ngreenwald commented Aug 26, 2022

Add context-dependent spatial randomization #215

Add context-dependent spatial randomization #215

Conversation

alex-l-kong commented Sep 10, 2020 • edited Loading

alex-l-kong commented Sep 10, 2020

ngreenwald left a comment

Choose a reason for hiding this comment

ngreenwald commented Sep 10, 2020 via email

ackagel left a comment

Choose a reason for hiding this comment

alex-l-kong commented Sep 14, 2020 • edited Loading

ackagel left a comment

Choose a reason for hiding this comment

ngreenwald left a comment

Choose a reason for hiding this comment

alex-l-kong commented Oct 1, 2020 • edited Loading

ngreenwald commented Aug 26, 2022

alex-l-kong commented Sep 10, 2020 •

edited

Loading

alex-l-kong commented Sep 14, 2020 •

edited

Loading

alex-l-kong commented Oct 1, 2020 •

edited

Loading