[GSOC] Add surrogate data generation for significant connectivity estimation #223

tsbinns · 2024-08-05T15:02:41Z

Follows from the changes in #220, so the diff will be quite smaller when that's merged.

In brief, adds a new function make_surrogate_connectivity in the datasets module (felt this was an appropriate place).

Since datasets is more populous now I thought it could make sense to clean things up and bring the tests from make_signals_in_freq_bands from the generic tests folder to one within the datasets folder (where the make_surrogate_connectivity tests also are).

Also added an example demonstrating the use of the new function.

Co-authored-by: Eric Larson <larson.eric.d@gmail.com>

examples/surrogate_connectivity.py

tsbinns · 2024-08-05T15:05:33Z

mne_connectivity/datasets/surrogate.py

+    Notes
+    -----
+    Surrogate data is generated by randomly shuffling the order of epochs, independently
+    for each channel. This destroys the covariance of the data, such that connectivity
+    estimates should reflect the null hypothesis of no genuine connectivity between
+    signals (e.g., only interactions due to background noise)
+    :footcite:`PellegriniEtAl2023`.
+
+    For the surrogate data to properly reflect a null hypothesis, the data which is
+    shuffled **must not** have a temporal structure that is consistent across epochs.
+    Examples of this data include evoked potentials, where a stimulus is presented or an
+    action performed at a set time during each epoch. Such data should not be used for
+    generating surrogates, as even after shuffling the epochs, it will still show a high
+    degree of residual connectivity between channels. As a result, connectivity
+    estimates from your surrogate data will capture genuine interactions, instead of the
+    desired background noise. Treating these estimates as a null hypothesis will
+    increase the likelihood of a type II (false negative) error, i.e., that there is no
+    significant connectivity in your data.
+
+    Appropriate data for generating surrogates includes data from a resting state,
+    inter-trial period, or similar. Here, a strong temporal consistency across epochs is
+    not assumed, reducing the chances that connectivity information of interest is
+    captured in your surrogate connectivity estimates.
+
+    In situations where you want to assess whether evoked data has significant
+    connectivity, you can generate your surrogate connectivity estimates from non-evoked
+    data (e.g., rest data, inter-trial data) and compare this to your true connectivity
+    estimates from the evoked data.
+
+    Regardless of whether you are working with evoked or non-evoked data, **you should
+    always compare true and surrogate connectivity estimates from epochs of the same
+    duration**. This will ensure that spectral information is captured with the same
+    accuracy in both sets of connectivity estimates. Ideally, **you should also compare
+    true and surrogate connectivity estimates from the same number of epochs** to avoid
+    biases from noise (fewer epochs gives noisier estimates) or finite sample sizes
+    (e.g., in coherency, phase-locking value, etc... :footcite:`VinckEtAl2010`).


Very long notes section, but like we discussed, there are important considerations as to what data surrogates should be generated from.

tsbinns · 2024-08-05T15:27:35Z

Hmm, building docs is failing here on the new example (fine for me locally), but doesn't give a reason beyond

make: *** [Makefile:56: html] Killed
Exited with code exit status 2

Could it be that it doesn't like multiprocessing? I'm using n_jobs=-1 to speed up connectivity computation on the multiple shuffles.

larsoner · 2024-08-05T19:43:36Z

Often that's a sign that you've used too much memory, and decreasing the number of jobs can help

tsbinns · 2024-08-06T12:58:35Z

Okay, even using 1/4 the CPU count ends up using too much memory (find it surprising with 36 cores and 70 GB). Just reverting to a single job and taking the hit on runtime.

tsbinns · 2024-08-07T10:32:14Z

Switching CI back to upstream since mne-tools/mne-python#12747 was merged.

examples/surrogate_connectivity.py

…onnectivity into add_surrogate_conn

tsbinns · 2024-08-20T12:29:37Z

Wanted to make sure everything was green before submitting final GSoC stuff. Newly failing tests addressed in #228

larsoner · 2024-08-20T17:38:54Z

@wmvanvliet want to look and merge if you're happy?

adam2392

This looks so great! thanks for all the hard work @tsbinns

https://output.circle-artifacts.com/output/job/bafe351e-3009-46ea-92ce-863de6768196/artifacts/0/dev/auto_examples/surrogate_connectivity.html#sphx-glr-auto-examples-surrogate-connectivity-py

Few comments:

would it be great to actually print the pvalue too potentially besides just writing that it's <0.05?
When comparing significance across the frequency band, the major difference appears in the alpha band, but you test the beta band. Perhaps it's worth mentioning that you test the beta band specifically since the alpha band is clearly significantly different (and due to simplicity for running the example), but in practice one can test both. You do allude to this point in multiple comparisons discussion, but that's one question I had when reading through that section

tsbinns · 2024-11-11T22:55:01Z

Thanks for the review @adam2392! Very good points, will have a look at implementing them by the end of the week. Cheers!

tsbinns and others added 9 commits August 1, 2024 15:59

Add EpochsSpectrum support

6debece

Fix CircleCI

b517e79

Add reminder to change version

dccecdc

Update Spectrum skips

45bf496

Co-authored-by: Eric Larson <larson.eric.d@gmail.com>

Fix broken tests and version checking

0b8790b

Be explicit with intersphinx roles

dac4153

Change empty weights contruction

121b40a

Merge branch 'main' into spec_conn_spectrum_support

00c381a

Add surrogate data generation

ffb256d

tsbinns commented Aug 5, 2024

View reviewed changes

examples/surrogate_connectivity.py Outdated Show resolved Hide resolved

tsbinns commented Aug 5, 2024

View reviewed changes

tsbinns added 5 commits August 6, 2024 12:59

Adjust n_jobs

30dc4e7

Adjust n_jobs

2a9c1e2

Try CPU count // 3

50ea4cd

Try CPU count // 4

9e4a11d

Use n_jobs=1

de52286

Reset tests to upstream-main

cece6a4

wmvanvliet reviewed Aug 7, 2024

View reviewed changes

examples/surrogate_connectivity.py Outdated Show resolved Hide resolved

tsbinns added 3 commits August 7, 2024 16:47

Expand test coverage

17f30cd

Merge branch 'main' into add_surrogate_conn

cabe170

Update example from review

a0ebe2a

tsbinns mentioned this pull request Aug 8, 2024

GSOC Cleanup: Add EpochsTFR support to spectral_connectivity_epochs() and spectral_connectivity_time() #225

Open

tsbinns added 3 commits August 19, 2024 20:28

Update surrogate example

ca6013d

Merge branch 'main' into add_surrogate_conn

6114bd5

Merge branch 'add_surrogate_conn' of https://github.com/tsbinns/mne-c…

7197a17

…onnectivity into add_surrogate_conn

Merge branch 'main' into add_surrogate_conn

bd180f0

tsbinns added 5 commits August 20, 2024 21:21

Merge branch 'main' into add_surrogate_conn

88e081b

Merge branch 'main' into add_surrogate_conn

6d1c9bb

Merge branch 'main' into add_surrogate_conn

abfc9ab

Merge branch 'main' into add_surrogate_conn

a710c43

Merge branch 'main' into add_surrogate_conn

f90a58a

tsbinns mentioned this pull request Oct 31, 2024

Within-epoch surrogate generation #251

Open

adam2392 reviewed Nov 7, 2024

View reviewed changes

Merge branch 'main' into add_surrogate_conn

9dbb10d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[GSOC] Add surrogate data generation for significant connectivity estimation #223

[GSOC] Add surrogate data generation for significant connectivity estimation #223

tsbinns commented Aug 5, 2024

tsbinns Aug 5, 2024

tsbinns commented Aug 5, 2024

larsoner commented Aug 5, 2024

tsbinns commented Aug 6, 2024

tsbinns commented Aug 7, 2024

tsbinns commented Aug 20, 2024 •

edited

Loading

larsoner commented Aug 20, 2024

adam2392 left a comment

tsbinns commented Nov 11, 2024

[GSOC] Add surrogate data generation for significant connectivity estimation #223

Are you sure you want to change the base?

[GSOC] Add surrogate data generation for significant connectivity estimation #223

Conversation

tsbinns commented Aug 5, 2024

tsbinns Aug 5, 2024

Choose a reason for hiding this comment

tsbinns commented Aug 5, 2024

larsoner commented Aug 5, 2024

tsbinns commented Aug 6, 2024

tsbinns commented Aug 7, 2024

tsbinns commented Aug 20, 2024 • edited Loading

larsoner commented Aug 20, 2024

adam2392 left a comment

Choose a reason for hiding this comment

tsbinns commented Nov 11, 2024

tsbinns commented Aug 20, 2024 •

edited

Loading