T-VICReg: Self-Supervised Learning for Seizure Detection

The T-VICReg pipeline.

Introduction

This codebase is the result of my work with T-CAIREM Summer Studentship from the Temerty Faculty of Medicine, University of Toronto to conduct research at the intersection of AI and neuroscience at the Neural Systems & Brain Signal Processing Lab. Project paper can be found here.

As the result, we have developed T-VICReg, a novel self-supervised learning (SSL) method for time series, enabling learned representations to be partially invariant to translations in time. These representations aim to capture information relevant to the past and the future, which is hypothesized to produce representations capable of capturing state transitions, useful for a variety of classification and prediction tasks. We validate our method on the OpenNeuro dataset ds003029 containing iEEG signals from epilepsy patients on the task of binary seizure classification (ictal, nonictal) and multiclass seizure detection (preictal, ictal, postictal). Fine-tuning the encoderfrom T-VICReg resulted in Top-1 accuracies of92.92% and 89.26%, compared to the supervised baseline with Top-1 accuracies of 89.23% and 84.07%, for the binary and multiclass tasks respectively. T-VICReg is noncontrastive, augmentation-free, and is compatible with continuous and discrete time series, allowing for flexible use in many contexts.

The GNN architecture comprises of Edge-Conditioned Convolution (ECC) and Graph Attention Network (GAT) layers, utilizing PyTorch and PyTorch Geometric libraries for standard deep learning and GNN implementation. For a more detailed description of our research, see projects/ssl-seizure-detection.

For more information on SSL and GNNs, please refer to the relevant papers:

Variance-Invariance-Covariance Regularization (VICreg) (Bardes et al., 2022)

A Path Towards Autonomous Machine Intelligence (Lecun, 2022)

Edge-Conditioned Convolution (ECC): (Simonovsky & Komodakis, 2017)

Graph Attention Layer (GAT): (Veličković et al., 2018)

T-VICReg

The implementation of the T-VICReg loss found in the paper, is contained in loss.py, using the VICRegT1Loss nn.Module.

Installation

For detailed setup instructions and environment configuration, please see our installation guide.

Data

Processed data formatted as PyTorch Geometric Data objects is not yet available for release. Meanwhile, the initial intracranial electroencephalogram (iEEG) dataset utilized for this project is publicly accessible on OpenNeuro, identified by Accession Number ds003029. From ds003029, we selected 26 patients out of the 88 available in the dataset. For each patient iEEG signal, we divided the signal into time windows of equal length, and for each window an initial graph representation was constructed with the following method. The initial graph representations are fully connected graphs, with the nodes corresponding to individual electrodes. To construct the edge features, for each electrode pair, we computed the Pearson correlation, phase-lock value (PLV), and the coherence, giving us edge features of dimension 3. To construct the node features, we used the average energy of the electrode, and the average energies at various frequency bands; due to variability in the iEEG data format, there may have been more or less frequency bands available for certain patients, thus node features dimensions varied between $7-9$ (dependent on the patient). From this, we converted the data to the standard PyTorch Geometric Data format of [edge_index, x, edge_attr, y] where edge_index is a tensor defining the graph structure (similar to a binary adjacency matrix), x is the node feature tensor, edge_attr is the edge feature tensor, and y is the target label tensor taking on values $0$ or $1$ for binary classification of ictal (seizure) or nonictal (no seizure); and took on values $0,1,$ or $2$ for preictal (before seizure), ictal (seizure), and postictal (after seizure) respectively for multiclass classification. Due of the heterogeneity of the iEEG data from patient to patient (e.g., different number of electrodes, different placement of electrodes) model training and evaluation was conducted intra-patient, i.e., a separate model was created for each patient.

Usage

To run the entire pipeline, please refer to the train() function in train.py and the relevant docstring. Please see the notebook train.ipynb for guidance on how to train each model. The main.py script is optimized for HPC on the Cledar cluster (Digital Research Alliance of Canada), and is not recommended for general use. For a tutorial on PyTorch Geometric and customized GNN models, please refer to tutorial.ipynb. To see how the initial graph representations are created, please refer to preprocess.ipynb. For a information on the transfer learning process (implemented in train.py), please refer to transfer.ipynb.

File Descriptions

models.py: Contains self-supervised models: relative_positioning, temporal_shuffling, CPC (to be added), and VICReg (to be added); and supervised models: supervised (base model), downstream1, and downstream2.
train.py: Implements the training loop for both self-supervised and supervised models. Includes logging with Weights and Biases (highly recomended). For a guide on how to train each model please see train.ipynb.
main.py: The primary script to run the training pipeline in parallel on multiple patients, optimized for Cedar cluster resources.
preprocess.py: Includes helper functions for all preprocessing tasks, such as converting initial graph representations to PyG-compatible structures.
patch.py: Patches pre-existing Numpy data from our lab to the PyG-compatible format, not recommended for general use unless your existing data fits the specifications as outlined in preprocess.ipynb.

Acknowledgements

I would like to sincerely thank both Dr. Alan A. Díaz-Montiel and Dr. Milad Lankarany for their continued support throughout this research project and for their expert guidance. I am also extremely grateful for the support from the Temerty Faculty of Medicine, University of Toronto through the T-CAIREM Summer Studentship, which allowed me to focus on this work.

I'd also like to extend my gratitude to the researchers and institutions for their generosity in sharing their iEEG data through OpenNeuro ds003029. Special thanks to:

Department of Biomedical Engineering, Johns Hopkins University, Baltimore, United States
Epilepsy Center, Cleveland Clinic, Cleveland, United States
Department of Neurosurgery, University of Miami Miller School of Medicine, Miami, United States
Department of Neurology, University of Miami Miller School of Medicine, Miami, United States
Neurology, University of Maryland Medical Center, Baltimore, United States
Neurology, Johns Hopkins Hospital, Baltimore, United States
Surgical Neurology Branch, NINDS, NIH, Bethesda MD
Neurosurgery, and Epilepsy Center, University of Pittsburgh Medical Center, Pittsburgh, United States
Institute for Computational Medicine, Johns Hopkins University, Baltimore, United States

For additional details on the dataset, the foundational paper is available at with the following link:

Li, A., et al. (2021). Neural fragility as an EEG marker of the seizure onset zone. Nature Neuroscience, 24(10), 1465–1474. Nature Publishing Group US New York..

Contact

For any queries, please contact xmootoo at gmail dot com.

Name		Name	Last commit message	Last commit date
Latest commit History 276 Commits
ssl_seizure_detection		ssl_seizure_detection
.gitignore		.gitignore
INSTALL.md		INSTALL.md
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

T-VICReg: Self-Supervised Learning for Seizure Detection

Introduction

Table of Contents

T-VICReg

Installation

Data

Usage

File Descriptions

Acknowledgements

Contact

About

Uh oh!

Releases

Packages

Uh oh!

Languages

xmootoo/ssl-seizure-detection

Folders and files

Latest commit

History

Repository files navigation

T-VICReg: Self-Supervised Learning for Seizure Detection

Introduction

Table of Contents

T-VICReg

Installation

Data

Usage

File Descriptions

Acknowledgements

Contact

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages