Skip to content

Self-supervised learning with graph neural network backbones for seizure detection in iEEG data.

Notifications You must be signed in to change notification settings

xmootoo/ssl-seizure-detection

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

T-VICReg: Self-Supervised Learning for Seizure Detection

#gh-dark-mode-only #gh-light-mode-only

The T-VICReg pipeline.

Introduction

This codebase is the result of my work with T-CAIREM Summer Studentship from the Temerty Faculty of Medicine, University of Toronto to conduct research at the intersection of AI and neuroscience at the Neural Systems & Brain Signal Processing Lab. Project paper can be found here.

As the result, we have developed T-VICReg, a novel self-supervised learning (SSL) method for time series, enabling learned representations to be partially invariant to translations in time. These representations aim to capture information relevant to the past and the future, which is hypothesized to produce representations capable of capturing state transitions, useful for a variety of classification and prediction tasks. We validate our method on the OpenNeuro dataset ds003029 containing iEEG signals from epilepsy patients on the task of binary seizure classification (ictal, nonictal) and multiclass seizure detection (preictal, ictal, postictal). Fine-tuning the encoderfrom T-VICReg resulted in Top-1 accuracies of92.92% and 89.26%, compared to the supervised baseline with Top-1 accuracies of 89.23% and 84.07%, for the binary and multiclass tasks respectively. T-VICReg is noncontrastive, augmentation-free, and is compatible with continuous and discrete time series, allowing for flexible use in many contexts.

The GNN architecture comprises of Edge-Conditioned Convolution (ECC) and Graph Attention Network (GAT) layers, utilizing PyTorch and PyTorch Geometric libraries for standard deep learning and GNN implementation. For a more detailed description of our research, see projects/ssl-seizure-detection.

For more information on SSL and GNNs, please refer to the relevant papers:

Table of Contents

  1. T-VICReg
  2. Installation
  3. Data
  4. Usage
  5. File Descriptions
  6. License
  7. Contact

T-VICReg

The implementation of the T-VICReg loss found in the paper, is contained in loss.py, using the VICRegT1Loss nn.Module.

Installation

For detailed setup instructions and environment configuration, please see our installation guide.

Data

Processed data formatted as PyTorch Geometric Data objects is not yet available for release. Meanwhile, the initial intracranial electroencephalogram (iEEG) dataset utilized for this project is publicly accessible on OpenNeuro, identified by Accession Number ds003029. From ds003029, we selected 26 patients out of the 88 available in the dataset. For each patient iEEG signal, we divided the signal into time windows of equal length, and for each window an initial graph representation was constructed with the following method. The initial graph representations are fully connected graphs, with the nodes corresponding to individual electrodes. To construct the edge features, for each electrode pair, we computed the Pearson correlation, phase-lock value (PLV), and the coherence, giving us edge features of dimension 3. To construct the node features, we used the average energy of the electrode, and the average energies at various frequency bands; due to variability in the iEEG data format, there may have been more or less frequency bands available for certain patients, thus node features dimensions varied between $7-9$ (dependent on the patient). From this, we converted the data to the standard PyTorch Geometric Data format of [edge_index, x, edge_attr, y] where edge_index is a tensor defining the graph structure (similar to a binary adjacency matrix), x is the node feature tensor, edge_attr is the edge feature tensor, and y is the target label tensor taking on values $0$ or $1$ for binary classification of ictal (seizure) or nonictal (no seizure); and took on values $0,1,$ or $2$ for preictal (before seizure), ictal (seizure), and postictal (after seizure) respectively for multiclass classification. Due of the heterogeneity of the iEEG data from patient to patient (e.g., different number of electrodes, different placement of electrodes) model training and evaluation was conducted intra-patient, i.e., a separate model was created for each patient.

Usage

To run the entire pipeline, please refer to the train() function in train.py and the relevant docstring. Please see the notebook train.ipynb for guidance on how to train each model. The main.py script is optimized for HPC on the Cledar cluster (Digital Research Alliance of Canada), and is not recommended for general use. For a tutorial on PyTorch Geometric and customized GNN models, please refer to tutorial.ipynb. To see how the initial graph representations are created, please refer to preprocess.ipynb. For a information on the transfer learning process (implemented in train.py), please refer to transfer.ipynb.

File Descriptions

  • models.py: Contains self-supervised models: relative_positioning, temporal_shuffling, CPC (to be added), and VICReg (to be added); and supervised models: supervised (base model), downstream1, and downstream2.
  • train.py: Implements the training loop for both self-supervised and supervised models. Includes logging with Weights and Biases (highly recomended). For a guide on how to train each model please see train.ipynb.
  • main.py: The primary script to run the training pipeline in parallel on multiple patients, optimized for Cedar cluster resources.
  • preprocess.py: Includes helper functions for all preprocessing tasks, such as converting initial graph representations to PyG-compatible structures.
  • patch.py: Patches pre-existing Numpy data from our lab to the PyG-compatible format, not recommended for general use unless your existing data fits the specifications as outlined in preprocess.ipynb.

Acknowledgements

I would like to sincerely thank both Dr. Alan A. Díaz-Montiel and Dr. Milad Lankarany for their continued support throughout this research project and for their expert guidance. I am also extremely grateful for the support from the Temerty Faculty of Medicine, University of Toronto through the T-CAIREM Summer Studentship, which allowed me to focus on this work.

I'd also like to extend my gratitude to the researchers and institutions for their generosity in sharing their iEEG data through OpenNeuro ds003029. Special thanks to:

  • Department of Biomedical Engineering, Johns Hopkins University, Baltimore, United States
  • Epilepsy Center, Cleveland Clinic, Cleveland, United States
  • Department of Neurosurgery, University of Miami Miller School of Medicine, Miami, United States
  • Department of Neurology, University of Miami Miller School of Medicine, Miami, United States
  • Neurology, University of Maryland Medical Center, Baltimore, United States
  • Neurology, Johns Hopkins Hospital, Baltimore, United States
  • Surgical Neurology Branch, NINDS, NIH, Bethesda MD
  • Neurosurgery, and Epilepsy Center, University of Pittsburgh Medical Center, Pittsburgh, United States
  • Institute for Computational Medicine, Johns Hopkins University, Baltimore, United States

For additional details on the dataset, the foundational paper is available at with the following link:

Contact

For any queries, please contact xmootoo at gmail dot com.

About

Self-supervised learning with graph neural network backbones for seizure detection in iEEG data.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published