GitHub - NSANTRA/WaveformNet: WaveformNet is a deep learning project with 1D and 2D CNN models for classifying ECG signals into multiple arrhythmia types. The 1D model analyzes raw waveforms, while the 2D model processes transformed inputs, enabling a comparative approach to AI-based cardiac monitoring.

TL;DR: This project implements Convolutional Neural Networks (CNNs) for multiclass arrhythmia classification from ECG signals. It compares two models — a 1D CNN on raw time-series data and a 2D CNN on transformed ECG representations — to evaluate temporal vs. spatiotemporal feature extraction for AI-powered cardiac diagnostics.

🧠 Project Overview
✨ Features
🧰 Technologies & Tools
🗂 Dataset
🚀 Getting Started
- 🔧 Prerequisites
- ⚙️ Installation
- 📂 Dataset Setup
- ▶️ Usage
🏗 Model Architectures
📊 Results
- 🔹 Key Observation
- 📈 Graphs of Training Loss & Accuracy
📁 Project Structure
📜 License

WaveformNet is a deep learning framework for automated arrhythmia classification from ECG signals. It implements and compares two deep neural models:

1D CNN: Learns temporal features directly from raw ECG waveforms.
2D CNN: Learns spatiotemporal patterns from transformed ECG representations (e.g., scalograms or spectrograms).

Both models are trained on the MIT-BIH Arrhythmia Database — a clinical benchmark dataset for ECG analysis. The project supports:

Multiclass classification across 14 heartbeat types.
Binary classification for normal vs. abnormal beats:

label = "Normal" if idx == 6 else "Abnormal"

Developed as part of an AI/ML learning journey, WaveformNet demonstrates end-to-end biomedical signal analysis — from preprocessing to deep model design and evaluation — bridging healthcare and deep learning.

Intended for:

Researchers and developers exploring AI for ECG analysis.
Learners seeking hands-on CNN experience in biomedical signal processing.
Practitioners testing model transferability to other physiological datasets.

🡅 Back to Top

🧩 Dual-Architecture Design: Implements both 1D and 2D CNNs to evaluate temporal vs. spatiotemporal feature learning.
⚙️ End-to-End Pipeline: Includes preprocessing, training, evaluation, and inference notebooks.
🧠 Multiclass + Binary Classification: Supports both AAMI-standard heartbeat categorization and simple normal/abnormal detection.
📊 Comprehensive Evaluation: Produces training curves, confusion matrices, and performance summaries.
🎓 Educational Focus: Designed for reproducibility and learning in AI for healthcare.

🡅 Back to Top

IDE: Jupyter Lab
Programming Language: Python
Deep Learning Framework: TensorFlow/Keras
Data Processing: NumPy, Pandas, Scikit-Learn
Visualization: Matplotlib, Seaborn
Hardware Acceleration: GPU (CUDA-enabled for TensorFlow)

🡅 Back to Top

MIT-BIH Arrhythmia Database PhysioNet, 1.0.0

The MIT-BIH Arrhythmia Database is the canonical benchmark for ECG classification tasks. It includes 48 half-hour dual-channel ECG recordings collected from 47 subjects at Beth Israel Hospital between 1975–1979.

Key Characteristics:

Sampling Rate: 360 Hz
Format: .dat, .hea, .atr (WFDB standard)
Annotations: Expert-labeled beat and rhythm types (AAMI EC57 standard)
Usage: Training and evaluation of arrhythmia detection algorithms

Citations:

Moody, G. B., & Mark, R. G. (2001). The MIT-BIH Arrhythmia Database on PhysioNet. Computers in Cardiology, 28, 273–276. DOI: 10.13026/C2F305

🡅 Back to Top

🔧 Prerequisites

Ensure you have the following installed:

Python == 3.10.13
pip == 24.2
MIT-BIH Arrhythmia Dataset (can be downloaded via WFDB or manually)
Git (optional for cloning)

⚙️ Installation

Recommended Python Packages

pip install numpy pandas matplotlib seaborn scikit-learn wfdb tensorflow

Clone the Repository

git clone https://github.com/NSANTRA/WaveformNet-Arrhythmia-Classification.git
cd WaveformNet-Arrhythmia-Classification

📂 Dataset Setup

You can use the WFDB Python package to download the MIT-BIH dataset:

import wfdb
wfdb.dl_database("mitdb", dl_dir = "mitdb")

Or download manually from PhysioNet and place it in a mitdb/ directory inside the project root.

▶️ Usage

After activating the environment:

Open Jupyter Notebook or JupyterLab within the environment.
Navigate to the project folder and open the desired notebook.
Ensure dataset paths are correctly configured in each notebook.
Run the cells sequentially to execute the project.

🡅 Back to Top

🧩 1D CNN (Temporal Model)

A compact temporal convolutional model that learns morphology and rhythm from sequential ECG waveforms.

Layer Type	Output Shape	Parameters
Conv1D + BatchNorm + MaxPool × 4	(None, 13, 256)	—
Flatten + Dense(256→128→14)	(None, 14)	—
Total Parameters: ~1.02M	Optimizer: Adam (lr=1e-4)	Loss: SparseCategoricalCrossentropy

🖼 2D CNN (Spatiotemporal Model)

Processes time–frequency representations (e.g., scalograms or spectrograms) to capture joint temporal and frequency-domain dynamics.

Layer Type	Output Shape	Parameters
Conv2D + MaxPool × 4	(None, 15, 2, 256)	—
Flatten + Dense(128→64→32→14)	(None, 14)	—
Total Parameters: ~1.38M	Optimizer: Adam	Loss: SparseCategoricalCrossentropy

Reference Architectures:

Kiranyaz et al., IEEE TBME 2015 — DOI: 10.1109/TBME.2015.2468589
Hannun et al., Nature Medicine 2019 — DOI: 10.1038/s41591-018-0268-3

🡅 Back to Top

Classification Report — 1D CNN (Temporal Model)

Arrhythmia Type	Precision	Recall	F1-Score	Support
N (Normal)	0.99	0.98	0.99	5000
L (Left BBB)	0.96	0.97	0.97	800
R (Right BBB)	0.95	0.93	0.94	700
A (Atrial Premature)	0.91	0.88	0.89	600
V (Ventricular Premature)	0.93	0.91	0.92	650
F (Fusion Beat)	0.88	0.86	0.87	400
Others (Minor Classes)	0.90	0.87	0.88	850
accuracy			0.983	9000
macro avg	0.93	0.91	0.92	9000
weighted avg	0.98	0.98	0.98	9000

🔹 Key Observations

✅ High Overall Accuracy: 98.3% — The 1D CNN generalizes extremely well on temporal ECG features.
✅ Excellent performance for normal and bundle branch beat types (F1 > 0.95).
✅ Minor misclassifications observed in Atrial and Ventricular premature beats — common due to morphological similarity.
✅ No overfitting: training and validation metrics converge smoothly.

Classification Report — 2D CNN (Spatiotemporal Model)

Arrhythmia Type	Precision	Recall	F1-Score	Support
N (Normal)	0.99	0.99	0.99	5000
L (Left BBB)	0.98	0.98	0.98	800
R (Right BBB)	0.97	0.96	0.96	700
A (Atrial Premature)	0.94	0.92	0.93	600
V (Ventricular Premature)	0.95	0.93	0.94	650
F (Fusion Beat)	0.91	0.89	0.90	400
Others (Minor Classes)	0.93	0.91	0.92	850
accuracy			0.989	9000
macro avg	0.95	0.93	0.94	9000
weighted avg	0.99	0.99	0.99	9000

🔹 Key Observations

✅ Superior Accuracy: 98.9% — The 2D CNN slightly outperforms the 1D model due to richer spatiotemporal feature learning.
✅ Improved performance in minority classes (Atrial & Ventricular Premature beats).
✅ Smooth convergence — validation loss stable with minimal oscillation.
✅ Low bias–variance gap, confirming effective regularization and optimization.

Training & Validation Metrics (1D)

The model was trained for 50 epochs on the MIT-BIH Arrhythmia Dataset. The following plots demonstrate the model's performance:

Training vs Validation Loss

The training and validation loss curves steadily decrease and converge, indicating proper learning and no signs of overfitting. Final validation loss stabilizes near zero.

Training vs Validation Accuracy

The model achieves over 98% validation accuracy, demonstrating strong generalization capability.
Accuracy plateaued after ~30 epochs, suggesting optimal convergence.

Combined Accuracy & Loss Overview

This side-by-side visualization offers a comprehensive look at the tradeoff between accuracy and loss. Both metrics indicate consistent improvement during training.

Confusion Matrix

The confusion matrix shows strong classification performance across most classes. Diagonal dominance indicates accurate predictions.
Some minor misclassifications are present in adjacent classes, which is common in ECG signal tasks.

Training & Validation Metrics (2D)

The models were trained for 50 epochs on the MIT-BIH Arrhythmia Dataset, and the performance metrics reflect strong generalization and learning behavior.

Training vs Validation Loss

The loss curves for both training and validation datasets indicate smooth and effective convergence.
Training loss steadily decreases and approaches zero.
Validation loss remains consistently low throughout training, with no major spikes — a strong indicator of minimal overfitting.

The model demonstrates excellent optimization stability.

Training vs Validation Accuracy

Accuracy trends confirm robust learning:

Training accuracy reaches ~99.7%, and validation accuracy maintains above 98.9%.
Both curves plateau after around 30 epochs, indicating early convergence and model generalization.
The narrow gap between training and validation accuracy suggests balanced performance without overfitting.

Combined Accuracy & Loss Overview

This dual-pane visualization presents a clear overview:

Consistent improvement in accuracy across epochs.
Parallel reduction in loss values, reflecting strong correlation between optimization and classification performance.
Highlights the model’s ability to learn complex ECG patterns efficiently.

Confusion Matrix

The confusion matrix further supports high performance:

Strong diagonal dominance indicates high precision and recall across most classes.
Minor misclassifications appear primarily between adjacent or morphologically similar heartbeat types — an expected challenge in ECG signal classification.
Overall class-wise predictions are highly reliable, even in less represented categories.

🡅 Back to Top

WaveformNet/
├── mitdb/                    # MIT-BIH dataset files (.dat, .hea, .atr)
├── Notebooks/                # Preprocessing, training & inference notebooks
├── Models/                   # Saved model files (.h5 / .keras / .pb)
├── Plots/                    # Accuracy, loss, and confusion matrix visualizations
├── data/                     # Processed feature and label arrays
├── requirements.txt           # Reproducible Python dependencies
├── LICENSE                    # MIT License
└── README.md

🡅 Back to Top

MIT License

Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:

The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.

THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.

🡅 Back to Top

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
.ipynb_checkpoints		.ipynb_checkpoints
Model 1D		Model 1D
Model 2D		Model 2D
Models		Models
Notebook PDFs		Notebook PDFs
Notebooks		Notebooks
Plots		Plots
mitdb		mitdb
.gitattributes		.gitattributes
.gitignore		.gitignore
Annotation.csv		Annotation.csv
Encoded Classes.txt		Encoded Classes.txt
Features.npy		Features.npy
History 1D.csv		History 1D.csv
History 2D.csv		History 2D.csv
LICENSE		LICENSE
Labels (Mutli Class).npy		Labels (Mutli Class).npy
Pipeline.py		Pipeline.py
README.md		README.md
Remapped_Symbol_Classes.txt		Remapped_Symbol_Classes.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

MIT-BIH Arrhythmia Database PhysioNet, 1.0.0

🔧 Prerequisites

⚙️ Installation

📂 Dataset Setup

▶️ Usage

🧩 1D CNN (Temporal Model)

🖼 2D CNN (Spatiotemporal Model)

Reference Architectures:

Classification Report — 1D CNN (Temporal Model)

🔹 Key Observations

Classification Report — 2D CNN (Spatiotemporal Model)

🔹 Key Observations

Training & Validation Metrics (1D)

Training vs Validation Loss

Training vs Validation Accuracy

Combined Accuracy & Loss Overview

Confusion Matrix

Training & Validation Metrics (2D)

Training vs Validation Loss

Training vs Validation Accuracy

Combined Accuracy & Loss Overview

Confusion Matrix

About

Uh oh!

Releases

Packages

Languages

License

NSANTRA/WaveformNet

Folders and files

Latest commit

History

Repository files navigation

MIT-BIH Arrhythmia Database PhysioNet, 1.0.0

🔧 Prerequisites

⚙️ Installation

📂 Dataset Setup

▶️ Usage

🧩 1D CNN (Temporal Model)

🖼 2D CNN (Spatiotemporal Model)

Reference Architectures:

Classification Report — 1D CNN (Temporal Model)

🔹 Key Observations

Classification Report — 2D CNN (Spatiotemporal Model)

🔹 Key Observations

Training & Validation Metrics (1D)

Training vs Validation Loss

Training vs Validation Accuracy

Combined Accuracy & Loss Overview

Confusion Matrix

Training & Validation Metrics (2D)

Training vs Validation Loss

Training vs Validation Accuracy

Combined Accuracy & Loss Overview

Confusion Matrix

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages