Weight Predictor Networks with Feature Selection (WPFS)

Official code for the paper Weight Predictor Network with Feature Selection for Small Sample Tabular Biomedical Data accepted at AAAI Conference on Artificial Intelligence 2023

by Andrei Margeloiu, Nikola Simidjievski, Pietro Lio, Mateja Jamnik

TL;DR: WPFS is a general framework for learning neural networks from high-dimensional and small-sample data by reducing the number of learnable parameters, and performing global feature selection. In addition to the predictor network, WPFS combines two small auxiliary networks: a weight predictor network that outputs the weight matrix of the first layer, and a feature-selection network that serves as an additional mechanism for regularisation.

Paper abstract: Tabular biomedical data is often high-dimensional but with a very small number of samples. Although recent work showed that well-regularised simple neural networks could outperform more sophisticated architectures on tabular data, they are still prone to overfitting on tiny datasets with many potentially irrelevant features. To combat these issues, we propose Weight Predictor Network with Feature Selection (WPFS) for learning neural networks from high-dimensional and small sample data by reducing the number of learnable parameters and simultaneously performing feature selection. In addition to the classification network, WPFS uses two small auxiliary networks that together output the weights of the first layer of the classification model. We evaluate on nine real-world biomedical datasets and demonstrate that WPFS outperforms other standard as well as more recent methods typically applied to tabular data. Furthermore, we investigate the proposed feature selection mechanism and show that it improves performance while providing useful insights into the learning task.

Citation

For attribution in academic contexts, please cite this work as

@inproceedings{margeloiu2023weights,
  title={Weight Predictor Network with Feature Selection for Small Sample Tabular Biomedical Data},
  author={Margeloiu, Andrei and Simidjievski, Nikola and Lio, Pietro and Jamnik, Mateja},
  booktitle={37th AAAI Conference on Artificial Intelligence},
  year={2023}
}

Code structure

src
- main.py: code for parsing arguments, and starting experiment
  - def parse_arguments - include all command-line arguments
  - def train - start training model
  - important command-line arguments
    - dataset
    - model
    - feature_extractor_dims - the size of the hidden layers in the dnn
    - max_steps - maximum training iterations
    - batchnorm, dropout_rate
    - lr, batch_size, patience_early_stopping
    - lr_scheduler - learning rate scheduler
- dataset.py: loading the datasets
- models.py: neural network architectures: WPFS, DietNetworks, FsNet and Concrete Autoencoders
- weights_predictor_network.py - defines the Weight Predictor Networks (WPN)
- sparsity_network.py - defines the Sparsity Network (SPN)
data
- cll, lung, prostate, smk, toxicity

Installation

Requirement: All project dependencies are included in requirements.txt. We assume you have conda installed.

Installing WPFS

conda create python=3.7.9 --name WPFS
conda activate WPFS
pip install -r requirements.txt

Optional: Change BASE_DIR from /src/_config.py to point to the project directory on your machine.

Running an experiment

Step 1: Run the script run_experiment.sh

Step 2: Analyze the results in the notebook analyze_experiments.ipynb

Adding a new dataset is straightforward:. Search your_custom_dataset in the codebase and replace it with your dataset.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Weight Predictor Networks with Feature Selection (WPFS)

Citation

Code structure

Installation

Running an experiment

About

Releases

Packages

Contributors 2

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
data		data
src		src
README.md		README.md
analyze_experiments.ipynb		analyze_experiments.ipynb
paper.gif		paper.gif
requirements.txt		requirements.txt
run_experiment.sh		run_experiment.sh

andreimargeloiu/WPFS

Folders and files

Latest commit

History

Repository files navigation

Weight Predictor Networks with Feature Selection (WPFS)

Citation

Code structure

Installation

Running an experiment

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages