GitHub - bongohead/fence: Feature Explainability for Neural Circuit Explainability (FENCE)

About

Updated Nov 2024

This repo contains training and testing code for Feature Encapsulation for Neural Circuit Explainability (FENCE).

Initial Setup

Acquire a server with a CUDA-compatible GPU (e.g., by renting on a cloud GPU service like Vast.ai or Runpod). You need an H100 for training, an A6000 will work fine for inference.
Clone this repo with git clone https://github.com/bongohead/fence.git.

Add credentials to the repo if you don't have git credentials set already.

git config user.email "{some_email_connected_to_your_github@email.com}"
git config user.name {your_name}

Setup a new virtual env if you need to (there is no need to for most cloud GPU services, just use the base venv).
To install necessary packages, run sh install_packages.sh.

Folder Structure

The create_phi3_from_scratch.ipynb and create_phi3_from_scratch_and_edit_model.ipynb notebooks contain exploratory code where we recreate Phi3 layer-by-layer then testing the effects of editing the model structure.
Python helpers are stored in py_helpers. py_helpers.fence.forwardpass contains code to recreate Phi3 and run forward pass with manual layer-by-layer control so that you can store/modify intermediate hidden state outputs. py_helpers.fence.dataset contains code to convert ChatML-formatted dictionaries into a Phi3 instruct-format string, as well as a torch dataset object which contains necessary FENCE position information.
The notebook train_fence_v3.ipynb allows you to train FENCE via a notebook. Or run python3 train-v3.py to run it via CLI.

Training

Create a secrets.env file with WANDB_API_KEY for logging.
Run python3 train-v3.py to train FENCE via CLI. Models will be saved every epoch in models/.
Note the training code currently uses flash attention and bfloat16 (except for self-attention modules), edit these if needed.

Remote SSH Development

The below instructions are for setting up the correct environment on a cloud GPU server. You can disregard these if you are developing locally, though regardless you need to have access to a CUDA GPU with enough VRAM.

It's advisable to use VSCode or Positron to run the code over SSH, instead of the default Jupyter install provided by most GPU rental providers.

Most GPU rental platforms support connecting remotely via an SSH key.

First, setup encryption keys on your local machine: ssh-keygen -t ed25519.
Copy the resulting key (use cat ~/.ssh/id_ed25519.pub) into the GPU provider's corresponding settings page (on Runpod, go to Settings -> SSH Public Keys).
Test whether you can connect via ssh [username]@[ip] -p [port] -i ~/.ssh/id_ed25519.

Next, to connect via VSCode/Positron:

Install the remote-ssh extension.
Enter the command palette and open Remote-SSH: Open SSH Configuration File.
Add the below lines to the file, substituting in the correct unix user, pot, and hostname provided by the rental service.
```
<summary>
    Host gpu
    User [user]
    HostName [ip]
    IdentityFile ~/.ssh/id_ed25519
    Port [port]
</summary>
```

Monitoring

You can use the following function to monitor GPU memory:

import torch

def check_memory():
    print("Allocated: %fGB"%(torch.cuda.memory_allocated(0)/1024/1024/1024))
    print("Reserved: %fGB"%(torch.cuda.memory_reserved(0)/1024/1024/1024))
    print("Total: %fGB"%(torch.cuda.get_device_properties(0).total_memory/1024/1024/1024))

check_memory()

Disk space can be monitered with the command du -hs $HOME /workspace/*.

Name		Name	Last commit message	Last commit date
Latest commit History 80 Commits
docs		docs
helpers		helpers
tmp_dev		tmp_dev
.gitignore		.gitignore
README.md		README.md
decomposition.ipynb		decomposition.ipynb
install_packages.sh		install_packages.sh
runpod_setup.sh		runpod_setup.sh
v5.0_eval.ipynb		v5.0_eval.ipynb
v5.0_train.ipynb		v5.0_train.ipynb
v5.2_eval.ipynb		v5.2_eval.ipynb
v5.2_train.ipynb		v5.2_train.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Initial Setup

Folder Structure

Training

Remote SSH Development

Monitoring

About

Releases

Packages

Languages

bongohead/fence

Folders and files

Latest commit

History

Repository files navigation

About

Initial Setup

Folder Structure

Training

Remote SSH Development

Monitoring

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages