GAN Toolkit

A personal toolkit for Generative Adversarial Networks (GANs) in PyTorch.

Features

Modular Architecture: Easily extendable codebase with separate modules for datasets, models, trainers, and utilities.
Flexible Data Loading: Supports various datasets with customizable transformations.
Docker Support: Containerized environment for consistent deployment.
Configuration Management: YAML configuration files for easy hyperparameter tuning and experiment management.
Unit Testing: Comprehensive tests for reliability and robustness.
Logging and Visualization: Integrated logging with TensorBoard support for monitoring training progress.

Getting Started

Prerequisites

Operating System: Linux, macOS, or Windows
Python: Version 3.6 or higher
PyTorch: Version 1.9.0 or higher
CUDA: For GPU support (optional)
Git: Version control system
Docker: For containerization (optional)

Installation

Clone the Repository

git clone https://github.com/yourusername/gan-toolkit.git
cd gan-toolkit

Setting Up the Virtual Environment

It's recommended to use a virtual environment to manage project dependencies.

Setting Up the Virtual Environment

It's recommended to use a virtual environment to manage project dependencies and prevent conflicts with other projects on your system.

Using `venv`

Create a Virtual Environment

# For Linux/macOS
python3 -m venv venv

# For Windows
python -m venv venv

Upgrade `pip`

pip install --upgrade pip

Install Dependencies

pip install -r requirements.txt

Verify Installation

pip list

This should display all the packages listed in requirements.txt.

Data Preparation

Downloading the Dataset

We provide scripts to download datasets. By default, the project uses the Cervical Cancer (Kaggle) dataset.

Steps:

Set Up Kaggle API Credentials
- Sign in to Kaggle.
- Go to your account settings and select "Create New API Token". This will download a kaggle.json file.
Place the kaggle.json file in the appropriate directory:
- Linux/macOS: ~/.kaggle/kaggle.json
- Windows: C:\Users\<YourUsername>\.kaggle\kaggle.json
Ensure the file has the correct permissions:
```
chmod 600 ~/.kaggle/kaggle.json
```
Download the Dataset: Run the data download script:
```
python scripts/download_data.py
```

Note: The dataset will be downloaded to the data/ directory by default.

Using Your Own Dataset

If you prefer to use your own dataset:

Organize Your Dataset

Place your dataset in the data/ directory with the following structure:

data/
└── your_dataset_name/
    ├── class1/
    │   ├── image1.jpg
    │   ├── image2.jpg
    │   └── ...
    └── class2/
        ├── image3.jpg
        ├── image4.jpg
        └── ...

Update the Configuration

Edit the configs/config.yaml file:

data:
  dataset_path: './data/your_dataset_name'
  # ... other configurations

Configuration

All configurable parameters are stored in configs/config.yaml. Adjust hyperparameters, paths, and other settings as needed.

Example `config.yaml`:

data:
  dataset_path: './data/cervical-cancer-dataset'
  image_size: 64
  batch_size: 128
  num_workers: 4
  image_channels: 3
  mean: [0.5, 0.5, 0.5]
  std: [0.5, 0.5, 0.5]

training:
  num_epochs: 25
  learning_rate: 0.0002
  batch_size: 128
  latent_dim: 100

model:
  name: 'dcgan'
  latent_dim: 100
  feature_d: 64
  feature_g: 64

sample_dir: 'samples'
model_dir: 'models'

Usage

Training DCGAN

Run the training script with the desired configuration:

python src/main.py --config configs/config.yaml

Running the Jupyter Notebook

For an interactive demonstration, use the provided Jupyter notebook:

Start Jupyter Notebook

jupyter notebook

Open the Notebook

Navigate to notebooks/gan_example.ipynb and open it.

Adjust Paths

Ensure that the notebook can access the project modules by adjusting the system path if necessary:

import sys
sys.path.append('../')

Run Cells

Execute the cells sequentially to see the model in action.

Docker

Installing Docker

Follow the official Docker installation guide for your operating system:

Windows and macOS: Docker Desktop
Linux: Docker Engine

Building the Docker Image

Navigate to the project root:
```
cd /path/to/gan-toolkit
```
Build the Docker image:
```
docker build -t gan-toolkit .
```

Running the Docker Container

docker run --gpus all -v $(pwd):/app gan-toolkit

Notes:

--gpus all: Enables GPU access inside the container (requires NVIDIA Docker support).
-v $(pwd):/app: Mounts the current directory into the container.

NVIDIA Docker Support (For GPU Access):

Install the NVIDIA Container Toolkit:

Set up the repository:

distribution=$(. /etc/os-release;echo $ID$VERSION_ID) \
    && curl -s -L https://nvidia.github.io/nvidia-docker/gpgkey | sudo apt-key add - \
    && curl -s -L https://nvidia.github.io/nvidia-docker/$distribution/nvidia-docker.list | sudo tee /etc/apt/sources.list.d/nvidia-docker.list

Install the NVIDIA Container Toolkit:

sudo apt-get update && sudo apt-get install -y nvidia-container-toolkit
sudo systemctl restart docker

Running the Container with NVIDIA Runtime:

docker run --runtime=nvidia -v $(pwd):/app gan-toolkit

Testing

We use pytest for unit testing.

Running Tests

pytest tests/

Adding Tests

Tests are located in the tests/ directory.

Follow the existing examples to add new tests.

Ensure your tests cover:

Data loading
Model initialization
Training steps
Utility functions

Contributing

We welcome contributions! Please read CONTRIBUTING.md for details on our code of conduct and the process for submitting pull requests.

Contribution Guidelines

Fork the repository.
Create a branch for your feature (git checkout -b feature/new-feature).
Commit your changes (git commit -am 'Add new feature').
Push to the branch (git push origin feature/new-feature).
Open a Pull Request.

Code Style

Follow PEP 8 guidelines.
Use docstrings to document modules, classes, and methods.
Add comments to explain complex code segments.
Write unit tests for new features.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Contact

Your Name

Email: georgiospathanasiou@yahoo.com
GitHub: Georgios Athanasiou

Feel free to reach out for any questions or suggestions!

Acknowledgments

Inspired by the PyTorch DCGAN tutorial.
Datasets provided by Kaggle.

Disclaimer: This project is intended for educational purposes. Use it responsibly and adhere to the licensing terms of any datasets or external resources used.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
configs		configs
notebooks		notebooks
src		src
tests		tests
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

License

ga83wuw/GANkit

Folders and files

Latest commit

History

Repository files navigation

GAN Toolkit

Table of Contents

Features

Getting Started

Prerequisites

Installation

Setting Up the Virtual Environment

Setting Up the Virtual Environment

Using venv

Create a Virtual Environment

Upgrade pip

Install Dependencies

Verify Installation

Data Preparation

Downloading the Dataset

Steps:

Using Your Own Dataset

Organize Your Dataset

Update the Configuration

Configuration

Example config.yaml:

Usage

Training DCGAN

Running the Jupyter Notebook

Start Jupyter Notebook

Open the Notebook

Adjust Paths

Run Cells

Docker

Installing Docker

Building the Docker Image

Running the Docker Container

Notes:

NVIDIA Docker Support (For GPU Access):

Install the NVIDIA Container Toolkit:

Running the Container with NVIDIA Runtime:

Testing

Running Tests

Adding Tests

Contributing

Contribution Guidelines

Code Style

License

Contact

Acknowledgments

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Using `venv`

Upgrade `pip`

Example `config.yaml`:

Packages