Skip to content

Latest commit

 

History

History
152 lines (123 loc) · 9.17 KB

README.md

File metadata and controls

152 lines (123 loc) · 9.17 KB

FEDn Ultralytics Tutorial

Introduction

This tutorial will guide you through the process of implementing a federated learning setup using the [FEDn] framework in combination with [Ultralytics] YOLOv8 models. Federated learning allows multiple clients to collaboratively train a global model without sharing their local datasets, ensuring data privacy and security. This tutorial is designed for users familiar with machine learning and federated learning concepts and will provide step-by-step instructions on configuring and running a complete federated learning workflow.

By the end of this tutorial, you will have built a distributed training environment where clients independently train local models, and a global model is aggregated on the server. You will also learn how to use Ultralytics YOLOv8 models for object detection tasks in this federated setting.

  1. Starting the server in FEDn – Learn how to initiate the central server that coordinates federated learning activities.
  2. Cloning the repository – Set up the project by cloning the necessary repository for configuration and deployment.
  3. Installing prerequisites – Install all required dependencies for the client environments.
  4. Setting up the dataset – Properly structure and configure your dataset to be used with Ultralytics models.
  5. Setting up configurations – Configure the model, such as defining the number of classes for the YOLOv8 model.
  6. Building the compute package – Build the compute package to prepare for the training process.
  7. Initializing the seed model – Generate the initial model to start the training process.
  8. Initializing the server-side – Set up the server-side of the federated learning system.
  9. Connecting and starting the clients – Connect the clients that will participate in the federated training.
  10. Training the global model – Observe how the global model improves through the aggregation of client updates and monitor training progress.

By following these steps, you will not only gain hands-on experience with the FEDn platform but also learn how to integrate object detection tasks with YOLOv8 in a federated learning environment.

Prerequisites

Step 1: Starting the server in FEDn

Firstly, create an account on the FEDn studio platform https://fedn.scaleoutsystems.com/signup. Once you are logged in, you need to start a new project by clicking on the New project button. This initializes the server which later will be used to run the federated learning process.

Step 2: Cloning the repository

Next, you need to clone the repository:

git clone https://github.com/scaleoutsystems/fedn-ultralytics-tutorial

Then navigate into the repository:

cd fedn-ultralytics-tutorial

This repository contains all the necessary files and configurations for the federated learning setup.

Step 3: Installing prerequisites

Next, you need to install the prerequisites. You can install everything using pip:

pip3 install -r requirements.txt

This is recommended to be done in a virtual environment to avoid conflicts with other packages.

Step 4: Setting up the dataset

Start setting up the dataset by creating a directory named 'datasets' inside the repository:

mkdir datasets

Then copy the path to the 'datasets' folder by running the following command:

echo "$(pwd)/datasets" | pbcopy

Ultralytics uses a specific directory to store datasets, which you can configure using the datasets_dir option in YOLO settings. To set this up to the 'datasets' directory you previously created, run the following command, replacing <path_to_dataset> with the path you just copied:

yolo settings datasets_dir=<path_to_dataset>

After setting the dataset directory, you’ll need to organize your data into a folder named 'fed_dataset' inside the datasets directory. Your final folder structure should look like this:

datasets/
  fed_dataset/
    train/
      images/
        image1.jpg
        image2.jpg
        ...
      labels/
        image1.txt
        image2.txt
        ...
    valid/
      images/
        image1.jpg
        image2.jpg
        ...
      labels/
        image1.txt
        image2.txt
        ...

Each label file should correspond to an image file, and the format of each label should be:

<class> <x_center> <y_center> <width> <height>
<class> <x_center> <y_center> <width> <height>
...

Each line corresponds to one bounding box in the image.

For further details on how to prepare your dataset, you can visit https://docs.ultralytics.com/datasets/.

For getting started quickly, you can navigate into the examples repository to download and partition a sample dataset.

Step 5: Setting up configurations

Global configurations

To set up your YOLOv8 model, you need to configure the global_config.yaml inside the 'client' folder. Here you choose the number of classes for the YOLOv8 model by setting the num_classes parameter, and respective class names. You also choose which YOLOv8 model to use by setting the model_size parameter.

Local client configurations

Each client can set different training configurations in the client_config.yaml file. This file contains the configurations for the client environments, such as the number of local epochs, and batch size. You can adjust these configurations to suit each client's hardware and training requirements.

Step 6: Building the compute package

Once you’ve completed all the configurations, you can build the compute package by running the following command:

python3 client/setup.py
fedn package create -p client

This creates he compute package package.tgz which contains all the necessary files and configurations for the client environments. If you make any changes to the global_config.yaml, you’ll need to rebuild (Step 6) and reupload (Step 8) the compute package to apply the updates. For changes in the client_config.yaml, you don't need to rebuild the compute package.

Step 7: Initializing the seed model

To initialize the seed model, run the following command:

fedn run build -p client

This command will generate the seed model that will be used as the starting point for the federated learning process.

Step 8: Initializing the server-side

The next step is to initialize the server side. This is done by uploading the compute package and the seed model to the FEDn studio platform. This is done by pressing the Sessions button and then the "New session" button. Here you can upload the compute package package.tgz and the seed model seed.npz. Once the compute package and seed model are uploaded, you can create the session by pressing the "Create session" button. Here you also configure the total number of rounds and aggregator function for the federated learning process.

Before starting the training process, you need to connect the clients to the server which is done in the next step.

Step 9: Connecting and starting the clients

To connect a client to the server, you need to hand each client a client.yaml file. This file contains the necessary configurations for the client to connect to the server and gain access to the compute package. Connect clients by pressing the "Clients" button on the left hand side. Here you can download a client.yaml file for each client. Place the client.yaml file in the repository and start the client by running following command:

fedn client start -in client.yaml

This starts the client and connects it to the server. Repeat this process for each client you want to connect to the server.

Step 10: Training the global model

Once the clients are running, you can start the global training by pressing the Start session button in FEDn studio. This will initiate the federated learning process, where the global model is trained by aggregating the updates from the clients. Now you will see things happening on both the server and client side. You can monitor the training progress on the FEDn studio platform where metrics such as recall, precision, and mAP scores are shown.

Once training is completed, you can download any model from a certain round in the session. The model can be used for inference on new data.

Conclusion

In this tutorial, you have learned how to implement Ultralytics YOLOv8 models in a federated learning setting using the FEDn platform. By following the steps outlined in this tutorial, you have successfully set up a distributed training environment where clients independently train local models, and a global model is aggregated on the server. You have also learned how to configure the dataset, set up the model configurations, build the compute package, and start the federated learning process. By completing this tutorial, you have gained hands-on experience with federated learning and object detection tasks using Ultralytics models.

Note

Steps 1, 2, 5, 6, 7 and 8 only need to be done once to set up the federated learning environment in FEDn.

To connect a new client, the only steps that needs to be followed are step 2, 3, 4 and 9. Each client can have different configurations in the client_config.yaml file to account for different hardware and training requirements.