We propose a novel method to compute a new ground truth, focusing on areas of high uncertainty among experts. This approach allows us to better understand the individual characteristics, extract their behavior, and use this insight to create a more sophisticated ground truth using maximum-likelihood. These findings contribute to the ongoing discussion of leveraging machine learning algorithms for medical image segmentation, particularly in scenarios involving multiple human annotators.
The surge of supervised learning methods for segmentation lately has underscored the critical role of label quality in predicting performance. This issue is prevalent in the domain of medical imaging, where high annotation costs and inter-observer variability pose significant challenges. Acquiring labels commonly involves multiple experts providing their interpretations of the "true" segmentation labels, each influenced by their individual biases. The blind acceptance of these noisy labels as the ground truth restricts the potential effectiveness of segmentation algorithms. Hereby, we apply coupled convolutional neural network approaches, previously tested on artificial data, to a small-sized real-world dataset of bovine cumulus oocyte complexes for the first time. This dataset is crucial for healthy embryo development. The application revealed an important challenge: the inability to effectively learn distinct confusion matrices for each expert due to large areas of agreement. In response, we propose a novel method that focuses on areas of high uncertainty. This approach allows us to understand the individual characteristics better, extract their behavior, and use this insight to create a more sophisticated ground truth using maximum likelihood. These findings contribute to the ongoing discussion of leveraging machine learning algorithms for medical image segmentation, particularly in scenarios involving multiple human annotators.
Firstly, this is an insight into a sample of the data for cumulus oocyte complexes.

and the areas of uncertainty in another sample:

The architecture of the model using the proposed local confusion matrices can be found below:

The architecture of the model using the proposed global confusion matrices can be found below:

Visualisation of the confusion matrices for each annotator, focusing on the areas of high uncertainty. This representation shows a clear behavior of each expert on the most difficult areas to identify.

The architecture of the model to optimize the ground truth, using the confusion matrices obtained on the uncertain areas and the maximum likelihood theorem:

To set up the project environment, follow these steps:
-
Clone the repository:
git clone <repository-url>
-
Create a new virtual environment:
python -m venv maximum_likelihood_gt_env
-
Activate the virtual environment (Unix/Linux):
source maximum_likelihood_gt_env/bin/activate -
Navigate to the project directory:
cd project-directory -
Install project dependencies:
pip install -r requirements.txt
We recreate the proposed approaches for disentangling the ground truth from human experts' biases...
python main.py config/config_skin.jsonpython main.py config/config_coc.jsonpython main.py config/config_coc3.jsonWe propose a novel method to compute a new ground truth, focusing on areas of high uncertainty among experts...
python bin/maximum_likelihood_gt.py config/config_mlgt.jsonThank you for considering contributing to this project! Contributions are welcome and encouraged. To contribute, please follow these steps:
-
Fork the repository and clone it to your local machine.
-
Create a new branch for your changes:
git checkout -b feature/your-feature-name
-
Make your desired changes and ensure that the code passes all tests.
-
Commit your changes:
git commit -m "Add your commit message" -
Push your branch to the remote repository:
git push origin feature/your-feature-name
-
Open a pull request on the main repository.
-
Provide a detailed description of your changes and why they are valuable.
-
Collaborate with the maintainers to review and refine your changes.
-
Once your changes are approved, they will be merged into the main branch.
The project is licensed under the ***.
For any questions or inquiries, please contact:
- Georgios Athanasiou
- Email: gathanasiou@iiia.csic.es
- Project Website: GitHub Repository
- Issue Tracker: GitHub Issues