Official implementation of "What Changed? Detecting and Evaluating Instruction-Guided Image Edits with Multimodal Large Language Models"
DICE (DIfference Coherence Estimator) is a novel framework designed to detect and evaluate instruction-guided image edits. It identifies differences between original and edited images and assesses their coherence with the editing prompt using Multimodal Large Language Models (MLLMs).
The framework consists of two main components:
- Difference Detector: Identifies localized differences between the original and edited images
- Coherence Estimator: Assesses the relevance of detected changes with respect to the editing prompt
- Object-level difference detection between image pairs
- Semantic coherence evaluation of edits
- Structured text generation for edit analysis
- High correlation with human judgment
If you find this work useful for your research, please cite our paper:
@inproceedings{baraldi2025changed,
title={What Changed? Detecting and Evaluating Instruction-Guided Image Edits with Multimodal Large Language Models},
author={Baraldi, Lorenzo and Bucciarelli, Davide and Betti, Federico and Cornia, Marcella and Sebe, Nicu and Cucchiara, Rita and others},
booktitle={Proceedings of the 2025 IEEE/CVF International Conference on Computer Vision},
year={2025}
}- Lorenzo Baraldi - University of Pisa, Italy
- Davide Bucciarelli - University of Modena and Reggio Emilia, Italy
- Federico Betti - University of Trento, Italy
- Marcella Cornia - University of Modena and Reggio Emilia, Italy
- Lorenzo Baraldi - University of Modena and Reggio Emilia, Italy
- Nicu Sebe - University of Trento, Italy
- Rita Cucchiara - University of Modena and Reggio Emilia, Italy
This folder contains a self-contained copy of the assets required to run simple_evaluation_example.py
simple_evaluation_example.py: evaluation entry point.original.jpg,edited.jpg: sample original/edited imagesediting_evaluation/: evaluation package.dataset/: dataset utilities (includes coherence dataset loader).requirements.txt,README.md: reference documentation and dependencies.
Download the required model weights from: https://huggingface.co/collections/aimagelab/dice
Run the script from within this directory to keep relative paths valid:
cd simple_evaluation_bundle
python simple_evaluation_example.pyFor questions or issues, please open an issue in the GitHub repository.