Towards semantic consistency: Dirichlet energy driven robust multi-modal entity alignment

This repository is the code of method, DESAlign, proposed in our paper: "Towards Semantic Consistency: Dirichlet Energy Driven Robust Multi-Modal Entity Alignment", which has been accepted by ICDE 2024.

Introduction

Multi-Modal Entity Alignment (MMEA) is a pivotal task in Multi-Modal Knowledge Graphs (MMKGs), seeking to identify identical entities by leveraging associated modal attributes. However, real-world MMKGs confront the challenges of semantic inconsistency arising from diverse and incomplete data sources. This inconsistency is predominantly caused by the absence of specific modal attributes, manifesting in two distinct forms: disparities in attribute counts or the absence of certain modalities. Current methods address these issues through attribute interpolation, but their reliance on predefined distributions introduces modality noise, compromising original semantic information. Furthermore, the absence of a generalizable theoretical principle hampers progress towards achieving semantic consistency. In this work, we propose a generalizable theoretical principle by examining semantic consistency from the perspective of Dirichlet energy. Our research reveals that, in the presence of semantic inconsistency, models tend to overfit to modality noise, leading to over-smoothing and performance oscillations or declines, particularly in scenarios with a high rate of missing modality. To overcome these challenges, we propose DESAlign, a robust method addressing the over-smoothing caused by semantic inconsistency and interpolating missing semantics using existing modalities. Specifically, we devise a training strategy for multi-modal knowledge graph learning based on our proposed principle. Then, we introduce a propagation strategy that utilizes existing features to provide interpolation solutions for missing semantic features. DESAlign outperforms existing approaches across 60 benchmark splits, encompassing both monolingual and bilingual scenarios, achieving state-of-the-art performance. Experiments on splits with high missing modal attributes demonstrate its effectiveness, providing a robust MMEA solution to semantic inconsistency in real-world MMKGs.

Framework

Dataset

The dataset we processed can be downloaded at GoogleDrive

Environment

Python = 3.7
PyTorch = 1.6.0
numpy = 1.19.2
Transformers = 4.21.3
easydict = 1.10
unidecode = 1.3.6
tensorboard = 2.11.0

Cite

Please consider citing this paper if you find the code or data useful. Thanks a lot ~

@inproceedings{wang2024towards,
  title={Towards semantic consistency: Dirichlet energy driven robust multi-modal entity alignment},
  author={Wang, Yuanyi and Sun, Haifeng and Wang, Jiabo and Wang, Jingyu and Tang, Wei and Qi, Qi and Sun, Shaoling and Liao, Jianxin},
  booktitle={2024 IEEE 40th International Conference on Data Engineering (ICDE)},
  pages={3559--3572},
  year={2024},
  organization={IEEE}
}

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
model		model
src		src
torchlight		torchlight
Framework.png		Framework.png
README.md		README.md
config.py		config.py
main.py		main.py
requirement.txt		requirement.txt
run.sh		run.sh
run_desalign_0.sh		run_desalign_0.sh
run_desalign_00.sh		run_desalign_00.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Towards semantic consistency: Dirichlet energy driven robust multi-modal entity alignment

Introduction

Framework

Dataset

Environment

Cite

About

Releases

Packages

Languages

wyy-code/DESAlign

Folders and files

Latest commit

History

Repository files navigation

Towards semantic consistency: Dirichlet energy driven robust multi-modal entity alignment

Introduction

Framework

Dataset

Environment

Cite

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages