Skip to content

HYU-NLP/MAGIC

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

3 Commits
Β 
Β 
Β 
Β 

Repository files navigation

MAGIC: A Multi-Hop and Graph-Based Benchmark for Inter-Context Conflicts in Retrieval-Augmented Generation

Official Repository for "MAGIC: A Multi-Hop and Graph-Based Benchmark for Inter-Context Conflicts in Retrieval-Augmented Generation". [πŸ“„ Paper(arXiv)]

Jungyeon Lee, Kangmin Lee and Taeuk Kim. Accepted to EMNLP 2025 Findings.


OverView

MAGIC is a large-scale benchmark designed to evaluate knowledge conflict detection and localization in Retrieval-Augmented Generation (RAG) systems. It focuses on multi-hop reasoning and graph-structured contexts, where inter-context knowledge conflicts emerge between retrieved passages.

Repository Structure

.\
β”œβ”€β”€ dataset
β”‚   β”œβ”€β”€ multi-hop/
β”‚   β”‚   β”œβ”€β”€ 1-multi-hop_conflict.json/
β”‚   β”‚   β”œβ”€β”€ 2-multi-hop_conflict.json/
β”‚   β”‚   β”œβ”€β”€ 3-multi-hop_conflict.json/
β”‚   β”‚   β”œβ”€β”€ 4-multi-hop_conflict.json/
β”‚   β”œβ”€β”€ single-hop/
β”‚   β”‚   β”œβ”€β”€ 1-single-hop_conflict.json/
β”‚   β”‚   β”œβ”€β”€ 2-single-hop_conflict.json/
β”‚   β”‚   β”œβ”€β”€ 3-single-hop_conflict.json/
β”‚   β”‚   β”œβ”€β”€ 4-single-hop_conflict.json/

Dataset Structure

  • ID: Unique identifier for each sample.
  • rel_id: Relation ID corresponding to the target knowledge relation (e.g. P150 from Wikidata)
  • subgraph: A set of surrounding triplets retrieved via DFS traversal from the source knowledge graph around the original_triplet.
  • original_triplet: Randomly sampled target triplet from the source graph β€” serves as the anchor for conflict formation.
  • perturb_triplet: Modified triplet(s) intentionally constructed to introduce a knowledge conflict with the original_triplet.
  • context1, context2: Textual representations of the original_triplet and perturb_triplet, respectively.

About

Official Repository for "MAGIC: A Multi-Hop and Graph-Based Benchmark for Inter-Context Conflicts in Retrieval-Augmented Generation".

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors