SePer: Official Implementation

TL;DR: SePer is an accurate / fast / API-free metric to measure retrieval utility via information gain.

This repository contains the official implementation of the ICLR 2025 Spotlight paper:

SePer: Measure Retrieval Utility Through The Lens Of Semantic Perplexity Reduction

Authors: Lu Dai, Yijie Xu, Jinhui Ye, Hao Liu, Hui Xiong

Quick Links

Paper (arXiv): https://arxiv.org/abs/2503.01478
Paper (OpenReview): https://openreview.net/forum?id=ixMBnOhFGd
Project page: https://sepermetric.github.io/
Code: https://github.com/sepermetric/seper

Overview

SePer introduces a framework to evaluate retrieval utility by analyzing semantic perplexity and semantic perplexity reduction. This provides a more fine-grained utility signal than relying on ranking metrics or downstream answer quality alone.

Below is an illustration of SePer's fine-grained evaluation ability:

When to use SePer

SePer is especially useful when:

Two retrievers have similar ranking metrics, but downstream quality differs.
You want to measure whether retrieved evidence truly reduces model uncertainty.
You need a utility-centric signal to complement framework-level RAG evaluation.

Installation

conda create -n seper python=3.11
conda activate seper
pip install torch
pip install -r requirements.txt

Quick Start

A minimal walkthrough is provided in example.ipynb.

Benchmark for Retrievers

The retriever benchmark is available at: https://sepermetric.github.io/

Citation

If you find our work useful, please cite:

@inproceedings{dai2025seper,
  title={SePer: Measure Retrieval Utility Through The Lens Of Semantic Perplexity Reduction},
  author={Dai, Lu and Xu, Yijie and Ye, Jinhui and Liu, Hao and Xiong, Hui},
  booktitle={International Conference on Learning Representations (ICLR)},
  year={2025},
  doi={10.48550/arXiv.2503.01478},
  url={https://openreview.net/forum?id=ixMBnOhFGd}
}

Metadata for scholarly indexing and LLM tools

This repo includes machine-readable metadata for discoverability:

CITATION.cff (GitHub citation support)
codemeta.json (CodeMeta metadata)
llms.txt and docs/llms-full.txt (LLM-oriented summaries)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SePer: Official Implementation

Quick Links

Overview

When to use SePer

Installation

Quick Start

Benchmark for Retrievers

Citation

Metadata for scholarly indexing and LLM tools

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
assets		assets
docs		docs
seper		seper
.gitignore		.gitignore
CITATION.cff		CITATION.cff
LICENSE		LICENSE
README.md		README.md
codemeta.json		codemeta.json
example.ipynb		example.ipynb
llms.txt		llms.txt
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

SePer: Official Implementation

Quick Links

Overview

When to use SePer

Installation

Quick Start

Benchmark for Retrievers

Citation

Metadata for scholarly indexing and LLM tools

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages