BIRD: A Trustworthy Bayesian Inference Framework for Large Language Models

This is the repository of dataset and source code for "BIRD: A Trustworthy Bayesian Inference Framework for Large Language Models".

Installation

Setup the environment by first downloading this repository and then running:

pip install -r requirements.txt

Data

The datasets evaluated in this paper are available in the data/ directory:

probabilistic estimation: common2sense_human_annotation.csv (for evaluation) and common2sense_human_annotation.json ( We provide this in the same format as a decision-making dataset to facilitate easier inference).
decision making: common2sense.json, plasma.json and today.json. Each JSON dataset contains the following columns:
- scenario
- statement
- opposite_statement
- additional_sentence_label (indicates which statement each additional condition supports)
- In common2sense.json, the additional conditions are provided as added_information and oppo_added_information.
- In plasma.json and today.json, the additional conditions are listed under additional_sentences.

Run

Configure files for running the pipeline are in the scripts/ directory:

To run the entire BIRD pipeline:

bash scripts/run_bird.sh

To run the baselines:

bash scripts/baseline.sh

To run the evaluation:

bash scripts/eval.sh

Citation and acknowledgement

If you find the project helpful, please cite:

@inproceedings{
feng2025bird,
title={{BIRD}: A Trustworthy Bayesian Inference Framework for Large Language Models},
author={Yu Feng and Ben Zhou and Weidong Lin and Dan Roth},
booktitle={The Thirteenth International Conference on Learning Representations},
year={2025},
url={https://openreview.net/forum?id=fAAaT826Vv}
}

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
code		code
data		data
scripts		scripts
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

BIRD: A Trustworthy Bayesian Inference Framework for Large Language Models

Installation

Data

Run

Citation and acknowledgement

About

Uh oh!

Releases

Packages

Uh oh!

Languages

CogComp/BIRD

Folders and files

Latest commit

History

Repository files navigation

BIRD: A Trustworthy Bayesian Inference Framework for Large Language Models

Installation

Data

Run

Citation and acknowledgement

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages