Numeral-Aware Headline Generation

Overview

Large Language Models (LLMs) demonstrate strong text generation abilities but often struggle with numerical reasoning and numeral-aware text generation. This limitation is particularly evident in tasks like generating news headlines containing numerical values, where both semantic fidelity and numerical accuracy are required.

As part of the SemEval 2024 NumEval shared task, we investigate approaches for numeral-aware headline generation (English). Our work systematically evaluates zero-shot prompting, few-shot prompting, and fine-tuning methods, providing insights into the current capabilities and limitations of LLMs in handling numerically intensive generation tasks.

📄 Refer to the PDF for more details: Numeval_Report.pdf

Approaches

We explore multiple paradigms for numeral-aware headline generation:

Zero-Shot Prompting
- Applied to Llama 2–7B.
- Task-specific prompts guide the model to generate concise, number-aware headlines.
Few-Shot Prompting
- Uses in-context examples to guide headline generation.
- Two-shot prompting improves contextual and numerical accuracy.
Fine-Tuning
- T5-large and T5-3B are fine-tuned on the headline generation dataset.
- Prefix "summarize:" prompts models to generate short, precise headlines.

We also experiment with numerical reasoning tasks using XLM-R and masked fine-tuning setups.

Code Usage

Install Dependencies

pip install -r requirements.txt

Model Running

T5

Use run_t5.py to run the T5-large model. Please adjust the data and model save path

Similarly, run_t5-3b.py for running T5-3B model.

Llama 2 - 7B

Use zero_shot_llama2.py to run the zero-shot and few-shot performance for llama2.

Please adjust the data and predictions save path

Evaluating the Predictions

Use numhg_eval.py for evaluating the predictions

python numhg_eval.py --tgt_path "path_to_labels.txt" --pre_path "path_to_predictions.txt" --num_gt_path "path_to_numerical_gt.txt"

Notebooks

numerical_generation_mlm_fine_tune.ipynb

This implements the proposed approach of performing Masked fine-tuning for numerical value generation

numerical_generation_zero_shot.ipynb

This notebook contains zero-shot application of xlm-roberta for numerical generation.
Note: Notebooks are independent. Please update the data directories accordingly

📊 Results

Fine-tuning (T5-3B) achieved the best performance, surpassing the BRIO baseline in headline generation.
Zero-shot and few-shot Llama 2 produced reasonable headlines but lagged behind fine-tuned models.
Numerical reasoning remains a challenge, with RoBERTa and XLM-R showing promise in masked fine-tuning.

Model	Headline Gen (Rouge-L)	Numerical Reasoning (Accuracy)
BRIO (baseline)	44.12	66.56
Llama2–7B (zero-shot)	30.63	40.13
Llama2–7B (few-shot)	32.78	41.08
T5-large	41.64	62.18
T5-3B	42.90	63.65

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
poc		poc
report		report
.gitignore		.gitignore
Headline_generation_GPT_2.ipynb		Headline_generation_GPT_2.ipynb
Headline_generation_zero_shot_t5_base.ipynb		Headline_generation_zero_shot_t5_base.ipynb
README.md		README.md
generate_preds.py		generate_preds.py
headline_generation_t-5_base.ipynb		headline_generation_t-5_base.ipynb
numerical_generation_mlm_fine_tune.ipynb		numerical_generation_mlm_fine_tune.ipynb
numerical_generation_zero_shot.ipynb		numerical_generation_zero_shot.ipynb
numhg_eval.py		numhg_eval.py
requirements.txt		requirements.txt
run_t5.py		run_t5.py
run_t5_3b.py		run_t5_3b.py
zero_shot_llama2.py		zero_shot_llama2.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Numeral-Aware Headline Generation

Overview

Approaches

Code Usage

Install Dependencies

Model Running

T5

Llama 2 - 7B

Evaluating the Predictions

Notebooks

numerical_generation_mlm_fine_tune.ipynb

numerical_generation_zero_shot.ipynb

📊 Results

About

Uh oh!

Releases

Packages

Languages

abhilash-neog/LLMs-for-Numerical-Reasoning

Folders and files

Latest commit

History

Repository files navigation

Numeral-Aware Headline Generation

Overview

Approaches

Code Usage

Install Dependencies

Model Running

T5

Llama 2 - 7B

Evaluating the Predictions

Notebooks

numerical_generation_mlm_fine_tune.ipynb

numerical_generation_zero_shot.ipynb

📊 Results

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages