Pre-trained Language Models Improve the Few-shot Prompt Ability of Decision Transformer

[TMLR 2025]

Duke University

Overview

Official implementation of the paper "Pre-trained Language Models Improve the Few-shot Prompt Ability of Decision Transformer".

Installation

Creating the environment through the following steps:

conda create --name lpdt python=3.8.5
conda activate lpdt
pip install -r requirements.txt
./install_envs.sh

Experiments

First, download the Dataset and place them in ./dataset.

Fine-tune the pretrained language model with classifier regularization using the Decision Transformer.

python experiment.py \
    --env ant_dir \
    --model_type dt \
    --dataset_mode expert \
    --test_dataset_mode expert \
    --seed 0 \
    --K 20 \
    -lr 1e-4 \
    -lmlr 1e-5 \
    --warmup_steps 10000 \
    --pretrained_lm gpt2 \
    --model_type dt \
    --adapt_mode \
    --adapt_embed \
    --lora \
    --mlp_embedding \
    --outdir test/ \
    --dropout 0.1 \
    --description "test_ratio_1.0" \
    --batch_size 6 \
    -w \
    --load_path "" \
    --ratio 1.0 \
    --classifier \
    --classifier_lambda 0.1 \
    --num_class 50

Fine-tune the pretrained language model with classifier regularization using the Reinformer.

python experiment.py \
    --env ant_dir \
    --model_type dt \
    --dataset_mode expert \
    --test_dataset_mode expert \
    --seed 0 \
    --K 20 \
    -lr 1e-4 \
    -lmlr 1e-5 \
    --warmup_steps 10000 \
    --pretrained_lm gpt2 \
    --model_type reinformer \
    --adapt_mode \
    --adapt_embed \
    --lora \
    --mlp_embedding \
    --outdir test/ \
    --dropout 0.1 \
    --description "test_ratio_1.0" \
    --batch_size 6 \
    -w \
    --load_path "" \
    --ratio 1.0 \
    --classifier \
    --classifier_lambda 0.1 \
    --num_class 50

Citation

@article{
  yang2025pretrained,
  title={Pre-trained Language Models Improve the Few-shot  Prompt Ability of Decision Transformer},
  author={Yu Yang and Pan Xu},
  journal={Transactions on Machine Learning Research},
  issn={2835-8856},
  year={2025},
  url={https://openreview.net/forum?id=k520i3XEMK},
  note={}
}

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
decision_transformer		decision_transformer
envs		envs
loralib		loralib
prompt_dt		prompt_dt
src		src
LICENSE		LICENSE
README.md		README.md
experiment.py		experiment.py
install_envs.sh		install_envs.sh
requirements.txt		requirements.txt
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Pre-trained Language Models Improve the Few-shot Prompt Ability of Decision Transformer

[TMLR 2025]

Overview

Installation

Experiments

Citation

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

panxulab/LPDT

Folders and files

Latest commit

History

Repository files navigation

Pre-trained Language Models Improve the Few-shot Prompt Ability of Decision Transformer

[TMLR 2025]

Overview

Installation

Experiments

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages