When to Ponder: Adaptive Compute Allocation for Code Generation via Test-Time Training

Paper | Project Page | Setup | Usage | Citation

Abstract

PonderTTT applies selective TTT updates based on input difficulty using the reconstruction loss as a training-free gating signal. A single scalar threshold, calibrated on unlabeled data and adapted during inference, governs update frequency. Testing on GPT-2 models (124M to 1.5B parameters) shows 82–89% Oracle Recovery while being fully training-free.

Results

Model	SKIP	Oracle	Ours	Recovery
Small (124M)	2.324	1.935	1.977	89.2%
Medium (355M)	1.909	1.653	1.697	82.8%
Large (774M)	2.005	1.580	1.656	82.1%
XL (1.5B)	1.875	1.518	1.576	83.8%

Setup

This codebase is implemented in JAX and has been tested on both GPUs and Cloud TPU VMs.

Installation

# Install uv
curl -LsSf https://astral.sh/uv/install.sh | sh

# Install the project
uv pip install -e .              # CPU
uv pip install -e . --group gpu  # CUDA 13
uv pip install -e . --group tpu  # TPU

Usage

Reconstruction Gating

output = model(input_ids, use_ttt=True)
recon_loss = output["ttt_stats"]["ttt_loss_step_0"]

if recon_loss > threshold:
    # UPDATE: re-forward with updated weights
    pass
else:
    # SKIP: use current weights
    pass

Reproduce Paper Results

./scripts/run_all_experiments.sh          # All models
./scripts/run_all_experiments.sh --small  # Small (124M)
./scripts/run_all_experiments.sh --xl     # XL (1.5B)

Citation

@article{sim2025ponderttt,
  title={When to Ponder: Adaptive Compute Allocation for Code Generation via Test-Time Training},
  author={Sim, Gihyeon},
  journal={arXiv preprint arXiv:2601.00894},
  year={2025}
}

License

This project is licensed under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 581 Commits
docs @ e38f4de		docs @ e38f4de
examples		examples
paper @ 64edc14		paper @ 64edc14
scripts		scripts
src/ponderttt		src/ponderttt
tests		tests
.gitignore		.gitignore
.gitmodules		.gitmodules
.python-version		.python-version
CITATION.cff		CITATION.cff
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
pyproject.toml		pyproject.toml
pyrightconfig.json		pyrightconfig.json
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

When to Ponder: Adaptive Compute Allocation for Code Generation via Test-Time Training

Abstract

Results

Setup

Installation

Usage

Reconstruction Gating

Reproduce Paper Results

Citation

License

About

Uh oh!

Releases 2

Packages

Contributors 2

Languages

License

deveworld/ponderTTT

Folders and files

Latest commit

History

Repository files navigation

When to Ponder: Adaptive Compute Allocation for Code Generation via Test-Time Training

Abstract

Results

Setup

Installation

Usage

Reconstruction Gating

Reproduce Paper Results

Citation

License

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Contributors 2

Languages

Packages