Refactor ATOM for top-k top-p sampling support by aryaman-gupta · Pull Request #227 · ROCm/ATOM

aryaman-gupta · 2026-02-20T16:54:40Z

Summary

Adds top-k and top-p sampling support to ATOM, complementing existing temperature-based sampling.

Changes

File	Changes
`sampling_params.py`	Added `top_k: int = -1` and `top_p: float = 1.0` fields with validation
`sequence.py`	Store `top_k` and `top_p` from sampling params
`scheduler.py`	Added `top_ks` and `top_ps` lists to `ScheduledBatch`
`model_runner.py`	Added GPU buffers, updated `prepare_sample` with CPU-side uniformity optimization
`sampler.py`	Added top-k/top-p filtering with aiter integration. Default temperature-base Gumbel Max path when filtering disabled (no overhead). Native PyTorch fallback when `aiter.ops.sampling` unavailable (marked experimental).
`openai_server.py`	Wired up `top_k` and `top_p` parameters

Basic Testing

Run a model:

python3 -m atom.entrypoints.openai_server --model <model_path> <additional_params> --host 0.0.0.0 --port 8000

Query with top-k and top-p parameters:

import requests

response = requests.post(
  "http://localhost:8000/v1/completions",
  json={
      "model": "/it-share/gpt-oss-120b/",
      "prompt": "The capital city of France is",
      "max_tokens": 32,
      "temperature": 0.8,
      "top_k": 10,
      "top_p": 0.8,
      "stream": True
  },
  stream=True
)
for line in response.iter_lines():
  print(line)

atom/model_ops/sampler.py

…op-p

refactor for top-k top-p sampling support

af59eba

valarLip reviewed Feb 23, 2026

View reviewed changes

atom/model_ops/sampler.py Show resolved Hide resolved

aryaman-gupta added 5 commits February 23, 2026 16:40

sampler.py: change in comments and default param syntax

9d19d26

sampler.py: adds fast path for all temperature=0

1e58f53

sampler.py: adds warning for native Pytorch implementation of top-k t…

db97552

…op-p

only copy one element to GPU if all ks or ps the same

b74f382

Merge branch 'main' into aryaman/topk-topp-support

5ac786d

aryaman-gupta marked this pull request as ready for review February 26, 2026 22:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor ATOM for top-k top-p sampling support#227

Refactor ATOM for top-k top-p sampling support#227
aryaman-gupta wants to merge 6 commits intomainfrom
aryaman/topk-topp-support

aryaman-gupta commented Feb 20, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

aryaman-gupta commented Feb 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Changes

Basic Testing

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

aryaman-gupta commented Feb 20, 2026 •

edited

Loading