Skip to content

Commit

Permalink
test_populate_grads for nanogpt - use smaller model for avoiding OOMs…
Browse files Browse the repository at this point in the history
… in CIs
  • Loading branch information
nikitaved committed Apr 3, 2024
1 parent c686597 commit d618bac
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion thunder/tests/test_grad.py
Original file line number Diff line number Diff line change
Expand Up @@ -1395,7 +1395,7 @@ def test_populate_grads_nanogpt(executor, device, dtype):
from thunder.benchmarks import NanoGPTBenchmark, NanoGPTConfig

# NOTE Currently setting dropout to zero for reproducibility, other settings taken from gpt2 config
config = NanoGPTConfig(dropout=0, n_layer=12, n_head=12, n_embd=768)
config = NanoGPTConfig(dropout=0, n_layer=12, n_head=12, n_embd=384)

bench = NanoGPTBenchmark(config=config, requires_grad=True, device=device, dtype=dtype)
model = bench.fn()
Expand Down

0 comments on commit d618bac

Please sign in to comment.