Skip to content

Actions: deepspeedai/DeepSpeed

nv-lightning-v100

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
4,701 workflow runs
4,701 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

nv-lightning-v100
nv-lightning-v100 #14429: Scheduled
February 20, 2025 00:21 Queued master
February 20, 2025 00:21 Queued
Training multiple models
nv-lightning-v100 #14428: Pull request #7018 synchronize by loadams
February 19, 2025 23:37 19m 58s olruwase/zero_multi_models
February 19, 2025 23:37 19m 58s
nv-lightning-v100
nv-lightning-v100 #14427: Merge group checks requested
February 19, 2025 22:15 24m 38s
February 19, 2025 22:15 24m 38s
nv-lightning-v100
nv-lightning-v100 #14426: Merge group checks requested
February 19, 2025 21:39 48m 24s
February 19, 2025 21:39 48m 24s
Rename aio_thread_count to intra_op_parallelism
nv-lightning-v100 #14425: Pull request #7056 opened by tjruwase
February 19, 2025 19:09 57m 17s olruwase/aio_thread_count_rename
February 19, 2025 19:09 57m 17s
add autoTP training zero2 tests
nv-lightning-v100 #14424: Pull request #7049 synchronize by tjruwase
February 19, 2025 18:50 1h 12m 49s inkcherry:minor_fix_version2
February 19, 2025 18:50 1h 12m 49s
Fix, bf16 optimizer remove dup loop
nv-lightning-v100 #14423: Pull request #7054 synchronize by tjruwase
February 19, 2025 18:44 55m 18s wukong1992:fix-bf16-moe-refresh-params
February 19, 2025 18:44 55m 18s
Add pyproject.toml with legacy build backend to keep most logic in setup.py
nv-lightning-v100 #14422: Pull request #7033 synchronize by loadams
February 19, 2025 18:17 24m 32s loadams/pyproject-toml
February 19, 2025 18:17 24m 32s
Add pyproject.toml with legacy build backend to keep most logic in setup.py
nv-lightning-v100 #14421: Pull request #7033 synchronize by loadams
February 19, 2025 18:05 10m 55s loadams/pyproject-toml
February 19, 2025 18:05 10m 55s
Add pyproject.toml with legacy build backend to keep most logic in setup.py
nv-lightning-v100 #14420: Pull request #7033 synchronize by loadams
February 19, 2025 18:04 1m 11s loadams/pyproject-toml
February 19, 2025 18:04 1m 11s
Add pyproject.toml with legacy build backend to keep most logic in setup.py
nv-lightning-v100 #14419: Pull request #7033 synchronize by loadams
February 19, 2025 17:56 6m 55s loadams/pyproject-toml
February 19, 2025 17:56 6m 55s
Enable ZeRO set/get APIs for NVMe offload
nv-lightning-v100 #14418: Pull request #7046 synchronize by loadams
February 19, 2025 17:47 4m 27s olruwase/update_nvme_offload_states
February 19, 2025 17:47 4m 27s
Bug Fix for offload_states API
nv-lightning-v100 #14417: Pull request #7050 synchronize by U-rara
February 19, 2025 17:37 Action required U-rara:bugfix_reload_states
February 19, 2025 17:37 Action required
nv-lightning-v100
nv-lightning-v100 #14416: Merge group checks requested
February 19, 2025 15:48 11m 22s
February 19, 2025 15:48 11m 22s
Variable batch size and LR scheduler
nv-lightning-v100 #14415: Pull request #7020 synchronize by bm-synth
February 19, 2025 15:44 Action required bm-synth:variable_batch_size_and_lr
February 19, 2025 15:44 Action required
Fix, pipeline model with moe cause error when send grad
nv-lightning-v100 #14414: Pull request #7055 opened by wukong1992
February 19, 2025 11:53 Action required wukong1992:fix-pipe-act-grad-comm
February 19, 2025 11:53 Action required
Fix, bf16 optimizer remove dup loop
nv-lightning-v100 #14413: Pull request #7054 opened by wukong1992
February 19, 2025 10:37 26s wukong1992:fix-bf16-moe-refresh-params
February 19, 2025 10:37 26s
Enabled configurable auto Tensor Parallelism (TP) for the inference of diverse models
nv-lightning-v100 #14412: Pull request #6553 synchronize by delock
February 19, 2025 07:27 Action required gyou2021:configurable_autoTP
February 19, 2025 07:27 Action required
Add DeepseekV3 AutoTP.
nv-lightning-v100 #14411: Pull request #7045 synchronize by Yejing-Lai
February 19, 2025 02:05 Action required Yejing-Lai:lyj/deepseekv3
February 19, 2025 02:05 Action required
nv-ds-chat breaks with latest transformers
nv-lightning-v100 #14410: Pull request #7052 opened by loadams
February 19, 2025 01:01 1h 52m 29s loadams/transformers-ds-chat
February 19, 2025 01:01 1h 52m 29s
Enable python 3.11 and 3.12 tests
nv-lightning-v100 #14409: Pull request #7007 synchronize by loadams
February 19, 2025 00:54 1h 54m 46s loadams/reenable-py311-312
February 19, 2025 00:54 1h 54m 46s
Add pyproject.toml with legacy build backend to keep most logic in setup.py
nv-lightning-v100 #14408: Pull request #7033 synchronize by loadams
February 19, 2025 00:52 1h 5m 15s loadams/pyproject-toml
February 19, 2025 00:52 1h 5m 15s
Enable ZeRO set/get APIs for NVMe offload
nv-lightning-v100 #14407: Pull request #7046 synchronize by loadams
February 19, 2025 00:52 32m 39s olruwase/update_nvme_offload_states
February 19, 2025 00:52 32m 39s
nv-lightning-v100
nv-lightning-v100 #14406: Scheduled
February 19, 2025 00:21 12m 8s master
February 19, 2025 00:21 12m 8s
Update setup.py handling of ROCm cupy
nv-lightning-v100 #14405: Pull request #7051 synchronize by loadams
February 18, 2025 22:21 4m 50s loadams/rocm-cupy-5
February 18, 2025 22:21 4m 50s