0.17.15
Changelog
- 9b74e54 chore: bump version: 0.17.15-rc4 -> 0.17.15
- 931983a chore: bump version: 0.17.15-rc3 -> 0.17.15-rc4
- 3bd2b4a fix: handle infinite metrics in searcher snapshots [DET-7122] (#3999)
- 153221e chore: bump version: 0.17.15-rc2 -> 0.17.15-rc3
- 1e9f5c4 chore: handle rank id in log entries (#3995)
- d12c892 fix: handle infinite validation metric values in more cases (#3992)
- 59b180d docs: add release notes for 0.17.15 (#3986)
- 9e3e541 chore: bump version: 0.17.15-rc1 -> 0.17.15-rc2
- 1c97bc7 chore: apply filtering to task logs (#3963)
- 8dd01c8 fix: task log level parsing (#3973)
- 9c24121 docs: add release notes for PR #3914 (#3962)
- 507cfe7 fix: update HPE logo sizes (#3953)
- e1a3de6 chore: bump version: 0.17.15-rc0 -> 0.17.15-rc1
- dc3ef47 chore: bump version: 0.17.15-dev0 -> 0.17.15-rc0
- 6c38aad chore: lock api state for backward compatibility check
- f206919 fix: take out job summary caching [DET-6695] (#3849)
- c9941ac chore: add missing icons to mobile navbar [DET-7009]
- 927ef94 fix: parse different time format in compare stats script [DET-7039] (#3909)
- 55fcf65 fix: checkpoint gc job should close (#3943)
- b8f1073 feat: track task stats [DET-6872, DET-6926, DET-6927] (#3852)
- b1a470d chore: add key attribute to avatar ActionCard action (#3939)
- 7c7375c chore: give det job a default command to run (#3934)
- 9028fe0 chore: mask registry auth password in harness [DET-6279] (#3867)
- 7d0234b chore: add filtering by userIds to API endpoints (DET-7019) (#3898)
- 180a79f chore: mask registry creds in webui [DET-7013] (#3881)
- 2d44564 replace username with userId for user API (#3914)
- b6b2c71 test: add interaction tests for spinner [DET-6665] (#3826)
- d86d09b chore: prevent InteractiveTable scroll from moving pagination or other controls [DET-7037] (#3923)
- c9a2458 fix: crash on upgrade to InteractiveTable [DET-7036] (#3922)
- 4c0ef95 hide archived experiments unless using --all (#3918)
- 586bba4 docs: tweak docs for socket activation (#3926)
- 64ae7ca perf: add infiniband-related libraries to environment (#3832)
- bcc954a chore: disallow getting metadata from dummy checkpoint context (#3920)
- 575cc21 chore: update dep requirements for react (#3901)
- e359843 chore: demote _get_last_validation to internal (#3919)
- 5c93cb6 chore: log checkpoint uuids in core, not in wlsq (#3924)
- 7086df4 chore: add missing docstrings in core api (#3911)
- ea0db53 feat: add slurm rendezvous (#3777)
- d5e793b chore: make store_path auto-create the directory (#3916)
- 1f3c4b2 feat: add core.DownloadMode (#3910)
- c334a53 docs: fix install cli typo (#3917)
- 069eb6f fix: run scheduling on agent connection/enable events, reconnectBacklog replay. (#3906)
- b75afe6 chore: bump version: 0.17.15-dev0-dev0 -> 0.17.15-dev0
- d06dc94 refactor: remove dependency of settings in the updateSettings call (#3894)
- 1b4c8e9 chore: remove pr preview cluster address [DET-7040] (#3907)
- e5723c7 docs: Release notes for 0.17.14 (#3912)
- 43b9a7c chore: bump version: 0.17.14 -> 0.17.15-dev0
- 5c45544 fix: RM crashes when setting cmd priority (#3908)
- 0c52367 chore: fix deepspeed nightly tests (#3897)
- 3e6267f chore: update StorageManager and extend CheckpointContext (#3829)
- f76cc45 docs: fix description of scheduling_unit behavior (#3890)
- 1815ee3 chore: sweeping rename of Core API components (#3896)
- 0b7141b feat: deepspeed DCGAN example (#3758)
- 015a8e0 feat: use enums instead of chief_only bool in Core API (#3888)
- 129d841 fix: use otel only if enabled (#3893)
- 0ab3288 hide sizeChanger on RoutePagination (#3892)
- ae50d3c fix: match reported rp name for k8 across endpoints [DET-7006] (#3870)
- c49561c docs: various fixes for master configuration and k8s docs (#3889)
- c716759 chore: bump version: 0.17.13-dev0 -> 0.17.14-dev0
- 2fdb2ef docs: add release notes for 0.17.13 (#3879)
- a349622 chore: Clean up tests with fewer Optional types and asserts (#3872)
- b0e0c96 feat: make core.Searcher multiworker-safe (#3871)
- 50c9f66 feat: show agent version in
/agents
anddet agent list
[DET-6847] (#3873) - 954cf97 fix: avoid double-timestamps in logs (#3876)
- b00124a feat: Drag to Reorder and Resize Experiment List columns [DET-6438] [DET-6809] (#3765)
- be58485 chore: use better NCCL SOCKET setting for gpt-neox (#3874)
- 65e1a97 chore: remove internal flag from det.launch.horovod --help (#3875)
- 9b1fa3b feat: Search models by name and description substring [DET-6939] (#3869)
- 78ba4a3 feat: add opentel to determined master [DET-6775] (#3851)
- 789f16d chore: cleanup stray changes (#3868)
Docker images
docker pull determinedai/determined-master:0.17.15
docker pull determinedai/determined-master:9b74e5444
docker pull determinedai/determined-master:9b74e54448d64009ce574e1a68b52149c1d00fe7
docker pull determinedai/determined-dev:determined-master-9b74e5444
docker pull determinedai/determined-dev:determined-master-9b74e54448d64009ce574e1a68b52149c1d00fe7
docker pull nvcr.io/isv-ngc-partner/determined/determined-master:0.17.15
docker pull nvcr.io/isv-ngc-partner/determined/determined-master:9b74e5444
docker pull nvcr.io/isv-ngc-partner/determined/determined-master:9b74e54448d64009ce574e1a68b52149c1d00fe7