Releases: determined-ai/determined
Releases · determined-ai/determined
0.23.3
Release Notes
Changelog
- f8a9d45 chore: bump version: 0.23.3-rc2 -> 0.23.3
- e713e3b docs: add release notes for 0.23.3 (#7413)
- e3fbe53 chore: add release notes for notebook tls (#7408)
- f102fef fix: refactor checkpoint modal so it closes correctly [WEB-1441] (#7398)
- 1f502eb fix: install sigusr1 on main thread only. (#7350)
- c2ed933 chore: bump version: 0.23.3-rc1 -> 0.23.3-rc2
- 7b3f913 fix: memoize settings in model detail and experiment detail pages (#7390)
- 289c2a8 fix: rbac filter on columns API (#7386)
- af821f4 fix: make
checkpoint_count
explicit (#7374) - bc8147c chore: bump version: 0.23.3-rc0 -> 0.23.3-rc1
- 7d73601 fix: use DEFAULT_COLUMNS in GroupManagement table (#7366)
- 6bbeb9c fix: Fix issue in user settings store (#7363)
- aa25670 fix: reverting not-found-errs change in python user-groups (#7361)
- 4749d06 chore: bump version: 0.23.3-dev0 -> 0.23.3-rc0
- 275da69 chore: lock published urls to preserve redirects
- 4965100 ci: try rebasing main before checking PR (INFENG-178) (#7357)
- 54f5bcf fix: Cleared date field removed from filters [WEB-1408] (#7359)
- 0266d5d fix: SDK methods don't update local on failure (#7270)
- 2f9d924 feat: add tls to notebook connections [DET-9002] (#6735)
- 6afe3e9 docs: Batch Processing API doc (#7273)
- 240e41d ci: tweak PR title matching in release tracking (#7251)
- e0bec3f ci: track unreleased cherry-picked PRs during release (#7324)
- 87fab84 chore: correct order of err check for master config (#7351)
- 881a6e8 chore: Remove/archive trial collection code [WEB-938] [WEB-1289] (#7297)
- 34bdc97 chore: adjusted cli to not nest master config under config (#7344)
- badb925 feat: Add the InlineForm component (#7199)
- 0a93bb2 refactor: remove unused
Trial.JobID
property. (#7314) - ecf27f1 feat: create bindings tab in resource pool detail page (#7339)
- b19c295 feat: added rbac to grpc master config (#7303)
- ad5ddc7 chore: removed echo-backed /config and updated cli [DET-9580] (#7307)
- cede2e9 docs: fix version switcher (#7327)
- acec97e docs/Edit homepage title (#7342)
- 67c0977 fix: ensure design kit landing page is typechecked (#7336)
- 211d8b7 chore: fix ray proxy pydantic testing issue (#7340)
- 1e6377d chore: fix flaky TestReceiveContainerLog (#7338)
- bf1a9e7 chore: Add proper Json type to the code (#7335)
- af59d02 docs: correct release date (#7333)
- 9ff4ef7 chore: temporarily disable python-coverage on test-gpu (#7331)
- 49bf9a4 fix: Download to existing directory from shared_fs source [MLG-672] (#7328)
- d614f6b fix: harmonize python & golang GET NotFound errs (#7252)
- 070865a fix: Fix type mismatch on design page (#7334)
- 497b72e chore: Add new UserSettings store [WEB-1353] (#7322)
- 0991540 chore: fix golangci-lint pre-commit (#7330)
- 6f6f568 chore: golangci check the package instead (#7226)
- b518504 fix: Table header context menu enabled [WEB-1382] (#7321)
- 13d2a0b feat: manage resource pool bindings (#7300)
- 6bcf997 fix: navbarCollapsed shortcut key (#7323)
- ccb3745 feat: launch.deepspeed passes (almost) all envvars (#7295)
- ff3cc84 chore: validate ROCM support (#7313)
- a234895 chore: Remove TimeSeries log scale, SummarizeTrial [WEB-1406] (#7278)
- f9ec91f chore: prepare for strict rbac jq control (#7305)
- 3d74226 chore: bump version: 0.23.2-dev0 -> 0.23.3-dev0
- f195adb docs: add release notes for 0.23.2 (#7285)
- 65d5918 fix: set user in model CLI client [DET-9612] (#7220)
- ecf7d36 test: pin pydantic in e2e_tests (#7302)
- 65bcf19 docs: Rename image assets (#7283)
- 9f441a4 feat: list resource pool binding (#7286)
- f659c71 fix: Link omnibar shortcut to new Space convention [WEB-1418] (#7293)
- 2f2864a chore: update pillow version for test requirements (#7291)
- 2329801 chore: GCP operation tracker refactor (#7262)
- 4e512fc docs: Provide additional Slurm configuration guidance [FE-88] (#7288)
- 6bbe184 feat: keyboard shortcut for JupyterLabModal (#7184)
- 4cd31ab chore: use settings value for omnibar shortcut (#7198)
- 4d7c849 docs: Make correction on workspaces page (#7254)
- a782ee5 chore: remove admin flag check in delete_model & delete_model_version [DET-9596] (#7175)
- 5dcfe20 fix: ManageGroupsModal on UserManagement page (#7268)
- e1fdb26 feat: Add keyboard shortcut input to UI Kit [WEB-1362] (#7192)
- 7fbb363 feat: implement web resource pool binding api [WEB-1402] (#7272)
- 3323a4b test: Pin Pydantic (#7282)
- 5fa9d85 fix: adding credential check to gcp list clusters function (#7258)
- a22af67 fix: hp seach launch issue (#7271)
- 227ead4 fix: no data to plot is shown sometimes incorrectly [WEB-1413] (#7279)
- 6b7f7f2 fix: LogViewer request handling to avoid filter mismatch [WEB-1412] (#7277)
- 5bcc2b3 fix: summary metrics decode (#7253)
- c010993 chore: revert make_url changes (#7261)
- 29c0ec5 fix: experiment list comparison view (#7250)
- c72ec84 fix: Race condition loading parsed query settings into all settings [WEB-1376] (#7170)
- 761245b ci: handle cherry-picking EE PRs into release branch (#7266)
- a37ef89 feat: rbac for templates supporting changes (#7224)
- 00a7749 fix: refetch groups on settings change (#7263)
- e6502e2 fix: use user id for new explist user filter (#7248)
- 10c7def feat: Batch Inference (Processing) API (#6807)
- 49435a8 chore(actors): refactor allgather (#7195)
- 9fc8ae9 feat: add num of experiments in quick search modal (#7223)
- 74a8122 Docs: Spell out acronyms in a release note (#7247)
0.23.2
Release Notes
Changelog
- 70503d9 chore: bump version: 0.23.2-rc3 -> 0.23.2
- 5301a55 chore: bump version: 0.23.2-rc2 -> 0.23.2-rc3
- d90e364 docs: add release notes for 0.23.2 (#7285)
- 9913b5b chore: update pillow version for test requirements (#7291)
- fdcca52 test: pin pydantic in e2e_tests (#7302)
- 680c13b chore: bump version: 0.23.2-rc1 -> 0.23.2-rc2
- e447321 fix: ManageGroupsModal on UserManagement page (#7268)
- 8a3d4fd test: Pin Pydantic (#7282)
- 964e765 fix: adding credential check to gcp list clusters function (#7258)
- 4f2e489 fix: hp seach launch issue (#7271)
- e557208 fix: no data to plot is shown sometimes incorrectly [WEB-1413] (#7279)
- 2349c89 fix: LogViewer request handling to avoid filter mismatch [WEB-1412] (#7277)
- 48b8208 fix: summary metrics decode (#7253)
- 0739a28 chore: revert make_url changes (#7261)
- 8bfe93e fix: experiment list comparison view (#7250)
- fbadc8b chore: bump version: 0.23.2-rc0 -> 0.23.2-rc1
- da0464c fix: refetch groups on settings change (#7263)
- 1f9673d fix: use user id for new explist user filter (#7248)
- 7323e2b chore: bump version: 0.23.2-dev0 -> 0.23.2-rc0
- 67a1ea1 chore: lock published urls to preserve redirects
- 593b8ad chore: lock api state for backward compatibility check
- 6357915 fix: sort of
GetWorkspaceProjects
API (#7214) - 5a0beab fix: text overflow in code view (#7205)
- b035cca fix: Windows file permissions (#7215)
- 46d18e7 test: Add a unit test for direct downloading from S3. (#7174)
- 8167314 tara: Add the approved dtrain diagram (#7218)
- e1c141c chore: indirect job package imports to avoid cycle in EE (#7219)
- 870b678 fix: add missing searcher types (#7212)
- a6ea8bc chore: account for paths passed in for DET_MASTER (#7097)
- 092634f chore: add NewProxyHandler test to proxy_intg_test [DET-9555] (#7196)
- 8552a26 fix: replace dsat cli underscores with dashes (#7187)
- d5552ab feat: add trust_remote_code to hf_trainer_api examples (#7209)
- a830e51 chore(actors): remove AllocationRef's from tasklist.TaskList (#7208)
- 4dccb5a feat: add glide table filter field add shortcut (#7127)
- 53ad788 fix: pinned column resizing in compare view [WEB-1395] (#7193)
- 104d24f fix: revert #7100 (master returns rbacEnabled = false...) #7216
- cd5d0e6 fix: delete templates on workspace delete (#7204)
- 1d564a3 test: sort metrics by type in multiTrialSample (#7210)
- 33d43ed ci: notify via Slack on cherry-pick conflict during release (#7211)
- 4e8ae39 chore(go): add a generic queue as a drop-in actor inbox replacement (#6962)
- 76dbfec ci(circle/test-unit): split off gpu unit tests (#7207)
- fd48c32 feat: add new SVG icons to UI kit (#7142)
- a1d0776 fix: TLS enabled leads to zombie tasksd (#7197)
- cc8351f fix: Modify the Determined master dialer to honor the proxy environment variables [FE-69] (#7203)
- f623404 chore: request queue refactor (#7006)
- a12d20b chore: refactor job as global (#7178)
- f548a4e fix: update
hermes-parallel-coordinates
(#7190) - 3e76f63 fix: master returns rbacEnabled = false; change filter for rbac tests (#7100)
- 15fe41e docs: Improve the distributed training guide (#7038)
- 0dc8c0a ci: copy PR bodies into release party tracking issues (#7194)
- 1c7e008 feat: add selection menu to glide table (#6808)
- dabe00c chore: Helm file removed static imgs refs, added log config (#7032)
- 942a46f ci: torch.distributed parallel unit tests (#7156)
- 06cc665 fix: Make sure that docs can build properly again [DET-9607] (#7189)
- 0434fa2 chore: bump version: 0.23.1-dev0 -> 0.23.2-dev0
- 5d2621f docs: add release notes for 0.23.1 (#7186)
- c62b865 feat: added new list fuction, new delete subcommand and added the use of default gcs bucket if local tf state if not present in det deploy gcp (#7146)
- 50cafa7 chore: properly handle external logout (#7181)
- 35907a9 docs: Update readme with version switcher info (#7158)
- 61625dd chore(actors): refactor task idle timeout service (#7072)
- d47fadb chore(actors): replace BuildTaskSpec message (#7161)
- f3f7f82 chore: update license for web packages (#7185)
- 147014c ci(circleci): use custom GPU runners for harness GPU unit tests [INFENG-192] (#7120)
- 688d254 fix: Don't draw rows after end of data; compare panel width [WEB-1339] [WEB-1365] (#7180)
- 3d045b5 fix: set protocol scheme in kubernetes resource manager (#7165)
- 90fc15a chore: add go pre-commit checks (#7056)
- deb22ff chore: fix query parameter parsing in tls proxy (#7171)
- 64f4ea8 feat: more welcoming home page in doc (#7169)
- 5b2a256 fix: log viwer height (#7179)
- afe9b9c chore: support registry auth with Singularity (#7177)
- 0c5c1c3 feat: Create keyboard shortcut to toggle sidebar collapse (#7147)
- 0fe8609 fix: charts should show is loading instead of no data [WEB-1367] (#7167)
- c44a61e docs: Add version dropdown to sidebar (#7081)
- f816b45 feat: allow column pinning for glide table (#7093)
- 3870943 chore: default retry in sdk (#7063)
- cf6ee24 feat: Drawer component added to UI Kit [WEB-1349] (#7151)
- 58d0e17 fix: deepspeed autotune user guide clarification (#7140)
- 6b2e1bc chore: backport agent info permission definition. (#7143)
- a944f1a feat: trials comparison table in experiment comparison [WEB-993] (#7111)
- 77181f4 docs: Minor edits to master config reference (#7164)
- a4814ed fix: correct scale positional argument in compareTrials API call (#7168)
- e31f50e chore: sanitize metrictype [DET-9585] (#7155)
- 29dc6d8 ci: cherry-pick all appropriately labeled PRs, regardless of type (#7157)
- 041fb1a chore: deprecation warnings for
determined.common.experimental
imports (#7103) - ba99a69 fix: correct the fetch page index for paged view (#7153)
- e09ea0c chore: fix a metric name comment. (#7154)
- b5467a1 fix: Display chart if filters.batch is a number (#7152)
- ff39e50 fix: menu overlapping (#7148)
- 467e176 fix: use
MetricBadgeTag
for chart grid title (#7139) - 3c8a5a4 fix: restore experiments with no provider or capacity (#7113)
- 08fc285 fix: remember experiment list sorts (#7150)
- 2f32351 feat: feature switch can be turned off (#7144)
- a40cdc0 fix: mask checkpoint storage secrets in gc logs (#7135)
- 245ed0f docs: Make minor edits to username release note (#7149)
- d3aff5b docs: Fix typos in Intro to Determined (#7138)
- 260fd7a fix(api): complete trial to task log request mapping (#7022)
- 843b012 feat: add/update read apis for generic metrics (#7065)
- ee1e2b1 fix: show filter count while filter is open (#7145)
- afff53e chore: onboarding doc clarification (#7109)
- 6f98aac chore: sort generated binding parameters based on required status (#7137)
- 8592397 feat: Experiment-search API supports summary metrics from training [WEB-1302] (#7115)
- 1aff89c docs: Add TLS Certs Setup Guide (#7110)
0.23.1
Release Notes
Changelog
- 56e604f chore: bump version: 0.23.1-rc2 -> 0.23.1
- 02d04af docs: add release notes for 0.23.1 (#7186)
- 074daa7 chore: bump version: 0.23.1-rc1 -> 0.23.1-rc2
- f243677 fix: correct the fetch page index for paged view (#7153)
- cae371e fix: remember experiment list sorts (#7150)
- fbbab7b docs: Make minor edits to username release note (#7149)
- 56c5d55 chore: bump version: 0.23.1-rc0 -> 0.23.1-rc1
- fd87099 chore: bump version: 0.23.1-dev0 -> 0.23.1-rc0
- ca23f12 chore: lock published urls to preserve redirects
- d9598e0 chore: lock api state for backward compatibility check
- 609bbb6 feat: rp-workspace mapping proto (#7125)
- 588e640 chore: await_first_trial ensures first trial is returned (#7079)
- eb38615 ci: handle cherry-picking PRs for release (#7035)
- d7e66dc test: New test -- Model.get_versions reads paginated responses. (#7087)
- d99a4d4 chore: change import path (#7134)
- 6f7472c fix: spinner import error (#7132)
- e5ac7e2 fix: Glide Table comparison view no data error; loading saved filters issue (#7122)
- 6ed34e1 fix: allow creating a user with original username of a renamed user (#7121)
- b583b4e fix: persist Experiment List sorts (#7128)
- 95fdc45 chore: conditional debugging output (#7124)
- dfc7d10 chore: make printed py bindings more readable (#7018)
- 3c3112d refactor: remove shared web folder (#7112)
- 80504f1 chore: assume missing experiment project as deleted experiment (#7078)
- f888564 fix:
/experiments-search
endpoint best trial state (#7123) - f11bcfb chore: deprecate
LightningAdapter
. (#6989) - eb1868d chore: ensure golang gcp defaults for disk are valid [MLG-425] (#7116)
- 0f1c538 fix: Use correct theme colors for new table (#7114)
- f343651 docs: Fix a typo in a release note (#7107)
- 2f7098b chore: update gpt-neox example (#7106)
- b716404 fix: correct infinite scrolling behavior (#7096)
- 65efd7a fix: tr rendering bug (#7077)
- b607026 docs: provide user docs for manage-enroot-cache (#7108)
- 2121ed3 test: kill child experiments before workspace deletion (#7075)
- a43edc3 feat: Experiment List comparison view should show hyperparameters chart [WEB-991] (#7084)
- d477769 fix: unexpected icon movement in admin page (#7099)
- ed2223a feat: added rbac for agent endpoints [DET-9211] (#6991)
- d8b8f15 chore: add slot type to container client config (#7094)
- 160854d feat: show column sort [WEB-1219] (#7085)
- 97895ce feat: API for project metrics range [WEB-1000] (#7009)
- cc58653 docs: give prominence to Apptainer instead of Singularity (#7071)
- 29e6985 feat: New default glide table columns [WEB-1197] (#7073)
- 774a386 fix: scroll chaining in doc sidebar (#7062)
- ed329d6 fix: Chart regression in detecting single point cluster chart (#7091)
- 90bba17 fix: hide non-static columns when comparison view open (#7014)
- 56f95b1 feat: add row highlight on hover (#7055)
- c069e14 chore: fix allocations swagger generated urls (#7020)
- 0d33d57 fix: disable hightlight after clicking column menu (#7092)
- e249a58 chore: errata from onboarding doc (#7083)
- 0b4b3d0 fix:
det a
under kubernetes should reflect the output ofkubectl get nodes
again [DET-9450] (#6839) - 4acc527 chore: fix the lint script on package to resolve (#7080)
- 7dddd50 fix: det shell start fails on grenoble with 0.21.2 (reading HTTP_PROXY, but ignoring NO_PROXY) [DET-9364] (#7024)
- c5a339b feat: allow row height setting in glide table (#6952)
- 93095ee fix: prevent re-renders on glide table (#7076)
- e13b20a chore: capabilities options for singularity (#7074)
- 2d943b4 fix: Breadcrumb should not have extra margin (#7069)
- 11013bc fix: show experiment count and change
start time
unit (#7004) - 5a9fc61 fix: Bottom of code editor, find tool both visible (#7050)
- 4ce93aa feat: add summary metric support for generic metrics (#7012)
- d8ba9cf chore(actors): refactor task preemption (#7045)
- 7cd0694 fix: Button with icon styling (#7047)
- 728078b chore: bump version: 0.23.0-dev0 -> 0.23.1-dev0
- 82ab84e chore: bump version: 0.22.3-dev0 -> 0.23.0-dev0
- 6e23beb docs: add release notes for 0.23.0 (#7058)
- 0d21873 test: Add fixtures for mocking the REST API (#7064)
- 977b387 chore: move inline sql off to master/static/srv (#7061)
- e810990 test: new SDK test wait for experiments that are "PAUSED" (#6987)
- 7957064 chore: bind mount options for singularity (#7054)
- 3fdfcf8 fix: handleError is required in ChartGrid (#7057)
- 379b69e fix: missing prop (#7059)
- c7d6f0c chore: Remove pagination from get_versions. (#7030)
- 4da5441 chore: New chart component on LearningCurve and Profiler (#7052)
- c89a773 chore: support displaying limited jobs in cli and web (#7001)
- 7ba65b3 feat: Add Chart grid to experiment comparison view [WEB-990] (#7005)
- c4a73b1 chore: treat k8s like slurm in cluster info page (#7051)
- da66bca chore: actor refactor pod log (#6941)
- fa18f9b fix: Fixes to UserSettings loading state, experiment lists [WEB-1271] (#7026)
- be207c2 fix: add missing workspace_id column to FilterExperimentsQuery calls (#7049)
- 124794d chore: clean up message handling in main (#7040)
- 3ee0900 fix: change taskType from string to enum [DET-8847] (#6927)
- 18da973 chore: actor refactor proxy (#6944)
- c2c9eee fix: omit projects outside of users permissions [DET-9557] (#7046)
- d7bdaa4 chore: added release note for DET-9035 (#7042)
- 100bda7 fix: load config file into the code editor (#7033)
- 6db91ab chore: enrich agent-generated logs (#7029)
- 08c7210 chore(db): backfill tasks tables to always have entries for trials (#6711)
- ebcc63b fix: add 'check permissions' message to RBAC NotFound errors (#6937)
- f6793dd fix: fix k8s slots count growing for commands [DET-9550] (#7025)
- 2b3c3a8 chore: onboarding doc updates after test-drive (#7028)
- b7d2819 chore: Pass in the allocation ID to releasedResources() (#7034)
- 7e359cf feat: Add pagination to experiment list [WEB-995] (#6971)
- 5a1614d chore: Increase size of Algolia search modal (#7023)
- 9883a67 refactor: make UI kit self-contained (WEB-1243) (#6918)
- f4f9fda fix: ensure contiguous before gathering in stable diffusion example (#7021)
- f4bae0e chore: expose workspace for jobs (#6996)
- 52e095a feat: When modifying glide table filters, keep columns visible [WEB-1232] (#7010)
- e6413e1 chore(actors): replace tasklogger (#6979)
- fe22f81 fix: WorkspaceList tableOffset bug (#7017)
- b9e8b94 chore: upgrade to typescript v5 (#6977)
- ef85382 fix: In new breadcrumbs, uncategorized links to /projects/1 [WEB-1319] (#7013)
- 695df59 fix: fixing error that 'det w delete ' throws when the project(s) in it has no experiments (#6986)
- 67e40c6 ci: fix unit-test-react flake [WEB-1262] (#6995)
- 4a854a9 docs: Improve links to distributed training guide (#6983)
- d822536 chore: improve partial checkpoint CLI in progress message (#7015)
- d8ffd91 chore: ml-sys onboarding exercises (#6516)
- 1793f47 style: remove unnecessary wrapper around directory tree (#7008)
- 4b3c126 fix: seconds come from epoch duration, not timestamp piece (#7007)
0.23.0
Release Notes
Changelog
- ac107b7 chore: bump version: 0.23.0-rc4 -> 0.23.0
- fb0383b docs: add release notes for 0.23.0 (#7058)
- d10e02c chore: bump version: 0.23.0-rc3 -> 0.23.0-rc4
- 83670e0 fix: add missing workspace_id column to FilterExperimentsQuery calls (#7049)
- afb7772 chore: bump version: 0.23.0-rc2 -> 0.23.0-rc3
- d961a39 fix: remove CodeEditor onError prop
- 269685b chore: bump version: 0.23.0-rc1 -> 0.23.0-rc2
- 1dafc8b fix: omit projects outside of users permissions [DET-9557] (#7046)
- 33d6fc7 fix: load config file into the code editor (#7033)
- 7464932 fix: fix k8s slots count growing for commands [DET-9550] (#7025)
- ee20409 chore: bump version: 0.23.0-rc0 -> 0.23.0-rc1
- 125c0b4 fix: WorkspaceList tableOffset bug (#7017)
- 93bf322 fix: In new breadcrumbs, uncategorized links to /projects/1 [WEB-1319] (#7013)
- 6b0f905 chore: bump version: 0.23.0-dev0 -> 0.23.0-rc0
- b99b722 chore: bump version: 0.22.3-dev0 -> 0.23.0-dev0
- 55962b2 chore: lock published urls to preserve redirects
- 1659465 chore: lock api state for backward compatibility check
- 6e1384c fix: add canDoActionOnCheckpointThroughModel to core_checkpoint (#6889)
- f5060e3 feat: DeepSpeed Autotune [MLG-201] (#6924)
- 2e98832 feat: partial checkpoint delete [DET-9491] (#6901)
- 1e0bf34 ci: fix latest-main/preview deployments.
- 847928a chore: remove support for hdfs (#6967)
- 750f82c chore: Consolidate CodeMirror code files [WEB-1306] (#6999)
- 6984945 ci:
det deploy aws
flavor for a normal RDS postgres. (#7000) - fefca32 refactor: full trial summary metrics recompute take metric type as param (#6997)
- 6545423 feat: add provisioning timeouts (#6447)
- 82a706b chore: adjust end_time threshold for allocation tensorboard test [DET-9542] (#6992)
- 8a8f4cb fix: k8s tasklist forever growing (#6957)
- 6ff5a99 fix: Stabilize return order for experiments (#6968)
- 318adf9 chore: Remove unused ExperimentConfiguration (#6959)
- 38f7273 feat: strategic merge pod spec with task_container_defaults [DET-7227] (#5728)
- ee99f56 fix: sticky nav bar in UIKit page (#6980)
- d65f072 chore: consolidate and simplify isort config files (#6990)
- c0971bb ci: Rng tests refactor (#6946)
- 46379a6 feat: switch from Monaco to CodeMirror (#6926)
- 9ff5e31 feat: add comparison toggle to glide experiment list [WEB-989] (#6976)
- 9ed4888 fix: breadcrumb warnings (#6965)
- cb52a19 chore: install sigusr1 on main thread only [MLG-585] (#6993)
- 5600fe8 chore: allow tokens to be provided via secure cookie instead (#6862)
- 8acc879 docs: update Core API UG w literal includes (#6501)
- daab280 chore: remove old Python packages (#6985)
- c8be6af test: fix
test_experiment_proxy_ray_tunnel
. (#6972) - 89c535b chore: Sort and filter on description, duration, and searcherType [WEB-1199] (#6913)
- 0dbd6d7 chore: simplify metrics handler syntax (#6969)
- faba534 fix: parameterize agent tmp by agent id [DET-9111] (#6960)
- 3a4a247 test: Removes several multiprocessing executions of test_enqueuer. (#6949)
- 6ebf476 feat: added
gpu_hours
to historical allocation CSV [DET-9506] (#6948) - ee2b8f0 chore: More use of workspace store [WEB-1240] (#6880)
- 114e425 chore: add generic metrics support part 1 (#6641)
- 9388f1a chore: bump version: 0.22.2-dev0 -> 0.22.3-dev0
- 53abbb7 docs: Add release notes for 0.22.2 (#6954)
- d4445a4 perf: render full experiment list [WEB-1234] (#6947)
- df4d55a fix: only select new pages during selectAll mode (#6955)
- 0ee01af style: update glide table styles to design specs [WEB-1227, WEB-1281] (#6811)
- 5a7a7f8 chore: fix bugs in redirects.py (#6958)
- 77e08fa fix: Test runs dirs (#6916)
- 1f9e6a8 chore: simplify det generated enum's str representation (#6942)
- 1aea75f docs: fix a typo in the proxy ports guide. (#6953)
- 6f57431 docs: remove an obsolete section about
Checkpoint.load_from_path
. (#6909) - 9edc69e chore: check if a.restored to send task log [DET-9515] (#6956)
- 271013a ci: skip non-fix/feat PRs in release automation (#6939)
- 26372ef docs: Apply consistency to setup guides (#6930)
- 273e145 docs: Add HPC Launcher Security Considerations [FOUNDENG-617] (#6945)
- e232d2f chore: enable flake8 pre-commit check on e2e_tests (#6933)
- 29abca2 chore: Add Breadcrumb to every Page [WEB-335] (#6798)
- 7f5ce01 fix: out of bounds when page is bigger than actual (#6931)
- d4b76d3 fix: Workspace selection on JupyterLab modal [DET-9503] (#6938)
- 47110dd chore: vary generated training and validation metric counts (#6915)
- 88f5a43 feat: add theme (fonts, colors, etc) to design kit (#6847)
- e61264f feat: Hide experiments from archived projects in
GetExperiments
DET-9381 (#6832) - 7d78dbc chore: remove old events infrastructure from allocation (#6771)
- c03ec45 ci: increase resource classes for some tests (#6932)
- 2cdb946 ci: handle cherry-pick marker label in release automation (#6911)
- 8666415 fix: Experiment search filter API parentheses (#6893)
- eeecef8 refactor: make staticcheck linter use correct go version. (#6914)
- 411d49f feat: add json output format to det :ntsc logs commands (#6887)
- 47ed9de docs: automatically put the current year in the copyright (#6928)
- 1274927 fix: avoid having user edit modal reset to original state every 5~10 seconds [WEB-1273, WEB-1261, WEB-1203] (#6912)
- 653ed42 fix: turn off AutoPause for AWS RDS Aurora. (#6925)
- eda1385 feat: parallel coordinates chart on trial hyperparameters tab [WEB-1196] (#6872)
- de5e845 fix: make AWS template postgres version-agnostic. (#6923)
- c60a424 chore: add isort pre-commit hook (#6897)
- cbc564b fix: bump AWS CFN
SecondsUntilAutoPause
to see if it helps with timeouts. (#6921) - 628613b fix: replace
crypto.randomUUID
withuuid
(#6908) - be9f545 fix: remove oss admin flag from 'det user list' with RBAC (#6920)
- 0b59d79 fix: patch workspacelist context menu (#6919)
- 6b4615b refactor: remove /agents endpoint [DET-9479] (#6907)
- 77b0e94 chore: update polling intervals for stores (#6892)
- 9985f57 chore: rm ntsc events manager, endpoint, and client usage (#6840)
- 482481d fix: icon position (#6904)
- 6f12aca refactor: explicitly support partial settings and compare changes in full settings (as opposed to comparing settings to updates) (#6894)
- 15ad730 fix: Full height on code editor [WEB-1258] (#6906)
- 4538fe7 chore: enable exhaustiveness checks in switch statements (#6903)
- 14ee7d9 feat: new experiment table filter (#6502)
- fda0f1c chore: Remove unused rbac feature flags [WEB-920] (#6845)
- e7e293d chore: update Button component (#6841)
- 36dac4b fix: case-insensitive contains query within experiment (not metrics or hparams) (#6858)
- eb58396 test: add concurrent trial metric updates (#6884)
- bf84d0a ci(gha/link-docs-preview): strip md5sum output (#6881)
- 48648a2 ci: track merged PRs in a GitHub project (#6867)
- 0e39f19 feat: detached mode v0. (#6519)
- fcc4b70 fix: adjust pinned position calculation (#6890)
- bb76e55 docs: Indent content under 4th level headings (#6888)
- 771f664 refactor: bunify a chunk of experiment-related db queries. (#6854)
- 89a5e5e fix: handled when allocation end_time < start_time in updating aggregate-resources (#6878)
- bc76c1a fix: fix an issue with running det :ntsc logs (#6886)
- 2e0e490 fix: trials created by paused experiments should be paused [DET-9493] (#6882)
- 95f27d2 chore: Add ordered map (orderedmapx) to allow for queuing of job cancelations [DET-9465] (#6874)
0.22.2
Release Notes
Changelog
- cc16e0a chore: bump version: 0.22.2-rc1 -> 0.22.2
- 5c76b16 docs: Add release notes for 0.22.2 (#6954)
- 9d4eeda chore: bump version: 0.22.2-rc0 -> 0.22.2-rc1
- 9f0ca2e fix: avoid having user edit modal reset to original state every 5~10 seconds [WEB-1273, WEB-1261, WEB-1203] (#6912)
- 41952af fix: patch workspacelist context menu (#6919)
- b5d4016 fix: icon position (#6904)
- fbe740d fix: Full height on code editor [WEB-1258] (#6906)
- 0b0660a fix: fix an issue with running det :ntsc logs (#6886)
- c74d60c fix: adjust pinned position calculation (#6890)
- a45b790 chore: bump version: 0.22.2-dev0 -> 0.22.2-rc0
- 91df9e4 chore: lock published urls to preserve redirects
- ace853b chore: bump version: 0.22.1-dev0 -> 0.22.2-dev0
- 85e23c6 docs: add release notes for 0.22.1 (#6885)
- b81417b fix: remove ptl images from circleci (#6870)
- a0aba90 chore: Replace
new page
withnew note
in NoteCards (#6873) - 6dacc85 ci(gha/link-docs-preview): fix variable name in output (#6879)
- 7a1cb51 ci(gha/link-docs-preview): fix url base for preview links (#6876)
- ad07143 ci(gha/link-docs-preview): use md5sum for branch hashing (#6868)
- 2a06b3a ci(circle): skip jobs if only .github files were changed (#6865)
- 3bd01e2 ci(circle): add HOME to react cache key (#6812)
- 63ab1ad chore(gha/link-docs-preview): comment out yet unworking steps (#6863)
- ca900b6 docs: Add documentation for installing and running Determined AI on WSL (#6826)
- 08893c5 feat: Add CodeEditor to UI Kit [WEB-1049] (#6522)
- 27179ae test: fix nightly
test_convergence
. (#6842) - 6a0defd chore(gha): add debug info to help iterate on link preview workflow (#6821)
- 06b6fd3 fix: total batches update on rollbacks. (#6859)
- d995abb ci: install
build
when publishing Python packages (#6861) - 6648710 chore: lock api state for backward compatibility check
- 83f6bbd docs: reorganize the model dev guide directory (#6848)
- 827c561 test: fix TestNonNumericEpochMetric test (#6855)
- 7e4fd9b chore: increase det deploy gcp server size to 200gb (#6850)
- aaaff87 chore: devcontainer node version and ssh bindings (#6849)
- 03d7d62 fix: on dashboard show workspace name and link for tasks (#6844)
- 884bca5 chore: error when non numeric epoch values are reported (#6768)
- a1a357e fix: 'det e list' and 'det e list -a' behavior was switched DET-9480 (#6829)
- deccdd3 chore: support non numeric metrics in CompareTrials, SummarizeTrials, and TrialsSample [DET-9384] (#6734)
- afb8bda refactor: Add notes to UI kit (#6791)
- 239a559 perf: speed up synthetic metric generation (#6843)
- 8bba67d chore: do not set trial state on PyTorchTrial until load (#6716)
- 007d3d4 chore: print messages while
det deploy aws (up|down)
waits (#6838) - 6c289c2 chore: update hermes [WEB-1186] (#6818)
- 11274ee docs: Update launcher dependency to 3.2.9 [DET-9474] (#6836)
- eb37c14 ci: bump GKE Kubernetes version (#6835)
- fda7675 feat: Add metrics-stream/metric-names API for multiple experiments [WEB-1107] (#6827)
- 6adf59f chore: remove bun log messages from master logs [DET-9460] (#6816)
- 8d44d62 chore(CLI): set command bind mounts from --config arg (#6769)
- 8e13ab4 fix: Fix designkit screenshots being affected by subpixel lengths (#6777)
- 03dc214 docs: Reorganize setup directory (#6819)
- a24bca9 refactor: add 16px bottom padding to Page [WEB-1052] (#6770)
- a7a6aaf fix: doc url fix (#6823)
- df18289 chore: add some prom utils before they are duped everywhere (#6729)
- 8036666 fix: Core api pytorch mnist update2 (#6775)
- 5f0a2b8 feat: mask token det auth login [MLG-540] (#6817)
- 8c975e3 docs: Update language about upgrading/uninstalling CLI (#6822)
- bb53fe8 feat: associate templates with workspaces (#6605)
- 680e492 ci: increase
det deploy aws up
wait time to 20m. (#6820) - 979346f chore(deps): bump github.com/docker/distribution in /agent (#6814)
- 969c7d0 chore(gha): fix invalid syntax for docsite linker (#6810)
- 7b294d1 chore: update jupyter-lab URL to be consistent with CLI generated URL (#6809)
- a9b6b80 docs: Reorganize reference directory (#6790)
- 6c919d3 chore: add names for experiment test fixture (#6760)
- 02961b8 chore: backport workspace creation role assignment config
- 7ff3b46 feat: replace profiler empty page (#6799)
- 9afa0b3 fix: det e download auth issue [MLG-526] (#6802)
- 948e6e7 fix: agent forwards any https_proxy setting (#6794)
- 7af9a30 refactor: add clipboard button to uikit [WEB-1122, WEB-355] (#6767)
- 6e67058 ci: remove broken gh actions test-cli. (#6782)
- f42ce80 feat: replace spinner cells with loading cells in glide table [WEB-1185] (#6795)
- 9353647 ci: Enable deepspeed unit tests (#6772)
- eb025cb fix: typo in singularity docs (#6797)
- 7d94460 docs: Clarify pip install determined (#6755)
- 542a0a7 chore: replace missing workspace id panic with an api error (#6758)
- 1fb0e78 fix: "Select All" behavior on new Experiment List (#6779)
- c0edbe4 refactor: remove legacy echo experiment post/patch, trial and trial metrics get endpoints. (#6763)
- 132382f chore(circleci): limit doc preview generation to upstream branches (#6784)
- be01075 docs: correct the cloud shared fs information. (#6780)
- 7ef5a99 docs: add rbac example setup diagram (#6757)
- bd9cb5c chore: Quarantine select React test(s) (#6765)
- 12dc9cd chore: backport rbac ntsc entity id logging (#6762)
- 22eeec6 chore: add initial vscode devcontainer files (#6736)
- 80f83cf refactor: add dropdown to uikit and update dropdown usage to use uikit version [WEB-1152] (#6703)
- 7bc1e32 ci: update to node v18 (#6759)
- 4a66b31 fix: Exp list supports selection with shift/cmd (#6745)
- edae25a ci: Schedule search indexing with release workflow (#6776)
- ebf755c chore: use icon component from designkit (#6774)
- 5d83c2f docs: added docstring in _checkpoint.py (#6741)
- fbc9417 fix: add border after last frozen column in glide [WEB-1194] (#6764)
- 4007457 refactor: add Icon to UI Kit [WEB-1115] (#6635)
- 14c7fff refactor: Columns component (#6684)
- aa8519e ci: exit github comment script when no pr is found (#6766)
- d115ff0 feat: Support sorting in glide table (#6604)
- 2316bc8 chore: porting EE PermissionDenied changes back to OSS [DET-9468] (#6751)
- b7677e9 fix: add correct experiment filter grouping (#6752)
- 83612df chore: cleanup our setup.py (#6602)
- 00fda76 feat: enable double click auto column adjust [WEB-1193] (#6727)
- 71572b7 fix: Use position: fixed to replace sticky TrialLogPreview (#6753)
- 9779486 fix: allow 0 metrics on old and new charts (#6748)
- b0c2fbc test: Add visual diff test to designkit (#6707)
- 91237bb ci: add automation script for redirects (#6747)
- 96d7d26 chore: test RBAC transfers permission from group to users [WEB-1172] (#6750)
- b6d6f03 fix: Limit filters queries to experiments in relevant applicable state [WEB-1160] (#6568)
- edfd1f8 fix: Remove canCreateExperiment experiment param (use project to get workspace context) (#6712)
- 0dcc3f0 chore: chart tooltips always using new tooltip [WEB-828] (#6713)
- 99df12d docs: add renaming tool (#6715)
- fb676f6 ci: refactor single-gpu e2e_tests (#6661)
- 3632e2c docs: update min python version for development (#6587)
- 8b05942 chore: bump version: 0.22.0-dev0 -> 0.22.1-dev0
- 0afef54 chore: bump version: 0.21.3-dev0 -> 0.22.0-dev0
- 7020730 docs: add release notes for 0.22.0 (#6739)
- 28e40ba fix: cell out of bounds (#6728)
- 0a82f41 fix: Cluster stats chart displays with single day result [WEB-1206] (#6737)
- 5c7bccd ci: make custom searcher dump failed trial logs (#6709)
- 9be051e fix: spinner layout and typo in workspace success message (#6725)
- a643f28 fix: Remove redundant scroll bar for new experiment list (#6732)
- 64168c2 chore: fix
make -C docs live
for real (#6731) - f28a606 feat: agent support for podman (#6657)
- 168181b fix: Delete multiple experiments with new call [WEB-1135] (#6571)
- 7d89b18 perf: speed up model versions query (#6726)
- a6df988 chore(docs): setup docsite previews [INFENG-184] (#6704)
- 54f17bc chore: tweak proto comment (#6723)
- 75bfa69 chore: update new experiment list column widths (#6722)
- 8c05a3d fix: centralize cluster polling and reduce workspace calls [WEB-1190, WEB-1201] (#6706)
- dbc0853 docs: edit various minor things, mostly capitalization (#6690)
- 0262b67 fix: update convert to just cast string instead of encoding (#6719)
- b3d59d9 docs: k8s deployment docs [DET-9449] (#6696)
- b47247a docs: deprecate
logging.additional_fluent_outputs
. (#6718) - a2fb82a fix: Use throttle for log viewer filters (#6662)
- 999ffec feat: /api/v1/experiments-search should support filters [WEB-984] (#6556)
- e131cf9 chore: refactor PermissionDenied RBAC errors [DET-8946] (#6618)
- bdd727f fix: GC jobs do not inherit sbatch_args from custom resource_pool [DET-9363] (#6679)
- 18e14e8 chore: deprecate tf1. (#6710)
- c9efa08 fix: dataloader skip on resume (#6708)
- 61d651f chore: add flake8 pre-commit check for harness (#6695)
- fbfefee chore: Remove portable fetch fix (#6700)
- 6586490 chore: deprecate EstimatorTrial (#6701)
- fc0f389 fix: pass auth with incorrect password [DET-5623] (#6591)
- 67fbac0 fix: summary metrics migration earlier than last release latest (#6705)
- ad70138 feat: add column picker to new exp list [WEB-978] (#6683)
- 5527364 docs: summary metric release note reword (#6702)
- 95d1b16 chore: bumpenvs 6eceaca (#6692)
- 5953049 docs: summary metric migration downtime (#6688)
0.22.1
Release Notes
Changelog
- 35123bb chore: bump version: 0.22.1-rc1 -> 0.22.1
- 350f1b7 docs: add release notes for 0.22.1 (#6885)
- 6ebcc45 Revert "docs: add release notes for 0.22.1"
- 7c942a1 docs: add release notes for 0.22.1
- 7ff42a8 chore: bump version: 0.22.1-rc0 -> 0.22.1-rc1
- b1fa4b5 fix: total batches update on rollbacks. (#6859)
- c2243d9 chore: bump version: 0.22.1-dev0 -> 0.22.1-rc0
- 2f46765 chore: bump version: 0.22.0 -> 0.22.1-dev0
0.22.0
Release Notes
Changelog
- 9ec3468 chore: bump version: 0.22.0-rc2 -> 0.22.0
- 6e38c80 docs: add release notes for 0.22.0 (#6739)
- 8dee396 docs: deprecate
logging.additional_fluent_outputs
. (#6718) - a4d513f chore: bump version: 0.22.0-rc1 -> 0.22.0-rc2
- f3dd110 perf: speed up model versions query (#6726)
- c927198 fix: fix last cherry-pick
- 188c83b fix: centralize cluster polling and reduce workspace calls [WEB-1190, WEB-1201] (#6706)
- 35cd432 fix: update convert to just cast string instead of encoding (#6719)
- 47eaa50 chore: bump version: 0.22.0-rc0 -> 0.22.0-rc1
- f064543 chore: deprecate tf1. (#6710)
- 7048e94 chore: deprecate EstimatorTrial (#6701)
- 8165cb8 fix: summary metrics migration earlier than last release latest (#6705)
- 059ff0b chore: bumpenvs 6eceaca (#6692)
- 40e6c50 docs: summary metric release note reword (#6702)
- a0a2799 docs: summary metric migration downtime (#6688)
- 4e8e2e0 chore: bump version: 0.22.0-dev0 -> 0.22.0-rc0
- c950ad7 chore: bump version: 0.21.3-dev0 -> 0.22.0-dev0
- dea331a chore: unwind the lazy write of trials to the database (#6650) [DET-9405]
- 72a218f chore: Introduce local-only designkit route (#6438)
- 6eb5fbd feat: add summary metrics to trial obj [DET-9402] (#6671)
- 74410b7 docs: Help users find exp conf ref (#6691)
- 428e0a9 chore: release note on breaking TF changes to Keras optimizers (#6693)
- f29c262 fix: close context menu after click for new experiment list table (#6632)
- 9fd1982 chore: replace querystring and query-string with URLSearchParams (#6670)
- 4267ff4 ci: fix bug in docs makefile (#6689)
- abfe659 chore: add warning to det auth login flow (#6655)
- f3701d0 fix: metricName ident escaping (#6686)
- 1d8e9a5 perf: optimize experiment search [WEB-1163] (#6646)
- fb29d6c chore: Add circleci param for default-pt-gpu-image (#6680)
- 0278cf2 perf: add latest_validation_id to trials (#6672)
- 080ef5f refactor: clean up a bit of logic in
test_core.py
(#6639) - aeaea82 feat: logging message for tb upload completion. (#6682)
- d998e7a fix: update user management table to use the correct total from pagination info [WEB-1187] (#6675)
- 171c3c1 docs: add meta descriptions (#6654)
- 2634da2 chore: disable flag (#6674)
- 40d65a9 chore: update license for web pacakges (#6638)
- 342c039 refactor: simplify config handling in
AddExperiment
. (#6667) - 3cc0c51 chore: bump version: 0.21.2-dev0 -> 0.21.3-dev0
- 0eca7c3 docs: add release notes for 0.21.2 (#6673)
- 3f76344 feat: add
Open Link in New Tab
andOpen Link in New Window
options to experiment table link cells[WEB-1168] (#6637) - dc2d86e chore: update UI kit Tooltip component (#6608)
- c1dc2ae chore!: remove old template endpoints [DET-9337] (#6572)
- 5456d5e fix: update column order in ExperimentTrials table (#6640)
- aef04d3 fix: task proxies should recover across restarts (#6660) [DET-9325]
- 10e28da chore: run
test-e2e-react
only on upstream branches (#6658) - feea576 docs: add missing docs for DownloadMode (#6648)
- 05e90b3 fix: Fix invalid token logout loop (#6653)
- 11c4154 perf: summary metrics perf improvements
MetricNames
andTopTrialsByMetric
(#6634) - dcd0a8c fix: Metadata card can view and preserve JSON values [WEB-1183] (#6642)
- 97dedf9 chore: fix det deploy local tests (#6612)
- 435d31a fix: cache k8s summarize for performance (#6647)
- 427fc07 ci: fix autoscrape ci job (#6629)
- ec3cc98 fix: Investigate "directory does not exist" tensorboard warning messages [DET-9166] (#6594)
- fb98f2b test: playwright e2e test setup (#6562)
- f0c8ef1 feat: Support "cancelled" and "failed" Shutdown operations [MLG-468] (#6627)
- d0fcfe3 ci: reduce overly long CloudFormation stack name (#6626)
- 23c7013 docs: update contributors guide (#6616)
- 308972f chore: refactor columns api (#6563)
- ee8f39c test: fix test_experiment_api_determined_disabled flake (#6633)
- bf5377f fix: remove TODOs from examples (#6636)
- 459d4bb fix: Fix cluster store polling (#6611)
- 61a804a feat: summary metrics (#6477)
- 2cb7730 feat: singularity support (#5933) [DET-9388]
- acb5084 feat: multitrial improve efficiency [DET-9329] (#6545)
- fbc375d refactor: remove dialogApi modal dependency [WEB-1132, WEB-1133] (#6589)
- f43e5e3 ci: actually auto-scrape algolia records (#6623)
- 04e021f chore: fix key error in workspace and project list cli (#6625)
- cc22a78 fix: fix validation before logic (#6624)
- 7131b48 perf: remove unnecessary full trial.total_batches recomputes (#6621)
- 56d7f65 chore: lazy load imports to speed up load time [MLG-455] (#6590)
- c51c026 ci: auto-scrape algolia records (#6620)
- 5658c0c chore: bumpenvs a8b9c02, remove some selective tensorflow imports [MLG-358] (#6595)
- b4deb6d fix: include global TaskContainerDefaults when no resource pool (#6609)
- 2317b20 ci: fully quarantine
test-det-deploy-local
andtest-stress
(#6606) - 5b292b0 fix: jiggle cli.tunnel & cli.proxy code to avoid an import warning. (#6599)
- c315505 chore: add cli render method for json (#6548)
- 701651c fix: Workspace member modal includes users and groups (#6596)
- 84a9194 docs: Add structure to master config ref (#6567)
- 7a4d7d5 fix: widen the Searcher Metric Values column (#6600)
- a660db8 fix: set searcher_metric_values properly (#6603)
- 8eff0bc ci(circleci): split-off quarantined e2e tests (#6601)
- fd7aac6 fix: launch error has no effect for k8s and slurm (#6597)
- 87810a8 feat: k8s cluster info using taints/tolerations incl. global ones (#6518)
- af65660 docs: fix searchbar result paths (#6592)
- 1c8bf2a fix: Moving model to new workspace - workspaces sort (#6593)
- ac3cb4b feat: storage manager shortcut dictionaries [MLG-407] (#6488)
- 1a0f9ff ci: quarantine some flaky tests (#6577)
0.21.2
Release Notes
Changelog
- fa2d99f chore: bump version: 0.21.2-rc5 -> 0.21.2
- 187e6ba docs: add release notes for 0.21.2 (#6673)
- a3007c2 chore: bump version: 0.21.2-rc4 -> 0.21.2-rc5
- e251cb4 fix: task proxies should recover across restarts (#6660) [DET-9325]
- d4f9105 chore: bump version: 0.21.2-rc3 -> 0.21.2-rc4
- bc21216 fix: cache k8s summarize for performance (#6647)
- e8e0466 fix: Investigate "directory does not exist" tensorboard warning messages [DET-9166] (#6594)
- a6731bb chore: bump version: 0.21.2-rc2 -> 0.21.2-rc3
- e7597fc perf: remove unnecessary full trial.total_batches recomputes (#6621)
- 98239e6 chore: fix key error in workspace and project list cli (#6625)
- ef2178a fix: fix validation before logic (#6624)
- ff6b44a fix: include global TaskContainerDefaults when no resource pool (#6609)
- c52e2bc feat: k8s cluster info using taints/tolerations incl. global ones (#6518)
- 34a330f fix: jiggle cli.tunnel & cli.proxy code to avoid an import warning. (#6599)
- e240489 chore: bump version: 0.21.2-rc1 -> 0.21.2-rc2
- 45bb0df fix: set searcher_metric_values properly (#6603)
- 53aaa71 chore: bump version: 0.21.2-rc0 -> 0.21.2-rc1
- 546c330 fix: Workspace member modal includes users and groups (#6596)
- 8f4850b fix: launch error has no effect for k8s and slurm (#6597)
- c464708 fix: Moving model to new workspace - workspaces sort (#6593)
- 5d04752 docs: fix searchbar result paths (#6592)
- aa9fb17 chore: bump version: 0.21.2-dev0 -> 0.21.2-rc0
- c104cce chore: lock api state for backward compatibility check
- ba87bb5 ci: update remaining references to master branch (#6588)
- e1d7497 chore: upgrade Black to latest version (#6564)
- 6fa5d18 fix: update test to reflect change in CompareTrialsRequest (#6585)
- 1847e49 fix: close modal properly (#6574)
- 66aee59 ci: master main rename (#6570)
- b112479 docs: stop relying on algolia crawler (#6584)
- 57077d0 docs: Improve the WebUI page (#6579)
- 23a3f8b chore: Update Trial Time Series API endpoint [WEB-1116] (#6566)
- 0c3961a docs: enable algolia search by adding structure (#6541)
- eca4d17 fix: metrics sampling for metrics with epochs. (#6578)
- c03e8fc chore: remove yaml from manifest (#6582)
- b646dac build: take mockery out of master build path (#6546)
- faa5eed docs: Add structure to agent configuration ref (#6573)
- 402da7c chore: tensorflow 2.12 [MLG-358] (#6527)
- e87a2a3 feat: Support order for experiment search API (#6479)
- 3ad532f ci: kill GKE GPU node pool creation if it's taking too long (#6565)
- 70c3fe3 chore: update launch_error docs and add release notes (#6561)
- a453cfc chore: add --json to det e|task|trial logs [DET-8982] (#6544)
- f1f6175 chore: bump default python version to 3.8 in CircleCI (#6560)
- 8f1fd54 fix: move return in glide table cell
- b57b9ef fix: Link clickability restrictions after rebase (#6557)
- 042512f feat: explist v2 style tweaks [WEB-1108] (#6521)
- f92bbed feat: add bulk actions to glide table [WEB-981] (#6493)
- 70c2d11 build(deps): bump github.com/docker/docker in /master (#6465)
- bc18aba chore: simplify logic and improve naming in an RBAC test (#6552)
- e99c241 fix: Append instead of replace sbatch args [DET-9263] (#6547)
- 0c979bb ci: retry Go download even more (#6550)
- 788b01a chore: bumpenvs with the
pynvml
-nvidia-ml-py
switch [DET-9319] (#6520) - a3005cf chore: address mypy lint warnings [DET-9334] (#6549)
- b221032 test: add a tool to replicate exps, trials, and metrics [DET-9043] (#6211)
- 4705845 feat: change launch warning to launch error (#6412)
- fecb5f8 chore: fix a lint issue in e2e test code (#6543)
- 468b689 chore: better error messages for logging test [DET-9280] (#6535)
- cdda88d docs: Minor updates for HPC Launcher details. (#6542)
- c25f6ce feat: Right-click for context menu in Glide table [WEB-977] (#6532)
- 66f82ab chore: fix the migration order for adding trials.total_batches (#6533)
- 5798ba7 chore: remove docker compose dependency [MLG-37] (#6499)
- 380e93b fix: install codecov uploader in CI (#6538)
- 1350bb2 docs: minor edit pod security (#6536)
- c13e13a ci: fix incorrect secret in helm workflow (#6537)
- 169729a chore: code refactor and minor updates [DET-9237] (#6515)
- d61fe43 ci: github actions consistently use major version (#6529)
- be10217 chore: bump version: 0.21.1-dev0 -> 0.21.2-dev0
- d15c6ca docs: add release notes for 0.21.1 (#6530)
- a4fea4a docs: fix links (#6528)
- 41a1c8b build: Some CircleCI tests begin with a python 3.8.16. (#6525)
- 835788c test: fix race checking for trial logs in missing_docker_container [DET-8933] (#6514)
- d375159 chore: [FIX] replace helm pw-changer image with generic det env image [DET-9110] (#6523)
- 41b01cb ci: retry Docker Hub publish steps (#6512)
- 5a680b9 build: Updates Python used in CircleCI to 3.8.16 (#6391)
- 0c6bfad chore: replace helm pw-changer image with generic det env image (#6483)
- 881645a docs: fix broken links, formatting, and typos [MLG-404] (#6510)
- 1b5ae81 chore: bumpenvs to 6218891517b070835640bb5ed3b2e2dd414c56d0 (#6504)
- 72aa32a feat: steps -> pre-compute metrics and allow non-aligned training and validations [DET-9154] (#6372)
- 69547e1 fix: remove hp importance [DET-9181] (#6446)
- 940c580 build(deps): bump github.com/docker/docker in /agent (#6466)
- 170c960 style: add inter font and apply (#6328)
- ee4cffd fix: preserve model registry label order [DET-9303] (#6507)
- 7dc905d fix: wrap shutil.rmtree in a loop (#6506)
- 7587c35 chore: use UI kit modal for Experiment actions (#6478)
- e541217 fix: use workspaces store in ProjectMoveModal (#6505)
- 588306f style: shorter generated py enums names (#6496)
- 3893035 fix: exp actions with workspace-specific user perm [DET-9300] (#6503)
- 6601085 fix: update settings observables to detect changes (#6489)
- a6cd26a fix: add unique port for local agent tests (#6492)
- baf6ee7 refactor: standardize state stores (#6326)
- 8ac2430 fix: task list race caused incorrect 404s to be returned [DET-9293] (#6500)
- bfb5349 fix: fix mmdetection checkout in unit tests to last 2.x tag to avoid test failures due to schema naming changes. (#6498)
- 008ace5 ci: more retries for
det deploy
(#6494) - 5b7fbd8 docs: remove leftover K80 mentions. (#6495)
- 1fb2137 ci: skip a test that's currently broken (#6497)
- 899f1c9 test: fix ray tests [DET-9267] (#6480)
- 5dc80c5 feat: add Core API pytorch mnist tutorial update (#6243)
- cb03407 test: cleanup should kill hung experiments, not cancel (#6484) [DET-9284]
- 0e156e4 docs: modified docs for clarity and k8s pod security (#6471)
- dbdc53c feat: Bulk delete experiments, experiment filters in LaunchTensorboard [WEB-1127] (#6468)
- 5a5a3ce docs: release note edits. (#6487)
- 89dedb2 docs: Edit Page Title (#6485)
- d3d1c48 fix: avoid duplicate labels (#6445)
- 8299a88 chore: tweak Go, Python versions in dev setup instructions (#6481)
- bbbdbc1 chore: set up authz interfaces for master logs and /resources (#6454)
- 2852410 fix: add remote prop to Python SDK user.User (#6476)
- 61847ad fix: podspec requests/limits for zero slot workloads [DET-9257] (#6469)
- 7ee018d chore: create a gRPC endpoint for tasks [DET-8803] (#6343)
- 49c93f7 chore: remove example bert_squad_pytorch [MLG-396] (#6467)
- e41d934 feat: treat instance startup script as a secret #2 [DET-9214] (#6450)
- 557dde2 docs: add a known issue and its fix for PBS - Display valid value for Accelerator field in the Resource Pool Configuration (#6379)
- fa1c043 revert: feat: k8s cluster info using taints/tolerations (#6474)
- 64b16e7 Revert "chore: remove docker-compose from det deploy local [MLG-37] (#6386)" (#6472)
- d33b220 chore: account for missing aws stack outputs [DET-9255] (#6455)
- 48df2bc docs: do some copy editing (#6462)
0.21.1
Release Notes
Changelog
- 42ea168 chore: bump version: 0.21.1-rc4 -> 0.21.1
- e10942a docs: add release notes for 0.21.1 (#6530)
- ce228f3 chore: bump version: 0.21.1-rc3 -> 0.21.1-rc4
- 69e6135 docs: fix links (#6528)
- eab3142 chore: bump version: 0.21.1-rc2 -> 0.21.1-rc3
- 5afa655 chore: bump version: 0.21.1-rc1 -> 0.21.1-rc2
- d214339 fix: wrap shutil.rmtree in a loop (#6506)
- 3aceec7 fix: exp actions with workspace-specific user perm [DET-9300] (#6503)
- 4e4646b fix: update settings observables to detect changes (#6489)
- 0a4e609 docs: release note edits. (#6487)
- d2edfd2 chore: bump version: 0.21.1-rc0 -> 0.21.1-rc1
- 213f426 fix: add remote prop to Python SDK user.User (#6476)
- 69bba45 revert: feat: k8s cluster info using taints/tolerations (#6474)
- 7e1c019 Revert "chore: remove docker-compose from det deploy local [MLG-37] (#6386)" (#6472)
- bf9f94a fix: podspec requests/limits for zero slot workloads [DET-9257] (#6469)
- 87195b9 feat: treat instance startup script as a secret #2 [DET-9214] (#6450)
- bd99cf9 chore: bump version: 0.21.1-dev0 -> 0.21.1-rc0
- e6ee9a4 chore: lock api state for backward compatibility check
- ae5b933 chore: remove docker-compose from det deploy local [MLG-37] (#6386)
- 049a8cd feat: storage manager shortcut strings [MLG-221] (#6408)
- 20da2b4 fix: Show page not found on experiment details (#6459)
- 6568ca4 chore: deprecate HDFS storage support (#6460)
- 713d2a3 fix: do not skip graceful preemption for tasks using alternate rendezvous path (#6463) [DET-9258]
- 0ff7f0a chore: use UI kit modal for UserSettings modals (#6376)
- a4266bd chore: add go workspace for dev use (#6461)
- 8cc8954 feat: k8s cluster info using taints/tolerations (#6425)
- c05b3a1 ci: enable debug mode for
det deploy aws
commands (#6458) - 5d9266a chore: remove harness schema gen (#6457)
- d84b8aa fix: do not error on nullable validation metrics [MLG-403] (#6453)
- e6f965a revert: returned removed message field from TrialLog [DET-8983] (#6253) (#6456)
- e7d7f58 docs: fix
public_ip
misinformation [DET-7014] (#6437) - e62c117 fix: add a whitespace in model version createdBy (#6424)
- 358f873 ci: check
det-deploy
exit status intest_local.py
(#6448) - 31ab9c1 docs: Style and organize content around CLI, Commands and Shells (#6347)
- 64916c6 revert: feat: treat instance startup script as a secret [DET-9214] (#6387) (#6449)
- 527ce2a build: fix examples makefile (#6404)
- 6bb0bfb refactor: removed message field from TrialLog [DET-8983] (#6253)
- 164f9ff chore: Use experiment-search API for new experiment list table (#6428)
- e209f37 test: examples test requirements.txt maintenance. (#6429)
- 60bbb0b feat: treat instance startup script as a secret [DET-9214] (#6387)
- 7a5d63a chore: remove storybook leftover (#6426)
- 266d0f1 docs: slot command limitations for HPC clusters (#6421)
- f0c87af fix: show states dropdown on ExperimentTrials (#6435)
- 82e70e7 chore: show error message that links to git rebaseability page (#6422)
- 482fb5d test: set up ci for reporting go tests using junit (#6418)
- c4446d5 feat: Add quota limits for helm (#6432)
- 347a0b1 fix: dont hardcode explist flag to true (#6430)
- be20cba fix: remote users cannot login with or change password (#6413) [DET-9246]
- 7ce560c chore: use Modal from UI kit (#6350)
- 58f8611 test: Make
test_launch
more portable. (#6419) - c6ca31d docs: fix make live (#6411)
- b4d20cd feat: initial explist v2 setup (#6382)
- 5b0a400 feat: Generate metrics using a determined command. (#6222)
- 2e4c61e chore: allocate less address space in e2e gke cluster tests [DET-9231] (#6405)
- ce8aa20 fix: set default value for create model modal (#6409)
- f5deb53 docs: add docs for torch profiler support [MLG-179] (#6400)
- a2fa442 fix: accurately count slots from scheduled jobs by allocationID (#6346)
- 0b1b87c fix: respect default pool task container defaults [DET-9241] (#6410)
- 9e08ff4 fix: allow
det checkpoint download
for completed checkpoints only [DET-3786] (#6377) - 49bef9e docs: add a simpler proxy configs example, fix typo in proxy ports guide. (#6374)
- 172fe17 fix: slot enable/disable for HPC cluster (#6396)
- c30922c chore: implement experiment details header section using InfoBox [WEB-1064] (#6399)
- fda950f chore: replace modal hooks with UIKit modals in model and model version pages (#6378)
- 3405aa7 chore: remove deprecated
det-deploy
executable [DET-5171] (#6401) - 27d57f3 ci: retry deploying and deleting AWS clusters (#6392)
- 6841dad Revert "chore: allocate less address space in e2e gke cluster tests [DET-9231] (#6393)" (#6398)
- aad581e fix: gently handle
not found page
in model and model version pages (#6394) - a436afc chore: allocate less address space in e2e gke cluster tests [DET-9231] (#6393)
- 3f8bfee docs: Add resource_manager.launcher_jvm_args [DET-9230] (#6395)
- 161bbdf feat: Add API for experiment list columns [WEB-979] (#6274)
- 775d062 fix: checkpoint insertion slow performance on big databases [DET-9219] (#6390)
- b47a411 feat: specify disk size and type for dynamic agents in gcp deployment [MLG-224] (#6384)
- 8960cc7 feat: add typography component (#6385)
- 13ca474 feat: refactor compareTrials to support time-series [WEB-999] (#6317)
- efbbb92 fix: Pin version of pydata-sphinx-theme (#6389)
- 7aabd87 fix: Increase timeout to improve situation re: [DET-9125] (#6369)
- 7d56180 chore: Fix router/app circular dependency (#6370)
- d322a90 chore: add better sync/map pkg for Go (#6299)
- cdf651f feat: Add filters options to multi-experiment actions [WEB-982] (#6351)
- d2b43de fix: get 'slot list' going for dispatcher (#6348)
- b4a21a3 fix: replace unnecessary main tags (#6362)
- 161ffdb feat: add support for loading state for the dropdown (#6371)
- fa92f8d fix: set unique port reqs on trial's shallow copied task spec (#6381)
- 6a2a16a feat: propogate gcp cluster labels to dynamic agents [MLG-365] (#6297)
- 4d69dc7 chore: backport wildcard actor lookup (#6375)
- 86882ce chore: bumpenvs and bump master amis (#6373)
- 5fa1ad1 feat: added RBAC for DeleteModel/Version [DET-9048] (#6339)
- 356286f fix: handle duplicate workspace name DB errors [DET-8808] (#6361)
- 12e36c1 fix: handles race on duplicate model names (#6364)
- c115758 fix: update ubuntu ami tables (#6363)
- 33d7708 ci: skip package-and-push-system-dev for web and docs PRs (#6332)
- 258c02d chore: change docs theme to sphinx-book-theme (#6323)
- 807068d fix: isEqual utility function (#6367)
- 8027a00 fix: log and skip invalid request(trial) id, don't kill experiment (#6349)
- 2f57cd6 fix: GC task spec's ResourcesConfig should be valid (#6366)
- fc923bb chore: bump version: 0.21.0-dev0 -> 0.21.1-dev0
- 3be67d9 docs: add release notes for 0.21.0 (#6365)
- 26150c4 fix: experiment table offset (#6353)
- 8a201a5 feat: metrics streaming API (#6267)
- 21793dd fix: dataBounds was never updated, use unzoomedBounds for chartgrid sync [WEB-1039] (#6359)
- 9725ed3 chore: tweak task_container_defaults merging logic to append sbatch_args (#6360)
- a187669 chore: Switch useSettings to observable [WEB-801] (#6202)
- 0e89282 chore: Add Accordion component to UI Kit (#6352)
- aeee6b8 fix: pytorch loading accepts trial_class kwarg (#6356)
- 4c63fc8 style: fix button arrangement (#6357)
- 7159f3f feat: Some experiment actions process multiple experiment IDs [WEB-982] (#6194)
- 986f5e7 fix: use new router store for login redirect (#6322)
- edd7b6a fix: workspace modal filter (#6341)
- 96dffbe fix: remove flaky test in test_convergence.py (#6336)
- 2efefab fix: avoid unnecessary refresh in model version node (#6344)
- 181ea2b fix: better debug message for invalid request id (#6340)
- 51edc7f fix: fix patchSlotState util (#6345)
- c920267 docs: add algolia search [MLG-367] (#6137)
- a9e9deb fix: remove extra markResourcesStarted call (#6331)
- 226745f fix: close open allocations terminates, too (#6283)
- f2ab6ef chore: reduce GPU count for GCP deployments (#6337)
- 2b6abe3 chore: remove old slots patch endpoint (#6330)
- 3e925bc fix: double scroll bars in chart grid (#6316)
- 6556c2f fix: wording in pytorch trainer guide (#6342)
- 18fa596 fix: skip op assert in test mode (#6338)
- e81987a chore: add experiment search endpoint skeleton (#6275)
- c45bf55 fix: Workspace members, isFiltered / reset tableOffset (#6335)
- 1a4d897 fix: Indicate and clear user search filter [WEB-1074] (#6309)
- 3bcb21f fix: Learning curve on validation-only experiment (#6334)
- a1392af fix: font and layout fix in project card (#6252)
- 1c80094 docs: update index pages with minor edits (#6315)
- 0ceae73 docs: fix release note (#6329)
- 45f4718 fix: Fix user store (#6327)
- 7033ece feat: show log viewer agent filter options even when there is one option (#6324)
- 49a4ed2 docs: Add usage examples for Loadable (#6325)
- 61513e4 fix: workspace card issue (#6320)
- acee4e3 feat: global port registry for unique port offset [DET-8954] (#6148)
- f15aa54 fix: Prevent closest point pluging from focusing hidden points (#6314)
- 87f72c5 chore: fix allocation req params annotation (#6305)
- a8f1357 chore: add modal to UI kit (#6188)
- 0effee9 test: attempt to make TestIdleTimeoutWatcher more reliable [DET-8974] (#6319)
- 29ddce4 fix: expand torch + distutils workaround (#6313)
- 8944890 chore: remove old master logs endpoint (#6265)
- b88b378 chore: avoid usage of k8s NamespaceAll easing k8s permission requirements [DET-9123] (#6248)
- cac99bc docs: Improve task_container_defaults docs [DET-9120] (#6306)
- 7654622 chore: prepare for RBACed agents/slots enable/disable [DET-9156] (#6310)
0.21.0
Release Notes
Changelog
- b0e47da chore: bump version: 0.21.0-rc4 -> 0.21.0
- 7b09cc9 docs: add release notes for 0.21.0 (#6365)
- 0073a16 chore: bump version: 0.21.0-rc3 -> 0.21.0-rc4
- 1837993 chore: remove old slots patch endpoint (#6330)
- b61169c fix: pytorch loading accepts trial_class kwarg (#6356)
- c45cf5c chore: bump version: 0.21.0-rc2 -> 0.21.0-rc3
- 38d7a88 fix: use new router store for login redirect (#6322)
- 47cf4c0 fix: workspace modal filter (#6341)
- 87c0c66 fix: fix patchSlotState util (#6345)
- bedc738 fix: Workspace members, isFiltered / reset tableOffset (#6335)
- e0d30bb fix: skip op assert in test mode (#6338)
- 0fdee1e chore: bump version: 0.21.0-rc1 -> 0.21.0-rc2
- 98b342e chore: prepare for RBACed agents/slots enable/disable [DET-9156] (#6310)
- d3b9c85 fix: remove extra markResourcesStarted call (#6331)
- 403967f chore: reduce GPU count for GCP deployments (#6337)
- 4d54710 fix: Indicate and clear user search filter [WEB-1074] (#6309)
- a70452f fix: Learning curve on validation-only experiment (#6334)
- b5a1e88 chore: bump version: 0.21.0-rc0 -> 0.21.0-rc1
- 69dbc40 fix: Fix user store (#6327)
- aad5947 feat: show log viewer agent filter options even when there is one option (#6324)
- f85ab20 fix: workspace card issue (#6320)
- eef0be0 fix: expand torch + distutils workaround (#6313)
- 003bd74 chore: avoid usage of k8s NamespaceAll easing k8s permission requirements [DET-9123] (#6248)
- 93d9e81 chore: bump version: 0.21.0-dev0 -> 0.21.0-rc0
- eef6069 chore: bump version: 0.20.2-dev0 -> 0.21.0-dev0
- f0dd082 fix: tb_callback not found (#6311)
- 9e7a636 fix: fix disable slot unexpected msg [DET-9155] (#6308)
- 6b5eabf feat: cli expect bindings.APIHttpError (#6278)
- 0841794 ci: retry large pip installs (#6260)
- a3ab995 chore: lock api state for backward compatibility check
- fe966c1 feat: remove static agents from gcp deployments [MLG-366] (#6281)
- fb2f515 chore: deprecated sdk methods (#6268)
- fc885e7 docs: PyTorch Trainer (#6177)
- fec6cf2 feat: load PyTorchTrials with kwargs (#6286)
- 9d62826 feat: checkpoint delete in python SDK [MLG-377] (#6230)
- 640af40 feat: log gpu information in task logs [MLG-234] [DET-5274] (#6303)
- f006a4a fix: report_training_metrics on parallel (#6304)
- 09df35c docs: remove autodoc test snippet (#6301)
- b7b07f8 fix: allocation rekill shouldn't use time.UTC in fine time checks (#6298) [DET-9149]
- 1016033 fix: do not restore stopping_killed experiments (#6300) [DET-9153]
- fcdb8e0 feat: Make Torchwriter behave like Pytorch SummaryWriter (#5916)
- 0e024ca feat: show TrialWorkloads table when f_chart is on [WEB-1025] (#6269)
- 8504b2e ci: bump gke version to 1.23 (#6292)
- b331c58 fix: fix err status code for ResourceAllocationRaw (#6294)
- 4473ef3 fix: update webui static gzip path regex (#6285)
- ed6e0d0 fix: Run environment bump scripts after environments fix (#6287)
- 6e8d238 docs: add instructions on how to use glasbey colors (#6266)
- 6aa04b9 chore: document dropping K80 support [MLG-385] (#6272)
- 51cb2a7 build(deps): bump actions/setup-python from 3 to 4 (#6084)
- cce1237 build(deps): bump actions/setup-go from 3 to 4 (#6244)
- 75662c2 chore(deps): bump mellium.im/sasl from 0.3.0 to 0.3.1 in /master (#5668)
- 2f2390f fix: login validation (#6284)
- 3d56934 fix: avoid explosion of prom series per actor with better labelling (#6280)
- 073e6ac chore: update code for new flake8 upgrade (#6276)
- 1acb9fd fix: reflect data immediately in model page and model version page (#6277)
- 447a00d fix: remove perpetual spinner (#6279)
- 87cf260 fix: Load multi-trial learning curve with only training metrics [WEB-1026] (#6220)
- 9bb9caa test: initialize certs properly in an auth utility [DET-9104] (#6271)
- 1888c33 chore: remove deprecated validation callbacks [MLG-21] (#6215)
- 3e5c863 chore: do not error on shared_fs delete for nonexistent paths [MLG-378] (#6247)
- 44dcfc7 fix: browser warnings (#6258)
- 51528d8 fix: handle results of polling updates in allocation actor (#6264) [DET-9126]
- e444f0e test: Small refactor of CLI experiment tests (#6237)
- 37794fa chore: take advantage of CheckpointContext handling sharded data [MLG-13] (#6200)
- ad02a93 chore: update requirement for Launcher version to 3.2.4 (#6261)
- c6b7dde docs: retitle api reference docs (#6257)
- 935ec57 chore: minor cleanup in useValueMemoizedObservable (#6256)
- 6227782 docs: Slurm details updates [FOUNDENG-512] (#6263)
- d89d98a fix: Restore proper SIGTERM handling for Slurm/PBS [DET-9124] (#6250)
- 36ad652 feat: use Loadable in ChartGrid and LineChart (#6239)
- cf4d47d fix: update User Management table on new user [WEB-1011] (#6231)
- fd770ff fix: Nameplate icon width bug (#6251)
- eb954b9 fix: correct cluster slots card size [WEB-1028] (#6249)
- 672223c chore: update npm libraries (#6241)
- 555f791 chore: add merge func for task container defaults (#6236)
- d808168 chore: move from K80 to T4 GPU type defaults (#6224)
- 08b0444 chore: bump version: 0.20.1-dev0 -> 0.20.2-dev0
- 0a69a8c docs: add release notes for 0.20.1 (#6235)
- 9950726 fix: Chart focus maintained as series changes (#6240)
- 352a08b fix: permission issue in member tab in workspace (#6208)
- 821994c docs: use theme templates better (#6226)
- ca330f5 fix: ensures docker container is cleaned up if a container is killed after start, but before run [DET-8988] (#6223)
- 7ddffdb ci: update ESLint to v8 (#6238)
- e3b6071 chore: preparing to integrate UserByExternalToken with SCIM support (#6201)
- 045d3e8 fix: CSS inside router (#6234)
- 2138c0c docs: update script for new core api user guide (#6228)
- 6f3463c docs: add HPC Launcher job exit code known issue (#6233)
- 0b9698a feat: remove TODOS from users loadable "loading" state (#6104)
- 6ec8f55 feat: hide non-global roles on user/group modal (#6189)
- e9c2259 chore: Update max_concurrent_trials to have a default value of 16 (#6221)
- a74ccd0 feat: disable user management if external sessions are enabled [DET-8791] (#6123)
- b6f7163 ci: update
Stylelint
to v15 (#6218) - d5665e1 chore: Remove deprecated react-router [WEB-580] (#6157)
- b54eda8 docs: minor edits to commands and shells (#6227)
- 69c36b3 feat: show unique HPs for Custom Searcher trials on experiment table [MLG-343] (#6192)
- e7b04c7 feat: collect timing data for actor system internals (#6216)
- 7dd63d0 feat: Enable username and display name search on cluster "user" list [WEB-1009] (#6198)
- fea9aa4 fix: username case sensitive issue (#6196)
- 1c8c473 fix: log level filter width (#6217)
- 5a348d6 chore: create Nameplate component (#6153)
- 2b3da4d chore: remove type/interface member ordering rule from eslint (#5896)
- 10f3914 fix:
det user change-password
for user's own passwords. (#6214) - 35056ef docs: add current rbac limitations to docs (#6212)
- b083ad1 fix: stop ws/prj failed notifications (#6210)
- a1c3d93 fix: flip undefined check logic for url params in ts bindings (#6213)
- b429b3f fix: project card issue (#6185)
- ac586f9 feat: update the regex to match the new build output format (#6163)
- e6813e1 fix: default tag in AWS stack updates (#6207)
- 11fbfb7 feat: PyTorch Trainer (#6059)
- efdba0b docs: clarified rbac and k8s docs (#6169)
- 87b3cfb chore: move assigned event to correct location (#6197) [DET-9012]
- 2269d78 perf: improve checkpoint size performance [DET-8960] (#6171)
- 9240dfc build: Eliminate CRACO, use Vitest for testing (#6021)
- c52e317 perf: remove
Trialv1.latest_training
field. (#6155) - 5b39a94 fix: adding exit reason to tensorboard allocation signal (#6150)
- 1571089 fix: support mixed CPU/GPU k8s quotas (#6195)
- b4a1081 fix: modified historical task allocation endpoint to instead aggregate allocations (#6184)
- d13169c refactor: use generic Go set type in more places (#6179)
- 77f8ec4 chore: bumpenvs reflecting tf27 not having pytorch (#6182)
- 744bd1b fix: remove get-deps-bindings target from top level makefile (#6191)
- a3e7b89 fix: add
archived
param and simplify query (#6175) - 9f43ad1 fix: exp move modal (#6183)
- 3bc5524 fix: don't print ':' when err msg is empty (#6190)
- 47c139d fix: pass workspace ID when creating tensor board from WebUI [WEB-1019] (#6186)
- 94d37ab fix: Continue Trial flow does not take the new
max_length
(#6168) - 5c3d6ae fix: Close expiriment fork/ continue trial modal properly (#6174)
- e8d4dcb build: eliminate java dependency for typescript swagger bindings (#6139)
- ba87f94 revert: reflag new chart experience (#6181)
- dcf2586 feat: add labels to GCP instances created with det deploy gcp [MLG-170] (#6147)
- e8858a6 fix: hide Foldable menu options when button is visible (#6178)
- 0849631 refactor: replace user store with observables [WEB-799] (#6140)
- b3e7ecc chore: pass Labels/project/workspace to TaskSpec (#6172)
- 455f536 fix: useMemo does not depend on trial having been loaded (#6173)
- 7d8b711 fix: separate Router and authCheck (#6170)