Skip to content

TensorHive 1.1.0

Latest
Compare
Choose a tag to compare
@roscisz roscisz released this 02 Feb 11:55
5b50245

What's Changed

Main changes:

  • Extended resource protection mechanism (#347)
    • configurable protection levels, with new mode (level 2) which does not allow using GPUs without reservations, even if there are no other reservations violated
    • violations are detected for all available resources, not only for current reservations
    • configurable process killing levels (0 - no killing, 1 - try to kill as the process owner, 2 - kill using sudo)
    • gather multiple violated resources in one message in both mailer and message sending handlers
    • send violation messages to user TTYs across multiple hosts
  • Reverse proxy support in (#329)
  • Persisting selected resources in calendar view (#353)

Minor changes:

  • dependency version upgrades
  • bugfixes

Configuration Changes

Standard upgrade process using pip or git pull will work. However, the following changes in main_config.ini should be taken into account:

  • enabled option under [protection_service] has been deleted and is no longer relevant
  • level option has been introduced under [protection_service] see description
  • kill_processes option has been introduced under [protection_service] see description
  • url_schema and url_port options have been introduced under [api] see description

New Contributors

Full Changelog: 1.0.0...1.1.0