You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
provider incorrectly defaults to the last (dict sorted) GPU model in the SDL model list when forming order request before handing it to the bid price script
#139
Open
andy108369 opened this issue
Nov 3, 2023
· 1 comment
The provider, despite supporting the correct GPU model and bidding accordingly, erroneously sets an unsupported GPU model when forming the order request. This error occurs because the provider defaults to the last (dict sorted) GPU model listed in the SDL, which may not be supported or may even be non-existent.
This leads to the bid price script calculating bids based on this incorrect GPU model, resulting in either inaccurate bids or a failure to bid if the provider has not set pricing for this model.
Steps to Reproduce:
Have a provider with some GPU (e.g., a100).
Create an SDL file listing multiple GPU models, placing a non-existent or random models (e.g., - model: akgjkajgksag) and the supported model (a100) further down the list.
Broadcast the SDL to initiate bidding from the provider.
Review the order request and observe that it incorrectly specifies the GPU model from the SDL found last (after dict sorting), e.g., "model": "akgjkajgksag", not the supported a100.
Notice that the bid price script fails to calculate a price due to the absence of pricing for the non-existent model akgjkajgksag.
Expected Behavior:
The provider should identify and select the GPU model it actually supports when forming the order request. This correct model should then be used by the bid price script for price calculation, ignoring any models that are not supported.
Actual Behavior:
The provider incorrectly selects the last (dict sorted) GPU model listed in the SDL for the order request. This misstep leads to the bid price script either not calculating a price or calculating an incorrect price, as it encounters an unsupported or non-existent GPU model.
Example
Provider attributes: supported GPU - a100
$ provider-services query provider get akash1c6rsz4f59nkus3s5qauxxh969j2mtkkn2clk2e -o text
attributes:
...
- key: capabilities/gpu/vendor/nvidia/model/a100
value: "true"
SDL Contents:
Notice, v100 model here would be the last model when dict (alphabetically) sorted. And a100 is also part of the list so that provider with a100 bids on it.
The deployment order Provider forms (before passing it to the bid price script):
As demonstrated, the received order request incorrectly specifies the v100 model (which would be the last when dict sorted from the SDL models list) instead of the a100 model that the provider supports.
* feat(bid-script): use highest price when model detection fails
Partially addresses akash-network/support#139
* chore(bid-script): add more debug logs
Developed a partial workaround for the bid price script that sets the GPU price to the highest (out of all set by the provider owner via price_target_gpu_mappings) when GPU model detection method fails due to issue-139.
Follow these steps to upgrade your bid price script:
andy108369
changed the title
provider incorrectly defaults to first SDL-listed GPU model when forming order request before handing it to the bid price script
provider incorrectly defaults to the last (dict sorted) GPU model in the SDL model list when forming order request before handing it to the bid price script
Nov 3, 2023
Environment:
0.4.6
0.26.1
Issue Summary:
The provider, despite supporting the correct GPU model and bidding accordingly, erroneously sets an unsupported GPU model when forming the order request. This error occurs because the provider defaults to the last (dict sorted) GPU model listed in the SDL, which may not be supported or may even be non-existent.
This leads to the bid price script calculating bids based on this incorrect GPU model, resulting in either inaccurate bids or a failure to bid if the provider has not set pricing for this model.
Steps to Reproduce:
a100
).- model: akgjkajgksag
) and the supported model (a100
) further down the list."model": "akgjkajgksag"
, not the supporteda100
.akgjkajgksag
.Expected Behavior:
The provider should identify and select the GPU model it actually supports when forming the order request. This correct model should then be used by the bid price script for price calculation, ignoring any models that are not supported.
Actual Behavior:
The provider incorrectly selects the last (dict sorted) GPU model listed in the SDL for the order request. This misstep leads to the bid price script either not calculating a price or calculating an incorrect price, as it encounters an unsupported or non-existent GPU model.
Example
Provider attributes: supported GPU -
a100
SDL Contents:
The deployment order Provider forms (before passing it to the bid price script):
Additional information
The model provider picks is the last model after dict (alphabetically) sorted.
dict (alphabetical) sorting:
The text was updated successfully, but these errors were encountered: