You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When I run distributed training, if the model is not already downloaded locally on disk, different ranks start fighting for the download and they crash.
I am looking for a fix such that:
If the model is not yet downloaded on disk, only one rank downloads it. The rest of the ranks are waiting until the file is downloaded
If the model is already on disk, all ranks load it simultaneously, no waiting for each other
The solution is universal. In other worlds, I still instantiate the model via AutoModel instead of with some wrapper function and I don't write a bunch of if-else statements every time I need to create a model
I wasn't able to find something that can achieve this right now. I guess a very simple solution could be adding lock files when downloading a model such that other ranks wait until the completion of the download and then use the downloaded files directly
The text was updated successfully, but these errors were encountered:
import random
lower = int(input("Enter the lower bound: "))
upper = int(input("\nEnter the upper bound: "))
if lower >= upper:
print('\nUpper bound must be greater than lower bound')
exit()
num = random.randint(lower, upper)
chances = 3
print(f"\nYou have {chances} chances to guess the number!\n")
count = 0
guessed = False
while count < chances:
count += 1
guess = int(input("Guess a number: "))
if num == guess:
print("Congratulations you did it in ", count, " try")
guessed = True
break
elif num > guess:
print("You guessed too small.")
elif num < guess:
print("You Guessed too high.")
if not guessed:
print("\nThe number is %d" % num)
print("\nBetter Luck Next time!")
When I run distributed training, if the model is not already downloaded locally on disk, different ranks start fighting for the download and they crash.
I am looking for a fix such that:
AutoModel
instead of with some wrapper function and I don't write a bunch of if-else statements every time I need to create a modelI wasn't able to find something that can achieve this right now. I guess a very simple solution could be adding lock files when downloading a model such that other ranks wait until the completion of the download and then use the downloaded files directly
The text was updated successfully, but these errors were encountered: