We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
作者,你好! 我自己在使用RWKV模块的时候,使用torch.optim.lr_scheduler.StepLR(optimizer, step_size=args.n_steps, gamma=args.gamma),在学习率更新后提升很小。我看您论文中是使用CosineAnnealingLR(optimizer_G, total_iteration, eta_min=1.0e-6),我想问一下total_iteration和总样本数有关吗?在你的任务中总样本数是多少呢? 谢谢!!!
The text was updated successfully, but these errors were encountered:
total iteration是模型更新的次数,与样本数有关,我是通过观察验证集效果来决定的。
Sorry, something went wrong.
No branches or pull requests
作者,你好!
我自己在使用RWKV模块的时候,使用torch.optim.lr_scheduler.StepLR(optimizer, step_size=args.n_steps, gamma=args.gamma),在学习率更新后提升很小。我看您论文中是使用CosineAnnealingLR(optimizer_G, total_iteration, eta_min=1.0e-6),我想问一下total_iteration和总样本数有关吗?在你的任务中总样本数是多少呢?
谢谢!!!
The text was updated successfully, but these errors were encountered: