Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG] Evaluation 时使用并行可能不会完整地遍历一遍数据 #119

Open
KYLN24 opened this issue Sep 14, 2023 · 1 comment
Open
Assignees

Comments

@KYLN24
Copy link
Collaborator

KYLN24 commented Sep 14, 2023

猜测是并行 size 或 batch size 设置不当(无法整除数据量),可能会有数据被重复计算。

@KYLN24 KYLN24 self-assigned this Sep 14, 2023
@KYLN24
Copy link
Collaborator Author

KYLN24 commented Oct 17, 2023

发现问题了,在这里记一下,是torch的DistributedSampler在drop_last=False的时候会将最后一个batch pad 到 batch_size,因此 CollieBatchSampler 后面做的处理没有生效

不对,不是这个问题

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant