Skip to content

[RnD] Investigate Early Stopping in AA Tests #145

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
5 tasks
tikhomirovd opened this issue Feb 18, 2025 · 0 comments
Open
5 tasks

[RnD] Investigate Early Stopping in AA Tests #145

tikhomirovd opened this issue Feb 18, 2025 · 0 comments
Assignees
Labels
Milestone

Comments

@tikhomirovd
Copy link
Collaborator

🚀 Feature Proposal

RnD: Investigate Early Stopping in AA Tests

Motivation

Currently, the AA test in HypEx does not include an early stopping mechanism. Introducing early stopping can help optimize experiment duration by identifying stable convergence points. However, it is essential to research when early stopping is valid, how to detect convergence effectively, and determine the best iteration for stopping without introducing bias.

Feature Description

  • Conduct research on when early stopping can be applied in iterative AA tests.
  • Analyze different stopping criteria (e.g., confidence interval stabilization, sequential testing methods, etc.).
  • Determine the best iteration to stop the test based on statistical evidence.
  • Document findings with recommendations for practical implementation.

Potential Impacts

  • Reduced experiment runtime while maintaining statistical integrity.
  • More efficient resource utilization for AA testing.
  • Improved guidelines for early stopping in HypEx experiments.

Alternatives

  • Continue running all iterations without early stopping (longer but stable testing).
  • Use heuristics instead of rigorous statistical stopping criteria.

Additional Context

  • Explore statistical techniques such as sequential analysis, Bayesian monitoring, and adaptive stopping.
  • Compare results across different datasets to identify generalizable stopping conditions.
  • Ensure findings align with best practices in experiment design.

Checklist

  • Conduct research on early stopping criteria.
  • Analyze when early stopping is statistically valid.
  • Identify the optimal stopping iteration.
  • Document findings and recommendations.
  • Review and refine conclusions before implementation.
@tikhomirovd tikhomirovd added this to the 1.0.3 milestone Feb 18, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants