Group sequential tests tend to have higher power compared to always-valid
tests. They require you to specify an expected sample size up front. Your
estimate of the expected sample size doesn’t need to be exact. If you can
give a reasonable estimate, then you should use a group sequential test.
Group Sequential Tests
If you give an expected sample size when setting up the experiment, Confidence uses group sequential tests to calculate valid statistical results while an experiment is running. The group sequential test optimally exploits the dependence between the tests at different points in time. It allocates the overall false positive rate that the experiment can spend across the multiple tests performed over time. How much of the false positive rate that each analysis spends depends on the amount of information that’s available at that time point relative to the expected amount of information at the end of the experiment.Always-Valid Inference
If you don’t give an expected sample size, Confidence can’t use the group sequential test and instead resorts to an always-valid approach. These tests guarantee that the false positive rate doesn’t exceed the intended level, but usually have lower power than the group sequential tests. This means that it is harder to find effects.Analyze Results Sequentially
- App
- API
Sequential tests are always enabled for rollouts.To use a sequential testing strategy for an A/B test or an analysis workflow:
- Go to Confidence and select A/B Tests or Analysis Workflows on the left sidebar.
- Select the experiment that you want to analyze sequentially.
- On the right sidebar, click Results > Results Settings.
- Select Continuously.
- Optional. On the right sidebar, click Results > Configure Statistics. Enter the Expected sample size.
If the expected sample size is present, deterioration checks use the group sequential test. The same applies even if you choose to view the results after the experiment ends.
References
- C. Jennison and B. W. Turnbull (2000) “Group Sequential Methods with Applications to Clinical Trials,” Chapman & Hall/CRC.
- M. Schultzberg and S. Ankargren (2023) “Choosing a Sequential Testing Framework—Comparisons and Discussions,” Spotify Engineering Blog, https://engineering.atspotify.com/2023/03/choosing-sequential-testing-framework-comparisons-and-discussions/.
- G. Y. Zou, A. Donner, and N. Klar (2005) “Group sequential methods for cluster randomization trials with binary outcomes.” Clinical Trials.
- I. Waudby-Smith, D. Arbour, R. Sinha, E. H. Kennedy, and A. Ramdas (2023) “Time-uniform central limit theory and asymptotic confidence sequences,” https://arxiv.org/pdf/2103.06476v8.pdf.

