Preprint
Article

Stratied Finite Empirical Bernstein Sampling

This version is not peer-reviewed.

Submitted:

17 January 2019

Posted:

21 January 2019

Read the latest preprint version here

Abstract
We derive a concentration inequality for the uncertainty in strati ed random sampling. Minimising this inequality leads to an iterated online method for choosing samples from the strata. The inequality is versatile and considers a range of factors including: the data ranges, weights, sizes of the strata, as well as the number of samples taken, the estimated sample variances and whether strata are sampled with or without replacement. We evaluate the improvement this method reliably offers against other methods over sets of synthetic data, and also in approximating the Shapley value of cooperative games. The method is seen to be competitive with the performance of perfect Neyman sampling, even without prior information on strata variances. We supply a multidimensional extension of our inequality and discuss some future applications.
Keywords: 
;  ;  ;  
Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

Downloads

828

Views

879

Comments

0

Subscription

Notify me about updates to this article or when a peer-reviewed version is published.

Email

Prerpints.org logo

Preprints.org is a free preprint server supported by MDPI in Basel, Switzerland.

Subscribe

© 2025 MDPI (Basel, Switzerland) unless otherwise stated