You are currently viewing a beta version of our website. If you spot anything unusual, kindly let us know.

Preprint
Article

Stratied Finite Empirical Bernstein Sampling

Altmetrics

Downloads

813

Views

878

Comments

0

This version is not peer-reviewed

Submitted:

17 January 2019

Posted:

21 January 2019

Withdrawn:

Invalid date

Read the latest preprint version here

Alerts
Abstract
We derive a concentration inequality for the uncertainty in strati ed random sampling. Minimising this inequality leads to an iterated online method for choosing samples from the strata. The inequality is versatile and considers a range of factors including: the data ranges, weights, sizes of the strata, as well as the number of samples taken, the estimated sample variances and whether strata are sampled with or without replacement. We evaluate the improvement this method reliably offers against other methods over sets of synthetic data, and also in approximating the Shapley value of cooperative games. The method is seen to be competitive with the performance of perfect Neyman sampling, even without prior information on strata variances. We supply a multidimensional extension of our inequality and discuss some future applications.
Keywords: 
Subject: Computer Science and Mathematics  -   Probability and Statistics
Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.
Prerpints.org logo

Preprints.org is a free preprint server supported by MDPI in Basel, Switzerland.

Subscribe

© 2024 MDPI (Basel, Switzerland) unless otherwise stated