Active Measurement: Efficient Estimation at Scale

Max Hamilton*
UMass Amherst
Jinlin Lai*
UMass Amherst
Wenlong Zhao
UMass Amherst
Subhransu Maji
UMass Amherst
Daniel Sheldon
UMass Amherst
* denotes equal contribution
[Paper] [Code]

Abstract

AI has the potential to transform scientific discovery by analyzing vast datasets with little human effort. However, current workflows often do not provide the accuracy or statistical guarantees that are needed. We introduce active measurement, a human-in-the-loop AI framework for scientific measurement. An AI model is used to predict measurements for individual units, which are then sampled for human labeling using importance sampling. With each new set of human labels, the AI model is improved and an unbiased Monte Carlo estimate of the total measurement is refined. Active measurement can provide precise estimates even with an imperfect AI model, and requires little human effort when the AI model is very accurate. We derive novel estimators, weighting schemes, and confidence intervals, and show that active measurement reduces estimation error compared to alternatives in several measurement tasks.

Citation

    
    @inproceedings{hamilton2025active,
        title={Active Measurement: Efficient Estimation at Scale},
        author={Max Hamilton and Jinlin Lai and Wenlong Zhao and Subhransu Maji and Daniel Sheldon},
        booktitle={The Thirty-ninth Annual Conference on Neural Information Processing Systems},
        year={2025},
        url={https://openreview.net/forum?id=nFc38gSYze}
    }