Can we use sn.mean() without providing the bounds? #161
-
In sn.analysis, is it possible to conduct the analysis without providing bounds while calling the query and while pre-processing the data (clamp, resize, impute)? |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 3 replies
-
There is no feature for automatic bounds calculation in SmartNoise. However, if you clamp, the bounds will be propagated up through the computation. The reason why bounds are required is because the noise scale is calibrated based on the maximum influence that modifying a user can have on the non-private statistic. For instance, if you are computing the sum, adding or removing one user can influence the sum by at most There are other libraries that do automatic bounds estimation. That would be a good feature request for the OpenDP library. |
Beta Was this translation helpful? Give feedback.
There is no feature for automatic bounds calculation in SmartNoise. However, if you clamp, the bounds will be propagated up through the computation.
The reason why bounds are required is because the noise scale is calibrated based on the maximum influence that modifying a user can have on the non-private statistic. For instance, if you are computing the sum, adding or removing one user can influence the sum by at most
max(|lower_bound|, |upper_bound|)
. In other words, what is the maximum the query can change, should we change one user? This is called the "sensitivity" of the query. If the data is unbounded, then the sensitivity is also unbounded, and the necessary noise scale to make the …