Continuity Correction - Random Notes Go Brrrrrrr

> [!tldr] Continuity Correction > A **continuity correction** is a technique applied when approximating discrete distributions (e.g. binomial) with continuous ones (e.g. Gaussians). > > This is because the mass of discrete distributions at a point $x$ corresponds to a an area $(x \pm \Delta x / 2)$ when integrating the continuous distribution, where $\Delta x$ is the distance between atoms in the discrete distribution. > ![[ContinuityCorrectionExample.png#invert|center|w60]] > In this example, the binomial mass at $x=12$ corresponds to the Gaussian's mass over $(11.5, 12.5)$, and $\Delta x=1$. - Besides estimating point masses, discrete CDF can also be estimated as $\mathbb{P}[X \le k] \approx F(k+\Delta x / 2),$where $F$ is the CDF of the continuous approximation. In the case of Gaussian approximations, this is simply $\Phi(k+\Delta x / 2)$. - While omitting the correction usually still gives asymptotically correct results, the correction improves the accuracy of finite-sample approximations. > [!examples] Example: Gaussian approximation of Binomials > Suppose $X \sim \mathrm{Binom}(n, p)$ for some large $n$. Then $\mu_{X}=np$, $\mathrm{Var}(X)=np(1-p)$, so $X \overset{D}{\approx}N(np, np(1-p))$by the CLT. Therefore the naive approximation of the cdf of $X$ gives $\mathbb{P}[X \le k]\approx \Phi\left( \frac{k-np}{\sqrt{ np(1-p) }} \right),$while the continuity correction gives $\mathbb{P}[X \le k] \approx \Phi\left( \frac{k+0.5-np}{\sqrt{ np(1-p) }} \right).$ > > Its pmf estimate does not rely on the continuity correction, which is simply $\mathbb{P}[X=k] \approx \Delta x \cdot\phi(k;np, np(1-p)),$where $\phi$ is the Gaussian density.