Wasserman's AoS, Chapter 7, Question 5

We show that the covariance Cov(ˆFn(x),ˆFn(y)) is 1nF(x)(1+F(y)),

for xy. For xy, we have 1nF(y)(1+F(x)).

Using linearity of expectation and the fact that EˆFn(y)=F(y), we obtain

E(ˆFn(x)F(x))(ˆFn(y)F(y))=E(ˆFn(x)ˆFn(y))F(x)F(y).

By definition of the empirical cumulative distribution function, the first term is equal to

1n2i,jE(I(Xix)I(Xjy))=1n2(i=jE(I(Xix)I(Xiy))+ijE(I(Xix)I(Xjy))).

We now simplify the two summations:

  1. Without loss of generality, we may assume that xy. With this assumption, I(Xix)I(Xiy)=I(Xix).

  2. Note that the Bernoulli variables I(Xix) and I(Xjy) are independent for ij since Xi and Xj are independent. This implies that the expectation of the product is the product of the expectations.

These two facts allow us to rewrite the first term as

1n2(i=jE(I(Xix))+ijE(I(Xix))E(I(Xjy)))=1n2(i=jF(x)+ijF(x)F(y))=1nF(x)(1+(n1)F(y)).

Putting everything together, we see that the covariance is

1nF(x)(1+(n1)F(y))F(x)F(y)=1nF(x)(1F(y))

for xy.