Convergence of random variables
- In probability (convergence in probability): P(∣Xn−X∣≤ϵ)→1
- In mean square: E(Xn−X)2→0
- With probability 1 (almost surely convergence): P(limn→∞Xn=X)=1
- (2)→(1), (3)→(1)
- Law of Large Numbers: X1,⋯,Xn∼p(x)
- Strong: P(limn→∞Xn=E(X1))=1
- Weak: Xn→E(X1) in probability
Asymptotic Equipartition Property
- AEP: X1,X2,⋯Xn are i.i.d. ∼p(x), then −n1logp(X1,X2,⋯,Xn)→H(X)
- 2−nH(X)+ϵ≤p(X1,X2,⋯,Xn)≤2−n(H(X)−ϵ)
- typical set Aϵ(n) is the set of sequences (x1,⋯,xn)∈Xn with property 2−nH(X)+ϵ≤p(X1,X2,⋯,Xn)≤2−n(H(X)−ϵ)
- P(Aϵ(n))≥1−ϵ for n sufficiently large (The typical set has probability nearly 1)
- ∣Aϵ(n)∣≤2n(H(X)+ϵ) (All elements are nearly qeuiprobable)
- ∣Aϵ(n)∣≥(1−ϵ)2n(H(X)−ϵ) (elements nearly 2nH)
- High Probability Set: Bδ(n)⊂Xn be the smallest set with P(Bδ(n))≥1−δ
- if P(Bδ(n))≥1−δ, then n1log∣Bδ(n)∣>H−δ′ for n sufficiently large
- Data Compression
- Encode: E(n1l(Xn))≤H(X)+ϵ
- nlog∣X∣+1+1 for (Aϵ(n))c
- n(H+ϵ)+1+1 for Aϵ(n)
- For any scheme with rate r<H(X),Pe→1
Joint Typicality
Jointly typical sequences Aϵ(n)
- ∣−n1logp(xn)−H(X)∣<ϵ
- ∣−n1logp(yn)−H(X)∣<ϵ
- ∣−n1logp(xn,yn)−H(X,Y)∣<ϵ
Xn∈Aϵ(n),Yn∈Aϵ(n) cannot imply (Xn,Yn)∈Aϵ(n)
Three Properties:
- Pr((Xn,Yn)∈Aϵ(n))→1 as n→∞
- ∣Aϵ(n)∣≤2n(H(X,Y)+ϵ)
- If (Xn,Yn)∼p(xn)p(yn), then (1−ϵ)2−n(I(X,Y)+3ϵ)≤Pr((Xn,Yn)∈Aϵ(n))≤2n(I(X,Y)−3ϵ)
The probability of joint AEP: 2−nI(X;Y)=2nH(X)2nH(Y)2nH(X,Y)