UnsolvedMajor Solve Problem

Capacity of the Binary Deletion Channel

§ Problem Statement

Setup

Fix a deletion probability d[0,1)d\in[0,1). The binary deletion channel BDC(d)\mathrm{BDC}(d) is defined as follows: for input blocklength nn, the transmitter sends Xn=(X1,,Xn){0,1}nX^n=(X_1,\dots,X_n)\in\{0,1\}^n. Independently for each i{1,,n}i\in\{1,\dots,n\}, a deletion indicator DiBernoulli(d)D_i\sim\mathrm{Bernoulli}(d) is drawn, with P(Di=1)=d\mathbb P(D_i=1)=d and P(Di=0)=1d\mathbb P(D_i=0)=1-d, and (Di)(D_i) is independent of XnX^n. The output is the random subsequence

Y=(Xi:Di=0){0,1}Ln,Y=(X_i:\,D_i=0)\in\{0,1\}^{L_n},

where Ln=i=1n(1Di)L_n=\sum_{i=1}^n(1-D_i) is random. Thus deleted symbols are removed, undeleted symbols keep their original order, and the receiver is not told the deleted positions.

A block code of length nn and size MnM_n consists of an encoder fn:{1,,Mn}{0,1}nf_n:\{1,\dots,M_n\}\to\{0,1\}^n and a decoder gn:{0,1}{1,,Mn}g_n:\{0,1\}^*\to\{1,\dots,M_n\}, where {0,1}=0{0,1}\{0,1\}^*=\bigcup_{\ell\ge 0}\{0,1\}^\ell. With uniformly distributed message W{1,,Mn}W\in\{1,\dots,M_n\}, average error probability is

Pe(n)=P ⁣[gn(Y)W].P_e^{(n)}=\mathbb P\!\big[g_n(Y)\neq W\big].

A rate R0R\ge 0 is achievable if there exists a sequence of codes with limnPe(n)=0\lim_{n\to\infty}P_e^{(n)}=0 and lim infn1nlog2MnR\liminf_{n\to\infty}\frac{1}{n}\log_2 M_n\ge R. The Shannon capacity is

C(d)=sup{R:R is achievable}.C(d)=\sup\{R:\,R\ \text{is achievable}\}.

Unsolved Problem

Determine C(d)C(d) exactly as a function of dd for the binary deletion channel. In particular, determine the exact asymptotic behavior of C(d)C(d) as d1d\to 1, i.e., find an explicit asymptotic equivalent a(d)a(d) such that C(d)/a(d)1C(d)/a(d)\to 1 as d1d\to 1 (and, ideally, further terms of the asymptotic expansion).

§ Discussion

Loading discussion…

§ Significance & Implications

The deletion channel is one of the simplest channels whose capacity remains unknown, despite being introduced in the 1960s. Unlike the binary symmetric or binary erasure channels (whose capacities have clean formulas), the deletion channel's capacity is notoriously difficult because the receiver does not know which bits were deleted. This makes synchronization a fundamental challenge (Mitzenmacher (2009)). Kanoria & Montanari (2013) established small-dd asymptotics, and Cheraghchi (2019) proved improved capacity upper bounds for deletion-type channels.

§ Known Partial Results

  • Kanoria et al. (2013): Global (all d[0,1)d\in[0,1)): C(d)1dC(d) \le 1-d (standard genie-aided erasure-channel upper bound).

  • Kanoria et al. (2013): Asymptotic as d0d\to 0: Kanoria & Montanari (2013) derive a rigorous small-dd expansion (including the dlogdd\log d correction term).

  • Cheraghchi (2019): Global upper bounds beyond the trivial 1d1-d bound: Cheraghchi (2019) gives improved upper bounds for deletion-type channels, including the binary deletion channel.

  • Cheraghchi (2019): As of 2019 (Cheraghchi) and subsequent survey context, the exact asymptotic equivalent of C(d)C(d) as d1d\to 1 is still open.

§ References

[1]

A survey of results for deletion channels and related synchronization channels

Michael Mitzenmacher (2009)

Probability Surveys

📍 Section 7 (upper bounds) and Open Questions (asymptotics as deletion probability goes to 0 and to 1), p. 22.

[2]

Optimal coding for the binary deletion channel with small deletion probability

Yashodhan Kanoria, Andrea Montanari (2013)

IEEE Transactions on Information Theory

📍 Introduction and Theorem 1: asymptotic expansion of capacity for small deletion probability ($d\to 0$).

[3]

Capacity Upper Bounds for Deletion-type Channels

Mahdi Cheraghchi (2019)

Journal of the ACM

📍 Main upper-bound theorems and numerical upper bounds for the binary deletion channel (global in $d$).

§ Tags