Unsolved

Lower-bound the spanning-tree polynomial to sharpen Gaussian \(\chi^2\) bounds

Sourced from the work of Yanjun Han, Jonathan Niles-Weed

§ Problem Statement

Setup

Fix integers n2n \ge 2 and d1d \ge 1, a positive-definite covariance matrix ΣRd×d\Sigma \in \mathbb R^{d \times d}, and mean vectors μ1,,μnRd\mu_1,\dots,\mu_n \in \mathbb R^d. For each i[n]:={1,,n}i \in [n]:=\{1,\dots,n\}, let Pi=N(μi,Σ)P_i=\mathcal N(\mu_i,\Sigma) with density pip_i on Rd\mathbb R^d, and define the average measure Pˉ=1ni=1nPi\bar P=\frac1n\sum_{i=1}^n P_i with density pˉ=1ni=1npi\bar p=\frac1n\sum_{i=1}^n p_i.

Define two distributions on (Rd)n(\mathbb R^d)^n. The permutation mixture Pn\mathbb P_n is

Pn=1n!πSnk=1nPπ(k),\mathbb P_n=\frac1{n!}\sum_{\pi\in S_n}\bigotimes_{k=1}^n P_{\pi(k)},

equivalently with density pperm(x1,,xn)=1n!πSnk=1npπ(k)(xk)p_{\mathrm{perm}}(x_1,\dots,x_n)=\frac1{n!}\sum_{\pi\in S_n}\prod_{k=1}^n p_{\pi(k)}(x_k). Its i.i.d. counterpart is

Qn=Pˉn,\mathbb Q_n=\bar P^{\otimes n},

with density q(x1,,xn)=k=1npˉ(xk)q(x_1,\dots,x_n)=\prod_{k=1}^n \bar p(x_k).

Let χ2(PnQn):= ⁣(dPndQn1)2dQn\chi^2(\mathbb P_n\|\mathbb Q_n):=\int\!\left(\frac{d\mathbb P_n}{d\mathbb Q_n}-1\right)^2d\mathbb Q_n. Define ARn×nA\in\mathbb R^{n\times n} by

Aij:=1nRdpi(x)pj(x)pˉ(x)dx,i,j[n].A_{ij}:=\frac1n\int_{\mathbb R^d}\frac{p_i(x)p_j(x)}{\bar p(x)}\,dx,\qquad i,j\in[n].

For this construction, AA is symmetric, positive semidefinite, and doubly stochastic. Let Tn\mathcal T_n be the set of spanning trees on vertex set [n][n], and define the weighted spanning-tree polynomial

τ(A):=TTn (u,v)E(T)Auv.\tau(A):=\sum_{T\in\mathcal T_n}\ \prod_{(u,v)\in E(T)} A_{uv}.

Unsolved Problem

Prove a nontrivial Gaussian-structure-dependent lower bound on τ(A)\tau(A) (for matrices AA arising from the above Gaussian family) that could improve the currently available eigenvalue-only upper bound on χ2(PnQn)\chi^2(\mathbb P_n\|\mathbb Q_n),

χ2(PnQn)+1k=2n(1λk(A))1,\chi^2(\mathbb P_n\|\mathbb Q_n)+1\le \prod_{k=2}^n(1-\lambda_k(A))^{-1},

where 1=λ1(A)λ2(A)λn(A)1=\lambda_1(A)\ge \lambda_2(A)\ge\cdots\ge\lambda_n(A) are the eigenvalues of AA. In particular, the goal is to exploit constraints specific to Gaussian-generated AA beyond trace/spectral-gap information.

§ Discussion

Loading discussion…

§ Significance & Implications

This is a proposed route in the paper for potentially sharpening approximate-independence guarantees in Gaussian permutation-mixture settings. If stronger (\chi^2) control is obtained, it would typically imply tighter downstream total-variation/testing/decision bounds via standard inequalities; these downstream improvements are inferential unless separately proved. See Han & Niles-Weed (2024) for details.

§ Known Partial Results

  • Han et al. (2024): The paper proves permanent-based upper bounds for (\chi^2(\mathbb P_n|\mathbb Q_n)), including a bound of the form (\chi^2(\mathbb P_n|\mathbb Q_n)+1\le \prod_{i=2}^n(1-\lambda_i)^{-1}) (with (\lambda_i) eigenvalues of (A)), and identifies this quantity with a spanning-tree polynomial expression; the authors describe exploiting extra structure of (A) as an open direction that could improve Gaussian bounds. This direction appears open in the cited source.

§ References

[1]

Approximate independence of permutation mixtures

Yanjun Han, Jonathan Niles-Weed (2024)

Annals of Statistics (to appear)

📍 Section 6.1 (Discussion: Tightness of upper bounds), paragraph immediately after Lemma 6.2 ("for specific $P$, further properties of the matrix $A$...") on p. 26 (arXiv v3; Section 5.1 in earlier numbering)

Source paper where this problem appears.

§ Tags