Unsolved

Average-Case Hardness of Hypergraphic Planted Clique Detection

Posed by Yuetian Luo et al. (2020)

§ Problem Statement

Setup

Fix an integer d3d\ge 3 and let [N]={1,,N}[N]=\{1,\dots,N\}. A (simple) dd-uniform hypergraph GG on [N][N] is specified by an edge set E([N]d)E\subseteq \binom{[N]}{d}. Let Gd(N,1/2)\mathcal{G}_d(N,1/2) denote the Erdos-Renyi dd-uniform hypergraph distribution in which each dd-set e([N]d)e\in\binom{[N]}{d} is included in EE independently with probability 1/21/2 (Luo & Zhang (2020)).

For an integer κ{1,,N}\kappa\in\{1,\dots,N\}, define the hypergraphic planted clique (HPC) distribution Gd(N,1/2,κ)\mathcal{G}_d(N,1/2,\kappa) by: draw GGd(N,1/2)G\sim\mathcal{G}_d(N,1/2); sample a uniformly random subset K[N]K\subseteq [N] with K=κ|K|=\kappa; then set every hyperedge eKe\subseteq K with e=d|e|=d to be present (so the induced dd-uniform subhypergraph on KK is complete).

The d-HPCd\text{-HPC} detection problem is the average-case hypothesis test

H0:GGd(N,1/2)vs.H1:GGd(N,1/2,κ).H_0: G\sim \mathcal{G}_d(N,1/2) \quad\text{vs.}\quad H_1: G\sim \mathcal{G}_d(N,1/2,\kappa).

A (randomized) polynomial-time test is a family of algorithms {ϕN}\{\phi_N\} running in time poly(N)\mathrm{poly}(N) and outputting ϕN(G){0,1}\phi_N(G)\in\{0,1\}. The test succeeds if its total error PrH0[ϕN(G)=1]+PrH1[ϕN(G)=0]0\Pr_{H_0}[\phi_N(G)=1]+\Pr_{H_1}[\phi_N(G)=0]\to 0 as NN\to\infty. For d=2d=2, this reduces to planted clique (PC) detection on graphs (Luo & Zhang (2020)). In this case, G2(N,1/2,κ)\mathcal{G}_2(N,1/2,\kappa) is generated by first drawing an Erdos-Renyi graph from G2(N,1/2)\mathcal{G}_2(N,1/2) and then forcing all edges inside a uniformly random size-κ\kappa vertex subset to be present.

The phrase "computationally equivalent" here means average-case randomized polynomial-time reductions in both directions (PC \leftrightarrow d-HPCd\text{-HPC}) that preserve null and planted distributions up to vanishing total-variation error and keep planted sizes asymptotically comparable (for example, κ=κNo(1)\kappa'=\kappa\cdot N^{o(1)}) (Brennan et al. (2018)).

Unsolved Problem

In the regime κ=o(N)\kappa=o(\sqrt{N}) (e.g. κ=O(N1/2τ)\kappa=O(N^{1/2-\tau}) for any fixed τ>0\tau>0), prove or refute such an average-case reduction from PC detection to d-HPCd\text{-HPC} detection.

Concretely, determine whether there exist Npoly(N)N'\le\mathrm{poly}(N), κ=κNo(1)\kappa'=\kappa\cdot N^{o(1)}, and a randomized map RR computable in poly(N)\mathrm{poly}(N) time such that (1) if GG2(N,1/2)G\sim\mathcal{G}_2(N,1/2), then R(G)R(G) is distributed as Gd(N,1/2)\mathcal{G}_d(N',1/2) up to total variation error o(1)o(1), and (2) if GG2(N,1/2,κ)G\sim\mathcal{G}_2(N,1/2,\kappa), then R(G)R(G) is distributed as Gd(N,1/2,κ)\mathcal{G}_d(N',1/2,\kappa') up to total variation error o(1)o(1).

Equivalently, would any polynomial-time test for d-HPCd\text{-HPC} at planted size κ\kappa' imply, via RR, a polynomial-time test for PC at planted size κ\kappa in the corresponding regime?

§ Discussion

Loading discussion…

§ Significance & Implications

HPC detection is a common average-case starting point for reductions to tensor-structured inference tasks because a dd-uniform hypergraph can be represented by an order-dd adjacency tensor. Establishing a precise average-case reduction from PC to dd-HPC (or proving such a reduction cannot exist with comparable κ\kappa) would clarify whether hardness assumptions stated for dd-HPC in the regime κ=o(N)\kappa=o(\sqrt{N}) are genuinely stronger/different than the standard planted clique conjecture, which directly affects the interpretability of hardness evidence for tensor problems built on dd-HPC as a base distribution.

§ Known Partial Results

  • Luo et al. (2020): Luo and Zhang (COLT 2020) formulate the dd-HPC detection problem and motivate it as an average-case hardness source for tensor problems.

  • Luo et al. (2020): Luo and Zhang (COLT 2020) ask for further evidence of average-case computational hardness of dd-HPC detection and, in particular, for evidence toward (or against) computational equivalence between dd-HPC detection and d=2d=2 planted clique detection.

  • Luo et al. (2020): As summarized in Luo and Zhang (COLT 2020), spectral methods based on unfolding/matricizing the order-dd adjacency tensor provide polynomial-time detection in a high-signal regime (with threshold behavior on the order of κ\kappa around N\sqrt{N} for fixed dd), and are not expected to succeed deep in the regime κ=O(N1/2τ)\kappa=O(N^{1/2-\tau}).

  • Luo et al. (2020): Luo and Zhang (COLT 2020) cite evidence for hardness of dd-HPC detection in restricted algorithmic frameworks (e.g. certain Markov chain/Metropolis-type approaches and the low-degree polynomial framework) in the regime κ=O(N1/2τ)\kappa=O(N^{1/2-\tau}).

  • Luo et al. (2020): There is a simple polynomial-time mapping from a dd-uniform hypergraph to a graph by fixing a (d2)(d-2)-subset S[N]S\subseteq[N] and connecting i,ji,j when S{i,j}S\cup\{i,j\} is a hyperedge; under H0H_0 this yields an Erdos-Renyi graph with edge probability 1/21/2, while under H1H_1 it yields a planted-clique graph conditional on the event SKS\subseteq K (and is otherwise null-like). The open challenge is a reduction in the opposite direction (PC to dd-HPC) that preserves κ\kappa up to asymptotic comparability without relying on such conditioning.

§ References

[1]

Open Problem: Average-Case Hardness of Hypergraphic Planted Clique Detection

Yuetian Luo, Anru R Zhang (2020)

Conference on Learning Theory (COLT), PMLR 125

📍 Open-problem note in COLT proceedings.

[2]

Open Problem: Average-Case Hardness of Hypergraphic Planted Clique Detection (PDF)

Yuetian Luo, Anru R Zhang (2020)

Conference on Learning Theory (COLT), PMLR 125

📍 Proceedings PDF.

[3]

Reducibility and Computational Lower Bounds for Problems with Planted Sparse Structure

Matthew Brennan, Guy Bresler, Wassim Huleihel (2018)

Conference on Learning Theory (COLT), PMLR 75

📍 Average-case reduction framework via randomized polynomial-time mappings preserving null/planted distributions up to small total-variation error.

§ Tags