Semi-Direct Sum Theorem and Nearest Neighbor under l infinity

Published in APPROX/RANDOM 2018, 2018

Download paper here

We introduce semi-direct sum theorem as a framework for proving asymmetric communication lower bounds for the functions of the form ni=1f(x,yi). Utilizing tools developed in proving direct sum theorem for information complexity, we show that if the function is of the form ni=1f(x,yi) where Alice is given x and Bob is given yi’s, it suffices to prove a lower bound for a single f(x,yi). This opens a new avenue of attack other than the conventional combinatorial technique (i.e. richness lemma’’ from [Miltersen1995]) for proving randomized lower bounds for asymmetric communication for functions of such form.

As the main technical result and an application of semi-direct sum framework, we prove an information lower bound on c-approximate Nearest Neighbor (ANN) under which implies that the algorithm of [Indyk2001] for c-approximate Nearest Neighbor under is optimal even under randomization for both decision tree and cell probe data structure model (under certain parameter assumption for the latter). In particular, this shows that randomization cannot improve [Indyk2001] under decision tree model. Previously only a deterministic lower bound was known by \cite{andoni2008hardness} and randomized lower bound for cell probe model by [Kapralov2012nns]. We suspect further applications of our framework in exhibiting randomized asymmetric communication lower bounds for big data applications.