Approximation Algorithms for Maximum Cut

发表于 2024-03-15 分类于论文阅读

Improved Approximation Algorithms for Maximum Cut and Satisfiability Problems Using Semidefinite Programming (Michael X. Coemans and David P. Williamso, 1994) reading note.

Introduction

Defn (Maximum Cut Problem): Let $G=(V,E,w)$ be a undirected graph with weigh function $w:E\to \mathbb R$ . One wants a subset $S$ of the vertex set $V$ such that the sum of the weights of the edges between $S$ and the complementary subset $\overline{S}$ is as large as possible.

The maximum cut problem can be written in the following integer programming form:

\begin{aligned} \max\ & \dfrac 12 \sum_{i<j}(1-x_ix_j)w_{ij} \\ \text{s.t. } & x_i\in \{-1,1\},\ i=1,2,\ldots,n \end{aligned} \tag{1}

However, maximum cut problem is a $\textbf{NP}$ -complete problem, which means that it probably has no polynomial time algorithm.

Thus we try to find its approximate solution.

Relaxation

Since solving $(1)$ is $\textbf{NP}$ -complete, we consider semidefinite relaxation of $(1)$ .

x_i\in \{-1,1\}=\mathbb S^1\Longrightarrow v_i\in \mathbb S^n,

where $\mathbb S^k$ denotes the $k$ -dimensional unit sphere.

Then we can write down the semidefinite relaxation of $(1)$ :

\begin{aligned} \max\ & \dfrac 12 \sum_{i<j}(1-X_{ij})w_{ij} \\ \text{s.t. } & v_i\in \mathbb S^n,\ i=1,2,\ldots,n \\ & X_{ij}=\langle v_i,v_j \rangle,\ i=1,2,\ldots,n,\ j=1,2,\ldots,n \end{aligned} \tag{2}

Thm: $X$ is a semidefinite matrix. Thus problem $(2)$ is a semidefinite programming problem.

Proof: For every $y\in \mathbb R^n$ , we have: \vspace{-0.3cm}

\begin{aligned} y^TXy & =\sum_{i,j} y_i y_j\langle v_i,v_j\rangle \\ & = \sum_{i,j} \langle y_i v_i,y_j v_j\rangle \\ & = \left\langle \sum_i y_iv_i,\sum_i y_iv_i\right\rangle\ge 0, \end{aligned}

as desired. $\Box$

Algorithm

Then there is a simple randomized approximation algorithm for the maximum cut problem.

Solve $(2)$ , obtaining an optimal set of vectors $v_i$ .
Let $r$ be a vector uniformly distributed on the unit sphere $\mathbb S^n$ .
Set $S = \{i\mid \langle v_i,r\rangle \ge 0 \}$ .

Remark: We can obtain $\{v_1,\ldots,v_n\}$ from $X$ by Cholesky decomposition, i.e. $X=L^TL$ and each column of $L$ is a $v_i$ .

Analysis

We analyze the approximate effect of the algorithm:

\begin{aligned} \mathbb E[w(S,\overline{S})]\triangleq\mathbb E[\Delta] & =\sum_{i<j} w_{ij}\Pr[\text{sgn}(\langle v_i,r\rangle)\ne \text{sgn}(\langle v_j,r\rangle)] \\ & =\sum_{i<j} w_{ij}\dfrac{2\arccos(\langle v_i,v_j\rangle)}{2\pi} \end{aligned}

Then let $p^\ast$ be the optimal value of problem $(1)$ , and $q^\ast$ be the optimal value of problem $(2)$ .

Since $(2)$ is a relaxation of $(1)$ , we have that $q^\ast\ge p^\ast$ . Thus the approximation ratio $\dfrac{\mathbb E[\Delta]}{p^\ast}\ge \dfrac{\mathbb E[\Delta]}{q^\ast}$ .

If $w_{ij}\ge 0$ for every $i,j$ , then we have:

\begin{aligned} \dfrac{\mathbb E[\Delta]}{q^\ast} & =\dfrac 2\pi\cdot \dfrac{\sum_{i<j}w_{ij}\arccos(\langle v_i,v_j\rangle)}{\sum_{i<j}w_{ij}(1-\langle v_i,v_j\rangle)} \\ & \ge \dfrac 2\pi \min_{i,j}\dfrac{\arccos(\langle v_i,v_j\rangle)}{1-\langle v_i,v_j\rangle} \\ & \ge \dfrac 2\pi \min_{\theta}\dfrac{\theta}{1-\cos(\theta)}\triangleq \alpha\approx 0.878\ldots \end{aligned}

Remark: By using Markov inequality and the fact $\mathbb E[(p^\ast-\Delta)/p^\ast]<0.122$ , we have:

\Pr[(p^\ast-\Delta)/p^\ast>0.122\epsilon]<\dfrac{\mathbb E[(p^\ast-\Delta)/p^\ast>0.122\epsilon]}{1+\epsilon}<\dfrac 1{1+\epsilon}<1

We can repeat this algorithm many times to reduce the error probability.

Quality of the Relaxation

The following picture shows that the bound showed before is almostly tight.

Analysis for Negative Edge Weights

For negative edge weights, we have:

Thm: Let $W^-=\sum_{i<j}\min\{w_{ij},0\}$ . Then:

\mathbb E[\Delta]-W^-\ge \alpha\left(q^\ast-W^- \right)

The proof of this theorem is almostly the same as before.

Supplements

The maximum cut problem is $\textbf{APX}$ -hard, meaning that there is no polynomial time approximation scheme, arbitrarily close to the optimal solution, for it, unless $\textbf P = \textbf{NP}$ .
If the unique games conjecture is true, this is the best possible approximation ratio for maximum cut. (Subhash Khot, Guy Kindler, Elchanan Mossel and Ryan O’Donnell, 2004)

Thm (Unique Games Conjecture): Given any $\epsilon, \delta > 0$ , there exists some $k>0$ depending on $\epsilon$ and $\delta$ , such that for the unique games problem with a universe of size $k$ , it is $\textbf{NP}$ -hard to distinguish between instances in which at least a $1-\epsilon$ fraction of the constraints can be satisfied, and instances in which at most a $\delta$ fraction of the constraints can be satisfied.

Further statements about UGC can be seen on https://www.zhihu.com/question/264803851

References

[1] Michel X. Goemans and David P. Williamson. 1995. “Improved approximation algorithms for maximum cut and satisfiability problems using semidefinite programming.” J. ACM 42, 6 (Nov. 1995), 1115–1145.

[2] Lecture 3 of the course Algorithms for Big Data, 2024 spring, lecturer: Hu Ding and Qi Song.

[3] Subhash Khot, Guy Kindler, Elchanan Mossel and Ryan O’Donnell. “Optimal inapproximability results for MAX-CUT and other 2-variable CSPs?.” SIAM Journal on Computing 37.1 (2007): 319-357.