6-难问题求解

Definition

Alphabet

$\Sigma$ $Σ$ : alphabet
- $\Sigma^*=\mathcal{P}(\Sigma)$
- $\Sigma^+=\Sigma^*-\{\lambda\}$
$s\in\Sigma$ $s \in Σ$ : symbol
- $s\in w$
$w\subseteq\Sigma, \omega\in\Sigma^*$ $w \subseteq Σ, ω \in Σ^{*}$ : word over $\Sigma$ $Σ$
- $w\in L$
- $\lambda$ : empty word
- $|w|$ $∣ w ∣$ : length of word
  - $\Sigma^n=\{w\in\Sigma^*|\vert w\vert=n\}$
- # $_a(w)$ : number of occurence of $a$ in $w$
- $vw$ $v w$ : concatenation of $v$ $v$ and $w$ $w$
  - $w^{n+1}=ww^n,w^0=\lambda$
- prefix, suffix, subword
$L\subseteq\Sigma^*$ $L \subseteq Σ^{*}$ : language over $\Sigma$ $Σ$
- $L^C =\Sigma^*-L$
- $L_1L_2=\{uv\in(\Sigma_1\cup\Sigma_2)^*|u\in L_1,v\in L_2\}$ : concatenation
canonical ordering: let $s_1<...<s_m$ $s_{1} < ... < s_{m}$ be a linear ordering, $u<v$ $u < v$
- if $\vert u\vert<\vert v\vert$
- or $\vert u\vert=\vert v\vert,u=xs_iu',v=xs_jv',i<j$

Algorithmic Problems

polynomially related: codings $e_1\in L_1,e_2\in L_2,\exists f:L_1\rightarrow L_2,f(e_1)$ is polynomial
decision Problem: $(L,U,\Sigma),L\subseteq U\subseteq \Sigma^*$ $(L, U, Σ), L \subseteq U \subseteq Σ^{*}$
- $A$ $A$ solves/decides $U$ $U$
  - $A(x)=1,x\in L$
  - $A(x)=0,x\in U-L$
- $A$ $A$ accepts $U$ $U$ : $A(x)=1,x\in L$ $A (x) = 1, x \in L$
  - Halting Problem: Undecidable but acceptable
optimization Problem: $U=(\Sigma_I,\Sigma_O,L,L_I,M,cost,goal)$ $U = (Σ_{I}, Σ_{O}, L, L_{I}, M, cos t, g o a l)$
- briefly: $U=(L,$ constraints, costs, goal $)$
- $\Sigma_I$ : input alphabet
- $\Sigma_O$ : output alphabet
- $L\subset\Sigma_I^*$ : language of feasible Problem instances
- $L_I\subset L$ : language of the actual Problem instance
- $M:L\rightarrow \mathcal{P}(\Sigma_O^*)$ , $\mathcal{M}(x)$ is the set of feasible solutions for $x$
- cost $:\mathcal{M}(x)\times L\rightarrow \mathbb{R}$
- goal $\in\{\min,\max\}$
- optimal solution $Opt_U(x)=cost(y,x)=goal_{x\in L_I}\{cost(z,x)|z\in \mathcal{M}(x)\}$
- $Output_U(x)\subseteq \mathcal{M}(x)$ : all optimal solutions for instance $x\in U$
Algorithm $A$ is consistent for $U$ : $\forall x\in L_I,A(x)\in \mathcal{M}(x)$
Algorithm $B$ $B$ solves $U$ $U$
- consistent
- $\forall x\in L_I,B(x)=Opt_U(x)$
$U_1$ is a subProblem of $U_2$ if $L_{I,1}\subseteq L_{I,2}$ (Others are same)

Turing Machine

Turing Machine: $M=(Q,\Sigma,\Gamma,\delta,q_0,q_{ac},q_{rej})$ $M = (Q, Σ, Γ, δ, q_{0}, q_{a c}, q_{r e j})$
- $Q$ : state set
- $\Sigma$ : Input alphabet
- $\Gamma\subseteq\Sigma$ : alphabet on tape
- $\delta:Q\times\Gamma\rightarrow Q\times\Gamma\times\{L,R,-\}$
- $q_0\in Q$ : initial satate
nondeterministic TM: $\delta:Q\times\Gamma\rightarrow \mathcal{P}(Q\times\Gamma\times\{L,R\})$
nondeterministic TM $M$ $M$ accept $L=L(M)$ $L = L (M)$
- $\forall x\in L,\exists$ computation of $M$ accepts $x$
- $\forall y\not\in L$ , all computations of $M$ rejects $y$
time complexity of nondeterministic TM $M$ $M$
- $T_M(\omega)$ : shortest accepting computation of $M$ on $\omega$
- $T_M(n)=\max\{T_M(x)|x\in L(M)\cap\Sigma^n\}$
Church-Turing thesis: Problem $U$ can be solved by an algorithm iff $\exists$ Turing machine solving $U$
Theorem: for every increasing function $f:\mathbb{N}\rightarrow\mathbb{R}^+$ $f : N \to R^{+}$
- $\exists$ decision Problem such that every TM solving it has the time complexity in $\Omega(f(n))$
- but $\exists$ TM solving it in $O(f(n)\log f(n))$
$L(M)$ : language decided by $M$

Examples

Decision

PRIM: test if a number is a prime
EQ-POL: $p_1\equiv p_2$ in $\mathbb{Z}_p$
EQ-1BP: equivalence of one-time-only branching programs
C-SAT: whether a formula with AND, NOT, OR gate is satisfiable
SAT (kSAT): whether a CNF can be satisfied
Clique: whether a graph contain $K_k$
VCP: whether graph contains a vertex cover of size $k$
HC: whether graph contains a Hamiltonian cycle
SOL-IP: existence of a solution of linear integer programming
- SOL-0/1-IP
- SOL-IP $_p$
PM: whether a bipartite graph has a perfect matching
SUBSET-SUM: exists a subset $S'\subseteq S$ sum up to $t$

Optimization

TSP: find a Hamiltonian cycle of the minial cost in a complete weighted graph
- $\Delta$ -TSP: metric traveling salespaerson Problem (satisfying triangle inequality)
- Euclidean TSP: geometrical, can be embedded in the two-dimensional Euclidean space
MSP: Makespan Scheduling Problem
MIN-VCP: find minimum vertex cover
- WEIGHT-VCP
SCP: Set Cover Problem
MAX-CL: Maximum Clique Problem
MAX/MIN-CUT
KP: Knapsack Problem
- SKP: Simple Knapsack Problem
- BIN-P: Bin-Packing Problem
MAX-SAT: maximize the number of stisfied clauses
- MAX-kSAT
- MAX-EkSAT: exactly
LP: Linear Programming
- IP: Integer Linear Programming
- 0/1-Linear Programming
MAX-LinModk: Maximum Linear Equation Problem Mod k
- MAX-EmLinModk: k is prime, m is positive integer
MAX-CSP: $\max_{S,T}|E(S,T)|$

Complexity Theory

main objective of the complexity theory is:
- find a formal specification of the class of practically solvable Problems
- to develop methodes enabling the classification of algorithmic Problemcs accoording to their membershiop in this class
uniform cost
- all numbers bounded
- basic operation: $O(1)$
logarithmic cost
- numbers $k$ takes $O(\lg k)$ bits
- addition, subtraction, assignment: $O(n)$
- multiplication, division: $O(n\log n)$
pseudopolynomial time complexity:
- $T(n)$ is polynomial in the numeric value of the input
- $T(n)$ is not polynomial in the number of bits required to repensent it of the input
bound
- $T/S_A(x)$ : time/space complexity on $x\in\Sigma_I$
- $T/S_A(n)=\max\{T/S_A(x)|x\in\Sigma_I^n\}$ : worst case analysis
- upper bound on the time complexity of $U$ : $\exists A$ solving $U$ with $T_A(n)\in O(g(n))$
- lower bound on the time complexity of $U$ : $\forall B$ solving $U$ has $T_B(n)\in\Omega(f(n))$
There is a decision Problem such that $\forall A$ deciding $L$ , $\exists B$ deciding $L$ : $T_A(n)=\log_2T_B(n)$
optimal algorithm: $\text{Time}_C(n)\in O(g(n))$ and $\Omega(g(n))$ is a lower bound

Complexity Class

Complexity Zoo
$\text{P}=\{L(M)|M$ $P = {L (M) ∣ M$ is a TM, $\exists c>0,T_M(n)\in O(n^c)\}$ $\exists c > 0, T_{M} (n) \in O (n^{c})}$
- tractable (solvable): $L\in P$ , $L$ is accepted/decided by a polynomial-time algorithm
- intractable: $L\not\in P$
$\text{NP}=\{L(M)|M$ $NP = {L (M) ∣ M$ is a polynomial-time nondeterministic TM $\}$ $}$
- verifier for $L$ : $A$ works on $\Sigma^*\times\{0,1\}^*$ , $L=V(A)=\{\omega\in\Sigma^*|\exists c\in\{0,1\}^*,A$ accepts $(\omega,c)\}$
- $\text{NP}=\{V(A)|T_A(\omega,c)\in O(|\omega|^d)\}$
- closed under $\cap,\cup,\cdot,\star$
polynomial-time reduction (Karp, many-one) $L_1\leq_p L_2$ $L_{1} \leq_{p} L_{2}$ : $\exists$ $\exists$ poly. time $f$ $f$ that $\forall x: x\in L_1\iff f(x)\in L_2$ $\forall x : x \in L_{1} ⟺ f (x) \in L_{2}$
- Cook/Turing reduction: an algorithm that solves Problem $A$ using a polynomial number of calls to a subroutine for Problem $B$ , and polynomial time outside of those subroutine calls
$\text{NP}$ -hard $L$ : $\forall U\in \text{NP},U\leq_p L$
$\text{NP}$ -complete $L$ : $L\in \text{NP}$ and $L$ is $\text{NP}$ -hard
co- $\text{NP}$ : $\overline{L}\in \text{NP}$ , $x\not\in L$ can be poly. time verified with $c$
strongly $\text{NP}$ : $\text{NP}$ when all of its numerical parameters are bounded by a polynomial in the length of the input
$\text{L}\subseteq\text{NL}\subseteq\text{P}\subseteq\text{ZPP}\subseteq\text{NP}\subseteq\text{PH}\subseteq\text{PSPACE}=\text{NPSPACE}\subseteq\text{EXP}\subseteq\text{NEXP}\subseteq\text{EXPSPACE}\subseteq\text{ELEMENTARY}\subseteq\text{PR}\subseteq\text{R}\subseteq\text{RE}\subseteq\text{ALL}$ $L \subseteq NL \subseteq P \subseteq ZPP \subseteq NP \subseteq PH \subseteq PSPACE = NPSPACE \subseteq EXP \subseteq NEXP \subseteq EXPSPACE \subseteq ELEMENTARY \subseteq PR \subseteq R \subseteq RE \subseteq ALL$
- $\text{P}\neq\text{EXP}$
- $\text{NP}\neq\text{NEXP}$
$\text{P}\subseteq\text{ZPP}\subseteq\text{RP}\subseteq\text{BPP}\subseteq\text{PP}$
$\text{P}\subseteq\text{BQP}\subseteq\text{PSPACE}$
Conjecture: $\text{P}\neq \text{NP}$ : no hope for a polynomial-time algorithm
Conjecture: $\text{BPP}=\text{P}$ : randomization alone not help

Optimization Complexity

$\text{NPO}$ $NPO$ :
- $L_I\in \text{P}$
- exists a polynomial $p_U$ $p_{U}$ such that
  - $\forall x\in L_I,y\in\mathcal{x},|y|\leq p_U(|x|)$
  - exists a polynomial-time algorithm that $\forall y\in\Sigma_O^*,x\in L_I$ such that $|y|\leq p_U(|x|)$ , decides wheter $y\in M(x)$
- cost is computable in polynomial time
$\text{PO}$ $PO$ :
- $U\in \text{NPO}$
- $\forall x\in L_I,\exists$ polynomial-time algorithm computes an optimal solution
threshold language of $U$ (minimum): $Lang_U=\{(x,a)\in L_I\times\Sigma^*_{bool}|Opt_U(x)\leq Number(a)\}$
$U$ $U$ is $\text{NP}$ $NP$ -hard if $Lang_U$ $L an g_{U}$ is $\text{NP}$ $NP$ -hard
- $U\in \text{PO},Lang_U\in \text{P}$

Reduction

Cook's Theorem: C-SAT is $\text{NP}$ -complete
$\text{NPC}$ $NPC$
- C-SAT $\leq_p$ SAT
- SAT $\leq_p$ 3SAT
- 3SAT $\leq_p$ SOL-0/1-ILP: $x_1\vee\overline{x}_2\vee\overline{x}_3\iff x_1+(1-x_2)+(1-x_2)\geq1$
- 3SAT $\leq_p$ SUBSET-SUM
- 3SAT $\leq_p$ Clique: $K_{3,3,\cdots,3}$
- Clique $\leq_p$ VC: $(G,k)\in$ Clique $\iff(\overline{G},|V|-k)\in$ VC
- VC $\leq_p$ HAM-CYCLE
- VC $\leq_p$ SCP
- HAM-CYCLE $\leq_p$ HAM-PATH
- HAM-CYCLE $\leq_p$ TSP
- MAX-CUT
strongly $\text{NPC}$ : 3-Partition
PM $\in\text{NP}\cap$ co- $\text{NP}$
$\text{EXP}$ -complete: Go
$\text{NEXP}$ -complete: equivalence of regular expressions with squaring, concatenating and union
$\text{EXPSPACE}$ -complete: equivalence of regular expressions with squaring, concatenating, union and Kleene

Approximation

Error

relative error $\epsilon_A(x)=\frac{|cost(A(x))-Out_U(x)|}{Opt_U(x)}$ $ϵ_{A} (x) = \frac{∣ cos t ( A ( x )) - O u t _{U} ( x ) ∣}{O p t _{U} ( x )}$
- $\epsilon_A(n)=\max\{\epsilon_A(x)|x\in L_I\cap(\Sigma_I)^n\}$
approximation ratio $R_A(x)=\max\{\frac{cost(A(x))}{Opt_U(x)},\frac{Opt_U(x)}{cost(A(x))}\}$ $R_{A} (x) = max {\frac{cos t ( A ( x ))}{O p t _{U} ( x )}, \frac{O p t _{U} ( x )}{cos t ( A ( x ))}}$
- $R_A(n)=\max\{R_A(x)|x\in L_I\cap(\Sigma_I)^n\}$
- (minimization) $R_A(x)=1+\epsilon_A(x)$

NPO	Name	Description	Examples
$\text{FPTAS}$	fully polynomial-time approximation scheme	$\text{Time}_A(x,\epsilon^{-1})$ bounded by a function that is polynomial in both $\lvert x\rvert$ and $\epsilon^{-1}$	knapsack
$\text{PTAS}$	polynomial-time approximation scheme	$\forall (x,\epsilon)\in L_I\times\mathbb{R}^+$ , $A$ computes a feasible solution $A(x)$ with $\epsilon_A(x)<\epsilon$ and $\text{Time}_A(x,\epsilon^{-1})$ can be bounded by a function that is polynomial in $\vert x\vert$	MSP
$\text{APX}$	$\delta$ -approximation algorithm	$\forall x\in L_I,R_A(x)\leq \delta$	MIN-VCP, MAX-SAT, $\delta$ -TSP
$\log\text{-APX}$	$f(n)$ -approximation algorithm	$R_A(n)\leq f(n),f(n)$ is bounded by a polylogarithmic function $\sum_{i}a_i\log^i(n)$	SCP
$f(n)\text{-APX}$	$f(n)$ -approximation algorithm	$R_A(n)\leq f(n),f(n)$ is not bounded by any polylogarithmic function	TSP, MAX-CL

Distance

distance function from $\overline{U}$ $\overline{U}$ according to $L_I$ $L_{I}$ : $h_L:L\rightarrow\mathbb{R}^+$ $h_{L} : L \to R^{+}$
- $\forall x\in L_I, h_L(x)=0$
- $h_L$ is polynomial-time computable
$Ball_{r,h}(L_I)=\{w\in L|h(w)\leq r\}$
$U_r=(\Sigma_I,\Sigma_O,L,Ball_{r,h}(L_I),M,cost,goal)$
property of infinite jumps: If $Ball_{q,h'}(L_I)\subset Ball_{r,h'}(L_I)$ for some $q<r$ , then $|Ball_{r,h'}(L_I)|-|Ball_{q,h'}(L_I)|$ is infinite

for $\delta$ -approximation $A$

Stability according to $h$	Description
$p$ -stable	$\forall r\leq p,\exists \delta_{r,\epsilon}\in\mathbb{R}^{>1},A$ is $\delta_{r,\epsilon}$ -approximation algorihtm for $U_r$
stable	$A$ is $p$ -stable according to $h$ for all $p\in R^+$
$(r,f_r(n))$ -quasistable	$A$ is an $f_r(n)$ -approxiamtion algorithm for $U_r$

for PTAS $A$ :

Stability according to $h$	Description
stable	$\forall r>0,\forall\epsilon>0,A_\epsilon$ is a $\delta_{r,\epsilon}$ -approximation algorithm for $U_r$
superstable	$\delta_{r,\epsilon}\leq f(\epsilon)g(r)$ , $f,g$ are some functions from $\mathbb{R}^{\geq0}$ to $\mathbb{R}^+$ and $\lim_{\epsilon\rightarrow0}f(\epsilon)=0$

constraint distance function for $u$ $u$ is $h:L_I\times\Sigma^*_O\rightarrow\mathbb{R}^{\geq0}$ $h : L_{I} \times Σ_{O}^{*} \to R^{\geq 0}$ , $\forall S\in M(x),h(x,S)=0$ $\forall S \in M (x), h (x, S) = 0$ , $\forall S\not\in M(x),h(x,S)>0$ $\forall S \neq \in M (x), h (x, S) > 0$ and $h$ $h$ is polynomial-time computable.
- $\epsilon$ -ball of $M(x)$ according to $h$ : $M_\epsilon^h(x)=\{S\in\Sigma^*_O|h(x,S)\leq\epsilon\}$

$h$ -dual	Description
PTAS	$\forall (x,\epsilon)\in L_I\times\mathbb{R}^+,A(x,\epsilon)\in M_\epsilon^h(x)$ and $cost(A(x,\epsilon))\geq Opt_U(x)$ if goal=max and $\text{Time}_A(x,\epsilon^{-1})$ is bounded by a function that is polynomial in $\lvert x\rvert$
FPTAS	$\text{Time}_A(x,\epsilon^{-1})$ can be bounded by a function that is polynoimal in both $\lvert x\rvert$ and $\epsilon^{-1}$

Randomization

$\text{Random}_A(x)$ $Random_{A} (x)$ : the maximum number of random bits used
- $\text{Random}_A(n)$ : $\max\{\text{Random}_A(x)||x|=n\}$
- derandomization: $\text{Random}_A(n)\leq\log n$
$\text{Prob}_{A,x}(C)$ $Prob_{A, x} (C)$ : Probability of the executaion $C$ $C$ on $x$ $x$
- $\text{Prob}(A(x)=y) = \sum_{C\text{ outputs }y}\text{Prob}_{A,x}(C)$
$\text{Exp-Time}_A(x)=\sum_C\text{Prob}_{A,x}(C)*Time(C)$ $Exp-Time_{A} (x) = \sum_{C} Prob_{A, x} (C) * T im e (C)$
- $\text{Exp-Time}_A(n)=\max\{\text{Exp-Time}_A(x)||x|=n\}$
$\text{Time}_A(x)=\max\{\text{Time}(C)|C\text{ runs on }x\}$ $Time_{A} (x) = max {Time (C) ∣ C runs on x}$
- $\text{Time}_A(n)=\max\{\text{Time}_A(x)||x|=n\}$

Decision Problem

Classification	Name	Description	Repeat k times
$\text{ZPP}$	Las Vegas algorithm	$\text{Prob}(A(x)=F(x))\geq\frac{1}{2}$ $\text{Prob}(A(x)=?)<\frac{1}{2}$	$L\in \text{ZPP}_{1-(1-\delta)^k}$
$\text{RP}$	One-sided-error Monte Carlo algorithm	$\forall x\in L,\text{Prob}(A(x)=F(x)=1)\geq\frac{1}{2}$ $\forall x\not\in L,\text{Prob}(A(x)=F(x)=0)=1$	$L\in \text{RP}_{1-(1-\delta)^k}$
$\text{BPP}$	Two-sided-error Monte Carlo algorithm	$\text{Prob}(A(x)=F(x))\geq\frac{1}{2}+\epsilon,0<\epsilon\leq\frac{1}{2}$	$k\geq\frac{2\ln 2\delta}{\ln(1-4\epsilon^2)},L\in \text{BPP}_{1-\delta}$
$\text{PP}$	Unbounded-error Monte Carlo algorithm	$\text{Prob}(A(x)=F(x))>\frac{1}{2}$

Optimization Problem

Algorithm	Description
$\text{RFPTAS}$	$p(\vert x\vert,\delta^{-1})$ is polynomial in both $\vert x\vert$ and $\delta^{-1}$
$\text{RPTAS}$ (randomized polynomial-time approximation scheme)	$\text{Prob}(A(x)\in M(x))=1$ and $\text{Prob}(\epsilon_A(x,\delta)\leq\delta)\geq\frac{1}{2}$ and $Time_A(x,\delta^{-1})\leq p(\vert x\vert,\delta^{-1})$ and $p$ is a polynomial in $\vert x\vert$
randomized $f(n)$ -approximation algorithm	$\text{Prob}(A(x)\in M(x))=1$ and $\text{Prob}(R_A(x)\leq f(\vert x\vert))\geq\frac{1}{2}$
randomized $\delta$ -approximation	$\text{Prob}(A(x)\in M(x))=1$ and $\text{Prob}(R_A(x)\leq\delta)\geq\frac{1}{2}$
randomized $\delta$ -expected approximation	$\text{Prob}(A(x)\in M(x))=1$ and $E(R_A(x))\leq\delta$

w.h.p (with high probility): $p_c=O(1-\frac{1}{n})$
Median Trick
- $\forall\epsilon$ , return a $\hat Z$ in time poly( $|\phi|,\frac{1}{\epsilon}$ ), $P((1-\epsilon)Z\leq\hat Z\leq(1+\epsilon)Z)\geq\frac{2}{3}$
- Repeat $O(\log\frac{1}{\delta})$ and choose median number (Chernoff Bound)
- FPRAS: $\forall\epsilon,\delta$ , return a $\hat Z$ in time Poly( $|\phi|,\frac{1}{\epsilon},\log\frac{1}{\delta}$ ), $P((1-\epsilon)Z\leq\hat Z\leq(1+\epsilon)Z)\geq1-\delta$

Paradigms of Design of Randomized Algorithm

Foiling an adversary
Abundance of witness: decision Problem
- Fingerprinting: equivalence Problem
random sampling
- relexation and random rounding

Table of Contents

Table of Contents

6-难问题求解

Definition

Alphabet

Algorithmic Problems

Turing Machine

Examples

Decision

Optimization

Complexity Theory

Complexity Class

Optimization Complexity

Reduction

Approximation

Error

Distance

Randomization

Decision Problem

Optimization Problem

Paradigms of Design of Randomized Algorithm