A Wishlist for Optimization Algorithms
- Nonlinear, non-convex objectives
- Powerful enough to tackle hard problems in a systematic way, and meanwhile is still practical
- Becoming more accurate as we're paying more
- A generic framework that can be applied obviously to various problems.
- Methods
- sum-of-squares (SoS) SDP
- Lasserre hierarchy
- Lovász-Schrijver hierarchy
Quadratic programming
min21xTQx+cTx
Semidefinite Program
- A≽0: symmetric matrix A∈Rn×n is positive semidefinite (∀x∈Rn,xTAx≥0)
- A≽0⟺∀λ(A)≥0⟺∃B∈Rn×n,A=BTB
- Semidefinite Programing(SDP): C,A1,⋯,Ak∈Rn×n,b1,b2,⋯,bk∈R
max s.t. tr(CTY)=i=1∑ni=1∑ncijyijtr(ArTY)≤brY≽0Y=YT∈Rn×n∀1≤r≤k
- SDP (vector program form, LP for inner products)
maxs.t.i=1∑ni=1∑ncij⟨vi,vj⟩i=1∑nj=1∑naij(r)⟨vi,vj⟩≤brv1,⋯,vn∈Rn∀1≤r≤k
- LP ⊂ SDP ⊂ convex programs
- ellipsoid method: find OPT±ϵ in time poly(n,ϵ1)
- SDP-Relaxation
- Present the original quadratic program
- SDP relaxation: xuxv→⟨xv,xu⟩
- Solve SDP relaxtion
- Round optimal solution xv∗ to x^v
Max-Cut
- Some Algorithm
- Greedy: 21-approximate
- Random: 21-approximate
- Local Search: 21-approximate
- LP for Max-Cut
maxs.t. uv∈E∑yuvyuv≤∣xu−xv∣xv∈{0,1}∀uv∈E∀v∈V
- Linearization: Integrality gap >2
maxs.t. uv∈E∑yuvyuv≤yuw+ywvyuv+yuw+ywv≤2yuv∈{0,1},∀u,v∈V∀u,v,w∈V∀u,v,w∈V
maxs.t. uv∈E∑yuvyuv≤21(1−xuxv)xv∈{−1,1}∀uv∈E∀v∈V
- Strictly Quadratic Program (nonlinear,non-convex)
maxs.t. uv∈E∑21(1−xuxv)xv2=1∀v∈V
maxs.t. uv∈E∑21(1−⟨xu,xv⟩)∥xv∥2=1xv∈R∣V∣∀v∈V
- Rounding: x^v=sgn(⟨xv∗,u⟩),u∈Rn,∥u∥2=1 is uniform random unit vector
- Generate u: u=∥r∥2r,r=(r1,⋯,rn)∈Rn,ri∼N(0,1) i.i.d
- E(cut)=∑uv∈Eπθuv=∑uv∈Eπarccos⟨xu∗,xv∗⟩≥α∑uv∈E21(1−⟨xu∗,xv∗⟩),≥αOPTIP≥αOPT
- Assuming the unique games conjecture: no poly-time algorithm with approx. ratio <α=infx∈[−1,1]π(1−x)2arccos(x)=0.87856⋯
SoS
- Given a n-variate polynomial f, determine whether f(x)≥0 for all x∈{0,1}n
- multilinear: f(x)=∑d=(d1,⋯,dn)∈{0,1}nx1d1⋯xndn
- Given a n-variate polynomial f, find
- either an x∈{0,1}n such that f(x)<0
- or a "proof" of f(x)≥0 over all x∈{0,1}n
- SoS proof: f(x)=∑i=1rg1(x)2
- For nonnegatvie polynomial
- f:{0,1}n→R, degree-2n SoS proof always exists
- degree-d SoS proof needs at most r=nO(d) length
- if f has degree=d SoS proff, then it can be found in nO(d) time (by SDP)