Linear programming involves solving optimization problems with linear objective functions and linear constraints. The typical LP problem is formed as follows:

x minimize subject to c^{⊤} x w_{L E}^{(i)} \leq b_{i} for i \in {1, 2, \dots} w_{GE}^{(j)} \geq b_{j} for j \in {1, 2, \dots} w_{EQ}^{(k)} \leq b_{k} for k \in {1, 2, \dots}

We typically use matrices to represent linear programs in general form:

x minimize subject to c^{⊤} x A_{L E} x \leq b_{L E} A_{GE} x \geq b_{GE} A_{EQ} x = b_{EQ}

We can convert general form linear programs to standard form:

x minimize subject to c^{⊤} x A x \leq b x \geq 0

We convert $A_{GE} x \geq b_{GE}$ into $- A_{GE} \leq - b_{GE}$ . We then split $A_{EQ} x = b_{EQ}$ into two constraints: $A_{EQ} x \leq b_{EQ}$ and $- A_{EQ} x \leq - b_{EQ}$ . Next, to ensure all $x$ entries are nonnegative, we replace $x$ with $x^{+} - x^{-}$ and constrain $x^{+} \geq 0$ and $x^{-} \geq 0$ giving us

x^{+}, x^{-} minimize subject to [c^{⊤} - c^{⊤}] [x^{+} x^{-}] [A - A] [x^{+} x^{-}] \leq b [x^{+} x^{-}] \geq 0

Each inequality $w^{⊤} x \leq b$ forms a half-space. The collection of inequalities forms a convex set.

center

As a result, any local feasible minimum is also a global feasible minimum.

Equality Form

We often represent linear programs in equality form:

x minimize subject to c^{⊤} x A x = b x \geq 0

where $x$ and $c$ have $n$ components, $A$ is an $m \times n$ matrix, and $b$ has $m$ components, i.e., there are $n$ nonnegative design variables and a system of $m$ equations.

Any linear program in standard form can be transformed to equality form by changing the constraints:

A x \leq b \to A x + s = b, s \geq 0

where $s$ is called a slack variable that enforces equality. For example, consider the following linear program:

x minimize subject to 5 x_{1} + 4 x_{2} 2 x_{1} + 3 x_{2} \leq 5 4 x_{1} + x_{2} \leq 11

by introducing two slack variables (one for each constraint) and splitting $x = x^{+} - x^{-}$ , we get

x^{+}, x^{-}, s minimize subject to 5 (x_{1}^{+} - x_{1}^{-}) + 4 (x_{2}^{+} - x_{2}^{-}) 2 (x_{1}^{+} - x_{1}^{-}) + 3 (x_{2}^{+} - x_{2}^{-}) + s_{1} = 5 4 (x_{1}^{+} - x_{1}^{-}) + (x_{2}^{+} - x_{2}^{-}) + s_{2} = 11 x_{1}^{+}, x_{1}^{-}, x_{2}^{+}, x_{2}^{-}, s_{1}, s_{2} \geq 0

which is in equality form.

Simplex Algorithm

The simplex algorithm is an algorithm for solving linear programs in equality form by moving between vertices of the feasible set. We assume the rows of $A$ are linearly independent and the number of equality constraints is at most the number of design variables $(m \leq n)$ , i.e., the problem is not over constrained.

Linear programs in equality form have feasible sets that form a convex polytope. Points on the interior of the feasible set are never optimal since we can improve them by moving in the $- c$ direction. Moreover, points on the faces of the polytope can only be optimal if the face is perpendicular to $c$ . Finally, vertices have the potential to be optimal.

The simplex algorithm searches over the feasible set’s vertices for the optimal vertex. We can represent every vertex be uniquely defined $n - m$ components of $x$ that equal zero. For example, if $A \in R^{3 \times 5}$ , then

A x_{1} 0 x_{3} x_{4} 0 = B x_{1} x_{3} x_{4} = b_{1} b_{2} b_{3}

uniquely defines a point.

We partition the component indices, ${1, \dots, n}$ , into two sets, $B$ and $V$ , such that

The design values associated with $V$ are zero $(i \in V \Rightarrow x_{i} = 0$ )
The design values associated with $B$ may be zero ( $i \in B \Rightarrow x_{i} \geq 0$ )
$B$ has exactly $m$ elements and $V$ has exactly $n - m$ elements. We define $x_{B}$ to be the vector consisting of components of $x$ that are in $B$ , likewise $x_{V}$ is the vector consisting of components of $x$ that are in $V$ (note $x_{V} = 0$ ).

We then find the vertex associated with partition $(B, V)$ by using the $m \times m$ matrix $A_{B}$ formed by taking the $m$ columns of $A$ selected by $B$ . We then get $x_{B} = A_{B}^{- 1} b$ .

Example: Consider the constraints

102 1 - 1 1 122 13 - 1 x = 2 - 1 3, x \geq 0.

We want to verify that $x = [1, 1, 0, 0]$ is feasible and that it has no more than three nonzero component. Notice that

A_{1, 2, 3} = 102 1 - 1 1 122

is invertible. Moreover,

A x = A_{1, 2, 3} x_{1, 2, 3} = A_{1, 2, 3} [110]^{⊤} = b .

Therefore, $x$ is a vertex of the feasible set polytope. Note we could also have used $B = {1, 2, 4}$ .

While every vertex has an associated partition $(B, V)$ , not every partition corresponds to a vertex. A partition corresponds to a vertex only if $A_{B}$ is a nonsingular and $A_{B}^{- 1} b$ is feasible.

The simplex algorithm has two phases:

An initialization phase which identifies a vertex partition
An optimization phase which transitions between vertex partitions toward a partition corresponding to an optimal vertex.

First-Order Necessary Conditions

Using the Lagrangian for the equality form gives us

L (x, μ, λ) = c^{⊤} x - μ^{⊤} x - λ^{⊤} (A x - b)

with the following FONCs:

feasibility: $A x = b$ , $x \geq 0$
dual feasibility: $μ \geq 0$
complementary slackness: $μ ⊙ x = 0$ (element-wise product)
stationary: $A^{⊤} λ + μ = c$

For linear programs, the FONCs are sufficient conditions for optimaility.

We can decompose the stationary condition into $B$ and $V$ components:

A_{B}^{⊤} λ + μ_{B} = c_{B} and A_{V}^{⊤} λ + μ_{V} = c_{V} .

Choosing $μ_{B} = 0$ satisfies the complementary slackness. As a result we get that

λ = (A_{B}^{- 1})^{⊤} c_{B} .

Plugging this in to our $V$ stationary equation gives us that

μ_{V} = c_{V} - (A_{B}^{- 1} A_{V})^{⊤} c_{B} .

If $μ_{V}$ contains negative components, then dual feasibility is not satisfied and the vertex is sub-optimal.

Optimization Phase

At this point of the algorithm we have a partition $(B, V)$ that corresponds to a vertex of the feasible set polytope. We can update the partition by swapping indices between $B$ and $V$ .

A transition $x \to x^{'}$ between vertices must satisfy $A x^{'} = b$ . We choose a entering index $q \in V$ . Then the new vertex $x^{'}$ must satisfy

A x^{'} = A_{B} x_{B}^{'} + A_{{q}} x_{q}^{'} = A_{B} x_{B} = A x = b .

The index $q$ replaces a leaving index $p \in B$ and becomes zero in $x_{B}^{'}$ . We call such a swap pivoting. We then can solve for the new design point

x_{B}^{'} = x_{B} - A_{B}^{- 1} A_{{q}} x_{q}^{'} .

In particular we want when the leaving component is zero, i.e., $(x_{B}^{'})_{p} = 0$ . Solving for $x_{q}^{'}$ gives us

x_{q}^{'} = \frac{( x _{B} ) _{p}}{( A _{B}^{- 1} A _{{q}} ) _{p}} .

We pick the leaving index through the minimum ratio test: select the leaving index $p$ that minimizes $x_{q}^{'}$ . With the leaving index selected, we swap $p$ and $q$ between $B$ and $V$ .

def edge_transition(A, c, b, B, q):
	n = A.shape[1]
	b_inds = sorted(B)
	n_inds = # indices not in B
	AB = A[:,b_inds]
	ABinv = np.linalg.inv(AB)
	d = np.matmul(ABinv, A[:,n_inds[q]])
	xB = np.matmul(ABinv, b)
 
	# Pick the leaving index p that minimizes x'_q
	p, xq = 0, np.inf
	for i in range(d):
		if d[i] > 0:
			v = xB[i] / d[i]
			if v < xq:
				p, xq = i, v
 
	return (p, xq)

Different heuristics can be used to select an entering index $q$ :

Greedy, choose the $q$ that maximally reduces $c^{⊤} x$
Dantzig’s rule, choose $q$ with the most negative entry in $μ$ .
Bland’s rule, choose the first $q$ with a negative entry in $μ$ .

Example: Consider the equality-form linear program with

A = [1 - 4 121001], b = [92], c = 3 - 1 00

and the initial vertex defined by $B = {3, 4}$ . We first extract $x_{B}$ :

x_{B} = A_{B}^{- 1} b = [1001]^{- 1} [92] = [92]

and compute $λ$ :

λ = (A_{B}^{- 1})^{⊤} c_{B} = 0

and $μ_{V}$ :

μ_{V} = c_{V} - (A_{B}^{- 1} A_{V})^{⊤} c_{B} = [3 - 1] .

Notice that $μ_{V}$ has a negative element, so $B$ is sub-optimal. We pivot on the index of the negative element, $q = 2$ . Using edge transition we get that $p = 4$ gives the minimal $x_{q}^{'}$ . Thus, we update our set of indices to $B = {2, 3}$ , ending the first iteration.

Upon the second iteration, we find the indices $B = {2, 3}$ is optimal as $μ_{V}$ has no negative entries.

Initialization Phase

Before performing the optimization phase, we need an initial partition corresponding to a vertex. We can find such a partition by solving an auxiliary linear program:

x, z minimize subject to [0^{⊤} 1^{⊤}] [x z] [A Z] [x z] = b [x z] \geq 0

where $Z$ is the diagonal matrix whose diagonal entries are given by

Z_{ii} = {+ 1 - 1 if b_{i} \geq 0 otherwise .

We solve the auxiliary linear program with a partition that selects only the $z$ -values. As a result, the corresponding vertex has $x = 0$ and each $z$ -element as the absolute value of the corresponding $b$ -value.

Example: Consider the following equality-form linear program:

x_{1}, x_{2}, x_{3} minimize subject to c_{1} x_{1} + c_{2} x_{2} + c_{3} x_{3} 2 x_{1} - x_{2} + 2 x_{3} = 1 5 x_{1} + x_{2} - 3 x_{3} = - 2 x_{1}, x_{2}, x_{3} \geq 0

we can identify a feasible vertex by solving

x_{1}, x_{2}, x_{3}, z_{1}, z_{2} minimize subject to z_{1} + z_{2} 2 x_{1} - x_{2} + 2 x_{3} + z_{1} = 1 5 x_{1} + x_{2} - 3 x_{3} - z_{2} = - 2 x_{1}, x_{2}, x_{3}, z_{1}, z_{2} \geq 0

with an initial vertex defined by $B = {4, 5}$ . The initial vertex has

x_{B}^{(1)} = A_{B}^{- 1} b_{B} = [12]

and thus, $x^{(1)} = [0, 0, 0, 1, 2]$ . From here we can then begin the optimization phase to get the feasible vertex $[0.045, 1.713, 1.312, 0, 0]$ in the auxiliary problem, or $[0.045, 1.713, 1.312]$ in the original problem.

Notes

Explorer

Linear Programming

Equality Form

Simplex Algorithm

First-Order Necessary Conditions

Optimization Phase

Initialization Phase

Graph View

Table of Contents

Backlinks

Source code