A CONSTRUCTIVE GLOBAL CONVERGENCE OF THE MIXED BARRIER-PENALTY METHOD FOR MATHEMATICAL OPTIMIZATION PROBLEMS

Suñagua, Porfirio; Oliveira, Aurelio Ribeiro Leite

doi:10.1590/0101-7438.2020.040.00217467

ABSTRACT

In this paper we develop a generic mixed bi-parametric barrier-penalty method based upon barrier and penalty generic algorithms for constrained nonlinear programming problems. When the feasible set is defined by equality and inequality functional constraints, it is possible to provide an explicit barrier and penalty functions. If such case, the continuity and differentiable properties of the restrictions and objective functions could be inherited to the penalized function.

The main contribution of this work is a constructive proof for the global convergence of the sequence generated by the proposed mixed method. The proof uses separately the main results of global convergence of barrier and penalty methods. Finally, for some simple nonlinear problem, we deduce explicitly the mixed barrier-penalty function and illustrate all functions defined in this work. Also we implement MATLAB code for generate iterative points for the mixed method.

Keywords:
nonlinear programming; mixed barrier-penalty methods; convergence of mixed algorithm

1 INTRODUCTION

The mathematical optimization is one of the concepts widely used to analyze many complex decision or allocation problems. In order to better use available resources, optimization techniques allow the selection of values for a certain number of interrelated variables, and with them we could measure the performance and quality of a decision by focusing on some objective functions.

Specifically, a mathematical optimization problem consists of minimizing or maximizing an objective function f (x) subject to restrictions $x \in Ω$ , where f is a real valued continuous function defined on $Ω \subset ℝ^{n}$ . In this work, we consider the feasible set Ω having three types of restrictions

x \in Ω_{1}, x \in Ω_{2}, x \in Ω_{3}

(1)

where Ω₁ can be whatever restriction set that is difficult to handle, Ω₂ is a robust set and Ω₃ could be a simple set such as signal or boundary restrictions. The robust set means that it has a dense nonempty interior subset. In other words, the set has an interior, and it is possible to get any boundary point by approaching it from a sequence of interior points, ^{Luenberger & Ye (2008}11 LUENBERGER DG & YE Y. 2008. Linear and nonlinear programming. 3th ed.. Springer New York.).

According to specifications above, we consider the following optimization problem,

\min f (x) s . t . x \in Ω_{1}, x \in Ω_{2}, x \in Ω_{3} .

(2)

One of the most common nonlinear programming problems formulation is when the restrictions are characterized by equality and inequality functional constraints, ^{Bazaraa et al. (2013}1 BAZARAA MS, SHERALI HD & SHETTY CM. 2013. Nonlinear programming: theory and algorithms. John Wiley & Sons.), ^{Luenberger & Ye (2008}11 LUENBERGER DG & YE Y. 2008. Linear and nonlinear programming. 3th ed.. Springer New York.), ^{Wright & Nocedal (1999}18 WRIGHT SJ & NOCEDAL J. 1999. Numerical optimization. vol. 2. Springer New York.), ^{Griva et al. (2009}9 GRIVA I, NASH SG & SOFER A. 2009. Linear and nonlinear optimization. vol. 108. Siam.). In which given the continuous functions $f : ℝ^{n} \to ℝ, h : ℝ^{n} \to ℝ^{m}, g : ℝ^{n} \to ℝ^{p}$ , the classical nonlinear optimization problem is

\min f (x) s . t . h (x) = 0 g (x) \leq 0,

(3)

where the restriction sets are given by $Ω_{1} = \{x \in ℝ^{n} : h (x) = 0\}, Ω_{2} = \{x \in ℝ^{n} : g (x) \leq 0\} and Ω_{3} = ℝ^{n}$ .

For many decades, many authors proved some theoretical results and proposed several algorithms in order to solve nonlinear optimization problems considering penalty or barrier function methods. ^{Luenberger & Ye (2008}11 LUENBERGER DG & YE Y. 2008. Linear and nonlinear programming. 3th ed.. Springer New York.), ^{Fiacco & McCormick (1990}7 FIACCO AV & MCCORMICK GP. 1990. Nonlinear programming: sequential unconstrained minimization techniques. vol. 4. Siam.) state convergence for both methods, ^{Polyak Polyak (1971}15 POLYAK BT. 1971. The convergence rate of the penalty function method. USSR Computational Mathematics and Mathematical Physics, 11(1): 1-12.) showed convergence rate for penalty function method within Hilbert space, ^{Bertsekas (1976}2 BERTSEKAS DP. 1976. On penalty and multiplier methods for constrained minimization. SIAM Journal on Control and Optimization, 14(2): 216-235.) obtained convergence and rate of convergence results for the sequences of primal and dual variables generated on penalty and Lagrange multiplier methods, he showed that the multiplier method is faster than the pure penalty method. Fiacco & McCormick (1990) demonstrate by contradiction the global convergence for mixed penalty-barrier method, also ^{Breitfeld & Shanno (1995}4 BREITFELD MG & SHANNO DF. 1995. A Globally Convergent Penalty-Barrier Algorithm for Nonlinear Programming. In: Operations Research Proceedings 1994. pp. 22-27. Springer.) proposed composite algorithm of augmented Lagrangian, modified log-barrier, and classical log-barrier methods for that they demonstrated global convergence to a first-order stationary point for the constrained problem which was based on ^{Breitfeld & Shanno (1994}3 BREITFELD MG & SHANNO DF. 1994. A globally convergent penalty-barrier algorithm for nonlinear programming and its computational performance. Rutgers University. Rutgers Center for Operations Research [RUTCOR].).

In this work, we develop the mixed barrier-penalty method for solving a general nonlinear problem (2); and we provide a generic bi-parametric algorithm. The main contribution is a constructive proof of global convergence of sequence generated by that mixed method as an alternative proof to existing ones with slightly different assumptions. ^{Suñagua & Oliveira (2017}16 SUÑAGUA P & OLIVEIRA AR. 2017. A new approach for finding a basis for the splitting preconditioner for linear systems from interior point methods. Computational Optimization and Applications, 67(1): 111-127.) showed that computational experiments for NETLIB problems work successfully for large scale linear optimization problems.

2 BARRIER METHODS OVERVIEW

Barrier methods are also called interior point or internal penalty methods. Some theoretical results of them were developed by ^{Martınez & Santos (1995}12 MARTINEZ JM & SANTOS SA. 1995. Métodos computacionais de otimização. Colóquio Brasileiro de Matemática, Apostilas, 20. Available at: https://www.ime.unicamp.br/~martinez/mslivro.pdf.
https://www.ime.unicamp.br/~martinez/msl... ), ^{Luenberger & Ye (2008}11 LUENBERGER DG & YE Y. 2008. Linear and nonlinear programming. 3th ed.. Springer New York.), ^{Nash & Sofer (1993}14 NASH SG & SOFER A. 1993. A barrier method for large-scale constrained optimization. ORSA Journal on Computing, 5(1): 40-53.), and ^{Wright (1992}17 WRIGHT MH. 1992. Interior methods for constrained optimization. Acta Numérica, 1: 341-407.). These methods are applicable to problems of the form

\min f (x) s . t . x \in Ω

(4)

where f is a continuous function and Ω is a robust restriction set. This kind of set

often arises from the inequality constraints, that is, $Ω = \{x \in ℝ^{n} : g (x) \leq 0\}$ , for which there is a point $\bar{x} \in Ω$ such that $g (\bar{x}) < 0$ .

Barrier methods work by establishing a barrier on the boundary of the restriction set that prevents a search procedure from leaving the feasible region. A barrier function is a function B(·) defined on the interior set $I n t (Ω) = \{x : g (x) < 0\}$ of Ω such that (i) B is continuous,(ii) $B (x) \geq 0$ , (iii) $B (x) \to \infty$ as x approaches the boundary of Ω. For inequality constraints $g_{i} (x) \leq 0, i = 1, 2, . . ., p$ in many practical applications, the barrier functions commonly used are the logarithmic or inverse barrier function. They are defined on Int(Ω) respectively by

B (x) = - \sum_{i = 1}^{p} \log (- g_{i} (x)) a n d B (x) = - \sum_{i = 1}^{p} \frac{1}{g_{i} (x)} .

Now, the problem (4) can be transformed into a penalized subproblem

(P_{µ}) \min f (x) + µ B (x) s . t . x \in Int (Ω)

(5)

where µ > 0 is called barrier parameter and we take µ small (going to zero). In this approach, the main assumption is that the original problem (4) has a global solution x ^∗. Let x(µ) be a global solution of subproblem (5). When $µ_{k} \to 0$ , we hope x(µ _k ) converges to x ^∗.

Given $ϕ (x, µ) = f (x) + µ B (x)$ , we have a generic barrier algorithm given in Algorithm 1.

Algorithm 1
Barrier Algorithm

The following Lemma gives a set of inequalities that follow directly from Algorithm 1 steps. A proof is based from ^{Luenberger & Ye (2008}11 LUENBERGER DG & YE Y. 2008. Linear and nonlinear programming. 3th ed.. Springer New York.) and ^{Martınez & Santos (1995}12 MARTINEZ JM & SANTOS SA. 1995. Métodos computacionais de otimização. Colóquio Brasileiro de Matemática, Apostilas, 20. Available at: https://www.ime.unicamp.br/~martinez/mslivro.pdf.
https://www.ime.unicamp.br/~martinez/msl... ).

Lemma 2.1.Let {x _k } be a sequence generated byAlgorithm 1, then

$ϕ (x_{k + 1}, µ_{k}) \leq ϕ (x_{k}, µ_{k - 1})$
$B (x_{k + 1}) \geq B (x_{k})$
$f (x_{k + 1}) \leq f (x_{k})$ .

Proof. Since {µ _k } is a monotone decreasing sequence, x _k+1 is a global minimizer of (6) and recalling (ii) of barrier condition, B is non-negative function, then

ϕ (x_{k + 1}, µ_{k}) = f (x_{k + 1}) + µ_{k} B (x_{k + 1}) \leq f (x_{k}) + µ_{k} B (x_{k}) \leq f (x_{k}) + µ_{k - 1} B (x_{k}) = ϕ (x_{k}, µ_{k - 1}) .

For establishes the second inequality, we also have

ϕ (x_{k + 1}, µ_{k}) = f (x_{k + 1}) + µ_{k} B (x_{k + 1}) \leq f (x_{k}) + µ_{k} B (x_{k})

(7)

ϕ (x_{k}, µ_{k - 1}) = f (x_{k}) + µ_{k - 1} B (x_{k}) \leq f (x_{k + 1}) + µ_{k - 1} B (x_{k + 1}),

(8)

now, using (8) and (7), we get

(µ_{k} - µ_{k - 1}) B (x_{k + 1}) \leq (µ_{k} - µ_{k - 1}) B (x_{k})

eliminating the common factor $µ_{k} - µ_{k - 1} < 0$ , we prove the item 2.

Finally, by previous inequality,

f (x_{k + 1}) + µ_{k} B (x_{k + 1}) \leq f (x_{k}) + µ_{k} B (x_{k}) \leq f (x_{k}) + µ_{k} B (x_{k + 1}),

hence $f (x_{k + 1}) \leq f (x_{k})$ . □

The global convergence of the barrier method, in the sense that any limit point of the sequence is a solution of problem (4), can be verified from the previous Lemma.

Theorem 2.1.Let {x _k } be a sequence generated byAlgorithm 1, in which $µ_{k} \to 0$ . Then, any limit point of the sequence is a global minimizer of problem (4).

Proof. Let $f_{k} = \min \{ϕ (x, µ_{k}) : x \in Int (Ω)\}$ be global minimum value of $ϕ (\cdot, µ_{k})$ on Int(Ω), whose solution is x _k+1 . By Lemma 2.1, $f_{k} \geq f_{k + 1}$ for all k. If $f^{*} = \min \{f (x) : x \in Ω\}$ , then

f_{0} \geq f_{1} \geq \cdot \cdot \cdot \geq f_{k} \geq f_{k + 1} \geq \cdot \cdot \cdot \geq f^{*} .

First of all, we prove only the sequence {f _k } converges to f ^∗, next we continue with the demonstration of convergence of some subsequence that will converge to some global minimizer.

Indeed, {f _k } is a bounded below monotone decreasing sequence, hence it converges to its infimum, we say $\bar{f}$ . If $\bar{f} \neq f^{*}$ , then $\bar{f} > f^{*}$ . Recalling x ^∗ the global minimizer of (4) and since f is a continuous function, then there is an open ball ℬ centered at x ^∗ such that, for all $x \in ℬ \cap Int (Ω)$ we have,

f (x) < \bar{f} - \frac{1}{2} (\bar{f} - f^{*}) .

(9)

Since $B (x) \geq 0$ for all $x \in Int (Ω)$ , and $0 < µ_{k + 1} < µ_{k}$ , we have $0 < µ_{k + 1} B (x) < µ_{k} B (x), \forall x \in Int (Ω)$ . Therefore,

\lim_{k \to \infty} µ_{k} B (x) = 0, \forall x \in Int (Ω) .

(10)

Thus, for any $x' \in ℬ \cap Int (Ω)$ , and k large enough, we get

µ_{k} B (x') < \frac{1}{4} (\bar{f} - f^{*}) .

(11)

Then, from (9) and (11), we have

ϕ (x', µ_{k}) < \bar{f} - \frac{1}{2} (\bar{f} - f^{*}) + \frac{1}{4} (\bar{f} - f^{*}) = \bar{f} - \frac{1}{4} (\bar{f} - f^{*}) < \bar{f},

which contradicts to $f_{k} \to \bar{f}$ . Therefore, $\bar{f} = f^{*}$ . That is

f_{k + 1} = ϕ (x_{k + 1}, µ_{k}) \to f^{*} .

(12)

Now, let $\bar{x} \in Ω$ be any subsequential limit of {x _k }, more precisely, there is a subsequence $\{x_{k_{l}}\}$ such that $x_{k_{l}} \to \bar{x}$ . If $\bar{x} \neq x^{*}$ with $f (\bar{x}) > f (x^{*})$ , then by continuity of f a subsequence $\{f (x_{k_{l}}) - f (x^{*}) + µ_{k_{l}} B (x_{k_{l}})\}$ cannot converge to zero, which contradicts $f_{k} - f^{*} \to 0$ . Therefore, $\bar{x} = x^{*}$ or $\bar{x} \neq x^{*}$ , but $f (\bar{x}) = f (x^{*})$ . Thus, every limit point generated by Algorithm 1 is a global solution of the problem (4). □

3 PENALTY METHODS OVERVIEW

Given $f : ℝ^{n} \to ℝ$ a continuous function, we consider the problem

(G P) m i n f (x) s . t . x \in Ω_{1} x \in Ω_{2} .

(13)

where Ω₁ and Ω₂ are any arbitrary subsets of ℝⁿ . In most applications Ω₁ is defined implicitly by functional restrictions as $h (x) = 0$ , where $h : ℝ^{n} \to ℝ^{m}$ . In some cases, we assume that f and h are twice differentiable functions. A basic assumption is that problem (GP) admits global minimizer, some theoretical results were established by ^{Polyak (1971}15 POLYAK BT. 1971. The convergence rate of the penalty function method. USSR Computational Mathematics and Mathematical Physics, 11(1): 1-12.), ^{Breitfeld & Shanno (1995}4 BREITFELD MG & SHANNO DF. 1995. A Globally Convergent Penalty-Barrier Algorithm for Nonlinear Programming. In: Operations Research Proceedings 1994. pp. 22-27. Springer.), ^{Nash (2010}13 NASH SG. 2010. Penalty and barrier methods. Wiley Encyclopedia of Operations Research and Management Science,.), and ^{Luenberger & Ye (2008}11 LUENBERGER DG & YE Y. 2008. Linear and nonlinear programming. 3th ed.. Springer New York.).

Given a restriction set Ω₁, a penalty function is defined as a function $𝒫 : ℝ^{n} \to ℝ$ satisfying (i) 𝒫 is continuous, (ii) $𝒫 (x) = 0 if x \in Ω_{1}$ , and (iii) $𝒫 (x) > 0 if x \notin Ω_{1}$ .

In order to solve the problem (13), the penalty function method solves the following penalized subproblem

(Q_{ρ}) \min f (x) + ρ P (x) s . t . x \in Ω_{2}

(14)

where $ρ > 0$ is a constant called penalty parameter. For ρ large, is clear that a solution of (14) will be in a region where 𝒫 is small. Thus, when $ρ \to \infty$ is expected that the corresponding optimal points will approach the feasible set Ω₁.

For C ² class functions, $h : ℝ^{n} \to ℝ^{m} and g : ℝ^{n} \to ℝ^{p}$ , some useful penalty functions 𝒫 based on the type of restrictions $h (x) = 0 or g (x) \leq 0$ may be

$𝒫 (x) = \frac{1}{2} {||h (x)||}_{2}^{2}$ , quadratic penalty,
$𝒫 (x) = {||h (x)||}_{1}$ ,
$P (x) = \sum_{i = 1}^{p} {[m a x \{0, g_{i} (x)\}]}^{2},$
$𝒫 (x) = \frac{1}{2} {||h (x)||}_{2}^{2} + \sum_{i = 1}^{p} {[\max {0, g_{i} (x)}]}^{2}$ ,

in the first item the quadratic penalty function preserves C ² property, but in the last three items 𝒫 is only C ¹.

Given $ψ (x, ρ) = f (x) + ρ 𝒫 (x)$ , we have a generic penalty algorithm given in Algorithm 2 for solving the problem (13), that works iteratively updating the parameter ρ before solving the penalized subproblem (14)

Algorithm 2
Penalty Algorithm

In general, one of the suggestions to compute ρk is taking $ρ_{0} = 1$ and $ρ_{k + 1} = 10 ρ_{k}$ , ^{Fletcher (2013}8 FLETCHER R. 2013. Practical methods of optimization. John Wiley & Sons, Chichester.). However, when Ω₁ is the set of equality constraints $h (x) = 0$ , a basic rule that works in practice is that if $||h (x_{k})|| \geq 0.1 ||h (x_{k - 1})||$ , then $ρ_{k + 1} = 10 ρ k$ , otherwise ρ does not change. That approach was successfully tested for linear programming problems, ^{Suñagua & Oliveira (2017}16 SUÑAGUA P & OLIVEIRA AR. 2017. A new approach for finding a basis for the splitting preconditioner for linear systems from interior point methods. Computational Optimization and Applications, 67(1): 111-127.).

The following Lemma gives a set of inequalities that follow directly from Algorithm 2 steps. A proof is based in ^{Martınez & Santos (1995}12 MARTINEZ JM & SANTOS SA. 1995. Métodos computacionais de otimização. Colóquio Brasileiro de Matemática, Apostilas, 20. Available at: https://www.ime.unicamp.br/~martinez/mslivro.pdf.
https://www.ime.unicamp.br/~martinez/msl... ) and ^{Luenberger & Ye (2008}11 LUENBERGER DG & YE Y. 2008. Linear and nonlinear programming. 3th ed.. Springer New York.).

Lemma 3.1.Let {x _k } be a sequence generated byAlgorithm 2, which x _k+1 is global solution of problem (Q _k ). Then

$ψ (x_{k}, ρ_{k - 1}) \leq ψ (x_{k + 1}, ρ_{k})$
$P (x_{k + 1}) \leq P (x_{k})$
$f (x_{k}) \leq f (x_{k + 1})$ .

Proof. Since {ρ _k } is a monotone increasing sequence and x _k is a global minimizer of subproblem (15), then

ψ (x_{k}, ρ_{k - 1}) = f (x_{k}) + ρ_{k - 1} P (x_{k}) \leq f (x_{k + 1}) + ρ_{k - 1} P (x k + 1) \leq f (x_{k + 1}) + ρ_{k} P (x_{k + 1}) = ψ (x_{k + 1}, ρ_{k}) .

To establish the second inequality, recalling the optimalities of x _k and x _k+1 , we have

ψ (x_{k}, ρ_{k - 1}) = f (x_{k}) + ρ_{k - 1} P (x_{k}) \leq f (x_{k + 1}) + ρ_{k - 1} P (x_{k + 1})

(16)

ψ (x_{k + 1}, ρ_{k}) = f (x_{k + 1}) + ρ_{k} P (x_{k + 1}) \leq f (x_{k}) + ρ_{k} P (x_{k})

(17)

using (17) and (16), we get

(ρ_{k - 1} - ρ_{k}) P (x_{k}) \leq (ρ_{k - 1} - ρ_{k}) P (x_{k + 1}),

as $ρ_{k - 1} < ρ_{k}$ , then $𝒫 (x_{k}) \geq 𝒫 (x_{k + 1})$ . Finally, using this inequality

f (x_{k}) + ρ_{k - 1} P (x_{k}) \leq f (x_{k + 1}) + ρ_{k - 1} P (x_{k + 1}) \leq f (x_{k + 1}) + ρ_{k - 1} P (x_{k})

hence $f (x_{k}) \leq f (x_{k + 1})$ . □

Lemma 3.2.If x^*is a global minimizer of (GP) then for $k = 0, 1, 2, \cdot \cdot \cdot$

f (x_{k}) \leq ψ (x_{k}, ρ_{k - 1}) \leq f (x^{*}) .

Consequently, $x_{k} \in Ω_{1}$ , if and only if, x_kis the global solution of (GP).

Proof. Since $ρ_{k} > 0$ and $𝒫 (x) \geq 0 \forall x \in ℝ^{n}$ and x _k is the global minimizer of (Q _k−1 ), then

f (x_{k}) \leq f (x_{k}) + ρ_{k - 1} P (x_{k}) \leq f (x^{*}) + ρ_{k - 1} P (x^{*}) = f (x^{*}),

where $P (x^{*}) = 0$ . □

The global convergence of the penalty method, in the sense any limit point of the sequence is a solution, can be verified from the two previous Lemmas.

Theorem 3.1 (Global convergence for penalty method). Let {x _k } be a sequence of global minimizers of (Q_k ) generated by Algorithm 2 in which $ρ_{k} \to + \infty$ . Then, any limit point of the sequence is a global minimizer of problem (13).

Proof. With a slight change of notation, the proof is based on Martínez & Santos’ demonstration. Indeed, let $\{x_{k_{l}}\}$ be a subsequence of {x _k } such that $x_{k_{l}} \to \bar{x}$ . By the continuity of f , we have

f (x_{k_{l}}) \to f (\bar{x}) .

(18)

Let f* be an optimal value of problem (GP). By Lemma 3.1 and Lemma 3.2, the sequence $\{ψ (x_{k}, ρ_{k - 1})\}$ is nondecreasing and bounded above by f ^∗, then

\lim_{l \to \infty} ψ (x_{k_{l}}, ρ_{k_{l} - 1}) = \sup_{l \geq 1} ψ (x_{k_{l}}, ρ_{k_{l} - 1}) = p^{*} \leq f^{*} .

(19)

Thus, using (18) and (19), yields

\lim_{l \to \infty} ρ_{k_{l} - 1} P (x_{k_{l}}) = \lim_{l \to \infty} [(f (x_{k_{l}}) + ρ_{k_{l} - 1} P (x_{k_{l}})) - f (x_{k_{l}})] = \lim_{l \to \infty} (f (x_{k_{l}}) + ρ_{k_{l} - 1} P (x_{k_{l}})) - \lim_{l \to \infty} f (x_{k_{l}}) = p^{*} - f (\bar{x}) .

Since $P (x_{k_{l}}) \geq 0$ and $ρ_{k_{l}} \to \infty$ , we conclude that $\lim_{l \to \infty} P (x_{k_{l}}) = 0$ . Using the continuity of 𝒫, $P (\bar{x}) = 0$ , thereby $\bar{x} \in Ω_{1}$ . To prove the optimality of $\bar{x}$ , just note that by Lemma 3.2, $f (x_{k_{l}}) \leq f^{*}$ , then

f (\bar{x}) = \lim_{l \to \infty} f (x_{k_{l}}) \leq f^{*},

which completes the proof, because obviously $f^{*} \leq f (\bar{x})$ and then $f (\bar{x}) = f^{*}$ . □

Furthermore, by definition of ψ and (19)

f (x_{x_{l}}) \leq ψ (x_{x_{l}}, ρ_{k_{l}} - 1) \leq p^{*} \Rightarrow f (\bar{x}) \leq p^{*} \leq f^{*} .

Therefore $f (\bar{x}) = p^{*} = f^{*}$ , then

\lim_{l \to \infty} ρ_{k_{l} - 1} P (x_{k_{l}}) = 0 .

(20)

And using (19)

\lim_{l \to \infty} Ψ (x_{k_{l}}, ρ_{k_{l} - 1}) = f^{*} .

(21)

4 MIXED BARRIER-PENALTY METHOD

For continuous function $f : ℝ^{n} \to ℝ$ , we consider the general programming problem

(N L P) \min f (x) s . t . x \in Ω_{1}, x \in Ω_{2}, x \in Ω_{3} .

(22)

where Ω₁, Ω₂ and Ω₃ are restriction sets that are defined in (1).

As in the previous sections, we assume that problem (22) admits a global minimizer. Now, let 𝒫 a penalty function related to Ω₁ and B a barrier function related to Ω₂. Then, taking the penalty parameter $ρ > 0$ and the barrier parameter $μ > 0$ , we have the associate mixed barrier-penalty subproblem,

(B P_{ρ, µ}) \min f (x) + ρ P (x) + µ B (x) s . t . x \in Int (Ω_{2}), x \in Ω_{3} .

(23)

Since the general problem (NLP) admits global minimizer, then the problem (BP _ρ,µ) in (23) also admits a global solution for any feasible parameter values. Therefore, we define

Φ (x, ρ, µ) = f (x) + ρ P (x) + µ B (x) .

(24)

In order to solve the general problem (22), we provide a generic algorithm given in Algorithm 3, that works iteratively updating ρ and µ parameters before solving the penalized subproblem (23).

Algorithm 3
Mixed barrier-penalty algorithm

To establish the global convergence of the Algorithm 3, firstly we can associate the additive terms in two convenient ways

Φ (x, ρ, µ) = [f (x) + ρ P (x)] + µ B (x) = [f (x) + µ B (x)] + ρ P (x) .

(26)

Therefore, fixing respectively ρ and µ, we define $F_{ρ} (x) = f (x) + ρ 𝒫 (x) and G_{µ} (x) = f (x) + µ B (x)$ , then we associate to (NLP) the following two problems

(G P_{ρ}) \min F_{ρ} (x) (G P_{μ}) \min G_{μ} (x) s . t . x \in Ω_{2} s . t . x \in Ω_{1}, x \in Int (Ω_{2}) x \in Ω_{3} x \in Ω_{3} .

(27)

Since the problem (NLP) admits a global minimizer, both (GP _ρ ) and (GP _µ ) in (27) also admit global minimizers. Therefore, defining

ϕ_{ρ} (x, µ) = F_{ρ} (x) + µ B (x) and ψ_{µ} (x, ρ) = G_{µ} (x) + ρ P (x) .

We have respectively the barrier and penalty subproblems

(B P_{ρ}) \min ϕ_{ρ} (x, μ) (P P_{μ}) \min ψ_{μ} (x, ρ) s . t . x \in Int (Ω_{2}) s . t . x \in Int (Ω_{2}) x \in Ω_{3}, x \in Ω_{3}

(28)

By fixing one of the parameters according to (27), the two problems in (28) are equivalent to (BP _ρ,µ ). In fact

ϕ_{ρ} (x, µ) = Φ (x, ρ, μ) = ψ_{µ} (x, ρ),

(29)

therefore, we can apply the results obtained in the preceding two sections.

In order to understand more clearly the ideas of the mixed problem, we consider the following particular quadratic problem

\min x_{1}^{2} + x_{2}^{2} s . t . x_{2} = 2 1 - x_{1} \leq 0 - 1 - x_{2} \leq 0 .

(30)

According to the contours of the objective function and graph of restrictions in Figure 1, the optimal point is $x^{*} = (1, 2)$ . First, if we consider the Lagrangian function $ℒ (x_{1}, x_{2}, λ, u_{1}, u_{2}) = x_{1}^{2} + x_{2}^{2} + λ (x_{2} - 1) + u_{1} (1 - x_{1}) + u_{2} (- 1 - x_{2})$ , the Karush Kuhn-Tuker conditions (-^{Kuhn & Tucker, 1951}10 KUHN HW & TUCKER AW. 1951. Nonlinear Programming. In: Proceedings of the Second Berkeley Symposium on Mathematical Statistics and Probability. pp. 481-492. Berkeley, California: University of California Press. Available at: http://projecteuclid.org/euclid.bsmsp/1200500249.
http://projecteuclid.org/euclid.bsmsp/12... ) are

ℒ_{x_{1}} = 2 x_{1} - u_{1} = 0, ℒ_{x_{2}} = 2 x_{2} + λ - u_{2} = 0 x_{2} - 2 = 0, 1 - x_{1} \leq 0, - 1 - x_{2} \leq 0 u_{1} (1 - x_{1}) = 0, u_{2} (- 1 - x_{2}) = 0 u_{1} \geq 0, u_{2} \geq 0

whose unique solutions for variables and Lagrangian parameters are $x_{1}^{*} = 1, x_{2}^{*} = 2, u_{1}^{*} = 2, u_{2}^{*} = 0, λ^{*} = - 4 .$

Figure 1
Convex Problem.

Now, we associate to (30) the mixed barrier-penalty subproblem

(Q P_{ρ, μ}) \min Φ (x, ρ, μ) s . t . 1 - x_{1} < 0 - 1 - x_{2} < 0 .

(31)

where the penalized objective function is

Φ (x, ρ, μ) = x_{1}^{2} + x_{2}^{2} + \frac{ρ}{2} {(x_{2} - 2)}^{2} - μ [\log (x_{1} - 1) + \log (x_{2} + 1) - \log M]

where M is a large enough positive number such that $x_{1} > 1, x_{2} > - 1$ and $(x_{1} - 1) (x_{2} + 1) < M$ , surely this region lies within inequality constraints $1 - x_{1} < 0 and - 1 - x_{2} < 0$ . This condition ensures that the barrier function is non-negative in the region that contains optimal point.

It is easy to see that Φ is a smooth function, thereby from first-order necessary conditions for optimal points, we have

2 x_{1} - \frac{μ}{x_{1} - 1} = 0, 2 x_{2} + ρ (x_{2} - 2) - \frac{μ}{x_{2} + 1} = 0 .

(32)

Solving this nonlinear system, subject to $x_{1} > 1$ and $x_{2} > - 1$ , by the substitution method, we obtain

x_{1} = \frac{1 + \sqrt{1 + 2 μ}}{2} \overset{μ \to 0}{\to} 1 = x_{1}^{*} x_{2} = \frac{ρ - 2 + \sqrt{{(2 - ρ)}^{2} + 4 (2 + ρ) (2 ρ + μ)}}{2 (2 + ρ)} \overset{\overset{μ \to 0}{ρ \to \infty}}{\to} 2 = x_{2}^{*} .

(33)

Thus, for each optimal point in (33), the optimal values of problem (QP _ρ,µ ) in (31) is $θ (ρ, µ) = Φ (x_{1}, x_{2}, ρ, µ)$ , whose graph is shown in Figure 2 with $M = 2$ .

Figure 2

θ (ρ, µ), 0 < µ < 2.5, 0 < ρ < 20

.

We can see that for fixed $µ, θ (ρ, µ)$ is an increasing function and for fixed $ρ, θ (ρ, µ)$ is a decreasing function. This fact will be showed theoretically in Theorem 4.1.

Furthermore, using (33), when $µ \to 0$ and $ρ \to \infty$ , the following gradient’s coefficients in (32) converge to optimal Lagrangian parameters

u_{1} = \frac{μ}{x_{1} - 1} = 1 + \sqrt{1 + 2 μ} \to 2 = u_{1}^{*} u_{2} = \frac{μ}{x_{2} + 1} = \frac{2 μ (2 + ρ)}{2 + 3 ρ + A} \to 0 = u_{2}^{*} λ = ρ (x_{2} - 2) = \frac{ρ (- 10 - 3 ρ + A)}{2 (2 + ρ)} \to - 4 = λ^{*}

where $A = \sqrt{4 µ (2 + ρ) + {(2 + 3 ρ)}^{2}}$ . In addition, the Hessian matrix for Φ is

\nabla^{2} Φ = (\begin{matrix} 2 + \frac{μ}{{(x_{1} - 1)}^{2}} & 0 \\ 0 & 2 + ρ + \frac{μ}{{(x_{2} + 1)}^{2}} \end{matrix}) = (\begin{matrix} 2 + \frac{u_{1}^{2}}{μ} & 0 \\ 0 & 2 + ρ + \frac{u_{2}^{2}}{μ} \end{matrix}) ~ (\begin{matrix} 2 + \frac{4}{μ} & 0 \\ 0 & 2 + ρ \end{matrix}) .

Then ∇²Φ is a positive definite matrix, that guarantees the minimality of x ₁ and x ₂ in (33). Moreover, the approximate condition number of this matrix is

κ (\nabla^{2} Φ) \approx \frac{2 + \frac{4}{μ}}{2 + ρ},

hence ∇²Φ is ill-conditioned for very small µ and small ρ, however for large ρ, that condition number could be reduced.

Next, we have the global convergence theorem for the mixed barrier-penalty algorithm.

Theorem 4.1 (Global convergence for mixed method).Let {x _k } be a sequence of global minimizers of (BP _k ) problem in (25) generated by mixedAlgorithm 3in which $ρ_{k} \to + \infty$ and $µ_{k} \to 0$ . Then any limit point of sequence is a global minimizer of the (NLP) problem.

Proof. In order to apply the results of the preceding sections, the idea is to fix, one of the parameters in the (BP _ρ,µ ) subproblem in (23) one at a time, and apply the corresponding results for each subproblems in (27).

Firstly, to fix ρ, let $\{x_{k}^{ρ}\}$ be the sequence generated by Algorithm 1 for solving the (GP _ρ ) subproblem in (27). By applying Lemma 2.1, we get

ϕ_{ρ} (x_{k + 1}^{ρ}, μ_{k}) \leq ϕ_{ρ} (x_{k}^{ρ}, μ_{k - 1}), F_{ρ} (x_{k + 1}^{ρ}) \leq F_{ρ} (x_{k}^{ρ}) .

(34)

By the monotonicity in (34) and by (12) the sequence $\{ϕ_{ρ} (x_{k}^{ρ}, µ_{k - 1})\}$ converges to global optimal value of the problem (GP _ρ ) in (27), that is,

ϕ_{ρ} (x_{k}^{ρ}, µ_{k - 1}) \to \inf_{k \geq 1} ϕ_{ρ} (x_{k}^{ρ}, µ_{k - 1}) = F_{ρ} (x_{*}^{ρ}) .

(35)

In addition, from Theorem 2.1 all convergent subsequence of $\{x_{k}^{ρ}\}$ converges to a global minimizer of the problem (GP _ρ ) in (27).

Similarly, to fix µ, let $\{x_{k}^{μ}\}$ be the sequence generated by the associated Algorithm 2 for solving the (GP _µ ) subproblem in (27). By applying Lemma 3.1, we get

ψ_{μ} (x_{k}^{μ}, ρ_{k - 1}) \leq ψ_{μ} (x_{k}^{μ}, ρ_{k}), G_{μ} (x_{k}^{μ}) \leq G_{μ} (x_{k + 1}^{μ}) .

(36)

By the monotonicity in (36) and by (21), the sequence $\{ψ_{μ} (x_{k}^{μ}, ρ_{k - 1})\}$ converges to global optimal value of the (GP _µ ) problem of (27), that is,

ψ_{μ} (x_{k}^{μ}, ρ_{k - 1}) \to \underset{k \geq 1}{s u p} ψ_{μ} (x_{k}^{μ}, ρ_{k - 1}) = G (x_{*}^{μ}) .

(37)

In addition, from Theorem 3.1 all convergent subsequence of $\{x_{k}^{μ}\}$ converges to a global minimizer of the problem (GP _µ ) in (27). And by Lemma 3.2, we get $G μ (x_{k}^{μ}) \leq G μ (x_{*}^{μ}), \forall k$ .

Now let {x _k } be a sequence of minimizers obtained by Algorithm 3 for a mixed problem. More precisely, let $x_{k + 1} = x (ρ_{k}, μ_{k})$ which also minimizes (28), because according to (29), we have

Φ (x_{k + 1}, ρ_{k}, μ_{k}) = f (x_{k + 1}) + ρ_{k} P (x_{k + 1}) + μ_{k} B (x_{k + 1}) = ϕ_{ρ_{k}} (x_{k + 1}, μ_{k}) = ψ_{μ_{k}} (x_{k + 1}, ρ_{k})

Since (35) and (37), we have

F_{ρ_{k}} (x_{*}^{ρ_{k}}) \leq ϕ_{ρ_{k}} (x_{k + 1}, μ_{k}) = Φ (x_{k + 1}, ρ_{k}, μ_{k}) = ψ_{μ_{k}} (x_{k + 1}, ρ_{k}) \leq G_{μ_{k}} (x_{*}^{μ_{k}}) .

(38)

Giving x(ρ _k , µ _k ) the solution of (BP _{ρ ,µ} ) for $μ = μ_{k}$ and $ρ = ρ_{k}$ . For $μ_{k} < μ_{k - 1}$ , additionally we solve (BP _{ρ ,µ} ) for $μ = μ_{k - 1}$ and $ρ = ρ_{k}$ , which solution is called x(ρ _k , µ _k−1 ). Using (34)

ϕ_{ρ_{k}} (x (ρ_{k}, μ_{k}), μ_{k}) \leq ϕ_{ρ_{k}} (x (ρ_{k}, μ_{k - 1}), μ_{k - 1})

(39)

and by (29), for $x = x (ρ_{k}, μ_{k})$ and $y = x (ρ_{k}, μ_{k - 1})$ , we have

ϕ_{ρ_{k}} (x, μ_{k}) = f (x) + ρ_{k} P (x) + μ_{k} B (x) = ψ_{μ_{k}} (x, ρ_{k}) ϕ_{ρ_{k}} (y, μ_{k - 1}) = F_{ρ_{k}} (y) + μ_{k - 1} B (y) = f (y) + ρ_{k} P (y) + μ_{k - 1} B (y) = f (y) + μ_{k - 1} B (y) + ρ_{k} P (y) = G_{μ_{k - 1}} (y) + ρ_{k} P (y) = ψ_{μ_{k - 1}} (y, ρ_{k})

(40)

Using (39) and (40), we get

ψ_{μ_{k}} (x (ρ_{k}, μ_{k}), ρ_{k}) \leq ψ_{μ_{k - 1}} (x (ρ_{k}, μ_{k - 1}), ρ_{k}) \Rightarrow G_{μ_{k}} (x_{*}^{μ_{k}}) \leq G_{μ_{k - 1}} (x_{*}^{μ_{k - 1}}) .

Similarly, for $ρ_{k} > ρ_{k - 1}$ , additionally we consider a solution of (BP _{ρ ,µ} ) for $ρ = ρ_{k - 1}$ and $μ = μ_{k}$ , which solution is called x(ρ _k−1 ,µ _k ). Using (36)

ψ_{μ_{k}} (x (ρ_{k - 1}, μ_{k}), ρ_{k - 1}) \leq ψ_{μ_{k}} (x (ρ_{k}, μ_{k}), ρ_{k})

(41)

and by (29) for $x = x (ρ_{k}, μ_{k})$ and $z = x (ρ_{k - 1}, μ_{k})$ , we have

ψ_{μ_{k}} (x, ρ_{k}) = f (x) + μ_{k} B (x) + ρ_{k} P (x) = ϕ_{ρ_{k}} (x, μ_{k}) ψ_{μ_{k}} (z, ρ_{k - 1}) = G_{μ_{k}} (z) + ρ_{k - 1} P (z) = f (z) + μ_{k} B (z) + ρ_{k - 1} P (z) = f (z) + ρ_{k - 1} P (z) + μ_{k} B (z) = F_{ρ_{k - 1}} (z) + μ_{k} B (z) = ϕ_{ρ_{k - 1}} (z, μ_{k})

(42)

Using (41) and (42), we get

ϕ_{ρ_{k - 1}} (x (ρ_{k - 1}, μ_{k}), μ_{k}) \leq ϕ_{ρ_{k}} (x (ρ_{k}, μ_{k}), μ_{k}) \Rightarrow F_{ρ_{k - 1}} (x_{*}^{ρ_{k - 1}}) \leq F_{ρ_{k}} (x_{*}^{ρ_{k}}) .

Let x* be a global minimizer of (NLP). Recalling $x_{*}^{μ_{k}}$ a solution of the problem $(G P_{μ_{k}})$ in (27), with the additional assumption $x_{*}^{μ_{k}} \in I n t (Ω_{2})$ , we can conclude that $f (x *) \leq G_{μ_{k}} (x_{*}^{μ_{k}})$ . Moreover x* is a feasible point of the problem (GP _ρ ), then $F_{ρ_{k}} (x_{*}^{ρ_{k}}) \leq f (x *)$ . Therefore, $\{G_{μ_{k}} (x_{*}^{μ_{k}})\}$ is a monotone nonincreasing sequence that is bounded below by f(x*) and using (12) this sequence converges to its infimum f(x*). Also, $\{F_{ρ_{k}} (x_{*}^{ρ_{k}})\}$ is monotone nondecreasing sequence that is bounded above by f(x*) and by (21) that sequence converges to its supremum f(x*), that is,

F_{ρ_{k}} (x_{*}^{ρ_{k}}) \to \underset{k \geq 1}{s u p} F_{ρ_{k}} (x_{*}^{ρ_{k}}) = f (x *), G_{μ_{k}} (x_{*}^{μ_{k}}) \to \underset{k \geq 1}{i n f} G_{μ_{k}} (x_{*}^{μ_{k}}) = f (x *) .

(43)

By applying squeeze theorem¹ 1 formulated in modern terms by Carl Friedrich Gauss to (38) and (43), we show

\underset{k \to \infty}{l i m} Φ (x_{k}, ρ_{k - 1}, μ_{k - 1}) = f (x *) .

(44)

Let $\{x_{k_{l}}\}$ be any subsequence of {x _k } such that $x_{k_{l}} \to x$ . By the continuity of f, we get $f (x_{k_{l}}) \to f (\bar{x})$ . The final demonstration will be done by contradiction under the assumption $x \neq x *$ with $f (x) > f (x *)$ . Using (10) for the problem (GP _ρ ), we have $μ_{k_{l^{- 1}}} B (x) \to 0$ , for any $x \in I n t (Ω_{2})$ . Furthermore, using (20) for the problem (GP _µ ), also we have $ρ_{k_{l^{- 1}}} 𝒫 (x_{k_{l}}) \to 0$ , and by continuity of f, the sequence $\{f (x_{k_{l}}) - f (x^{*}) + ρ_{k_{l}} 𝒫 (x_{k_{l}}) + μ_{k_{l}} B (x_{k_{l}})\}$ cannot converge to zero, which contradicts (44). □

5 APPLICATIONS

5.1 Barrier-Penalty applied to convex problem

The Algorithm 4 is an algorithm based on generic Algorithm 3 in order to solve the nonlinear problem (30).

Algorithm 4
Mixed barrier-penalty algorithm

For $ρ_{0} = 1, μ_{0} = 1 and x_{0} = (1.5, 1)$ we write a MATLAB script for Algorithm 4 in order to compute a sequence of optimal points that approach to $x * = (1, 2)$ . The iterative results is shown in Table 1.

Thumbnail

Table 1
Iterative results.

The path following points is shown in Figure 3, where last points are close to x*. The exact results also solved by MATLAB are $x * = (1.000000, 2.000000)$ and $f (x *) = 5.000000$ .

Figure 3
Points generated by .

5.2 Penalized standard linear programming problem

We consider the standard linear programming problem where several variables are upper bounded

(L P) m i n c^{T} x s . t . A x = b E x \leq u x \geq 0,

(46)

where A is m × n matrix, $c, x \in ℝ^{n}, b \in ℝ^{m}$ , and E is formed by rows of n × n identity matrix corresponding to bounded variables, thereby Ex is the vector of bounded variables for which u is the vector of upper bounds. In this case is usual to add the slack variable v such that $E x + v = u$ , where v ≥ 0.

In the most computational packages that implement Interior Point Methods for solving linear programming problems only barrier parameter is considered.

In order to solve the LP problem (46), by using the quadratic penalty and logarithmic barrier functions, the objective function is penalized as follow

Φ (x, v, ρ, μ) = c^{T} x + \frac{ρ}{2} {||b - A x||}^{2} - μ \sum_{j = 1}^{n} \log x_{j} - μ \sum_{j = 1}^{n_{b}} \log v_{j},

(47)

where µ and ρ are respectively the barrier and penalty parameters and nb is the number of bounded variables. Then the associated mixed barrier-penalty subproblem is

(L P P_{ρ, μ}) m i n Φ (x, v, ρ, µ) s . t . (x, v) > 0 .

(48)

Since Φ(x,v,ρ,µ) is a smooth function on open set $(x, v) > 0$ . By applying the first-order necessary condition, we have

c - A^{T} ρ (b - A x) - µ X^{- 1} e + µ E^{T} V^{- 1} e = 0,

Defining $y = ρ (b - A x), z = µ X^{- 1} e, w = µ V^{- 1} e$ , we get

c - A^{T} y + E^{T} w - z = 0, E x + v = u, X Z e = µ e, V W e = µ e, y = ρ (b - A x) .

Taking $δ = 1 / ρ$ , we rewrite $δ y = b - A x$ . Thus $A x + δ y = b$ .

Therefore, the optimality conditions for subproblem (LPP _ρ,µ ) on $(x, v) > 0$ and $(z, w) > 0$ are

A x + δ y = b E x + v = u A^{T} y + z - E^{T} w = c X Z e = μ e V W e = μ e .

(49)

In Interior Point Methods reviewed on ^{Suñagua & Oliveira (2017}16 SUÑAGUA P & OLIVEIRA AR. 2017. A new approach for finding a basis for the splitting preconditioner for linear systems from interior point methods. Computational Optimization and Applications, 67(1): 111-127.), we find a search direction by applying Newton’s Method for solving nonlinear system (49). In fact, the Newton’s directions satisfy

(\begin{matrix} A & 0 & δ I & 0 & 0 \\ E & I & 0 & 0 & 0 \\ 0 & 0 & A^{T} & I & - E^{T} \\ Z & 0 & 0 & X & 0 \\ 0 & W & 0 & 0 & V \end{matrix}) (\begin{matrix} d x \\ d v \\ d y \\ d z \\ d w \end{matrix}) = (\begin{matrix} r_{p} \\ r_{u} \\ r_{d} \\ r_{c} \\ r_{s} \end{matrix}) \begin{matrix} r_{p} = b - A x - δ y \\ r_{u} = u - E x \\ r_{d} = c - A^{T} y - z + E^{T} w \\ r_{c} = μ e - X Z e \\ r_{s} = μ e - V W e, \end{matrix}

(50)

solving this block linear equations, we find up

d z = X^{- 1} (r_{c} - Z d x), d w = V^{- 1} (r_{s} - W d w), d v = r_{u} - E d x

(51)

replacing this on third group of equations

A^{T} d y - D^{- 1} d x = r_{d} - X^{- 1} r_{c} + E^{T} V^{- 1} r_{s} - E^{T} V^{- 1} W r_{u}

where $D^{- 1} = X^{- 1} Z + E^{T} V^{- 1} W E$ , then

d x = D (A^{T} d y - r_{d} + X^{- 1} r_{c} - E^{T} V^{- 1} r_{s} + E^{T} V^{- 1} W r_{u})

(52)

using (52) and first group of equations of (50), we get the normal equations

(A D A^{T} + δ I) d y = A D (r_{d} - X^{- 1} r_{c} + E^{T} V^{- 1} r_{s} - E^{T} V^{- 1} W r_{u})

(53)

close to optimal point, D matrix is very bad scaled and then ADA ^T is also very ill-conditioned. In this case, the penalty parameter δ improves that condition number, which is helpful for solving the symmetric positive definite system by applying conjugate gradient method for instance.

Alternatively to (52) and (53), dx and dy also obtain by solving the following augmented system

(\begin{matrix} - D^{- 1} & A^{T} \\ A & δ I \end{matrix}) (\begin{matrix} d x \\ d y \end{matrix}) = (\begin{matrix} r_{1} \\ r_{p} \end{matrix})

(54)

where $r_{1} = r_{d} - X^{- 1} r_{c} + E^{T} V^{- 1} r_{s} - E^{T} V^{- 1} W r_{u}$ . This system is also symmetric, indefinite and better condition number due to penalty parameter.

For computational experiments, we use the open source package PCx (^{Czyzyk et al., 1997}5 CZYZYK J, MEHROTRA S, WAGNER M & WRIGHT SJ. 1997. PCx user guide (Version 1.1). Optimization Technology Center, Northwestern University,.), that implements the Mehrotra’s Predictor-Corrector algorithm in which the barrier parameter µ is already incorporated in order to solve linear programming problems. By adding an appropriate code to PCx, we achieve to incorporate the penalty parameter δ, thus, we obtain the modified PCx called the Predictor-Corrector mixed algorithm with barrier and penalty parameters. The numerical results for several NETLIB LP problems were computed for approaches proposed in ^{Suñagua & Oliveira (2017}16 SUÑAGUA P & OLIVEIRA AR. 2017. A new approach for finding a basis for the splitting preconditioner for linear systems from interior point methods. Computational Optimization and Applications, 67(1): 111-127.), where the goodness of the approaches were compared according to ^{Dolan & More (2002}6 DOLAN ED & MORE JJ. 2002. Benchmarking optimization software with performance profiles. Mathematical programming, 91(2): 201-213.) performance profile criteria.

6 CONCLUSIONS

Firstly, we present a brief summary of the main concepts and results upon the barrier and penalty methods, where for each method we show the global convergence theorems in order to use these strategies in the proof of the global convergence theorem for mixed algorithm.

In the Section 4, we provide a mixed algorithm for solving mixed barrier-penalty subproblem (23), and we provide a constructive proof on global convergence theorem for mixed barrierpenalty methods as an alternative showed in ^{Fiacco & McCormick (1990}7 FIACCO AV & MCCORMICK GP. 1990. Nonlinear programming: sequential unconstrained minimization techniques. vol. 4. Siam.) and ^{Breitfeld & Shanno (1995}4 BREITFELD MG & SHANNO DF. 1995. A Globally Convergent Penalty-Barrier Algorithm for Nonlinear Programming. In: Operations Research Proceedings 1994. pp. 22-27. Springer.). For simple convex nonlinear problems we write MATLAB code in order to generate iterative points that illustrate penalty and barrier functions.

Finally, we develop an application for nonlinear programming problems with equality and inequality functional constraints, such as, quadratic programming problem, and a standard linear programming problem. Since the functions involved have the smooth property on an open set, then the optimality conditions for each class of problems are stated, those can be solved by applying interior point methods.

ACKNOWLEDGEMENTS

Thanks to CNPq, FAPESP (grant number 2010/06822-4) and Universidad Mayor de San Andrés (UMSA) for their financial support.

References

¹
BAZARAA MS, SHERALI HD & SHETTY CM. 2013. Nonlinear programming: theory and algorithms. John Wiley & Sons.
²
BERTSEKAS DP. 1976. On penalty and multiplier methods for constrained minimization. SIAM Journal on Control and Optimization, 14(2): 216-235.
³
BREITFELD MG & SHANNO DF. 1994. A globally convergent penalty-barrier algorithm for nonlinear programming and its computational performance. Rutgers University. Rutgers Center for Operations Research [RUTCOR].
⁴
BREITFELD MG & SHANNO DF. 1995. A Globally Convergent Penalty-Barrier Algorithm for Nonlinear Programming. In: Operations Research Proceedings 1994. pp. 22-27. Springer.
⁵
CZYZYK J, MEHROTRA S, WAGNER M & WRIGHT SJ. 1997. PCx user guide (Version 1.1). Optimization Technology Center, Northwestern University,.
⁶
DOLAN ED & MORE JJ. 2002. Benchmarking optimization software with performance profiles. Mathematical programming, 91(2): 201-213.
⁷
FIACCO AV & MCCORMICK GP. 1990. Nonlinear programming: sequential unconstrained minimization techniques. vol. 4. Siam.
⁸
FLETCHER R. 2013. Practical methods of optimization. John Wiley & Sons, Chichester.
⁹
GRIVA I, NASH SG & SOFER A. 2009. Linear and nonlinear optimization. vol. 108. Siam.
¹⁰
KUHN HW & TUCKER AW. 1951. Nonlinear Programming. In: Proceedings of the Second Berkeley Symposium on Mathematical Statistics and Probability. pp. 481-492. Berkeley, California: University of California Press. Available at: http://projecteuclid.org/euclid.bsmsp/1200500249
» http://projecteuclid.org/euclid.bsmsp/1200500249
¹¹
LUENBERGER DG & YE Y. 2008. Linear and nonlinear programming. 3th ed.. Springer New York.
¹²
MARTINEZ JM & SANTOS SA. 1995. Métodos computacionais de otimização. Colóquio Brasileiro de Matemática, Apostilas, 20. Available at: https://www.ime.unicamp.br/~martinez/mslivro.pdf
» https://www.ime.unicamp.br/~martinez/mslivro.pdf
¹³
NASH SG. 2010. Penalty and barrier methods. Wiley Encyclopedia of Operations Research and Management Science,.
¹⁴
NASH SG & SOFER A. 1993. A barrier method for large-scale constrained optimization. ORSA Journal on Computing, 5(1): 40-53.
¹⁵
POLYAK BT. 1971. The convergence rate of the penalty function method. USSR Computational Mathematics and Mathematical Physics, 11(1): 1-12.
¹⁶
SUÑAGUA P & OLIVEIRA AR. 2017. A new approach for finding a basis for the splitting preconditioner for linear systems from interior point methods. Computational Optimization and Applications, 67(1): 111-127.
¹⁷
WRIGHT MH. 1992. Interior methods for constrained optimization. Acta Numérica, 1: 341-407.
¹⁸
WRIGHT SJ & NOCEDAL J. 1999. Numerical optimization. vol. 2. Springer New York.

1
formulated in modern terms by Carl Friedrich Gauss

Publication Dates

Publication in this collection
18 May 2020
Date of issue
2020

History

Received
07 Dec 2018
Accepted
31 Oct 2019

This is an open-access article distributed under the terms of the Creative Commons Attribution License

[1] ¹
BAZARAA MS, SHERALI HD & SHETTY CM. 2013. Nonlinear programming: theory and algorithms. John Wiley & Sons.

[2] ²
BERTSEKAS DP. 1976. On penalty and multiplier methods for constrained minimization. SIAM Journal on Control and Optimization, 14(2): 216-235.

[3] ³
BREITFELD MG & SHANNO DF. 1994. A globally convergent penalty-barrier algorithm for nonlinear programming and its computational performance. Rutgers University. Rutgers Center for Operations Research [RUTCOR].

[4] ⁴
BREITFELD MG & SHANNO DF. 1995. A Globally Convergent Penalty-Barrier Algorithm for Nonlinear Programming. In: Operations Research Proceedings 1994. pp. 22-27. Springer.

[5] ⁵
CZYZYK J, MEHROTRA S, WAGNER M & WRIGHT SJ. 1997. PCx user guide (Version 1.1). Optimization Technology Center, Northwestern University,.

[6] ⁶
DOLAN ED & MORE JJ. 2002. Benchmarking optimization software with performance profiles. Mathematical programming, 91(2): 201-213.

[7] ⁷
FIACCO AV & MCCORMICK GP. 1990. Nonlinear programming: sequential unconstrained minimization techniques. vol. 4. Siam.

[8] ⁸
FLETCHER R. 2013. Practical methods of optimization. John Wiley & Sons, Chichester.

[9] ⁹
GRIVA I, NASH SG & SOFER A. 2009. Linear and nonlinear optimization. vol. 108. Siam.

[10] ¹⁰
KUHN HW & TUCKER AW. 1951. Nonlinear Programming. In: Proceedings of the Second Berkeley Symposium on Mathematical Statistics and Probability. pp. 481-492. Berkeley, California: University of California Press. Available at: http://projecteuclid.org/euclid.bsmsp/1200500249
» http://projecteuclid.org/euclid.bsmsp/1200500249

[11] ¹¹
LUENBERGER DG & YE Y. 2008. Linear and nonlinear programming. 3th ed.. Springer New York.

[12] ¹²
MARTINEZ JM & SANTOS SA. 1995. Métodos computacionais de otimização. Colóquio Brasileiro de Matemática, Apostilas, 20. Available at: https://www.ime.unicamp.br/~martinez/mslivro.pdf
» https://www.ime.unicamp.br/~martinez/mslivro.pdf

[13] ¹³
NASH SG. 2010. Penalty and barrier methods. Wiley Encyclopedia of Operations Research and Management Science,.

[14] ¹⁴
NASH SG & SOFER A. 1993. A barrier method for large-scale constrained optimization. ORSA Journal on Computing, 5(1): 40-53.

[15] ¹⁵
POLYAK BT. 1971. The convergence rate of the penalty function method. USSR Computational Mathematics and Mathematical Physics, 11(1): 1-12.

[16] ¹⁶
SUÑAGUA P & OLIVEIRA AR. 2017. A new approach for finding a basis for the splitting preconditioner for linear systems from interior point methods. Computational Optimization and Applications, 67(1): 111-127.

[17] ¹⁷
WRIGHT MH. 1992. Interior methods for constrained optimization. Acta Numérica, 1: 341-407.

[18] ¹⁸
WRIGHT SJ & NOCEDAL J. 1999. Numerical optimization. vol. 2. Springer New York.

k	x _k	Φ(x _k *, ρ* _k *, µ* _k )	ρ _k	µ _k
0	(1.500000,1.000000)	3.25000000	1	1
1	(1.366025,0.847127)	3.63962871	10	0.1
2	(1.047723,1.669788)	4.63714955	100	0.01
3	(1.004975,1.960817)	4.97372208	10³	0.001
4	(1.000500,1.996008)	4.99951984	10⁴	0.0001
5	(1.000054,1.999600)	5.00018097	10⁵	1e-05
6	(1.000009,1.999960)	5.00004320	10⁶	1e-06
7	(1.000101,1.999996)	5.00020113	10⁷	1e-07
8	(1.000020,2.000000)	5.00004029	10⁸	1e-08
9	(1.000001,2.000000)	5.00000209	10⁹	1e-09