A New Hybrid Preconditioner for the Interior Point Method

HEREDIA, M.R.; CASTRO, C.O.; OLIVEIRA, A.R.L.

doi:10.5540/tema.2019.020.02.0359

ABSTRACT

This study aims to improve the computation of the search direction in the primal-dual Interior Point Method through preconditioned iterative methods. It is about a hybrid approach that combines the Controlled Cholesky Factorization preconditioner and the Splitting preconditioner. This approach has shown good results, however, in these preconditioners there are factors that reduce their efficiency, such as faults on the diagonal when performing the Cholesky factorization, as well as a demand for excessive memory, among others. Thus, some modifications are proposed in these preconditioners, as well as a new phase change, in order to improve the performance of the hybrid preconditioner. In the Controlled Cholesky Factorization, the parameters that control the fill-in and the correction of the faults which occur on the diagonal are modified. It considers the relationship between the components from Controlled Cholesky Factorization obtained before and after the fault on the diagonal. In the Splitting preconditioner, in turn, a sparse base is constructed through an appropriate ordering of the columns from constrained matrix optimization problem. In addition, a theoretical result is presented, which shows that, with the proposed ordering, the condition number of the preconditioned Normal Equation matrix with the Splitting preconditioner is uniformly limited by an amount that depends only on the original data of the problem and not on the iteration of the Interior Point Method. Numerical experiments with large scale problems, corroborate the robustness and computational efficiency from this approach.

Keywords:
Interior Point Method; Controlled Cholesky Factorization; Splitting preconditioner

RESUMO

Este trabalho visa melhorar o cálculo da direção de busca no Método de Pontos Interiores primal-dual usando métodos iterativos precondicionados. Trata-se de uma abordagem híbrida que combina o precondicionador Fatoração Controlada de Cholesky e o precondicionador Separador. Esta abordagem tem mostrado bons resultados, entretanto, nesses précondicioandores existem fatores que reduzem sua eficiência, como falhas na diagonal ao calcular a Fatoração Incompleta de Cholesky, assim como a demanda por memória excessiva no precondicionador Separador, entre outros. Assim, algumas modificações são propostas nestes precondicionadores, bem como uma nova mudança de fase, a fim de melhorar o desempenho da aboradagem híbrida. Na Fatoração Controlada de Cholesky, os parâmetros que controlam o preenchimento e a correção das falhas que ocorrem na diagonal são modificados, para tal considera-se a relação entre os componentes da Fatoração Controlada de Cholesky obtidos antes e depois da falha na diagonal. No precondicionador Separador, por sua vez, a base esparsa é construída usando um ordenamento apropriado das colunas da matriz do problema de otimização. Além disso, é apresentado um resultado teórico que mostra que, com a ordenação proposta, o número da condição da matriz de Equações Normais precondicionada com o Precondicionador Separador é uniformemente limitado por uma quantidade que depende apenas dos dados originais do problema e não da iteração do Método do Pontos Interiores. Experimentos numéricos com problemas de grande porte corroboram a robustez e eficiência computacional desta abordagem.

Palavras-chave:
Método de Ponto Interior; Fatoração Controlada de Cholesky; precondicionador Separador

1 INTRODUCTION

Among the Interior Point Methods (IPM) found in the literature today, the primal-dual method of infeasible points using the Mehrotra’s predictor-corrector technique results to be the most computationally efficient, see ⁸8 J. Gondzio . Multiple centrality corrections in a primal-dual method for linear programming. Computational Optimization and Applications, 6(2) (1996), 137-156. doi:10.1007/BF00249643. URL https://doi.org/10.1007/BF00249643.
https://doi.org/10.1007/BF00249643... ^{), (}⁹9 J. Gondzio . Interior point methods 25 years later. European Journal of Operational Research, 218(3) (2012), 587-601.^{), (}²¹21 S.J. Wrigth. “Primal-dual Interior-Point Methods:”. SIAM e-books. Society for Industrial and Applied Mathematics (SIAM) (1997).. However, the greatest computational effort in all IPM is the computation of the search direction because it results from linear systems that become ill conditioned when IPM is near to achieve the optimal solution. Additionally, this computation may require an excessive memory usage. In large scale and sparse problems, the preconditioning technique and the use of iterative methods are recommended to overcome these difficulties, see ¹1 G. Al-Jeiroudi, J. Gondzio & J. Hall. Preconditioning indefinite systems in interior point methods for large scale linear optimisation. Optimization Methods and Software, 23(3) (2008), 345-363. doi:10. 1080/10556780701535910. URL https://doi.org/10.1080/10556780701535910.
https://doi.org/10.1080/1055678070153591... ^{), (}³3 S. Bocanegra, F.F. Campos & A.R.L. Oliveira. Using a hybrid preconditioner for solving large-scale linear systems arising from interior point methods. Computational Optimization and Applications, 36(2-3) (2007), 149-164..

The search direction can be computed by solving both as the Augmented System (AS) that has an indefinite matrix, as the Normal Equations System (NES) that has a positive definite matrix. In this paper the NES is solved using a hybrid preconditioning approach applied to the Conjugate Gradient Method (CGM). In the early iterations of the IPM is used the Controlled Cholesky Factorization preconditioner (CCF), see ⁴4 F.F. Campos . “Analysis of conjugate gradients-type methods for solving linear equations.”. Ph.D. thesis, University of Oxford (1995)., with the proposed modifications that will be presented in Section 4.1; the objective from the contributions of this paper is to accelerate the construction of CCF preconditioner by reducing the restarts when exist diagonal faults. It was shown in ¹⁰10 J. Gondzio . Matrix-Free Interior Point Method. Computacional Optimization and Applications, 51 (2012), 457-480. that the condition number of the NES matrix is the order $O (μ^{- 2})$ , where µ denotes the complementarity gap of the Linear Programming (LP) problem, it means, it is inevitable that the performance of the CCF preconditioner get deteriorated when IPM is near to achieve an optimal solution since it is a generic preconditioner. Proposed by ¹⁷17 A.R.L. Oliveira & D. Sorensen. A new class of preconditioners for large-scale linear systems from interior point methods for linear programming. Linear Algebra and its applications, 394 (2005), 1-24., the Splitting Preconditioner (SP), in turn, it was exclusively made to overcome the problem of ill conditioning of linear systems from the last IPM iterations.

The CCF preconditioner is obtained by performing an Incomplete Cholesky Factorization (ICF), its fill-in in ⁴4 F.F. Campos . “Analysis of conjugate gradients-type methods for solving linear equations.”. Ph.D. thesis, University of Oxford (1995). allows the preconditioner to vary from a diagonal matrix to another with more nonzero entries than the classical ICF matrix. It is known that any ICF is susceptible to faults on diagonal, however, if a symmetric matrix V is positive definite, it exists a constant $α > 0$ such as an ICF of the matrix $V + α diag (V)$ exists, see ¹⁵15 T.A. Maunteuffel. An incomplete factorization technique for positive definite linear systems. Mathematics of computation, 34(150) (1980), 473-497.. Techniques of diagonal modification in the ICF can be found in ¹¹11 M.R. Heredia & A.R.L. Oliveira . Uma nova proposta para modificar a Fatoração Controlada de Cholesky no método dos pontos interiores. 1 (2015), 2912-2923.^{), (}¹²12 M.T. Jones & P.E. Plassmann. An improved incomplete Cholesky factorization. ACM Transactions on Mathematical Software (TOMS), 21(1) (1995), 5-17.^{), (}¹³13 C.J. Lin & J.J. Moré . Incomplete Cholesky factorizations with limited memory. SIAM Journal on Scientific Computing, 21(1) (1999), 24-45..

In the original CCF construction proposed in ⁴4 F.F. Campos . “Analysis of conjugate gradients-type methods for solving linear equations.”. Ph.D. thesis, University of Oxford (1995)., the faults that occur during the factoring are corrected with an exponential increase and the computation of the elements from the preconditioner is restarted. On this study, algebraic and geometric tools are used to obtain relationships among the elements that caused the failure and the new components of the matrix obtained with the increment. In addition, it was observed that the parameter that controls the fill-in of the CCF preconditioner is related to the increase of the diagonal. Using these relations, it is proposed a modification of these parameters in order to reduce the number of factoring restarts required for its construction.

The new hybrid preconditioner is compared with the version currently used in ²⁰20 M.I. Velazco, A.R.L. Oliveira & F.F. Campos . A note on hybrid preconditioners for large-scale normal equations arising from interior-point methods. Optimization Methods Software, 25(2) (2010), 321-332. doi:10.1080/10556780902992829.
https://doi.org/10.1080/1055678090299282... . The computational tests show that the new proposal is more efficient and robust.

We present a criterion that evaluates the performance of the CCF preconditioner and it will indicate the moment of the preconditioner exchange that starts the second phase of hybrid preconditioning using the SP. The SP performance depends on a non singular submatrix B of the constraints matrix A, the choice of columns of B is done using ordering in the columns from matrix A. The authors from SP and, later, their collaborators ²⁰20 M.I. Velazco, A.R.L. Oliveira & F.F. Campos . A note on hybrid preconditioners for large-scale normal equations arising from interior-point methods. Optimization Methods Software, 25(2) (2010), 321-332. doi:10.1080/10556780902992829.
https://doi.org/10.1080/1055678090299282... , developed an efficient heuristic. However, there are problems where the approach fails or demands an excessive of computational time. Thus, in respect to SP, the objective of this paper is to study the condition number of the preconditioned NES by the SP and, from this study, to order the columns of the matrix of restrictions from LP problem in order to construct a sparse base that provides a number condition limited by an amount that is independent of the IPM iteration.

2 SEARCH DIRECTIONS IN THE INTERIOR POINT METHOD

Consider the linear programming problem

(P) \{\begin{matrix} \min & c^{T} x \\ s . t . & A x = b; \\ x + s = u; \\ x, s \geq 0, \end{matrix} and (D) \{\begin{matrix} \max & b^{T} y - u^{T} w \\ s . t . & A^{T} y - w + z = c; \\ w, z \geq 0; \\ y \in ℝ^{m}, \end{matrix}

where $x, s, w \in ℝ^{n}$ and $A \in ℝ^{m \times n}$ . We assume that A has full row rank throughout this paper. The search direction in the infeasible Interior Point Method (IPM) is obtained by applying Newton’s method to the optimality conditions of the problem

(P') \{\begin{matrix} \min & c^{T} x - μ \sum_{i = 1}^{n} \log x_{i} - μ \sum_{i = 1}^{n} \log s_{i} \\ s . t . & A x = b; \\ x + s = u; \\ x, s \geq 0, \end{matrix}

where the problem (P') results from applying the logarithmic barrier penalty on the nonnegativity constraints of the primal problem (P). Since (P') is a convex problem, the KKT conditions are sufficient and necessary to find the optimal solution. Consider its lagrangian R and the partial derivatives,

l (x, s, y, w) = c^{T} x - μ \sum_{i = 1}^{n} \log x_{i} - μ \sum_{i = 1}^{n} \log s_{i} + y^{T} (b - A x) + w^{T} (u - x - s), \nabla_{x} l = c - μ X^{- 1} e - A^{T} y - w, \nabla_{s} l = - μ S^{- 1} e - w, \nabla_{y} l = b - A x and \nabla_{w} l = u - x - s,

where $e^{T} = (1, . . ., 1) \in ℝ^{n}$ , $X^{- 1} = diag (x_{1}^{- 1}, . . ., x_{n}^{- 1})$ , and $S^{- 1} = diag (S_{1}^{- 1}, . . ., s_{n}^{- 1})$ . If $z \in ℝ^{n}$ is defined as $z = µ X^{- 1} e$ , the optimality conditions of the problem (P') are

A x = b; x + s = u, x, s > 0; A^{T} y + z - w = c, z, w > 0; S W e = μ e; X Z e = μ e,

(2.1)

the equations in (2.1) are an implicit parameterization of the central path, ²¹21 S.J. Wrigth. “Primal-dual Interior-Point Methods:”. SIAM e-books. Society for Industrial and Applied Mathematics (SIAM) (1997)..

In order to obtain the search direction $∆ X = (∆ x, ∆ s, ∆ y, ∆ z, ∆ w)^{T}$ , using (2.1) consider the maps F given by

F (x, s, y, w, z) = (A x - b, x + s - u, A^{T} y + z - w - c, X Z e - μ e, S W e - μ e),

apply the Newton’s method in the maps F implies to solve the following linear system:

(\begin{matrix} A & 0 & 0 & 0 & 0 \\ I_{n} & I_{n} & 0 & 0 & 0 \\ 0 & 0 & A^{T} & - I_{n} & I_{n} \\ Z & 0 & 0 & X & 0 \\ 0 & W & 0 & 0 & S \end{matrix}) (\begin{matrix} Δ x \\ Δ s \\ Δ y \\ Δ z \\ Δ w \end{matrix}) = (\begin{matrix} r_{b} \\ r_{u} \\ r_{c} \\ r_{1} \\ r_{2} \end{matrix}),

(2.2)

where $r_{b} = b - A x, r_{u} = u - x - s, r_{c} = c + w - z - A^{T} y, r_{1} = μ e - X Z e, r_{2} = μ e - S W e, e^{T} = (1, . . ., 1) \in ℝ^{n}, X = diag (x_{1}, . . ., x_{n}), Z = diag (z_{1}, . . ., z_{n}), S = diag (s_{1}, . . . ., s_{n}), W = diag (w_{1}, . . ., w_{n})$ .

The predictor-corrector method modifies the right-hand side in (2.2) by

r_{1} = σ μ e - X Z e, r_{2} = σ μ e - S W e,

where $σ \in [0, 1]$ is known as centering parameter.

Substituting the variables

∆ s = r_{u} - ∆ x, ∆ z = X^{- 1} (r_{1} - Z ∆ x) and ∆ w = S^{- 1} (r_{2} - W ∆ s)

in the third equation in (2.2) we have:

A^{T} ∆ y - (X^{- 1} Z + S^{- 1} W) ∆ x = r_{c} - X^{- 1} r_{1} + S r_{2} - S^{- 1} W r_{u} .

(2.3)

Considering the equation in (2.3) and the first equation in the system (2.2), we obtain the Augmented System

(\begin{matrix} - Θ^{- 1} & A^{T} \\ A & 0 \end{matrix}) (\begin{matrix} Δ x \\ Δ y \end{matrix}) = (\begin{matrix} r \\ h \end{matrix}),

(2.4)

where $Θ^{- 1} = X^{- 1} Z + S^{- 1} W, r = r_{c} - X^{- 1} r_{1} + S^{- 1} r_{2} - S^{- 1} W r_{u}$ and $h = r_{b}$ . Substituting $∆ x = Θ A^{T} ∆ y - Θ r$ in the equation $A ∆ x = h$ we obtain the Normal Equation’s system

A Θ A^{T} ∆ y = h + A Θ r,

(2.5)

this system is symmetric and positive definite. In the next section, a Hybrid Preconditioner (HP) is used for the preconditioning of the matrix in (2.5).

3 PRECONDITIONER FOR THE NORMAL EQUATION SYSTEM

The preconditioning technique has as main objective to facilitate the convergence of iterative methods in order to find the solution of linear systems.

In this paper, two-sided preconditioning is used, that is, a preconditioner given by $K = K_{1} K_{2}$ . The iterative method is applied to

K_{1}^{- 1} A K_{2}^{- 1} z = K_{1}^{- 1} b instead of A x = b,

(3.1)

where $x = K_{2}^{- 1} z$ . Observe that, in (3.1) the system $A x = b$ represents that system in (2.5) and the matrices K ₁ and K ₂ are the Controlled Cholesky Factorization (CCF) preconditioner in the first phase and the Splitting Preconditioner (SP) in the second phase. More precisely, in this paper, we denote as the Hybrid Preconditioner (HP) for the Interior Point Method (IPM) to the approch that computing the search direction via the Conjugate Gradient Method (CGM) in two phases, in the early iterations it uses the CCF preconditioner and after a phase change criteria the SP is used. The early iterations use the CCF preconditioner proposed in ⁴4 F.F. Campos . “Analysis of conjugate gradients-type methods for solving linear equations.”. Ph.D. thesis, University of Oxford (1995). successfully, but its performance reduces as the IPM approaches the optimal solution. This can be justified by the fact that the condition number of the non-preconditioned Normal Equation System, see equation (2.5), is of the order $O (μ^{- 2})$ , where µ denotes the duality gap of the Linear Programming (LP) Problem, see ¹⁰10 J. Gondzio . Matrix-Free Interior Point Method. Computacional Optimization and Applications, 51 (2012), 457-480.. However, the SP uses this characteristic to your favor to contain the difficulty provided of ill conditioning. We will make a description of the preconditioners used in this paper.

3.1 Controlled Cholesky Factorization preconditioner

Preconditioners based on an Incomplete Cholesky Factorizaton (ICF) present good performance if the fill-in of the preconditioning matrix is controlled, see ¹⁴14 I.J. Lustig, R.E. Marsten & D.F. Shanno. On implementing Mehrotras s predictor-corrector interiorpoint method for linear programming. SIAM Journal on Optimization, 2(3) (1992), 435-449.. Consider the matrix AΘA ^T from (2.5), let be ℒ and $\tilde{L}$ , the lower triangular matrices from Cholesky factorization and Incomplete Cholesky Factorization (ICF) of matrix AΘA ^T , respectively, that is,

{LL}^{T} = A Θ A^{T} \tilde{L} {\tilde{L}}^{T} + R,

where R is the residual matrix. We define the matrix $E = L - \tilde{L}$ , then,

{\tilde{L}}^{- 1} (A Θ A^{T}) {\tilde{L}}^{- T} = (I + {\tilde{L}}^{- 1} E) {(I + {\tilde{L}}^{- 1} E)}^{T},

observe that if $\tilde{L} \approx L$ then $E \approx 0$ and therefore ${\tilde{L}}^{- 1} (A Θ A^{T}) {\tilde{L}}^{- T} \approx I_{m}$ , this fact motivate the construction of the Controlled Cholesky Factorization (CCF) preconditioner. More precisely, the CCF preconditioner is a type of the ICF based in the minimization of the Frobenius norm of the matrix E, that is: min ${||E||}_{F}^{2}$ . For this is consideraded the minimize problem:

\min \sum_{j = 1}^{m} c_{j}, where c_{j} = \sum_{i = 1}^{m} {|l_{i j} - {\tilde{l}}_{i j}|}^{2},

(3.2)

rewriting this problem, we have:

\min \sum_{j = 1}^{m} (\sum_{k = 1}^{m_{j} + η} {|l_{i_{k} j} - {\tilde{l}}_{i_{k} j}|}^{2} + \sum_{k = m_{j} + η + 1}^{m} {|l_{i_{k} j}|}^{2}),

(3.3)

where m is the order of the matrix, m _j is the number of nonzero entries below the diagonal in the j-th column of the matrix AΘA ^T and η is the extra number of nonzero entries allowed per column. Note that we want to minimize the problem in (3.3), the entries ${\tilde{I}}_{i_{k} j}$ will be higher in absolute value. We will denote by $\hat{L}$ the CCF matrix, that is, the matrix containing the major elements of ICF. Thus, the NES given in 3.4 preconditioned by CCF preconditioner is:

{\hat{L}}^{- 1} A Θ A^{T} {\hat{L}}^{- T} Δ \hat{y} = {\hat{L}}^{- 1} (h + A Θ r),

(3.4)

where $Δ \hat{y} = {\hat{L}}^{T} Δ y$ .

In the first iteration the number of nonzero entries allowed is given by:

η_{0} = \{\begin{cases} nnz (A Θ A^{T}) / m, & if nnz (A Θ A^{T}) < 10 m; \\ - nnz (A Θ A^{T}) / m, & other case . \end{cases}

(3.5)

As the number of CGM iterations is increased, it becomes necessary that the value of η is increased. That is, if the number of CGM iterations exceeds $m / 5$ the value of η is increased by 10, see ³3 S. Bocanegra, F.F. Campos & A.R.L. Oliveira. Using a hybrid preconditioner for solving large-scale linear systems arising from interior point methods. Computational Optimization and Applications, 36(2-3) (2007), 149-164.. When $η > 10$ the change of phases is made in Hybrid Preconditioner (HP), see ²⁰20 M.I. Velazco, A.R.L. Oliveira & F.F. Campos . A note on hybrid preconditioners for large-scale normal equations arising from interior-point methods. Optimization Methods Software, 25(2) (2010), 321-332. doi:10.1080/10556780902992829.
https://doi.org/10.1080/1055678090299282... .

In the construction of the CCF preconditioner it is possible to find diagonal faults, these faults are corrected with an exponential increase. The increment value is:

α_{t} = 5 \cdot 10^{- 4} \cdot 2^{t - 1},

(3.6)

where $t = 1, . . ., 15$ represents the number of allowable restarts on the CCF, see ³3 S. Bocanegra, F.F. Campos & A.R.L. Oliveira. Using a hybrid preconditioner for solving large-scale linear systems arising from interior point methods. Computational Optimization and Applications, 36(2-3) (2007), 149-164.. There are other sequences to compute an increase in the diagonal, see ¹¹11 M.R. Heredia & A.R.L. Oliveira . Uma nova proposta para modificar a Fatoração Controlada de Cholesky no método dos pontos interiores. 1 (2015), 2912-2923.^{), (}¹²12 M.T. Jones & P.E. Plassmann. An improved incomplete Cholesky factorization. ACM Transactions on Mathematical Software (TOMS), 21(1) (1995), 5-17.^{), (}¹⁵15 T.A. Maunteuffel. An incomplete factorization technique for positive definite linear systems. Mathematics of computation, 34(150) (1980), 473-497..

In this way, every time that a diagonal fault occurs the computing of the matrix elements $\tilde{L}$ is restarted and if the number of restarts is 15, the value of α is not a small value. In order to avoid restarts an approach is proposed in ¹⁸18 L.M. Silva & A.R.L. Oliveira . Melhoria do desempenho da fatoração controlada de Cholesky no precondicionamento de sistemas lineares oriundos dos métodos de pontos interiores. In “Proceeding Series of the Brazilian Society of Computational and Applied Mathematics”, volume 3. SBMAC (2015), pp. 1-7.. This approach was based on the paper of ²2 S. Bellavia, V. Simone, D. Serafino & B. Morini. A preconditioning framework for sequences of diagonally modified linear systems arising in optimization. SIAM Journal on Numerical Analysis, 50(6) (2012), 3280-3302.. In Section 4, we apresented an approach such that the number of restarts to compute the CCF preconditioner is reduced by looking for the value of the increment in the main diagonal of matrix A to be close to the value proposed by ¹⁵15 T.A. Maunteuffel. An incomplete factorization technique for positive definite linear systems. Mathematics of computation, 34(150) (1980), 473-497..

3.2 Splitting preconditioner applied to the normal equation system

The Splitting Preconditioner is based on the complementary slackness conditions of the Linear Programming problem (P) and (D), that is,

x_{i} z_{i} = 0 and s_{i} w_{i} = 0 for all i = 1, . . ., n .

(3.7)

Note that the diagonal matrix Θ components $θ_{j} = (z_{j} / x_{j} + w_{j} / s_{j})^{- 1}$ given in (2.4) and (2.5) changes in each IPM iteration, particularly next to the optimal solution due to (3.7) and the nonnegativity of the variables x, z, s, w, there will be indexes $j \in \{1, . . . n\}$ such that $θ_{j} \to 0$ or $θ_{j} \to \infty$ . This feature is the reason for the good performance of the Splitting preconditioner in the last IPM iterations.

In each IPM iteration consider the ordering $θ_{σ (1)} \geq . . . \geq θ_{σ (m)} \geq . . . \geq θ_{σ (n)}$ , where σ is a permutation of the set $\{1, . . . n\}$ , this permutation changes from iteration to iteration. The sets of indexes are denoted by $B = \{σ (1), . . ., σ (m)\}$ and $N = \{σ (m + 1), . . ., σ (n)\}$ , if the A and Θ columns are reordered according to σ , the matrix (2.5) can be written as

A Θ A^{T} = A_{B} Θ_{B} A_{B}^{T} + A_{N} Θ_{N} A_{N}^{T} .

(3.8)

If the submatrix A_ℬ is non-singular, the Splitting preconditioner for the normal equation’s system is given by the matrix

P = A_{B} Θ_{B}^{1 / 2},

(3.9)

in this case, ℬ and A_ℬ are known as basic indexes and base of the SP , respectively.

Preconditioning the matrix given in (3.8) for P, we obtain

P^{- 1} (A Θ A^{T}) P^{- T} = I_{m} + W W^{T} where W = Θ_{B}^{- 1 / 2} A_{B}^{- 1} A_{N} Θ_{N}^{1 / 2} .

An ideal situation would occur if $Θ_{B}^{- 1 / 2} \to 0$ and $Θ_{N}^{1 / 2} \to 0$ implying that $W \to 0$ and, thus, $P^{- 1} (A Θ A^{T}) P^{- T} \approx I_{m}$ . However, nothing guarantees that this matrix A _ℬ is non-singular and even if so, not all θ _j with $j \in ℬ$ is a large value. In fact, close to the optimal solution there are at least $n - m$ values close to zero, this implies that there will be a maximum of m not small values.

However, if $ℬ = \{σ (1), . . ., σ (m)\}$ is the indexes set used in an IPM iteration for the SP construction, an advantageous property of this preconditioner is that the same indexes may be reused by several iterations making these iterations much cheaper. The papers ¹1 G. Al-Jeiroudi, J. Gondzio & J. Hall. Preconditioning indefinite systems in interior point methods for large scale linear optimisation. Optimization Methods and Software, 23(3) (2008), 345-363. doi:10. 1080/10556780701535910. URL https://doi.org/10.1080/10556780701535910.
https://doi.org/10.1080/1055678070153591... ^{), (}⁵5 L. Casacio, C. Lyra, A.R.L. Oliveira & C.O. Castro. Improving the Preconditioning of Linear Systems from Interior Point Methods. Comput. Oper. Res., 85(C) (2017), 129-138. doi:10.1016/j.cor.2017.04.005. URL https://doi.org/10.1016/j.cor.2017.04.005.
https://doi.org/10.1016/j.cor.2017.04.00... ^{), (}⁹9 J. Gondzio . Interior point methods 25 years later. European Journal of Operational Research, 218(3) (2012), 587-601.^{), (}¹⁹19 P. Suñagua & A.R.L. Oliveira . A new approach for finding a basis for the splitting preconditioner for linear systems from interior point methods. Computational Optimization and Applications, 67(1) (2017), 111-127. URL https://EconPapers.repec.org/RePEc:spr:coopap:v:67:y:2017:i:1:d:10.1007_s10589-016-9887-0.
https://EconPapers.repec.org/RePEc:spr:c... study the choice of basic indexes for a preconditioner.

In order to study the condition number of the preconditioned matrix, assume that λ and v are an eigenvalue and an eigenvector of the matrix $I + W W^{T}$ , that is, $v + W W^{T} v = λ v$ , multiplying by vector v ^T this equation, we note that $| λ | \geq 1$ , that is

κ (P^{- 1} (A Θ A^{T}) P^{- T}) = \frac{λ_{\max}}{λ_{\min}} \leq λ_{\max} .

On the other hand, we observe that: $λ_{\max} (P^{- 1} A Θ A^{T} P^{- T}) = {||P^{- 1} A Θ^{1 / 2}||}_{2}^{2}$ and

λ_{\max} (P^{- 1} A Θ A^{T} P^{- T}) = \leq {||P^{- 1} A Θ^{1 / 2}||}_{F}^{2} = \sum_{i = 1}^{n} θ_{j} {||P^{- 1} A_{j}||}_{2}^{2},

(3.10)

we use (3.10) to find an upper bound condition number $κ (P^{- 1} (A Θ A^{T}) P^{- T})$ in the Section 4.3. To find the linearly independent columns of A to form the base of the SP may require an excessive memory usage because it is done trough a factorization LU of matrix A, a small pivot or zero indicates that the column corresponding is linearly dependent. The technique proposed in ¹⁷17 A.R.L. Oliveira & D. Sorensen. A new class of preconditioners for large-scale linear systems from interior point methods for linear programming. Linear Algebra and its applications, 394 (2005), 1-24. to deal the excessive fill-in is the interruption of the factorization and reordering the independent columns found thus far by the number of nonzero entries. In ¹⁷17 A.R.L. Oliveira & D. Sorensen. A new class of preconditioners for large-scale linear systems from interior point methods for linear programming. Linear Algebra and its applications, 394 (2005), 1-24., the authors of the SP suggested the choice of the first m linearly independent columns of matrix A reordered giving priority to the indexes j of the $θ_{j} / {||A_{j}||}_{1}$ values in decreasing order. In ²⁰20 M.I. Velazco, A.R.L. Oliveira & F.F. Campos . A note on hybrid preconditioners for large-scale normal equations arising from interior-point methods. Optimization Methods Software, 25(2) (2010), 321-332. doi:10.1080/10556780902992829.
https://doi.org/10.1080/1055678090299282... was proposed a new column ordering according $θ_{j} / {||A_{j}||}_{2}$ values in decreasing order, which achieved better results in the SP performance.

4 NEW PROPOSALS

4.1 Fault correction parameter

Hereafter, the matrix AΘA ^T is denoted by 𝒜. Suppose 𝒜 is a scaled matrix, that is, $a_{j j} = 1$ and $a_{i j} \leq 1$ for $i, j = 1, . . . m$ . In the construction of the CCF preconditioner, it is said that there is a diagonal fault when $d_{j} < ε$ for some $j = 1, . . ., m$ .

The proposal for the computation of the new increment α _t considers the LDL ^T factorization of 𝒜 and $𝒜 + α I$ . That is, if $𝒜 = 𝒜 + α I$ , we look for the matrices L, D, $L$ and $D$ such that $A = L D L^{T}$ . The subscript t of α _t indicates the number of attempts to correct the diagonal fault. The CCF preconditioner allows up to fifteen attempts, that is, up to fifteen restarts in its construction. From now on, for simplicity, α _t is denoted only as α. Next, we establish the dependence between the entries of the matrices $L$ and $L$ with respect to the parameter α:

d_{j} = a_{j j} - \sum_{k = 1}^{j - 1} d_{k} l_{j k}^{2};

(4.1a)

l_{i j} = \frac{1}{d_{j}} (a_{i j} - \sum_{k = 1}^{j - 1} l_{i k} d_{k} l_{j k});

(4.1b)

given that $A = A + α I$ , we get:

d_{j} = a_{j j} + α - \sum_{k = 1}^{j - 1} d_{k} l_{j k}^{2};

(4.2a)

l_{i j} = \frac{1}{d_{j}} (a_{i j} - \sum_{k = 1}^{j - 1} l_{i k} d_{k} l_{j k}),

(4.2b)

for $j = 1, . . . m$ and $i = j + 1, . . ., m$ . We obtain $d_{j} > ε$ in (4.2a) each time a value α is incremented in the diagonal of 𝒜, this would imply that the umerical value of the sumatory $\sum_{k = 1}^{j - 1} d_{k} l_{j k}^{2}$ is decreasing. Computationally, this verified in ⁴4 F.F. Campos . “Analysis of conjugate gradients-type methods for solving linear equations.”. Ph.D. thesis, University of Oxford (1995). and this study presented the Proposition 4.1, to justify this fact.

Proposition 4.1.If $d_{j} < ε$ , the functions $F_{u} : ℝ^{+} \to ℝ$ , given by

α \mapsto \sum_{k = 1}^{u - 1} d_{k} l_{u k}^{2},

(4.3)

are decreasing, where $u = 2, . . ., j$ and $F_{u} (0) = \sum_{k = 1}^{u - 1} d_{k} l_{u k}^{2}$ Furthermore,

(i) $d_{j} > ε$ for all $α \geq ε - d_{j}$ .
(ii) If $0 < d_{j} < ε$ , then $d_{j} > ε$ for all $α \geq ε$ .

Proof. Let’s j such that $d_{j} < ε$ and we assume that the function F _j is decreasing and we proof this fact later. So for every $α > 0$ , we have $F_{j} (0) > F_{j} (α)$ or

\sum_{k = 1}^{j - 1} (d_{k} l_{j k}^{2} - d_{k} l_{u k}^{2}) > 0 .

(4.4)

This inequality is used to show ((i)) and ((ii)):

(i) In fact, by equation (4.2a):

d_{j} = a_{j j} - \sum_{k = 1}^{j - 1} (d_{k} l_{j k}^{2}) + α + \sum_{k = 1}^{j - 1} (d_{k} l_{j k}^{2}) - \sum_{k = 1}^{j - 1} d_{k} l_{j k}^{2} = d_{j} + α + \sum_{k = 1}^{j - 1} (d_{k} l_{j k}^{2} - d_{k} l_{j k}^{2}) > d_{j} + α \geq ε,

(4.5)

the first inequality follows from (4.4) and the second inequality from $α \geq ε - d_{j}$ .
(ii) Observe that $α \geq ε$ , we have:

d_{j} = d_{j} + α + \sum_{k = 1}^{j - 1} (d_{k} l_{j k}^{2} - d_{k} l_{j k}^{2}) > d_{j} + α \geq d_{j} + ε > ε,

(4.6)

since $α \geq ε$ and $d_{j} > 0$ .

In this way, the next step is to prove that the function F _u defined in (4.3), is decreasing for $u = 2, . . ., j$ where $j = 2, . . ., m$ .

For every j, differentiating the function $d_{k} l_{j k}$ in relation to the variable α:

(d_{k} l_{j k})' = d_{k}^{'} l_{j k} + d_{k} l_{j k}^{'} .

(4.7)

Since that $F_{j}^{'} (α) = \sum_{k = 1}^{j - 1} (d_{k}^{'} l_{j k}^{2} + 2 l_{j k} (d_{k} l_{_{j k}}^{'}))$ we use the Equation 4.7 to obtain:

F_{j}^{'} (α) = \sum_{k = 1}^{j - 1} (- d_{k}^{'} l_{j k}^{2} + 2 l_{j k} (d_{k} l_{j k})') .

(4.8)

The following will demonstrate that for every $u = 2, . . ., j - 1, F_{t}^{'} (α) < 0$ and consequently F _j will be decreasing. In fact, using Mathematical Induction,

a) Basis: when $u = 2$ , from the equation (4.1):

d_{1}^{'} = (d_{1} + α)' = 1 and (d_{1} l_{21})' = (d_{1} + l_{21})' = 0 .

(4.9)

Substituting the equations from 4.9 in (4.8):

F_{2}^{'} (α) = - d_{1}^{'} l_{21}^{2} + 2 l_{21} (d_{1} l_{21})' = l_{21}^{2} \leq 0 .

b) Inductive step: Assume for every $u = 2, . . ., j - 1, F_{u}^{'} (α) \leq 0$ . It must then be shown that $F_{j}^{'} (α) \leq 0$ . Otherwise, assume $F_{j}^{'} (α) > 0$ , from the equation (4.8):

0 < F_{j}^{'} (α) = \sum_{k = 1}^{j - 1} (- d_{k}^{'} l_{j k}^{2} + 2 l_{j k} (d_{k} l_{j k})'),

since that $0 \leq = \sum_{k = 1}^{j - 1} d_{k}^{'} l_{j k}^{2}$ , we have:

0 < \sum_{k = 1}^{j - 1} 2 {\bar{l}}_{j k} (d_{k} {\bar{l}}_{j k})' .

(4.10)

If the inequality (4.10) were true, it would imply that exist an index $r, r \in \{1, . . ., j - 1\}$ , such that $l_{j r} (d_{r} l_{j r})' > 0$ . From the equation (4.10), it would follow that the function ${(d_{r} l_{j r})}^{2}$ is increasing. In fact, for all $u \in \{1, . . ., j - 1\}$ the function $d_{u}$ is increasing, since that we have the equation (4.1a) and the basis of the Mathematical Induction:

d_{u}^{'} = 1 - \sum_{k = 1}^{u - 1} (d_{k} l_{k}^{2})' = 1 - F_{u}^{'} (α) > 0,

(4.11)

for every $u = 1, . . ., j - 1$ . Thus, when $α > 0$ , we use (4.11) to obtain: ¹ 1 Observe that if exist diagonal fault in j-column, du>ε for all u=1,..., j−1 .

d_{u} > d_{u} > ε > 0,

that mean, $d_{t} > 0$ for all $u \in \{1, . . ., j - 1\}$ . In order to prove that the function ${(d_{r} l_{j r})}^{2}$ is increasing, we must consider $u = r$ to garantee $d_{r} > 0$ . Thus, from the equation (4.10):

({(d_{r} l_{j r})}^{2})' = 2 d_{r} l_{j r} (d_{r} l_{j r})' > 0,

(4.12)

that is, the function ${(d_{r} l_{j r})}^{2}$ is increasing. We use (4.12) in order to obtain a contradiction. If ${(d_{r} l_{j r})}^{2}$ is increasing, for all $α > 0$ ,

{(d_{r} l_{j r})}^{2} < {(d_{r} l_{j r})}^{2},

and consequently from the equation (4.11), for all $α > 0$ , it would follow that:

{(d_{r} l_{j r})}^{2} < {(d_{r} l_{j r})}^{2} = {(d_{r} l_{j r} + \sum_{s = 1}^{r - 1} l_{j s} d_{s} l_{r s} - l_{j s} d_{s} l_{r s})}^{2} .

However, when α approaches zero:

{(d_{r} l_{j r})}^{2} < \lim_{α \to 0} {(d_{r} l_{j r} + \sum_{s = 1}^{r - 1} l_{j s} d_{s} l_{r s} - l_{j s} d_{s} l_{r s})}^{2} = {(d_{r} l_{j r})}^{2},

that means, a contradiction.

Therefore, $F_{u}^{'} (α) \leq 0$ for every $u = 2, . . ., j$ . Since that j is arbitrary we have that the function F _j is decreasing, for every $j = 2, . . ., m$ . ◻

As a consequence of the Proposition 4.1, the value $α = ε - d_{j}$ could be used. However, it is necessary that α be as small as possible so that in this way $A \approx A$ and we have nothing to guarantee that $ε - d_{j}$ is a small value. Thus, it is proposed to solve the following problem:

(P_{α}) \{\begin{cases} min_{α > 0} & α \\ s . t . & \sum_{k = 1}^{j - 1} d_{k} l_{j k}^{2} \leq a_{i j} + α - ε . \end{cases}

(4.13)

This approach is a consequence of the equation (4.1) and the fact that for each $α > 0, d_{j} > ε$ is satisfied, if, and only if, $\sum_{k = 1}^{j - 1} d_{k} l_{_{j k}}^{2} \leq a_{j j} + α - ε$ . Observe that in this case, the values $d_{k}$ and $l_{j k}$ are not known for all $k = 1, . . ., j - 1$ , because the factorization for Obtain $L D L^{T}$ , has not been done yet. In order to get an approximation of the solution of the problem $(P_{α})$ , look for a function that is equivalent to $F_{j} : ℝ \to ℝ$ , given by $α \mapsto \sum_{k = 1}^{j - 1} d_{k} l_{j k}^{2}$ when α approaches zero² 2 The functions f (x) and g(x) are are called equivalents when x approaches a if limx→afxgx=1 ; this is denoted as f ~x→a g . , for some $j \in \{2, . . . m\}$ .

Using the Proposition 4.1, for every $α > 0$ , we have $F_{j} (α) < F_{j} (0)$ or

\sum_{k = 1}^{j - 1} (d_{k} l_{t k}^{2}) < \sum_{k = 1}^{j - 1} (d_{k} l_{j k}^{2}),

(4.14)

where $j = 2 . . ., m$ . Consider the following functions

f_{j} : ℝ \to ℝ a n d g_{j} : ℝ \to ℝ given by α \mapsto \sum_{k = 1}^{j - 1} \frac{{(d_{k} l_{j k})}^{2}}{d_{k} + α} α \mapsto \sum_{k = 1}^{j - 1} \frac{α}{d_{k} + α} d_{k} l_{j k}^{2},

(4.15)

respectively. We use the functions f _j and g _j because for every $α > 0$ , we have:

(f_{j} + g_{j}) (α) = \sum_{k = 1}^{j - 1} d_{k} l_{j k}^{2}

furthermore, from the Proposition 4.1, we obtain: $\sum_{k = 1}^{j - 1} d_{k} l_{j k}^{2} \leq (f_{j} + g_{j}) (α)$ .

Since that $f_{j} (α) ~ \sum_{k = 1}^{j - 1} d_{k} l_{j k}^{2}$ when α approaches zero, we are looking for the solution of the following problem:

(P_{α}) \{\begin{array}{cl} \min_{α > 0} & α \\ s . t . & f_{j} (α) \leq a_{j j} - ε + α, \end{array}

in order to obtain an approximate solution of $(P_{α})$ .

Since the function f _j is decreasing, we have that α is solution of the problem (P_α) if, and only if, $f_{j} (α) = a_{j j} - ε + α$ . We use the Newton Raphson method for computing the numerical value of α in the Algorithm 1.

Algorithm 1:
Find the value a that solve the equation

f_{j} (α) = a_{j j} - ε + α

.

4.2 Modification in the fill-in of the Controlled Cholesky Factorization preconditioner

We will denote by η the fill-in parameter of the Controlled Cholesky Factorization (CCF) preconditioner. The objective in this proposal is to ensure that the number of nonzero entries (nnz) of the matrix $\hat{L}$ in the equation (3.4) is at most $(nnz (𝒜) + 3 m) / 2$ when the CCF preconditioner is used in the IPM iterations. Thus, from (3.4) we have $nnz (\hat{L}) = nnz (\tilde{L})$ .

In order to determine the initial parameter η, denoted by η ₀, the quotient $nnz A / nnz A$ is computed and from it we have the following cases:

(i) $η_{0} = 1$ , if $1 \leq nnz 𝒜 / nnz A < 2$ ;
(ii) $η_{0} = - nnz A / m$ , otherwise.

If the number of the preconditioned CGM iterations is greater than $m / 5$ , the heuristic to determine the increment of η is given by:

(i) $η_{k} = 1$ , when $η_{0} = 1$ ;
(ii) If for all $j = 1, . . ., m$ and $i = j + 1, . . ., m, |l_{i j}| < 1 / 1 + α$ in the iteration $k - 1$ , then the value η _k will be incremented, $η_{k} = η_{k - 1} / 2$ , if $η_{k - 1} < 0$ .

The final η, denoted by η _f , in both cases ((i)) and ((ii)) will be $η_{f} \leq 1$ . Thus, the largest fill-in allowed for $\hat{L}$ will be $(nnz A + m) / 2 + m$ .

4.3 New ordering criteria of basic indexes for Spliting Preconditioner

Based in the observations presented in the Section 3.2, the idea arises of an ordering that simultaneously considers well conditioning and sparsity for the base of the Splitting Preconditioner (SP). We denote by nnz(A _j ) to the number of nonzero entries in column A _j of the constrained matrix $A_{m \times n}$ of the LP problem, where for $j = 1, . . ., n$ .

Observe that; $1 \leq nnz (A_{j}) \leq m$ for all column A _j of A, however, in sparse problems $nnz (A j) < < m$ . Define $k_{j} = θ_{j}^{1 / 2} nnz (A_{j})$ and perform a decreasing ordering of the k _j elements, with this order the Algorithm 2 is proposed.

Algorithm 2:
In order to find the basic indexes of the Splitting preconditioner

The non-increasing ordering of values k' _j s is motivated by the following reason: if two columns A _j1 and A _j2 satisfy $nnz (A_{j 1}) \leq nnz (A_{j 2})$ , that is A _j1 is more sparse than A _j2 , then,

1 / nnz (A_{j 1}) \geq 1 / nnz (A_{j 2})

Therefore, the column A _j1 will have priority over A _j2 if $θ_{j 1} \approx θ_{j 2}$ . Thus, while the values $θ_{j}^{1 / 2}$ will be used in the Theorem 4.1 to take care of well conditioning, the values nnz(A _j ) will give priority to the sparse columns. The algoritm 2 and the proof of the Theorem 4.1 are based in ¹⁶16 R.D. Monteiro, J.W. O’Neal & T. Tsuchiya. Uniform boundedness of a preconditioned normal matrix used in interior-point methods. SIAM Journal on Optimization, 15(1) (2004), 96-100. adding a condition that takes into account the sparseness of the A columns. To simplify the notation we consider the permutation $σ = i d$ , where id is the identity permutation, in addition, we denote A_ℬ simply by B.

Teorema 4.1. Suppose that the basic and non-basic index sets ℬ and 𝒩 of the Splitting preconditioner are obtained by thealgorithm 2. Then

$θ_{j}^{1 / 2} ||Θ_{B}^{- 1 / 2} B^{- 1} A_{j}|| = 1 for j \in B$ ;
$θ_{j}^{1 / 2} ||Θ_{B}^{- 1 / 2} B^{- 1} A_{j}|| \leq nnz (A_{j}) ||B^{- 1} A_{j}|| for j \in N = \{1, . . ., n\} \ B$ . Also, $κ (P^{- 1} A Θ A^{T} P^{- T}) \leq n K^{2} {||B^{- 1} A||}^{2}$ , when $K = \max \{nnz (A_{j}) : j = 1, . . . n\}$ .

Proof. The proof this theorem consider two cases.

Case 1. If $j \in B$ , then $B^{- 1} A_{j} = e_{j}$ where e _j is the j-th unit vector of ℝ^m so,

θ_{j}^{1 / 2} ||Θ_{B}^{- 1 / 2} B^{- 1} A_{j}|| = θ_{j}^{1 / 2} ||Θ_{B}^{- 1 / 2} e_{j}|| = θ_{j}^{1 / 2} ||θ_{j}^{1 / 2} e_{j}|| = 1 .

(4.16)

Case 2. If $j \in N$ , two cases are considered.

Case 2.1. The column A _j was not considered to enter the base according to Algorithm 2, that is $j > b_{i}$ , thus

k_{b_{i}} \geq k_{j} for all b_{i} \in B .

(4.17)

Let $θ_{0}^{1 / 2} = \min \{θ_{b_{i}}^{1 / 2} : b_{i} \in B\}$ , if , since b _m is the last basic index we have $k_{0} \geq k_{b_{m}}$ , using (4.17) $k_{0} \geq k_{j}$ , thus

θ_{j}^{1 / 2} ||Θ_{B}^{- 1 / 2} B^{- 1} A_{j}|| \leq \frac{d_{j}^{1 / 2} ||B^{- 1} A_{j}||}{m i n \{θ_{b_{i}}^{1 / 2} : b_{i} \in B\}} = \frac{k_{j} nnz (A_{j})}{k_{0} nnz (A_{0})} ||B^{- 1} A_{j}|| \leq nn z (A_{j}) ||B^{- 1} A_{j}|| .

(4.18)

Case 2.2. The column A _j was considered to be r-th column of B, however A _j resulted to be linearly dependent on the colunms $A_{b_{1}}, A_{b_{2}}, . . ., A_{b_{r - 1}}$ , that is, $A_{j} = B {[u, 0]}^{T}$ , where $u \in ℝ^{r - 1}$ , observe that $||u|| = ||B^{- 1} A_{j}||$ . Furthermore, $k_{b_{i}} \geq k_{j}$ for all $i = 1, . . ., r - 1$ . If $θ_{0}^{1 / 2} = \min \{θ_{b_{1}}^{1 / 2}, . . ., θ_{b_{r - 1}}^{1 / 2}\}$ and we define $k_{0} : = θ_{0}^{1 / 2} / nnz (A_{0})$ , then $k_{0} \geq k_{b_{r - 1}} \geq k_{j}$ thus,

θ_{j}^{1 / 2} ||Θ_{B}^{- 1 / 2} B^{- 1} A_{j}|| = θ_{j}^{1 / 2} {(\sum_{i = 1}^{r - 1} θ_{b_{i}}^{- 1} u_{i}^{2})}^{1 / 2} \leq \frac{k_{j} nnz (A_{j})}{k_{0} nnz (A_{0})} ||B^{- 1} A_{j}|| \leq nn z (A_{j}) ||B^{- 1} A_{j}|| .

(4.19)

Using (3.10) we have that

λ_{\max} = {||Θ_{B}^{- 1 / 2} B^{- 1} A Θ^{1 / 2}||}_{2}^{2} \leq \sum_{j = 1}^{n} θ_{j} {||Θ_{B}^{- 1 / 2} B^{- 1} A_{j}||}_{2}^{2},

(4.20)

substituting (4.16), (4.18) and (4.19) in (4.20) we have

λ_{\max} (P^{- 1} A Θ A^{T} P^{- T}) \leq K^{2} \sum_{j = 1}^{n} {||B^{- 1} A_{j}||}^{2} = K^{2} {||B^{- 1} A||}_{F}^{2} \leq m K^{2} {||B^{- 1} A||}^{2} .

Furthermore, we have

κ (P^{- 1} A Θ A^{T} P^{- T}) \leq m K^{2} {||B^{- 1} A||}^{2},

(4.21)

since $λ_{\min} (P^{- 1} A Θ A^{T} P^{- T}) \geq 1$ . ◻

Note that the condition number of the matrix preconditioned in (4.21) is uniformly bounded by amount that depends on the data of the problem and not on the interior point method iteration.

5 NUMERICAL XPERIMENTS

The PCx was originally proposed by ⁶6 J. Czyzyk, S. Mehrotra, M. Wagner & S.J. Wright. PCx An Interior Point Code for Linear Programming. Optimization Methods & Software, 11 (1999), 397-430. and in this paper to perform the numerical experiments the PCx was modified. The direct method used in PCx to obtain the solution of linear systems was replaced by an iterative method ³3 S. Bocanegra, F.F. Campos & A.R.L. Oliveira. Using a hybrid preconditioner for solving large-scale linear systems arising from interior point methods. Computational Optimization and Applications, 36(2-3) (2007), 149-164..

The tests performed compare the Hybrid Preconditioner (HP) proposed in ²⁰20 M.I. Velazco, A.R.L. Oliveira & F.F. Campos . A note on hybrid preconditioners for large-scale normal equations arising from interior-point methods. Optimization Methods Software, 25(2) (2010), 321-332. doi:10.1080/10556780902992829.
https://doi.org/10.1080/1055678090299282... , and the preconditioner presented in this study, denoted by HPmod. In SP, the base B can be maintained in some iterations of the IPM, this base is changed when $8 * n_{g} \geq m$ , where n _g denotes the number of iterations of preconditioned CGM in an IPM iteration. The problems used are in the public domain of the Netlib, QAP, and Kennington repositories.

The first two columns of Table 1 indicate the number of rows and columns of preprocessed problems. The number of CCF restarts corresponding to all iterations of the first phase, the time each problem was solved and measured in seconds, and finally, in the last two columns, the number of iterations of the IPM for solve each problem. For the comparison of approaches, the results presented in Table 1 can be summarized in performance profiles. These profiles use a logarithmic scale in the base 2, see ⁷7 E.D. Dolan & J.J. Moré. Benchmarking optimization software with performance profiles. Mathematical programming, 91(2) (2002), 201-213..

Thumbnail

Table 1:
Performance of the approaches.

Large scaled problems were tested and the criteria for choosing them depended on whether the number of rows or columns were greater than 5000. The most significant differences are in bold. The symbol “−” indicates that the problem has not been resolved, the symbol “‡” mean that the total number of restarts was greater than 15 in more than one iteration of the IPM.

The HPmod works well compared to the HP preconditioner when evaluating the total number of iterations, see Fig. 1. The proposal to use a hybrid approach is that the CCF and the SP did not work well on their own in most of the problems tested. According to ³3 S. Bocanegra, F.F. Campos & A.R.L. Oliveira. Using a hybrid preconditioner for solving large-scale linear systems arising from interior point methods. Computational Optimization and Applications, 36(2-3) (2007), 149-164., the CCF preconditioner shows good results in the initial iterations of the IPM, however it may deteriorate in the latter, since the matrix 𝒜 becomes ill-conditioned. If the SP is used in the early iterations, it is possible that the optimal solution is not found, that happened in the problems: nug05-3rd, nug06-3rd, nug07-3rd and nug08-3rd. In particular, in these problems the HP changes phase, which does not happen in the HPmod, that is, at least in these problems it was not necessary to carry out the phase change and, therefore, the optimal solution was obtained by the CCF preconditioner. In the problems that there was a phase change, when we used the HPmod, the SP computation is performed in less time compared to SP proposed by ²⁰20 M.I. Velazco, A.R.L. Oliveira & F.F. Campos . A note on hybrid preconditioners for large-scale normal equations arising from interior-point methods. Optimization Methods Software, 25(2) (2010), 321-332. doi:10.1080/10556780902992829.
https://doi.org/10.1080/1055678090299282... , due to the sparse columns that were used.

Figure 1:
Performance profile for iterations of IPM.

Note that the CCF preconditioner that we propose can solve problems osa-14, osa-30 and osa60 without restarts, see Table 1. Therefore, to elaborate the performance profile presented in Fig. 2 the value 0.1 was considered instead of 0. Using the preconditioner CCF the diagonal fault correction parameter allows to increase a value lower than that computed with the CCF.

Figure 2:
Performance profile for restars to compute CCF preconditioner.

The computation of the preconditioning CCF proposed by ⁴4 F.F. Campos . “Analysis of conjugate gradients-type methods for solving linear equations.”. Ph.D. thesis, University of Oxford (1995)., works with a diagonal matrix when the number of restarts is greater than 15. Observe that preconditioning a system in the early iterations using a diagonal matrix seems a good strategy, however in iterations that are not early it may result in an increase in time or even in not finding the optimal solution. It can be seen in Fig. 3 that the HPmod performed better than the other approaches in 32 problems. HPmod solves all the problems, since the curve of its performance profile has reached 1.

Figure 3:
Performance profile for time of IPM.

It is observed that in 14 problems the number of IPM iterations was reduced when we used the HPmod. Problems qap12, osa-14, osa-30, osa-60, nug05-3rd, nug06-3rd, nug07-3rd, and nug083rd were not solved by HP. With respect to the time used to solve the problems, HPmod was superior in 32 of the 34 problems presented. Finally, in the fifth column of Table 1, it can be seen that in the HPmod approach, less than 15 restarts were performed to compute the CCF preconditioner in all IPM iterations.

6 CONCLUSIONS

The modifications in the CCF preconditioner in both the diagonal fault correction parameter α and the fill-in parameter η reduced the number of restarts in the computation of this preconditioner. With the diagonal fault correction parameter of the original CCF preconditioner, see (3.6), up to fifteen attempts to build this preconditioner were allowed in more than one IPM iteration. With the new proposal, that did not happen in any IPM iteration. It resulted in the number of the preconditioned CGM iterations was reduced and therefore the processing time corresponding to the first phase of the HPmod also decreased.

In the SP, the computation of the base was accelerated because the sparse columns generated less fill and in addition, the proposed ordering for the SP performed well because the number of iterations of the preconditioned CGM did not increase and the effort to compute B decreased.

ACKNOWLEDGMENT

We would like to thank the agencies CNPq and FAPESP for grants which supported this research.

REFERENCES

¹
G. Al-Jeiroudi, J. Gondzio & J. Hall. Preconditioning indefinite systems in interior point methods for large scale linear optimisation. Optimization Methods and Software, 23(3) (2008), 345-363. doi:10. 1080/10556780701535910. URL https://doi.org/10.1080/10556780701535910
» https://doi.org/10. 1080/10556780701535910 » https://doi.org/10.1080/10556780701535910
²
S. Bellavia, V. Simone, D. Serafino & B. Morini. A preconditioning framework for sequences of diagonally modified linear systems arising in optimization. SIAM Journal on Numerical Analysis, 50(6) (2012), 3280-3302.
³
S. Bocanegra, F.F. Campos & A.R.L. Oliveira. Using a hybrid preconditioner for solving large-scale linear systems arising from interior point methods. Computational Optimization and Applications, 36(2-3) (2007), 149-164.
⁴
F.F. Campos . “Analysis of conjugate gradients-type methods for solving linear equations.”. Ph.D. thesis, University of Oxford (1995).
⁵
L. Casacio, C. Lyra, A.R.L. Oliveira & C.O. Castro. Improving the Preconditioning of Linear Systems from Interior Point Methods. Comput. Oper. Res., 85(C) (2017), 129-138. doi:10.1016/j.cor.2017.04.005. URL https://doi.org/10.1016/j.cor.2017.04.005
» https://doi.org/10.1016/j.cor.2017.04.005 » https://doi.org/10.1016/j.cor.2017.04.005
⁶
J. Czyzyk, S. Mehrotra, M. Wagner & S.J. Wright. PCx An Interior Point Code for Linear Programming. Optimization Methods & Software, 11 (1999), 397-430.
⁷
E.D. Dolan & J.J. Moré. Benchmarking optimization software with performance profiles. Mathematical programming, 91(2) (2002), 201-213.
⁸
J. Gondzio . Multiple centrality corrections in a primal-dual method for linear programming. Computational Optimization and Applications, 6(2) (1996), 137-156. doi:10.1007/BF00249643. URL https://doi.org/10.1007/BF00249643
» https://doi.org/10.1007/BF00249643 » https://doi.org/10.1007/BF00249643
⁹
J. Gondzio . Interior point methods 25 years later. European Journal of Operational Research, 218(3) (2012), 587-601.
¹⁰
J. Gondzio . Matrix-Free Interior Point Method. Computacional Optimization and Applications, 51 (2012), 457-480.
¹¹
M.R. Heredia & A.R.L. Oliveira . Uma nova proposta para modificar a Fatoração Controlada de Cholesky no método dos pontos interiores. 1 (2015), 2912-2923.
¹²
M.T. Jones & P.E. Plassmann. An improved incomplete Cholesky factorization. ACM Transactions on Mathematical Software (TOMS), 21(1) (1995), 5-17.
¹³
C.J. Lin & J.J. Moré . Incomplete Cholesky factorizations with limited memory. SIAM Journal on Scientific Computing, 21(1) (1999), 24-45.
¹⁴
I.J. Lustig, R.E. Marsten & D.F. Shanno. On implementing Mehrotras s predictor-corrector interiorpoint method for linear programming. SIAM Journal on Optimization, 2(3) (1992), 435-449.
¹⁵
T.A. Maunteuffel. An incomplete factorization technique for positive definite linear systems. Mathematics of computation, 34(150) (1980), 473-497.
¹⁶
R.D. Monteiro, J.W. O’Neal & T. Tsuchiya. Uniform boundedness of a preconditioned normal matrix used in interior-point methods. SIAM Journal on Optimization, 15(1) (2004), 96-100.
¹⁷
A.R.L. Oliveira & D. Sorensen. A new class of preconditioners for large-scale linear systems from interior point methods for linear programming. Linear Algebra and its applications, 394 (2005), 1-24.
¹⁸
L.M. Silva & A.R.L. Oliveira . Melhoria do desempenho da fatoração controlada de Cholesky no precondicionamento de sistemas lineares oriundos dos métodos de pontos interiores. In “Proceeding Series of the Brazilian Society of Computational and Applied Mathematics”, volume 3. SBMAC (2015), pp. 1-7.
¹⁹
P. Suñagua & A.R.L. Oliveira . A new approach for finding a basis for the splitting preconditioner for linear systems from interior point methods. Computational Optimization and Applications, 67(1) (2017), 111-127. URL https://EconPapers.repec.org/RePEc:spr:coopap:v:67:y:2017:i:1:d:10.1007_s10589-016-9887-0
» https://EconPapers.repec.org/RePEc:spr:coopap:v:67:y:2017:i:1:d:10.1007_s10589-016-9887-0
²⁰
M.I. Velazco, A.R.L. Oliveira & F.F. Campos . A note on hybrid preconditioners for large-scale normal equations arising from interior-point methods. Optimization Methods Software, 25(2) (2010), 321-332. doi:10.1080/10556780902992829.
» https://doi.org/10.1080/10556780902992829
²¹
S.J. Wrigth. “Primal-dual Interior-Point Methods:”. SIAM e-books. Society for Industrial and Applied Mathematics (SIAM) (1997).

1
Observe that if exist diagonal fault in j-column, $d_{u} > ε$ for all $u = 1, . . ., j - 1$ .
2
The functions f (x) and g(x) are are called equivalents when x approaches a if $\lim_{x \to a} \frac{f (x)}{g (x)} = 1$ ; this is denoted as $f \underset{x \to a}{~} g$ .

Publication Dates

Publication in this collection
16 Sept 2019
Date of issue
May-Aug 2019

History

Received
19 Dec 2017
Accepted
02 Mar 2019

This is an open-access article distributed under the terms of the Creative Commons Attribution License

[1] ¹
G. Al-Jeiroudi, J. Gondzio & J. Hall. Preconditioning indefinite systems in interior point methods for large scale linear optimisation. Optimization Methods and Software, 23(3) (2008), 345-363. doi:10. 1080/10556780701535910. URL https://doi.org/10.1080/10556780701535910
» https://doi.org/10. 1080/10556780701535910 » https://doi.org/10.1080/10556780701535910

[2] ²
S. Bellavia, V. Simone, D. Serafino & B. Morini. A preconditioning framework for sequences of diagonally modified linear systems arising in optimization. SIAM Journal on Numerical Analysis, 50(6) (2012), 3280-3302.

[3] ³
S. Bocanegra, F.F. Campos & A.R.L. Oliveira. Using a hybrid preconditioner for solving large-scale linear systems arising from interior point methods. Computational Optimization and Applications, 36(2-3) (2007), 149-164.

[4] ⁴
F.F. Campos . “Analysis of conjugate gradients-type methods for solving linear equations.”. Ph.D. thesis, University of Oxford (1995).

[5] ⁵
L. Casacio, C. Lyra, A.R.L. Oliveira & C.O. Castro. Improving the Preconditioning of Linear Systems from Interior Point Methods. Comput. Oper. Res., 85(C) (2017), 129-138. doi:10.1016/j.cor.2017.04.005. URL https://doi.org/10.1016/j.cor.2017.04.005
» https://doi.org/10.1016/j.cor.2017.04.005 » https://doi.org/10.1016/j.cor.2017.04.005

[6] ⁶
J. Czyzyk, S. Mehrotra, M. Wagner & S.J. Wright. PCx An Interior Point Code for Linear Programming. Optimization Methods & Software, 11 (1999), 397-430.

[7] ⁷
E.D. Dolan & J.J. Moré. Benchmarking optimization software with performance profiles. Mathematical programming, 91(2) (2002), 201-213.

[8] ⁸
J. Gondzio . Multiple centrality corrections in a primal-dual method for linear programming. Computational Optimization and Applications, 6(2) (1996), 137-156. doi:10.1007/BF00249643. URL https://doi.org/10.1007/BF00249643
» https://doi.org/10.1007/BF00249643 » https://doi.org/10.1007/BF00249643

[9] ⁹
J. Gondzio . Interior point methods 25 years later. European Journal of Operational Research, 218(3) (2012), 587-601.

[10] ¹⁰
J. Gondzio . Matrix-Free Interior Point Method. Computacional Optimization and Applications, 51 (2012), 457-480.

[11] ¹¹
M.R. Heredia & A.R.L. Oliveira . Uma nova proposta para modificar a Fatoração Controlada de Cholesky no método dos pontos interiores. 1 (2015), 2912-2923.

[12] ¹²
M.T. Jones & P.E. Plassmann. An improved incomplete Cholesky factorization. ACM Transactions on Mathematical Software (TOMS), 21(1) (1995), 5-17.

[13] ¹³
C.J. Lin & J.J. Moré . Incomplete Cholesky factorizations with limited memory. SIAM Journal on Scientific Computing, 21(1) (1999), 24-45.

[14] ¹⁴
I.J. Lustig, R.E. Marsten & D.F. Shanno. On implementing Mehrotras s predictor-corrector interiorpoint method for linear programming. SIAM Journal on Optimization, 2(3) (1992), 435-449.

[15] ¹⁵
T.A. Maunteuffel. An incomplete factorization technique for positive definite linear systems. Mathematics of computation, 34(150) (1980), 473-497.

[16] ¹⁶
R.D. Monteiro, J.W. O’Neal & T. Tsuchiya. Uniform boundedness of a preconditioned normal matrix used in interior-point methods. SIAM Journal on Optimization, 15(1) (2004), 96-100.

[17] ¹⁷
A.R.L. Oliveira & D. Sorensen. A new class of preconditioners for large-scale linear systems from interior point methods for linear programming. Linear Algebra and its applications, 394 (2005), 1-24.

[18] ¹⁸
L.M. Silva & A.R.L. Oliveira . Melhoria do desempenho da fatoração controlada de Cholesky no precondicionamento de sistemas lineares oriundos dos métodos de pontos interiores. In “Proceeding Series of the Brazilian Society of Computational and Applied Mathematics”, volume 3. SBMAC (2015), pp. 1-7.

[19] ¹⁹
P. Suñagua & A.R.L. Oliveira . A new approach for finding a basis for the splitting preconditioner for linear systems from interior point methods. Computational Optimization and Applications, 67(1) (2017), 111-127. URL https://EconPapers.repec.org/RePEc:spr:coopap:v:67:y:2017:i:1:d:10.1007_s10589-016-9887-0
» https://EconPapers.repec.org/RePEc:spr:coopap:v:67:y:2017:i:1:d:10.1007_s10589-016-9887-0

[20] ²⁰
M.I. Velazco, A.R.L. Oliveira & F.F. Campos . A note on hybrid preconditioners for large-scale normal equations arising from interior-point methods. Optimization Methods Software, 25(2) (2010), 321-332. doi:10.1080/10556780902992829.
» https://doi.org/10.1080/10556780902992829

[21] ²¹
S.J. Wrigth. “Primal-dual Interior-Point Methods:”. SIAM e-books. Society for Industrial and Applied Mathematics (SIAM) (1997).

Problem	Size		Restarts		Time		Iteration
Problem	Rows	Colums	HP	HPmod	HP	HPmod	HP	HPmod
maros-r7	2152	7440	173‡	13	16,24	8,90	30	22
NL	6665	14680	284	75	33,74	23,74	41	41
BL	5729	12462	261	100	18,05	15,10	38	38
stocfor3	15362	22228	199	55	89,50	55,41	32	32
els19	4350	13186	78	50	43,51	42,06	31	31
chr22b	5587	10417	79	38	19,10	18,19	29	29
chr25a	814	15325	64‡	37	38,60	36,35	29	28
nug05-3rd	1410	1425	20	25	−	0,24	−	6
nug06-3rd	3972	4686	30	37	−	5,92	−	7
nug07-3rd	9422	12691	30	40	−	32,29	−	8
nug08-3rd	19728	29856	30	47	−	208,34	−	9
qap12	2794	8856	64‡	5	−	104,06	−	20
scr15	2234	6210	64‡	43	6,49	6,58	24	24
scr20	5079	15980	74	55	60,18	52,76	21	21
rou20	7359	37640	81	43	754,32	658,29	24	24
cre-a	2989	6692	116‡	29	7,02	4,25	28	27
cre-b	5328	36382	288	147	43,33	37,79	43	43
cre-c	2370	5412	64‡	34	6,29	2,69	26	26
cre-d	4094	28601	281	131	27,76	25,88	42	42
ex05	832	7805	96‡	61	12,37	5,01	65	39
ex09	1821	18184	319	86	47,94	43,98	45	45
osa-14	2300	54760	0	4	−	1,28	−	18
osa-30	4313	104337	0	7	−	3,82	−	23
osa-60	10243	243209	0	6	−	14,54	−	23
ken11	9964	16740	74	20	10,42	8,19	23	22
ken13	22365	36561	73	38	94,63	56,10	29	29
ken18	78538	128434	103	45	1052,51	901,08	41	37
pds-06	9145	28472	216	60	8,34	7,88	39	39
pds-10	16558	48763	256	69	18,49	16,78	47	47
pds-20	32276	106180	322	90	212,79	210,62	60	61
pds-40	34265	214385	479	135	410,47	396,32	78	79
pds-60	96503	332862	492	150	1096,77	1064,20	84	84
pds-80	126109	430800	478	166	1526,79	1597,84	83	83
pds-100	156243	514577	508	190	2631,49	2448,85	87	87

Brasil