A DISCRETE DYNAMICAL SYSTEM AND ITS APPLICATIONS

Carrasco-Gutierrez, Carlos E.; Sosa, Wilfredo

doi:10.1590/0101-7438.2019.039.03.0457

ABSTRACT

The main goal of this manuscript is to introduce a discrete dynamical system defined by symmetric matrices and a real parameter. By construction, we rediscovery the Power Iteration Method from the Projected Gradient Method. Convergence of the discrete dynamical system solution is established. Finally, we consider two applications, the first one consists in find a solution of non linear equation problem and the other one consists in verifies the optimality conditions when we solve quadratic optimization problems over linear equality constraints.

Keywords:
Projected gradient type-method; Power Iteration Method; Symmetric matrix; Discrete Dynamical System

1 INTRODUCTION

Discrete dynamical system appears as a tool in order to understand differential equations from numerically view point (for more details, see ^{Galob (2007}3 GALOB O. 2007. Discrete Dynamical Systems. Springer Verlag, Berlin Heidelberg.) and chapter 6 in ^{Loneli & Rumbos (2003}7 LONELI OHE & RUMBOS PIB. 2003. Métodos dinámicos en Economia. Otra busqueda del tiempo perdido. International Thompson Editores, Mexico.)). The classical model, in finite dimensional space, is as follows:

x_{k + 1} = F (x_{k})

(1)

where $F : Ω \subset ℝ^{n} \to ℝ^{n}$ is an operator and Ω is a subset of the domain of the operator F (the domain of F is denoted by $d o m (F)$ ). According to the literature, the equation 1 is not exclusive for differential equations, for example it appears in order to find fixed points for contractive operators (remember, F is contractive if $||F (x) - F (y)|| \leq λ ||x - y||$ , with $λ \in (0, 1)$ and $x, y \in d o m (F)$ ). For details about contractive operators, see classical books in functional analysis or general topology or fixed point theorems as for instance ^{Brezis (1983}1 BREZIS H. 1983. Analyse fonctionnelle - Théorie et applications. Masson, Paris.), ^{Istrǎţescu (1981}5 ISTRǍŢESCU VI. 1981. Fixed point theory. D. Reidel Publishing Co., Dordrecht.), ^{Kelley (1955}6 KELLEY JL. 1955. General Topology. Van Nostrand Co.. Princeton.). Other example is the autoregressive model (for more details see ^{Shumway & Stoffer (2017}9 SHUMWAY RH & STOFFER DS. 2017. Time series analysis and its applications. Springer International Publishing AG.)).

Given a symmetric matrix A and a real number λ such that $- λ^{- 1} \notin σ (A)$ . We consider the following operator $T_{λ} : S \to S$ defined by

T_{λ} (x) = \frac{(I + λ A) x}{||(I + λ A) x||}

(2)

where I is the identity matrix, $S = \{x \in ℝ^{n} : ||x|| = 1\}$ and $σ (A)$ is the eigenvalue set of matrix A.Note that, in this setting, $I + λ A$ is a non singular matrix. So, operator Tλ is well defined.

The focus of this manuscript is the operator defined by the equation 2, which is very interesting, because:

when, either $λ = 0$ or A is the null matrix, the operator Tλ is the identity. Here, every $x \in S$ is a fixed point of Tλ.
when $B = I + λ A$ has a dominant eigenvalue (i.e there exists an eigenvalue α* such that $|α *| > |α|$ for all eigenvalue $α \neq α *$ ), the operator T was used in the famous Power Iteration Method introduced by R. Von ^{Mises and H. Pollaczek-Geiringer in 1929}8 MISES RV & POLLACZEK-GEIRINGER H. 1929. Praktische Verfahren der Gleichungsauflösung. ZAMM - Zeitschrift für Angewandte Mathematik und Mechanik, 9: 152-164. (see Mises & Pollaczek-Geiringer (1929)).
when ${|λ|}^{- 1} \in (n a, + \infty)$ , where $a = m a x \{|a_{i, j}| : A = [a_{i, j}]\}$ and n is the size of $A, B = I + λ A$ is a strong monotone operator. Moreover, each eigenvector of A belonging to S is a fixed point of T (we prove it in section 2).

1.1 The Power Iteration Method

In order to understand the Power Iteration Method, consider a function called “Rayleigh quotient” which is defined, as follows, for each $x \neq 0$

r (x) = \frac{<x, B x>}{<x, x>} .

(3)

If x is an eigenvector, then $B x = r (x) x$ (i.e. r(x) is the corresponding eigenvalue of x). Suppose that ${\{v_{i}\}}_{i = 1}^{n}$ is a set of eigenvectors of B which is a basis of $ℝ^{n}, B v_{i} = λ_{i} v_{i}$ and $|λ_{1}| > |λ_{2}| \geq \cdot \cdot \cdot \geq |λ_{n}|$ . Taking $v_{0} \neq 0$ a vector with $||v_{0}|| = 1$ , we have that $v_{0} = \sum_{i = 1}^{n} α_{i} v_{i}$ .

Then,

B v_{0} = α_{1} λ_{1} v_{1} + α_{2} λ_{2} v_{2} + \cdot \cdot \cdot + α_{n} λ_{n} v_{n},

and so

B^{k} v_{0} = λ_{1}^{k} (α_{1} v_{1} + α_{2} {(\frac{λ_{2}}{λ_{1}})}^{k} v_{2} + \cdot \cdot \cdot + α_{n} {(\frac{λ_{n}}{λ_{1}})}^{k} v_{n}) .

Here, $\frac{B^{k} v_{0}}{||B^{k} v_{0}||}$ converges to v ₁, because $\lim_{k \to + \infty} {(\frac{λ_{i}}{λ_{1}})}^{k} = 0 \forall i \geq 2$ . The Power Iteration method is elegant, simple and can be stated as follows

(P I M) \{\begin{matrix} p i c k a s t a r t i n g v e c t o r x_{0} w i t h ||x_{0}|| = 1 \\ F o r k = 1, 2, \cdot \cdot \cdot \\ L e t x_{k} = T_{λ} (x_{k - 1}) \\ w h e r e A = \frac{1}{λ} (B - I) \end{matrix}

but, convergence is only guaranteed if the following two assumptions hold:

Non singular matrix B has an eigenvalue that is strictly greater in absolute value than its other eigenvalues.
The starting vector x0 has a nonzero component in direction of an eigenvector associated with the dominant eigenvalue.

The reader can verify that for $B = [\begin{matrix} 0 & 1 \\ 1 & 0 \end{matrix}]$ (matrix B is nonsingular and symmetric), the sequences generated by Power Iteration Method diverge for any stated point x (different to any eigenvector of B), because does not have a dominant eigenvalue, but $\bar{B} = I + (1 / 3) B$ is positive definite (all its eigenvalues are strictly positive) and it has a dominant eigenvalue.

1.2 The Projected Gradient Method

The Projected Gradient method was introduced by ^{Goldstein (for more detail see Goldstein (1964}4 GOLDSTEIN AA. 1964. Convex programmin in Hilbert space. Bulletin of The American Mathematical Society, 70: 709-710.)) for solving the following differentiable optimization problem

(P) \{\begin{cases} m a x i m i z e f (x) \\ x \in C \end{cases}

(4)

where $f : C \to ℝ$ is differentiable in each point of a nonempty closed subset C of $ℝ^{n}$ .

The essence of the Projected Gradient method is based on two facts:

First order Necessary Optimality condition: If $\bar{x}$ is a solution of P, then $<\nabla f (x), d> \leq 0, \forall d \in T (C, x)$ (Here $T (C, x)$ is the tangent cone of C at $\bar{x}$ , for more details see ^{Crouzeix et al. (2011}2 CROUZEIX J, KERAGEL A & SOSA W. 2011. Programación Matematica diferenciable. Universidad Nacional de Ingenieria.))
Orthogonal projection: If u is an orthogonal projection of v over C, then $||u - v|| \leq ||y - v||, \forall y \in C$ (in short $u : = P C (v)$ ).

Now, given a symmetric matrix A, we know that all eigenvalues of A are real numbers and we can consider n eigenvectors of matrix A as a basis of $ℝ^{n}$ . Moreover if we consider problem (P) with $f : ℝ^{n} \to ℝ$ defined by $f (x) = <x, A x>$ and $C = \{x \in F : <x, x> = 1\}$ , where F is a subspace of $ℝ^{n}$ generated by eigenvectors of matrix A. The optimal value is an eigenvalue of A and any maximizer is a normalized eigenvector associated to the optimal value.

Here, if $x \in C$ , then $T (C, x)$ is an hyperplane in F defined by normal vector $x \neq 0$ and contain vector $x$ (from now on $H (\bar{x}, 1) : = T_{C} (\bar{x})$ ). So, the necessary condition is reduced to $<\nabla f (\bar{x}), y - \bar{x})> = 0, \forall y \in H (\bar{x}, 1)$ , which is equivalent to $<(\bar{x} + λ \nabla f (\bar{x})) - \bar{x}, y - \bar{x}> = 0, \forall y \in H (\bar{x}, 1)$ and $\forall λ \neq 0$ fixed, which is also equivalent to $x = P_{H (\bar{x}, 1)} (\bar{x} + λ \nabla f (\bar{x})), \forall λ \neq 0$ fixed.

So,

x_{k + 1} = \frac{P_{H (x_{k + 1}, 1)} (x_{k} + λ A x_{k})}{||P_{H (x_{k + 1}, 1)} (x_{k} + λ A x_{k})||} = \frac{x_{k} + λ A x_{k}}{||x_{k} + λ A x_{k}||} = T_{λ} (x_{k})

If we define $B = I + λ A$ , then both matrices A and B have the same eigenvectors. Moreover, δ is an eigenvalue of A and u an associate eigenvector ( $A u = δ u$ ) if and only if $δ u = A u = λ^{- 1} (B - I) u$ if and only if $B u = (1 + δ λ) u$ if and only if $(1 + δ λ)$ is an eigenvalue of B and u an associate eigenvector to it. In the next section we introduce an easy result which establishes that for each symmetric matrix A and each $λ \neq 0$ such that $1 + λ δ > 0$ for all eigenvalue δ of A, we have that $B = I + λ A$ is a Symmetric Positive Definite (SPD) matrix. It implies that $(1 + λ δ) > 0$ , for all eigenvalue δ of A.

In the section 2, we introduce a discrete dynamical system defined by symmetric matrices and a real parameter λ. We show that, under some conditions on the parameter λ, any sequence generated by the discrete dynamical system converges to a fixed point of the operator which define the discrete dynamical system. Moreover, there is an equivalence between the fixed point of the operator and the eigenvector of the symmetric matrix.

In section 3 we consider two applications, the first one consists in find a solution for the non linear equation problem and the second one consists in verifies the optimality conditions when we solve quadratic optimization problems over linear equality constraints.

2 A DISCRETE DYNAMICAL SYSTEM

We start this section with two elementary results.

Lemma 1. Let A be a no null symmetric matrix with size n and $a = \max \{|a_{i j}| : A = [a_{i j}]\}$ . If $λ \in ℝ$ with ${|λ|}^{- 1} > n a$ , then $(1 + λ β) > 0 \forall β \in σ (A)$ .

Proof. Take $\bar{β} = \max \{|β| : β \in σ (A)\}$ and consider an eigenvector $\bar{x}$ such that $\bar{β} = | {\bar{x}}^{T} A \bar{x} |$ . Then,

|β| \leq \bar{β} = |{\bar{x}}^{T} A \bar{x}| = |\sum_{i, j} {\bar{x}}_{i} {\bar{x}}_{j} <e_{i}, A e_{j}>| \leq \sum_{i, j} |x_{i} ||x_{j}|| a_{i j}| \forall β \in σ (A)

So, we have that

|β| \leq a \sum_{i, j} |x_{i}| |x_{j}| = a {(\sum_{i} |x_{i}|)}^{2}

But $n^{1 / 2} = argmax \{\sum_{i} |x_{i}| : \sum_{i} {|x_{i}|}^{2} = 1\}$ (follows directly applying optimality conditions). It implies that, $|β| \leq n a < {|λ|}^{- 1}$ . And so the statement follows. □

Note that the eigenvalue set σ(A) of A exists, but its elements are unknown explicitly in the previous Lemma.

Lemma 2. If A is a symmetric matrix and $\bar{β} = \max \{|β| : β \in σ |A|\}$ , then for all $λ \in ℝ$ with ${|λ|}^{- 1} > \bar{β}$ , then matrix $B = I + λ A$ is a SPD matrix. Moreover, for each $δ \in σ (B)$ we have that $B u = δ u$ and $A u = λ^{- 1} (δ - 1) u (i . e . σ (A) = λ^{- 1} (σ (B) - 1))$ .

Proof. If $β \in σ (A)$ , then

B x = (I + λ A) x = x + λ A x = (1 + λ β) x

But, ${|λ|}^{- 1} > \bar{β} \geq |β|$ . So, $1 + λ β > 0 \forall β \in σ (A)$ . Then, the statement follows. □

From now, for each non null symmetric matrix A, define the following operator $T_{λ} : S \to S (S = \{x \in ℝ^{n} : ||x|| = 1\})$ by

T_{λ} (x) = \frac{(I + λ A) x}{||(I + λ A) x||} = \frac{B x}{||B x||}

(5)

where $λ \in ℝ \ \{0\}$ is such that $- λ^{- 1} \notin σ (A)$ and $B = I + λ A$ .

Theorem 1.Let A be a non null symmetric matrix and $λ \in ℝ$ such that $- λ^{- 1} \notin σ (A)$ . A vector x∗ is a fixed point of Tλ if and only if there exists $δ \in σ (A)$ such that $A x * = δ x *$ and $x * \in S$ .

Proof. If x∗ is a fixed point of $T_{λ} (x *) = \frac{B x *}{||B x *||}$ . So, $||B x *|| \neq 0$ and $||x *|| = 1$ . Let $σ (B) = \{λ_{1}, \cdot \cdot \cdot, λ_{n}\}$ and let $\{u_{1}, \cdot \cdot \cdot, u_{n}\} \subset S$ be an eigenvector set of B such that $B u_{i} = λ_{i} u_{i} \forall i \in \{1, . . ., n\}$ . Here, $\{u_{1}, \cdot \cdot \cdot, u_{n}\}$ is a basis of $ℝ^{n}$ , then $x * = \sum_{i = 1}^{n} α_{i} u_{i} = \sum_{i \in I} α_{i} u_{i}$ , where $I = \{i \in \{1, \cdot \cdot \cdot, n\} : α_{i} \neq 0\}$ . Note that $I \neq 0$ , because $||x *|| \neq 0$ . On the other hand $\sum_{i = 1}^{n} α_{i} u_{i} = x * = T_{λ} (x *) = \sum_{i = 1}^{n} \frac{α_{i} λ_{i}}{||B x *||}$ . Since $\{u_{1}, \cdot \cdot \cdot, u_{n}\}$ is a basis of $ℝ^{n}$ , then $α_{i} = \frac{α_{i} λ_{i}}{||B x *||} \forall i \in \{1, \cdot \cdot \cdot, n\}$ . It implies that $\forall i \in I, 1 = \frac{λ_{i}}{||B x *||}$ . So, $||B x *|| = λ_{i} > 0 \forall i \in I$ . Finally,

B x * = B (\sum_{i \in I} α_{i} u_{i}) = \sum_{i \in I} α_{i} B u_{i} = \sum_{i \in I} α_{i} λ_{i} u_{i} = ||B x *|| \sum_{i \in I} α_{i} u_{i} = ||B x *|| x *

The statement follows from Lemma 2, taking $δ = λ^{- 1} (||B x *|| - 1)$ .

If there exists $δ \in σ (A)$ such that $A x * = δ x *$ and $x * \in S$ , then $||B x *|| = 1 + λ δ$ . The statement follows because $T_{λ} (x *) = \frac{B x *}{||B x *||} = \frac{(1 + λ δ) x *}{1 + λ δ} = x *$ . □

Now, we are able to find the solution of the discrete dynamical system, for each non null symmetric matrix A and $λ = \frac{1}{n a + 1}$ , where $n = s i z e (A)$ and $a = \max \{|a_{i, j}| : A = [a_{i, j}]\}$ .

Initial step Given a non null symmetric matrix A.

n = s i z e (A) .

a = \max \{|a_{i, j}| : A = [a_{i, j}]\} .

λ = \frac{1}{n a + 1} .

k = 0 .

x_{k} \in S .

Iterative step Calculate:

x_{k + 1} = T_{λ} (x_{k}) = \frac{(I + λ A) x_{k}}{||(I + λ A) x_{k}||} = \frac{B x_{k}}{||B x_{k}||} = \frac{B^{k} x_{0}}{||B^{k} x_{0}||} k = k + 1 .

The following result establishes that the sequence generated by the discrete dynamical system (the solution of the discrete dynamical system) is asymptotically stable for any starting point $x_{0} \in S$ .

Theorem 2.Let A be a non null symmetric matrix. For each $x_{0} \in S$ , the sequence {xk} generated by the discrete scheme converges to an eigenvector of A belonging to S and the sequence $\{<A x_{k}, x_{k}>\}$ converges to its respective eigenvalue.

Proof. From Lemma 1, we have that B is a SPD matrix. It implies that $B x_{k} \neq 0$ for all $k \in ℕ$ and so $x_{k + 1} = \frac{B x_{k}}{||B x_{k}||}$ is well defined for all $k \in ℕ$ . Let $λ_{1}, \cdot \cdot \cdot, λ_{n}$ be the eigenvalue set of B and $\{u_{1}, \cdot \cdot \cdot, u_{n}\} \subset S$ a respective eigenvector set. Without loss of generality consider $0 < λ_{i} \leq λ_{i + 1} \forall i \in \{1, \cdot \cdot \cdot, n - 1\}$ . Since $\{u_{1}, \cdot \cdot \cdot, u_{n}\}$ is an orthonormal basis of $ℝ^{n}$ , then $x_{0} = \sum_{i = 1}^{n} ε_{i} u_{i}$ . Here, $||B^{k} x_{0}|| = ||\sum_{i = 1}^{n} ε_{i} λ_{i}^{k} u_{i}|| = {(\sum_{i \in I} ε_{i}^{2} λ_{i}^{2 k})}^{1 / 2}$ , where $I = \{i \in \{1, \cdot \cdot \cdot, n\} : ε_{i} \neq 0\}$ . Taking $j = \max \{i : i \in I\}$ and $I (j) = \{l : λ_{l} = λ_{j}\}$ , then $x_{k + 1} = \sum_{i \in I} \frac{ε_{i} λ_{i}^{k}}{{(\sum_{i \in I} ε_{i}^{2} λ_{i}^{2 k})}^{1 / 2}} u_{i}$ . It implies that $x_{k + 1} = \sum_{i \in I} \frac{ε_{i} {(\frac{λ_{i}}{λ_{j}})}^{k}}{{(\sum_{i \in I} ε_{i}^{2} {(\frac{λ_{i}}{λ_{j}})}^{2 k})}^{1 / 2}} u_{i}$ . Note that for any $i \in I \ I (j)$ we have that $0 < \frac{λ_{i}}{λ_{j}} < 1$ . It implies that, the sequence {xk} converges to $\sum_{i \in I (j)} \frac{ε_{i}}{{(\sum_{i \in I (j)} ε_{i}^{2})}^{1 / 2}} u_{i}$ . It is easy to verify that the cluster point is a normalized eigenvector of B associated to an eigenvalue λj. Since A and B have the same eigenvectors set, then the statement follows. □

3 APPLICATIONS

In this section we consider two applications.

3.1 The Non Linear Equation Problem

This problem consists in find a feasible point of a nonlinear equation defined by a function $f : ℝ^{n} \to ℝ$ (here, f is twice differentiable on $ℝ^{n}$ ) and $λ \in ℝ$ . The Mathematical Model is:

(N L E P) {Find x such that f (x) = λ

(6)

Take $x \in ℝ^{n}$ , the representation of Taylor around x is

f (y) \approx f (x) + <\nabla f (x), y - x> + (1 / 2) <\nabla^{2} f (x) (y - x), y - x> .

Taking $y = x + t d$ with $||d|| = 1$ , we have that

f (x + t d) \approx f (x) + t <\nabla f (x), d> (t^{2} / 2) <\nabla^{2} f (x) d, d> .

Using this approach, the problem consists in find a direction d such that the function $h : ℝ \to ℝ$ defined by $h (t) = f (x + t d)$ has at least one real roots.

Definition 1. Given $f : ℝ^{n} \to ℝ$ be a function twice differentiable, let $(x, λ) \in ℝ^{n + 1}$ . A vector $d \in S$ is called a feasible direction for the problem (NLEP), if the function h defined by $h (t) = f (x + t d) = λ$ has at least one real root.

We point out, in the case that f be a linear function. Here $f (x) = <a, x>$ for $a \in ℝ^{n} \ \{0\}$ . In

this case, $h (t) = f (x + t d) = <a, x> + t <a, d>$ . Note that $d = \frac{a}{||a||}$ is a feasible direction. Moreover, $x + \bar{t} d$ is the orthogonal projection of x over the hyperplane $\{x \in ℝ^{n} : <a, x> = λ\}$ , where $\bar{t} = (λ - <a, x>) / ||a||$ .

Lemma 3. Let $f : ℝ^{n} \to ℝ$ be a quadratic function and $(x, λ) \in ℝ^{n + 1}$ . The following statement follows:

The matrix $D = \nabla f (x) \nabla f {(x)}^{T} + 2 (λ - f (x)) \nabla^{2} f (x)$ is symmetric.
If $σ (D) \subset (- \infty, 0)$ , then the problem (NLEP) has no solution.
If $σ (D) \subset (0, + \infty)$ , then the problem (NLEP) has at least one solution. Moreover, any eigenvector associated to positive eigenvector, is a feasible direction.
If d is an eigenvector associated to null eigenvalue and $<\nabla f (x), d> = 0$ , then d is not a feasible direction when $f (x) \neq λ$ .
If d is an eigenvector associated to null eigenvalue and $<\nabla f (x), d> \neq 0$ , then d is a feasible direction.

Proof. If f is a quadratic function, then

f (x + t d) = f (x) + t <\nabla f (x), d> + \frac{t^{2}}{2} <\nabla^{2} f (x) d, d> .

So, the equation $f (x + t d) = λ$ has solution if the discriminant

<D d, d> = {(<\nabla f (x), d>)}^{2} - 4 (f (x) - λ) \frac{<\nabla^{2} f (x) d, d>}{2} \geq 0 .

All items follows because the discriminant need to be non negative, in order to find real roots of the quadratic equation. □

3.2 Linearly Constrained Quadratic Programming Problems

This problem can be formulated as follows:

(L C Q P P) \{\begin{cases} minimize (1 / 2) <Q x, x> - <a, x> \\ P x = b \end{cases}

(7)

without loss of generality Q is a $n \times n$ non null symmetric matrix, P is a $m \times n$ non null matrix, $a \in ℝ^{n}$ and $b \in ℝ^{m}$ .

For the next result, we use the following notation: $\{λ_{1}, \cdot \cdot \cdot, λ_{n}\}$ is the eigenvalue set of matrix PT P, $\{u_{1}, \cdot \cdot \cdot, u_{n}\} \subset S$ an eigenvector set $(i . e . P^{T} P u_{i} = λ_{i} u_{i} \forall i), I (0) = \{i : λ_{i} = 0\}$ and $s p a n \{u_{i} : i \in I (0)\}$ is the subspace generated by $\{u_{i} : i \in I (0)\}$ . By convention $s p a n (0) = \{0\}$ .

Lemma 4. If P is a non null matrix with size $m \times n$ , then

K e r^{P} = K e r (B^{T} B) = s p a n \{u_{i} : i \in I (0)\} .

Proof. If $K e r^{P} = \{0\}$ , then PT P is non singular and then $I (0) = 0$ and so $s p a n \{u_{i} : i \in I (0)\} = \{0\}$ . If not, take $h \in K e r (P) \ \{0\}$ , then $P h = 0$ and so PT $P h = 0$ . It implies that h is an eigenvector of PT P and so $h \in s p a n \{u_{i} : i \in I (0)\}$ . Conversely, if $h \in s p a n \{u_{i} : i \in I (0)\}$ , then $h = \sum_{i \in I (0)} α_{i} u_{i}$ . Hence, PT $P h = \sum_{i \in I (0)} α_{i} B^{T} B u_{i} = 0$ . Then $0 = <P^{T} P h, h> = {||P h||}^{2}$ and so $P h = 0$ . □

Now, consider a matrix V such that $V : ℝ^{n} \to ℝ^{n}$ is the orthogonal projection over $K e r (P) = K e r (P^{T} P)$ . Matrix V can be calculated as follows: Apply our scheme and obtain $σ (B^{T} B) = \{λ_{1}, \cdot \cdot \cdot, λ_{n}\}$ and $\{u_{1}, \cdot \cdot \cdot, u_{n}\}$ such that $Q u_{i} = λ_{i} u_{i}$ . Then $V = \prod_{i \in I} (I - u_{i} u_{i}^{T})$ , where $I = \{i : λ_{i} \neq 0\}$ . The following result is important, because we can verify the condition $<Qh, h> \geq 0 \forall h \in K e r (P)$ , verifying that VT QV is semi definite positive (i. e. all its eigenvalues of VQV are nonnegative real values).

Corollary 1. $<Q h, h> \geq 0 \forall h \in K e r (P)$ if and only if VQV is symmetric semi definite positive.

For the next result, consider $L = \{x \in ℝ^{n} : P x = b\}$ and $f : ℝ^{n} \to ℝ$ defined by $f (x) = (1 / 2) <Qx, x> - <a, x>$ .

Theorem 3. The problem LCQPP has a solution if and only if

E = \{z \in ℝ^{n + m} : C z = c\} \neq 0 a n d <Q h, h> \geq 0 \forall h \in K e r (B)

(8)

where $C = [\begin{matrix} Q & P^{T} \\ B & 0 \end{matrix}]$ and $c = (\begin{matrix} a \\ b \end{matrix})$ . Moreover, if $\bar{z} = (\bar{x}, \bar{y}) \in E$ , then $\bar{x}$ is a solution of LCQPP.

Proof. If $\bar{x}$ is a solution of LCQP, then the KKT optimality conditions imply that $\nabla f (\bar{x}) + P^{T} \bar{y} = Q \bar{x} - a + P^{T} \bar{y} = 0$ and $B \bar{x} = b$ . So $C \bar{z} = c$ for $\bar{z} = (\bar{x}, \bar{y})$ . The first order necessary optimality

condition tells us that ${(Q \bar{x} - a)}^{T} h = 0 \forall h \in K e r (B)$ (because the tangent cone of L in the point $\bar{x}$ is equal to kernel of P, denoted by $K e r (P)$ ). So, for any $h \in K e r (P), x = \bar{x} + h \in L$ and $f (x) = f (\bar{x} + h) = f (\bar{x}) + (1 / 2) <Q h, h> \geq f (\bar{x})$ and so $<Q h, h> \geq 0$ .

Now, if $E = \{z \in ℝ^{n + m} : C z = c\} \neq 0$ and $<Q h, h> \geq 0 \forall h \in K e r (P)$ , then taking $\bar{z} = (\bar{x}, \bar{y}) \in S$ we claim that $\bar{x}$ is a solution of LCQP. Indeed, $C \bar{z} = c$ implies that $P \bar{x} = b$ and $Q \bar{x} - a = - P^{T} \bar{y}$ . Defining $f (x) = (1 / 2) x^{T} Q x - a^{T} x$ , we need to show that $f (x) \geq f (\bar{x})$ for all x such that $P x = b$ .

Note that for $h = x - \bar{x}$ we have that $P h = 0$ . So $h^{T} (\nabla f (\bar{x})) = h^{T} (Q \bar{x} - a) = - h^{T} P^{T} \bar{y} = 0$ . It implies that $f (x) = f (\bar{x} + h) = f (\bar{x}) + (1 / 2) h^{T} A h \geq f (\bar{x})$ and the claim follows. □

4 NUMERICAL EXPERIMENTS

Here, we show numerical experiments using a program code written in SciLab software. Of course, this program code is very simple and developed by an amateur in Computer Science (Wilfredo Sosa).

The following numerical experiment concern to verify optimality condition when we solve linearly constrained quadratic programming problems. If

A = [\begin{matrix} 0 & 2 & 7 & - 17 \\ 2 & 8 & - 6 & - 6 \\ 7 & - 6 & - 6 & 0 \\ - 17 & - 6 & 0 & - 2 \end{matrix}], a = [\begin{matrix} 1 \\ 2 \\ 3 \\ 4 \end{matrix}], b = [\begin{matrix} 3 \\ 2 \end{matrix}] a n d B = [\begin{matrix} 2 & 0 & - 1 & 2 \\ 0 & 1 & 2 & 1 \end{matrix}] .

Running our program code in SciLab, we obtain the spectral set $σ (B^{T} B) = \{9, 6, 0, 0\}$ and the respective eigenvector set is

\{[\begin{matrix} - 0.6666667 \\ 1.351 (10^{- 08}) \\ 0.3333334 \\ - 0.6666667 \end{matrix}] [\begin{matrix} 2.371 (10^{- 08}) \\ 0.4082483 \\ 0.8164966 \\ 0.4082483 \end{matrix}] [\begin{matrix} - 7.366 (10^{- 10}) \\ 0.9128709 \\ - 0.3651484 \\ - 0.1825742 \end{matrix}] [\begin{matrix} 0.7453560 \\ 0 \\ 0.2981424 \\ - 0.5962848 \end{matrix}]\}

By definition

V = (e y e (4, 4) - C P (:, 1) * C P {(:, 1)}^{'}) * (e y e (4, 4) - C P (:, 2) * C P {(:, 2)}^{'})

Again applying our program code to $V * A * V$ we have that

σ (V * A * V) = \{19.018037, 9.7597408, 0, 0\}

It implies that, the vector

\bar{z} = (0.6662676, 0.4182381, 0.2992118, 0.9833383, 7.3928963, 5.0168612)

is solution of $C z = c$ , and so $\bar{x}$ is solution of LCQP ( $\bar{z} = (\bar{x}, \bar{y})$ ).

Also, we applied our scheme for find eigenvalues and eigenvectors of symmetric matrixes. We simulate symmetric matrices and then calculate their eigenvalues and eigenvectors using our program code. We build symmetric matrices as follows: Given each two matrices (data), the first one D is a diagonal matrix and the second one V is an unitary matrix (i.e. $V^{T} V = V V^{T} = I$ ), then we define $A : = V D V^{T}$ , here diagonal entries of D are the eigenvalues of A and the column vectors of V are eigenvectors of A. We generate V using Gram-Schimidt process.

The first matrix was built with 10 eigenvalues equal to -30; 10 eigenvalues equal to zero; and 10 eigenvalues equal to 30.
The second matrix was built with 20 eigenvalues equal to -2000; 20 eigenvalues equal to zero; and 20 eigenvalues equal to 2000.
The next matrix was built with eigenvalues following the rule $λ_{i + 1} = λ_{i + 1}$ , starting with $λ_{1} = - 29$ until $λ_{59} = 29$ and $λ_{60} = 0$ .
The next matrix was built with 20 eigenvalues equal to -3000; 10 eigenvalues equal to -3; 20 eigenvalues equal to zero; 10 eigenvalues equal to 3; and 20 eigenvalues equal to 3000.
The next matrix was built with 50 eigenvalues equal to -100 and 50 eigenvalues equal to 100.

Of course, our scheme finds all eigenvalues and a eigenvector set. Unfortunately, the Power Method Iteration does not run for generated matrices, because the first four matrices has null eigenvalues and the last one has as absolute value of all eigenvalues equal to 100 (it is not dominant eigenvalue matrix).

ACKNOWLEDGEMENTS

Wilfredo Sosa was partially supported by Fundação de Apoio à Pesquisa do Distrito Federal (FAP-DF) [grant 0193.001695/2017 and 00193.00002100/2018-51 and CNPq [Grants 302074/2012-0 and 471168/2013-0]. Part of this research was carried out during visits to IMPA.

References

¹
BREZIS H. 1983. Analyse fonctionnelle - Théorie et applications. Masson, Paris.
²
CROUZEIX J, KERAGEL A & SOSA W. 2011. Programación Matematica diferenciable. Universidad Nacional de Ingenieria.
³
GALOB O. 2007. Discrete Dynamical Systems. Springer Verlag, Berlin Heidelberg.
⁴
GOLDSTEIN AA. 1964. Convex programmin in Hilbert space. Bulletin of The American Mathematical Society, 70: 709-710.
⁵
ISTRǍŢESCU VI. 1981. Fixed point theory. D. Reidel Publishing Co., Dordrecht.
⁶
KELLEY JL. 1955. General Topology. Van Nostrand Co.. Princeton.
⁷
LONELI OHE & RUMBOS PIB. 2003. Métodos dinámicos en Economia. Otra busqueda del tiempo perdido. International Thompson Editores, Mexico.
⁸
MISES RV & POLLACZEK-GEIRINGER H. 1929. Praktische Verfahren der Gleichungsauflösung. ZAMM - Zeitschrift für Angewandte Mathematik und Mechanik, 9: 152-164.
⁹
SHUMWAY RH & STOFFER DS. 2017. Time series analysis and its applications. Springer International Publishing AG.

1
Mathematics Subject Classification (2000): 15A18, 90C52, 90C20

APPENDIX

In this section we present a program code of our scheme written in SciLab Software. Of course, this program code is very simple and we do not use numerical strategies in order to reduce the time of compilation or reduce the accumulation of errors. For theses reason, we do not compare our program code with others in the literature, because it is not our subject. Criterions for the program code are the following:

1. Try to find a great eigenvalue in absolute value, for do it we find j such that $a b s (A (j, j)) \geq A (i, i) \forall i \neq j$ .

2. For build a matrix B, we consider $α = 10$ and $L = ({(m a x (a b s (A A)) * n)}^{1 / 2}) + 1$ .

3. If $a b s (A) < 10^{- 8}$ , then, we consider matrix A as a null matrix.

4. The error to find an eigenvalue will be less to 10−16.

The following function calculate an eigenvector of a symmetric matrix A.

The main part of the code is the following. Of course, it is necessarily read a matrix DC.

Publication Dates

Publication in this collection
2 Dec 2019
Date of issue
Sep-Dec 2019

History

Received
28 Nov 2018
Accepted
11 Sept 2019

This is an open-access article distributed under the terms of the Creative Commons Attribution License

[1] ¹
BREZIS H. 1983. Analyse fonctionnelle - Théorie et applications. Masson, Paris.

[2] ²
CROUZEIX J, KERAGEL A & SOSA W. 2011. Programación Matematica diferenciable. Universidad Nacional de Ingenieria.

[3] ³
GALOB O. 2007. Discrete Dynamical Systems. Springer Verlag, Berlin Heidelberg.

[4] ⁴
GOLDSTEIN AA. 1964. Convex programmin in Hilbert space. Bulletin of The American Mathematical Society, 70: 709-710.

[5] ⁵
ISTRǍŢESCU VI. 1981. Fixed point theory. D. Reidel Publishing Co., Dordrecht.

[6] ⁶
KELLEY JL. 1955. General Topology. Van Nostrand Co.. Princeton.

[7] ⁷
LONELI OHE & RUMBOS PIB. 2003. Métodos dinámicos en Economia. Otra busqueda del tiempo perdido. International Thompson Editores, Mexico.

[8] ⁸
MISES RV & POLLACZEK-GEIRINGER H. 1929. Praktische Verfahren der Gleichungsauflösung. ZAMM - Zeitschrift für Angewandte Mathematik und Mechanik, 9: 152-164.

[9] ⁹
SHUMWAY RH & STOFFER DS. 2017. Time series analysis and its applications. Springer International Publishing AG.

Brazil