AVOIDING LOOK-AHEAD IN THE LANCZOS METHOD

(1)

E. H. A Y A C H O U R (Lille)

AVOIDING LOOK-AHEAD IN THE LANCZOS METHOD

AND PAD´ E APPROXIMATION

Abstract. In the non-normal case, it is possible to use various look-ahead strategies for computing the elements of a family of regular orthogonal polynomials. These strategies consist in jumping over non-existing and singular orthogonal polynomials by solving triangular linear systems. We show how to avoid them by using a new method called ALA (Avoiding Look- Ahead), for which we give three principal implementations. The application of ALA to Pad´e approximation, extrapolation methods and Lanczos method for solving systems of linear equations is discussed.

1. Introduction. A Hankel system comes up implicitly in the Lanczos method, in Padé approximation and in extrapolation methods. The principal submatrices of a Hankel matrix are Hankel matrices of linear systems which are solved by using orthogonal polynomials. It is well known that these orthogonal polynomials satisfy a three-term recurrence relation. When some of them are singular, a breakdown (or a so-called true breakdown [8]) problem occurs in this recurrence relation. To avoid such a problem, Draux [16] has shown how to compute regular orthogonal polynomials by using look-ahead strategies. A look-ahead strategy consists in jumping over the non-existing orthogonal polynomials. Draux and Van Ingelandt applied this technique to Padé approximation in [17] where they give algorithms which allow moving in the Padé table along a diagonal, a row, a staircase consisting of two adjacent diagonals and a sawtooth consisting of two adjacent rows.

Gutknecht and Hochbruck used the Levinson–Schur type recurrences with look-ahead strategies for computing Pad´e approximants [23, 22]. Brezinski, Redivo Zaglia and Sadok have applied these look-ahead strategies to the

1991 Mathematics Subject Classification: 42C05, 41A21, 65F10, 65B05, 65B10.

Key words and phrases : orthogonal and biorthogonal polynomials, Lanczos method, Pad´e approximation, extrapolation methods.

[33]

(2)

Lanczos method [10, 12]. These strategies have also been applied to QMR by Freund, Gutknecht and Nachtigal [18, 25].

In this paper, we will show how to substitute new intermediate polynomials instead of look-ahead strategies. These intermediate polynomials are biorthogonal and satisfy a simple three-term recurrence relation. In addi- tion, they can be considered as an alternative for orthogonal polynomials which are singular or non-existent.

For a given integer n ∈ Z, we consider the linear functional C

⁽ⁿ⁾

defined on the space C[X] of polynomials by C

⁽ⁿ⁾

(x

ⁱ

) = c

_n+i

. By convention, we set c

_i

= 0 when i < 0. We denote by H

_k^θⁿ

the following determinant:

H

_k^θⁿ

=

c

_θ_n(0)+n

c

_θ_n(0)+n+1

. . . c

_θ_n(0)+n+k−1

c

_θ_n_(1)+n

c

_θ_n_(1)+n+1

. . . c

_θ_n_(1)+n+k−1

.. . .. . .. .

c

_θ_n_(k−1)+n

c

_θ_n_(k−1)+n+1

. . . c

_θ_n(k−1)+n+k−1

where θ

_n

is a permutation of N recursively defined by associating with every j ∈ N the smallest integer θ

ⁿ

(j) satisfying H

_j+1^θⁿ

6= 0. So, θ

ⁿ

(0) = i

0

if i

0

is the smallest integer such that c

_i₀+n

6= 0, θ

n

(1) is the smallest integer such that H

₂^θⁿ

6= 0, and so on.

2. Orthogonality. For a fixed integer n, let {P

i⁽ⁿ⁾

}

i

be the family of orthogonal polynomials such that, for all i, P

_i⁽ⁿ⁾

has degree i and

(1) C

⁽ⁿ⁾

(x

^j

P

_i⁽ⁿ⁾

(x)) = 0 for j = 0, 1, . . . , i − 1.

For every i and n, if the set of all solutions of (1) is a subspace of dimen- sion 1, then P

_i⁽ⁿ⁾

is called regular. The explicit expression of each orthogonal polynomial P

_i⁽ⁿ⁾

is given in [8]:

P

_i⁽ⁿ⁾

(x) =

c

n

c

n+1

. . . c

n+i

c

_n+1

c

_n+2

. . . c

_n+i+1

.. . .. . .. . c

n+i−1

c

n+i

. . . c

n+2i−1

1 x . . . x

ⁱ

/d

⁽ⁿ⁾_i

,

where d

⁽ⁿ⁾_i

is the determinant

c

_n

c

n+1

. . . c

n+i

c

n+1

c

n+2

. . . c

n+i+1

.. . .. . .. . c

_n+i−1

c

_n+i

. . . c

_n+2i−1

a

^n,i₀

a

^n,i₁

. . . a

^n,i_i

.

Each choice of the coefficients {a

^n,ij

}

^j=0,...,i

corresponds to a different nor-

(3)

malization of the orthogonal polynomial P

_i⁽ⁿ⁾

. In the sequel, we will examine three normalizations:

1. In Pad´e approximation, we choose a

^n,i₀

= a

^n,i₁

= . . . = a

^n,i_i−1

= 0 and a

^n,i_i

= 1. Thus P

_i⁽ⁿ⁾

is a monic polynomial of degree i.

2. For the Lanczos method, we set a

^n,i₁

= a

^n,i₂

= . . . = a

^n,i_i

= 0 and a

^n,i₀

= 1, which is equivalent to the condition P

_i⁽ⁿ⁾

(0) = 1.

3. In extrapolation methods, we choose a

^n,i₀

= a

^n,i₁

= . . . = a

^n,i_i

= 1, which corresponds to P

_i⁽ⁿ⁾

(1) = 1.

As we can see from the above explicit expression of P

_i⁽ⁿ⁾

, the determinant d

⁽ⁿ⁾_i

can be zero. This depends on the values assigned to a

^n,i₀

, a

^n,i₁

, . . . , a

^n,i_i

, that is, on the normalization. When P

_i⁽ⁿ⁾

is singular, d

⁽ⁿ⁾_i

is zero. In this situation, we say that there is a breakdown. The aim of this work is to introduce new regular biorthogonal polynomials with some normalization and to use them in the computation of regular orthogonal polynomials in order to avoid breakdown problems.

Let {P

i^θⁿ

}

n,i

be a family of monic polynomials such that, for all i, P

_i^θⁿ

has degree i and

(2) C

⁽ⁿ⁾

(x

^θⁿ^(j)

P

_i^θⁿ

) = 0, j = 0, 1, . . . , i − 1.

The family {P

i^θⁿ

}

n,i

contains all the monic regular orthogonal polynomials with respect to C

⁽ⁿ⁾

. The explicit expression of each polynomial P

_i^θⁿ

is

P

_i^θⁿ

(x) =

c

_θ_n_(0)+n

c

_θ_n_(0)+n+1

. . . c

_θ_n_(0)+n+i

c

_θ_n(1)+n

c

_θ_n(1)+n+1

. . . c

_θ_n(1)+n+i

.. . .. . .. .

c

_θ_n_(i−1)+n

c

_θ_n_(i−1)+n+1

. . . c

_θ_n_(i−1)+n+i

1 x . . . x

ⁱ

/H

_i^θⁿ

.

This shows that P

_i^θⁿ

is a regular orthogonal polynomial if and only if θ

_n

({0, 1, . . . , i − 1}) = {0, 1, . . . , i − 1}.

In particular, when θ

_n

is the identity, we recover the explicit expressions of adjacent orthogonal polynomials which are studied, in the normal case, in [5]. When P

_i^θⁿ

is not orthogonal with respect to C

⁽ⁿ⁾

, it is, in fact, a biorthogonal polynomial, as defined in [2].

3. Recurrence relations. Assume that there exists a regular orthogonal polynomial P

_k^θⁿ

such that

C

⁽ⁿ⁾

(x

^k

P

_k^θⁿ

) = 0.

(4)

From the explicit expression of P

_k^θⁿ

, we get C

⁽ⁿ⁾

(x

^θⁿ^(k)

P

_k^θⁿ

) = H

_k+1^θⁿ

/H

_k^θⁿ

6= 0. In the following theorem, we see how to compute the biorthogonal polynomials P

_k+1^θⁿ

, P

_k+2^θⁿ

, . . . , P

_θ^θ_nⁿ_(k)

of the family {P

i^θⁿ

}

ⁱ

, for a fixed integer n.

Theorem 3.1. For i ∈ {k, . . . , θ

ⁿ

(k) − 1}, we have (i) P

_i+1^θⁿ

= xP

_i^θⁿ

+ α

_i

P

_k^θⁿ

with

α

i

= −C

⁽ⁿ⁾

(x

^θⁿ^(k)+1

P

_i^θⁿ

)/C

⁽ⁿ⁾

(x

^θⁿ^(k)

P

_k^θⁿ

),

(ii) C

⁽ⁿ⁾

(x

^j

P

_i+1^θⁿ

) = 0 for j = 0, . . . , θ

_n

(k) and j = θ

_n

(k) + k − (i + 1), (iii) θ

_n

(i + 1) = θ

_n

(k) + k − (i + 1) and

C

⁽ⁿ⁾

(x

^θⁿ⁽ⁱ⁺¹⁾

P

_i+1^θⁿ

) = C

⁽ⁿ⁾

(x

^θⁿ^(k)

P

_k^θⁿ

) 6= 0.

P r o o f. The proof is by induction on i from i = k. It consists in proving that xP

_i^θⁿ

+ α

_i

P

_k^θⁿ

satisfies the orthogonality condition (2) for P

_i+1^θⁿ

.

This theorem shows that P

_θ(k)+1^θⁿ

is a regular orthogonal polynomial and that P

_k^θⁿ

divides P

_i^θⁿ

for i = k, k + 1, . . . , θ

_n

(k).

Theorem 3.2. Let P

_k^θⁿ

0

be the regular orthogonal polynomial of the highest degree preceding P

_k^θⁿ

. Then P

_θ^θⁿ

n(k)+1

can be computed from the recurrence relation

P

_θ^θⁿ

n(k)+1

= xP

_θ^θⁿ

n(k)

+

θn(k)

X

i=k+1

α

_i

P

_i^θⁿ

+ α

_k

P

_k^θⁿ

+ α

k−1

P

_k^θ₀ⁿ

with

α

k−1

= −C

⁽ⁿ⁾

(x

^θⁿ^(k)

P

_k^θⁿ

)/C

⁽ⁿ⁾

(x

^θⁿ^(k⁰⁾

P

_k^θⁿ

0

) 6= 0,

α

k

= −(C

⁽ⁿ⁾

(x

^θⁿ^(k)+1

P

_θ^θ_nⁿ_(k)

) + α

k−1

C

⁽ⁿ⁾

(x

^θⁿ^(k)

P

_k^θ₀ⁿ

))/C

⁽ⁿ⁾

(x

^θⁿ^(k)

P

_k^θⁿ

), α

_i

= C

⁽ⁿ⁾

(x

^θⁿ⁽ⁱ⁾

P

_k^θⁿ

0

)/C

⁽ⁿ⁾

(x

^θⁿ^(k⁰⁾

P

_k^θⁿ

0

), i = k + 1, . . . , θ(k).

P r o o f. Since P

_k^θⁿ

0

is the regular orthogonal polynomial of the highest degree preceding P

_k^θⁿ

, we have θ

_n

(k

0

) = k − 1. For fixed n, the set {P

i^θⁿ

}

i

is a basis of C[X], so we can write

(3) P

_θ^θ_nⁿ_(k)+1

= xP

_θ^θ_nⁿ_(k)

+

θn(k)

X

i=0

α

i

P

_i^θⁿ

.

Multiplying (3) by x

^θⁿ^(j)

and applying C

⁽ⁿ⁾

gives the expressions for α

_i

and shows that (3) is equivalent to

(4) P

_θ^θⁿ

n(k)+1

= xP

_θ^θⁿ

n(k)

+

θn(k)

X

i=k+1

α

_i

P

_i^θⁿ

+ α

_k

P

_k^θⁿ

+ α

_k₀

P

_k^θⁿ

0

.

(5)

By Theorems 3.1 and 3.2, there exists a polynomial W

_θ_n_(k)−k+1

(x) of degree θ

_n

(k)−k+1 such that P

θ(k)+1^θⁿ

(x) = W

_θ_n_(k)−k+1

(x)P

_k^θⁿ

(x)+α

k−1

P

_k^θⁿ

0

(x).

It is sufficient to remark that P

_k

divides P

_i^θⁿ

for i = k, . . . , θ

_n

(k). Different proofs of this property were given by Draux [16], Gragg and Lindquist [19]

and Gutknecht [21]. Notice that our proof is shorter and simpler than that of Gutknecht [21].

The polynomials P

_q^θⁿ

can be displayed in an array called the table P . We suppose that this table P contains a square block of order θ

_n

(k) − k at its kth column. This can be illustrated by the scheme

P

_k^θⁿ

P

_k+1^θⁿ⁻¹

. . . P

_k^θⁿ⁺¹

P

_k+1^θⁿ

. . . P

_k^θⁿ⁺²

P

_k+1^θⁿ⁺¹

. . . .. . .. . . ..

where P

_k^θⁿ

is regular. From the preceding results, we obtain the two relations ( P

_k^θ^m+1

= P

_k^θ^m

− e

^θ_k^m

P

_θ^θ^m+1

m+1(k−1)

,

e

^θ_k^m

= C

^(m)

(x

^k

P

_k^θ^m

)/C

^(m+1)

(x

^k−1

P

_θ^θ^m+1

m+1(k−1)

), for m = n, . . . , n + θ

_n

(k) − k, and

( P

_i+1^θ^m−1

= xP

_i^θ^m

− q

i+1^θ^m−1

P

_θ^θ^m−1

m−1(i)

, q

^θ_i+1^m−1

= C

^(m)

(x

ⁱ

P

_i^θ^m

)/C

^(m−1)

(x

ⁱ

P

_θ^θ^m−1

m−1(i)

),

for i = k, . . . , θ

_n

(k), m = n + k − i. These relations yield some properties of blocks of the table P :

Theorem 3.3 For every n ∈ Z, if the table P contains a block at its kth column as described above, then

P

_k^θⁿ⁺ⁱ

= P

_k^θⁿ

, P

_k+i^θⁿ⁻ⁱ

= x

ⁱ

P

_k^θⁿ

, i = 0, . . . , θ

_n

(k) − k,

with θ

_n−i

(k + i) = θ

_n

(k) and θ

n+i

(k) = θ

_n

(k) − i for i = 0, . . . , θ

n

(k) − k.

This was proved by Draux in [16]. Here, we only made the connection with the permutation θ

n

, which simplifies the recurrence relations. We also note that a simple proof of this theorem can be deduced from (5) and (6).

The new biorthogonal polynomials defined above are displayed inside the blocks of the table P .

Theorem 3.4. For every n ∈ Z, if the table P has a block at its kth

column as described above, then for i = k, . . . , θ

_n

(k) and m = 0, . . . , θ

_n

(k)− i,

(6)

we have

P

_i^θ^n+m

= P

_i^θⁿ

, P

_i+m^θ^n−m

= x

^m

P

_i^θⁿ

, θ

_n−m

(i + m) = θ

_n

(i), θ

n+m

(i) = θ

_n

(i) − m.

P r o o f. We use the properties of the permutation θ

_n

given in Theorems 3.3 and 3.1. Indeed, from Theorem 3.1, θ

_n−m

(i + m) = θ

_n−m

(i + m − 1) + 1, and from Theorem 3.3, θ

_n−m

(i + m − 1) + 1 = θ

^n−m+1

(i + m − 1). Thus, by applying Theorem 3.1 and then Theorem 3.3, m times, we deduce that θ

_n−m

(i + m) = θ

_n

(i).

Theorem 3.3 shows that θ

_n+m

(i) = θ

_n+m−1

(i) − 1. By applying Theo- rem 3.3, m times, we get θ

n+m

(i) = θ

_n

(i) − m.

These relations between the permutations θ

_j

show that P

_i^θⁿ

and x

^m

P

_i^θⁿ

have the properties of P

_i^θ^n+m

and P

_i+m^θ^n−m

respectively, so P

_i^θ^n+m

= P

_i^θⁿ

and P

_i+m^θ^n−m

= x

^m

P

_i^θⁿ

.

Theorem 3.4 is a generalization of Theorem 3.3.

By using the recurrence relations of Theorems 3.1 and 3.2, we can derive an algorithm for computing the regular orthogonal polynomials with respect to the functional C

⁽ⁿ⁾

. Actually, this algorithm allows one to move along a diagonal of the table P . It makes use of the intermediate biorthogonal polynomials for computing the regular orthogonal ones. The procedure is called ALA (Avoiding Look-Ahead strategy).

3.1. Implementation of ALA. Define a symmetric bilinear form g

1

on C [X] by

g

1

(ψ, ϕ) = C(ψϕ), ∀ψ, ϕ ∈ C[X].

For simplicity, we write C and θ instead of C

⁽ⁿ⁾

and θ

n

since n is fixed.

Definition 3.1. Let D = {p

0

, p

₁

, . . .} and Q = {q

0

, q

₁

, . . .} be two sets of polynomials. If g

1

(p

_i

, q

_j

) = 0 for i 6= j, then we say that D and Q are g

1

-biorthogonal.

We consider two bases {v

0

, v

₁

, . . .} and {w

0

, w

₁

, . . .} of C[X] such that, for every integer i, the polynomials v

_i

and w

_i

are of degree i. We assume that (C[X], g

1

) is regular. A subspace L ×L

^′

of C[X]×C[X] is called regular if the right-orthogonal of L,

L

^⊥^g1

= {z ∈ C[X] | ∀y ∈ L, g

¹

(y, z) = 0},

does not contain any element of L

^′

, and the left-orthogonal of L

^′

, L

^′^g1^⊥

= {y ∈ C[X] | ∀z ∈ L

^′

, g

1

(y, z) = 0}, is such that L ∩ L

^′^g1^⊥

= {0}.

As (C[X], g

₁

) is regular, we can choose two permutations σ and θ of N

such that, for every integer i, the subspace V

_i^σ

× W

i^θ

generated by {v

σ(0)

, . . .

(7)

. . . , v

_σ(i)

} × {w

θ(0)

, . . . , w

_θ(i)

} is regular. This choice enables us to build two g

1

-biorthogonal bases D = {p

⁰

, p

1

, . . .} and Q = {q

⁰

, q

1

, . . .} of monic polynomials such that, for every integer i, p

i

∈ V

i^σ

\V

i−1^σ

and q

i

∈ W

i^θ

\W

i−1^θ

. For more details, see [2].

To each pair of bases {v

⁰

, v

1

, . . .} and {w

⁰

, w

1

, . . .}, there corresponds another pair of bases which are g

₁

-biorthogonal. We are interested in the choice of {v

⁰

, v

1

, . . .} and {w

⁰

, w

1

, . . .} which yields all the regular orthogonal polynomials with respect to the functional C. This means that the corresponding g

₁

-biorthogonal bases {p

0

= P

₀^θ

, p

₁

= P

₁^θ

, . . .} and {q

0

= Q

^θ_θ(0)

, q

1

= Q

^θ_θ(1)

, . . .} satisfy Q

^θi

= P

_i^θ

= P

_i

for every monic regular orthogonal polynomial P

i

of degree i.

We will introduce three interesting choices which give three different ways for implementing the ALA method. These choices will be called C1, C2 and C3.

We now give the recurrence relations connecting the polynomials of {P

0^θ

, P

₁^θ

, . . .} and {Q

^θθ(0)

, Q

^θ_θ(1)

, . . .}, in order to apply them to the Lan- czos method. This is equivalent to substituting y for the variable x of the polynomials Q

^θ_i

, in order to have two biorthogonal bases with respect to the bilinear form g

2

defined on C[X] × C[Y ] by g

²

(x

ⁱ

, y

^j

) = C(x

^i+j

) for i, j ∈ N.

• C1 is obtained by taking for σ the identity permutation and by choosing recursively the polynomials of the bases {v

0

, v

₁

, . . .} and {w

0

, w

₁

, . . .} with v

_j

= xP

_j−1^θ

and w

_j

= x

^j

for j = 1, 2, . . . This choice was already studied in Section 2 and before this subsection.

• C2 consists in setting

v

_j

= xP

_j−1^θ

, j = 1, 2, . . . ,

w

j

= x

^j−k

Q

^θ_k

, j = k + 1, . . . , θ(k) + 1,

k = 0, θ(0) + 1, θ(θ(0) + 1) + 1, . . .

The degrees of the polynomials v

_j

and w

_j

are equal to their indices. The definitions of P

_j^θ

and Q

^θ_j

yield the recurrence relations

Q

^θ_j

= x

^j−k

Q

^θ_k

, j = k + 1, . . . , θ(k),

(7) (

α

j−1

= C(xP

_j−1^θ

Q

^θ_θ(k)

)/C(P

_k^θ

Q

^θ_θ(k)

),

P

_j^θ

= xP

_j−1^θ

− α

^j−1

P

_k^θ

, j = k + 1, . . . , θ(k), (8)

 

 

 

 



α

θ(k)

= C(xP

_θ(k)^θ

Q

^θ_θ(k)

)/C(P

_k^θ

Q

^θ_θ(k)

), β

_θ(k)

= C(P

_θ(k)^θ

Q

^θ_k

)/C(P

_θ(k−1)^θ

Q

^θ_k−1

), P

_θ(k)+1^θ

= xP

_θ(k)^θ

− α

θ(k)

P

_k^θ

− β

θ(k)

P

_θ(k−1)^θ

, Q

^θ_θ(k)+1

= xQ

^θ_θ(k)

−

θ(k)

X

i=k+1

α

_θ(i)

Q

^θ_i

− α

θ(k)

Q

^θ_k

− β

θ(k)

Q

^θ_θ(k−1)

.

(9)

(8)

For every i, if the polynomials Q

^θ_i

and P

_i^θ

are both orthogonal, then Q

^θ_i

= P

_i^θ

. Replacing Q

^θ_i

by P

_i^θ

if Q

^θ_i

= P

_i^θ

, these three equations lead to an implementation of the ALA method where only three vectors need to be stored.

The coefficients α

θ(i)

of the relation which gives Q

^θ_θ(k)+1

can also be computed by using the polynomials of the set {P

k−1^′^θ

, P

_k^′^θ

, . . . , P

_θ(k)^′^θ

, P

_θ(k)+1^′^θ

} which is g

1

-biorthogonal to {xQ

^θθ(k)

, Q

^θ_θ(k)

, Q

^θ_θ(k)−1

, . . . , Q

^θ_k

, Q

^θ_θ(k−1)

} with P

_k−1^′^θ

= P

_k−1^θ

. The computation of the polynomials P

_i^′^θ

is via the following relation which connects them to P

_i^θ

:

(10)

( λ

_i

= C(P

_i^θ

xQ

^θ_θ(k)

)/C(P

_k^θ

Q

^θ_θ(k)

),

P

_i^′^θ

= P

_i^θ

− λ

i

P

_k−1^θ

, i = k, k + 1, . . . , θ(k) + 1.

Thanks to these polynomials, the expression for α

_θ(i)

is α

θ(i)

= −β

^θ(k)

C(Q

^θ_θ(k−1)

P

_θ(i)^′^θ

)/C(Q

^θ_i

P

_θ(i)^′^θ

).

If we only need to compute P

_j^θ

, then we can use the simpler relation

(11)

 



 

α

_j

= C(x

^θ(k)−k+1

P

_j^θ

P

_k^θ

)/C(x

^θ(k)−k

P

_k^θ

P

_k^θ

), β

j

= C(P

_j^θ

P

_k^θ

)/C(P

_θ(k−1)^θ

P

_k−1^θ

),

P

_j+1^θ

= xP

_j^θ

− α

^j

P

_k^θ

− β

^j

P

_θ(k−1)^θ

,

j = k, k + 1, . . . , θ(k),

for k = 0, θ(0) + 1, θ(θ(0) + 1) + 1, . . . The initializations of this recurrence relation are P

₀^θ

= 1 and P

₋₁^θ

= 0 with θ(−1) = −1.

• C3 consists in taking

v

_j

= x

^j−k

P

_k^θ

, j = k + 1, k + 2, . . . , n

_k

, v

j+1

= xP

_j^θ

, j = n

_k

, n

_k

+ 1, . . . , θ(k), w

_j

= x

^j−k

Q

^θ_k

, j = k + 1, k + 2, . . . , n

_k

, w

_j+1

= xQ

^θ_j

, j = n

_k

, n

_k

+ 1, . . . , θ(k),

for k = 0, θ(0) + 1, θ(θ(0) + 1) + 1, . . . , with n

k

= ⌊(θ(k) + k + 1)/2⌋. The degrees of v

_j

and w

_j

are equal to their indices. For a complete study of this choice, see [2].

For C2, we deduce from (11) the following theorem which generalizes the classical recurrence relation for regular orthogonal polynomials.

Theorem 3.5. Every regular orthogonal polynomial P

_θ(k)+1^θ

satisfies a recurrence relation of the form

P

_θ(k)+1^θ

= xP

_θ(k)^θ

− α

θ(k)

P

_k^θ

− β

θ(k)

P

_θ(k−1)^θ

,

where P

_k^θ

is the regular orthogonal polynomial of the highest degree preceding

(9)

P

_θ(k)+1^θ

. The degrees of P

_θ(k)+1^θ

, P

_θ(k)^θ

, P

_k^θ

and P

_θ(k−1)^θ

are equal to their lower indices.

In the following, we are most interested in the application of C2 because of its particular characteristics which are detailed in [2].

4. Application to the Lanczos method. Let us begin by describing the Lanczos method following [13, 14].

4.1. Description. We want to find the solution of the linear system Ax = b, where A ∈ C

^n×n

is supposed to be non-singular, b ∈ C

ⁿ

and x ∈ C

ⁿ

.

Let x

0

and y

0

be arbitrary vectors in C

ⁿ

and define two sequences (x

_k

)

_k

and (r

k

)

k

of vectors by

x

_k

− x

⁰

∈ K

k

(A, r

0

), (12)

r

_k

= b − Ax

k

⊥ K

k

(A

^∗

, y

0

), (13)

where K

_k

(A, r) = span(r, Ar, . . . , A

^k−1

r) and A

^∗

denotes the conjugate transpose of the matrix A.

The Lanczos method is completely defined by (12) and (13). It consists recursively in projecting the initial residual r

0

on the Krylov space K

_k

(A, Ar

0

), orthogonally to K

_k

(A

^∗

, y

0

) with respect to the Hermitian product h·, ·i of C

ⁿ

. Here h·, ·i replaces the form g

1

introduced before. From (12), we can write

(14) x

k

− x

⁰

= −α

¹

r

0

− α

²

Ar

0

− . . . − α

^k

A

^k−1

r

0

. Multiplying (14) by A and subtracting b, we obtain

r

_k

= r

₀

+ α

₁

Ar

₀

+ . . . + α

_k

A

^k

r

₀

. (13) implies

(15) hr

k

, A

^∗ⁱ

y

₀

i = 0 for i = 0, . . . , k − 1.

If we consider the polynomial P

_k

(ξ) = 1 + α

1

ξ + . . . + α

_k

ξ

^k

, then r

_k

= P

k

(A)r

0

. Let us now define the linear functional C on C[X] by C(ξ

ⁱ

) = c

_i

= hA

ⁱ

r

0

, y

0

i, i = 0, 1, . . . , and the functional C

⁽¹⁾

by C

⁽¹⁾

(ξ

ⁱ

) = C(ξ

ⁱ⁺¹

), i = 0, 1, . . . The polynomial P

_k

satisfies

C(ξ

ⁱ

P

_k

(ξ)) = 0 for i = 0, . . . , k − 1, P

_k

(0) = 1.

So, P

_k

is a formal orthogonal polynomial with respect to the linear functional C normalized by the condition P

_k

(0) = 1.

Let P

_k⁽¹⁾

be the monic polynomial of degree k satisfying C

⁽¹⁾

(ξ

ⁱ

P

_k⁽¹⁾

(ξ)) = 0 for i = 0, . . . , k − 1.

(P

_k⁽¹⁾

)

_k

and (P

_k

)

_k

are called adjacent families [29]. We can easily see that,

for each k ∈N

^∗

, P

_k

and P

_k⁽¹⁾

exist and are unique if and only if the Hankel

(10)

determinant

H

_k⁽¹⁾

=

c

1

c

2

. . . c

k

c

2

c

3

. . . c

k+1

.. . .. . .. . c

_k

c

k+1

. . . c

2k−1

is different from zero. In order to define uniquely the two sequences (P

_k^θ

)

_k

and (P

_k^θ¹

)

k

with only one permutation θ, (P

_k^θ

)

k

and (P

_k^θ¹

)

k

will be normalized by P

_k^θ

(0) = 1 and P

_k^θ¹

monic of degree k.

Even if P

_k^θ

is not orthogonal, we set r

_k

= P

_k^θ

(A)r

0

. The polynomial P

_k^θ

satisfies

C(ξ

^θ(i)

P

_k^θ

(ξ)) = 0 for i = 0, . . . , k − 1, P

_k^θ

(0) = 1.

Consequently, (15) becomes

(16) hr

k

, A

^∗^θ(i)

y

₀

i = 0 for i = 0, . . . , k − 1.

(16) is equivalent to the linear system

(S)

 



 

α

1

c

_θ(0)+1

+ α

2

c

_θ(0)+2

+ . . . + α

_k

c

_θ(0)+k

= −c

θ(0)

, α

1

c

θ(1)+1

+ α

2

c

θ(1)+2

+ . . . + α

k

c

θ(1)+k

= −c

^θ(1)

, . . .

α

1

c

θ(k−1)+1

+ α

2

c

θ(k−1)+2

+ . . . + α

_k

c

θ(k−1)+k

= −c

^θ(k−1)

.

According to the definition of θ, the determinant of (S) is H

_k^θ¹

6= 0. So, (S) has a unique solution.

A survey of the various algorithms for implementing the Lanczos method is given in [14]. Here, we only present the application of ALA to Lan- czos/Orthodir which is described in [24, 31].

4.2. Lanczos/Orthodir. Several Lanczos/Orthodir type algorithms were given, for example, in [14]. In particular, we cite the algorithm known as Biodir [21].

According to C2, we apply ALA to Lanczos/Orthodir. This is also equivalent to applying ALA to Biodir.

In order to compute P

k+1

, we use the formula (17)

( P

_i+1^θ

= P

_i^θ

− λ

i

ξP

_i^θ¹

,

λ

_i

= C(P

_k^θ

Q

^θ_θ(i)¹

)/C(ξP

_k^θ¹

Q

^θ_θ(k)¹

), i = k, k + 1, . . . , θ(k).

Its proof is by induction from i = k to i = θ(k), it consists in proving that

P

_i^θ

− λ

ⁱ

ξP

_i^θ¹

satisfies the orthogonality condition (2) for P

_i+1^θ

. This formula

requires the knowledge of the polynomials P

_i^θ¹

, Q

^θ_i¹

. These polynomials are

obtained from (7)–(9) if we substitute C

⁽¹⁾

for C and θ

1

for θ. Therefore,

to apply ALA to Lanczos/Orthodir according to C2, we use the formulas

(7)–(9) and (17).

(11)

Now, we are able to give an algorithm which allows avoiding the look- ahead strategy in Biodir. This algorithm consists of three steps:

• initialization,

• determination of the next existing regular orthogonal polynomial, which is equivalent to determining σ(k) at the iteration k,

• computation of the iterates x

^k+1

, z

k+1

and the residual vector r

k+1

at the iteration k.

Set z

_k

= P

_k^θ¹

(A)r

0

and y

_k

= Q

^θ_k¹

(A

^∗

)y

0

for k = 0, 1, . . . Algorithm 1

• Step 1 (Initialization): Choose x

⁰

and y

0

arbitrary in C

ⁿ

, set r

0

= b − Ax

⁰

, z

0

= r

0

, z

−1

= y

−1

= (0, 0, . . . , 0)

^t

, h

−1

= 1, θ(−1) = −1 and k = 0.

• Step 2 (the determination of σ(k)):

1 i = 0

2 e

_i

= hy

k+i

, r

_k

i y

k+i+1

= A

^∗

y

k+i

h

k+i

= hy

^k+i+1

, z

_k

i

if |h

^k+i

| < tol for some tolerance tol, then i = i + 1, go to 2

end (if) θ(k) = k + i.

• Step 3:

b

_k

= h

θ(k)

/h

k−1

for i = k, . . . , θ(k) λ

i

= e

θ(k)−i

/h

θ(k)

x

_i+1

= x

_i

+ λ

_i

z

_i

r

_i+1

= r

_i

− λ

i

Az

_i

β

_i

= hy

θ(k)+1

, Az

_i

i/h

θ(k)

z

i+1

= Az

_i

− β

i

z

_k

y

θ(k)+1

← y

^θ(k)+1

− β

i

y

θ(i)

end (for)

z

θ(k)+1

← z

^θ(k)+1

− b

^k

z

θ(k−1)

y

_θ(k)+1

← y

θ(k)+1

− b

k

y

_θ(k−1)

k = θ(k) + 1

go to 1

end.

(12)

It is important to notice that, for each iteration of this algorithm, we have a product of A and A

^∗

by a vector and three inner products. The coding of this algorithm needs the storage of 9 + m vectors, where m = max

_k

(θ(k) − k + 1) < n.

4.3. Numerical results. First, let us mention that the computations were performed on a computer working with 16 decimal digits in double precision and our tests were run using FORTRAN 77.

Let kr

k

k be the residual norm obtained, at iteration k, by Algorithm 1.

The algorithm is stopped at the kth iteration if kr

^k

k < eps, where eps is a given tolerance.

Example 1 . Consider the example of [12]:



 



0 0 0 . . . 0 −1 1 0 0 . . . 0 0 0 1 0 . . . 0 0 .. . .. . .. . .. . .. . 0 0 0 . . . 1 0



 





 

 1 2 3 .. . n



 



=



 



−n 1 2 .. . n − 1



 

 .

We take n = 1000 and choose y

0

= (1, 0, 0, . . . , 0, 0, 1)

^t

, x

0

= (0, 0, . . . , 0)

^t

. For tol = 10

⁻¹

, 10

⁻²

, . . . , 10

⁻¹⁶

, eps = 10

⁻¹²

, we get

θ(0) = 0, θ(k) = 999 − k for k = 1, . . . , 998, θ(999) = 999.

There is stagnation from k = 1 until iteration k = 998. At the end of this stagnation, we obtain kr

999

k = 1.58 · 10

⁴

and kr

1000

k = 9.55 · 10

⁻⁶

.

Example 2 . We consider a matrix obtained from discretization of the elliptic partial differential equation

Lu = f on [0, 1] × [0, 1], where

Lu = −∆u + s ∂u

∂x ,

with Dirichlet boundary conditions u = 0, using a five-point centered finite difference scheme on a uniform 20 × 20 grid with mesh size h = 1/21.

This yields a sparse non-symmetric matrix of order n = 400 with 1920

non-zero elements. We choose s = 10

⁴

. By applying Algorithm 1 to this

matrix with tol = 10

⁻¹

, 10

⁻²

, . . . , 10

⁻¹⁶

, eps = 10

⁻⁸

, y

₀

= (0, 0, . . . , 0, 0, 1)

^t

,

x

0

= (0, 0, . . . , 0)

^t

, b = (1, 0, 0, . . . , 0)

^t

, we get

(13)

0 10 20 30 40 50 60 70

−9

−8

−7

−6

−5

−4

−3

−2

−1 0

Iterations

log10 of residual norm

figure 1, example 2, n=400

As for the first example, there is stagnation at the beginning. Afterwards, we obtain a good convergence to the exact solution. We also remark that the convergence curve presents some peaks. It is well known that these peaks are characteristic of Lanczos type methods.

Example 3. We consider a matrix arising from discretization of the 3-dimensional partial differential equation

Lu = f on [0, 1] × [0, 1] × [0, 1], where

Lu = −∆u + x ∂u

∂x + y ∂u

∂y + z ∂u

∂z − u,

with Dirichlet boundary conditions u = 0. The operator was discretized

using a seven-point centered finite difference scheme on a uniform 5 × 5 × 5

grid with mesh size h equal to 1/6. This yields a sparse non-symmetric

matrix of order n = 125, with 725 non-zero elements. By using Algorithm 1

with tol = 10

⁻¹

, 10

⁻²

, . . . , 10

⁻¹⁶

, eps = 10

⁻¹⁴

, y

0

= (0, 0, . . . , 0, 0, 1)

^t

,

x

₀

= (0, 0, . . . , 0)

^t

, b = (1, 0, 0, . . . , 0)

^t

, we obtain the following convergence

curve:

(14)

0 5 10 15 20 25 30

−15

−10

−5 0

Iterations

log10 of residual norm

figure 2, example 3, n=125

For the values of the permutation θ, we get θ(k) =

12 − k for k = 0, 1, . . . , 12, k for k = 13, 14, . . .

At the beginning, we have stagnation from k = 0 until k = 12. After this stagnation, the residual norm converges quickly to zero. At iteration k = 28, we obtain kr

²⁸

k = 7.52 · 10

⁻¹⁵

.

Let us indicate that we have compared our estimation of the residual norm given by Algorithm 1 with the actual one. This comparison shows that both our estimation and the actual residual norm coincide.

4.4. Application to the non-hermitian Lanczos process. Define the symmetric bilinear form g

1

by g

1

(u, w) = w

^t

u for all u, w ∈ C

ⁿ

. According to C2, we get the following process.

Process 1 . Choose v

1

, v

2

∈ C

ⁿ

and set λ

1

p

1

= v

1

, µ

1

q

1

= v

2

, p

0

= 0, k = 1 (λ

1

and µ

1

are chosen such that kp

¹

k = kq

¹

k = 1).

Compute 1 i = 0

2 d

_k

= g

1

(p

_k

, q

k+i

)

(15)

if |d

k

| < tol (for some tolerance tol), then i = i + 1

µ

k+i

q

k+i

= A

^∗

q

k+i−1

(µ

k+i

is chosen such that kq

^k+i

k = 1) go to 2

end (if)

θ(k + j) = k + i − j, j = 0, 1, . . . , i q

_θ(k)+1

= A

^∗

q

_θ(k)

for j = k, k + 1, . . . , θ(k):

α

j

= g

1

(Ap

j

, q

θ(k)

)/g

1

(p

k

, q

θ(k)

) p

j+1

= Ap

_j

− α

j

p

_k

β

_j

= g

1

(q

_θ(k)+1

, p

_j

)/g

1

(q

_θ(j)

, p

_j

) q

_θ(k)+1

← q

θ(k)+1

− β

j

q

_θ(j)

if j = θ(k), then

α

^′_j

= g

1

(Ap

j

, q

k−1

)/g

1

(p

θ(k−1)

, q

k−1

) p

j+1

← p

^j+1

− α

^′j

p

θ(k−1)

β

_j^′

= g

1

(q

_θ(k)+1

, p

k−1

)/g

1

(q

_θ(k−1)

, p

k−1

) µ

_θ(k)+1

q

_θ(k)+1

← q

θ(k)+1

− β

j^′

q

_θ(k−1)

(µ

_θ(k)+1

is chosen such that kq

θ(k)+1

k = 1) end (if)

λ

j+1

p

j+1

← p

^j+1

(λ

j+1

is chosen such that kp

^j+1

k = 1) end (for)

k = θ(k) + 1, go to 1 end.

For solving a linear system, we use a process which allows us to triangu- larize, tridiagonalize or transform the matrix of the system to another one for which we have to find its inverse, as for example the Hessenberg matrix.

Here, for each iteration k of Process 1, we get the following factorization:

A(p

_k

p

k+1

. . . p

_θ(k)

) = (p

_θ(k−1)

p

_k

p

k+1

. . . p

_θ(k)

p

_θ(k)+1

)

d

^′_k^t

H e

_k^′

where d

^′_k^t

= α

^′_θ(k)

(0, 0, . . . , 0, 1) ∈ C

^θ(k)−k+1

, and the matrix e H

^′_k

with θ(k) − k + 2 rows and θ(k) − k + 1 columns is



 



α

_k

α

_k+1

. . . α

_θ(k)

λ

k+1

λ

k+2

. ..

λ

_θ(k)+1



 

 .

Let us discuss the stopping criterion for this process. We consider two

Krylov subspaces W

₂

= K

_n

(A, v

₁

) = span(v

₁

, Av

₁

, A

²

v

₁

. . .) and W

₃

=

K

_n

(A

^∗

, v

2

) = span(v

2

, A

^∗

v

2

, A

^∗²

v

2

, . . .). There are two cases to consider:

(16)

• The first one corresponds to not having a breakdown at iteration k if k ≤ min{l, l

^′

} with l = dim W

²

and l

^′

= dim W

3

. This means that the subspace (W

2

× W

³

, g

1

) is regular.

• The second case is the situation where there is a serious incurable breakdown. It corresponds to a breakdown occurring at iteration k with k ≤ min{l, l

^′

} and it means that (W

²

×W

³

, g

1

) is not regular. Consequently, Process 1 cannot be used and the solution is to make another choice of the vectors v

1

and v

2

in C

ⁿ

.

Remark 4.1. This process needs the storage of m + 5 vectors of C

ⁿ

, where m = max

_k

{θ(k) − k + 1}. m + 5 is smaller than the number 2m + 4 of vectors used in look-ahead strategies. In the regular case, the classical Lanczos process only needs 6 vectors. This number coincides with m + 5, since in the regular case, θ(k) = k, which implies that m = 1 for all k.

We note that the factorization of Process 1 has also been used by Ziegler in [32, 33], where he talks about a special look-ahead strategy.

Remark 4.2. We have shown how to apply C2 to the Lanczos method.

We can do the same for the CGM-type (Conjugate Gradient Multiplied) methods which have been simultaneously introduced by Brezinski [7] and Gutknecht [20], and which are also known under the name of “product-type methods”. The CGM class contains CGS (Conjugate Gradient Squared) due to Sonneveld [27] and Bi-CGSTAB due to Van Der Vorst [28].

5. Application to Pad´ e approximation. Orthogonal polynomials and their associates implicitly come up in the computation of Padé approximants. Blocks of a non-normal Padé table are due to the non-existence and singularity of some orthogonal polynomials. In this section, we give relations between orthogonal, reciprocal, associated and intermediate polynomials introduced in [1], and we show how to apply them to the recursive computation of Padé approximants.

Let f be a formal power series f (t) = c

0

+ c

1

t

¹

+ c

2

t

²

+ . . . with c

_i

∈ C for i ∈ N. We look for a rational fraction

R(t) = Q(t)

P (t) = a

0

+ a

1

t + . . . + a

_p

t

^p

b

0

+ b

1

t + . . . + b

_q

t

^q

whose power series expansion in ascending powers of t agrees with f as far

as possible, which means that f (t) − R(t) = O(t

^p+q+1

) (t → 0). Such a

rational fraction is called a Pad´e approximant of f and it is denoted by

[p/q]

_f

(t). Usually these approximants are displayed in a two-dimensional

array called the Pad´e table. Identical Pad´e approximants can only occur in

square blocks in the Pad´e table. If there is no block, we say that the Pad´e

table is normal. Otherwise, it is called non-normal.

(17)

For every n ∈ Z, define the linear functional C

⁽ⁿ⁾

on the space of complex polynomials by C

⁽ⁿ⁾

(x

ⁱ

) = c

n+i

with the convention that c

_i

= 0 for i < 0.

C

⁽ⁿ⁾

is associated with the formal power series

f

_n

(t) = c

_n

+ c

n+1

t + c

n+2

t

²

+ . . .

5.1. The associated polynomials. For every P

_q^θⁿ

, we consider the associated polynomial

Q

^θ_qⁿ

(t) = C

⁽ⁿ⁾

P

_q^θⁿ

(x) − P

q^θⁿ

(t) x − t

where C

⁽ⁿ⁾

acts on x.

Lemma 5.1. If Q

^θ_kⁿ

is associated with the polynomial P

_k^θⁿ

of degree k, then

Q

^θ_kⁿ

(t) = X

m i=0

t

ⁱ

C

^(n−i−1)

(P

_k^θⁿ

(x))

where C

^(n−i−1)

acts on x and m = n + k − 1 − θ

ⁿ

(0). Q

^θ_kⁿ

(t) has degree m if m ≥ 0, otherwise Q

^θ_kⁿ

(t) = 0.

P r o o f. Q

^θ_kⁿ

(t) is equal to C

⁽ⁿ⁾

[(P

_k^θⁿ

(x) − P

k^θⁿ

(t))/(x − t)]. By using the equality

1/(x − t) = x

⁻¹

X

∞ i=0

(x

⁻¹

t)

ⁱ

, we prove that

Q

^θ_kⁿ

(t) = C

⁽ⁿ⁾

[P

_k^θⁿ

(x) − P

k^θⁿ

(t)]x

⁻¹

n+k−1

X

i=0

(x

⁻¹

t)

ⁱ

. Finally, since c

_i

= 0 for i < θ(0), we obtain the result of the lemma.

5.2. The reciprocal orthogonal polynomials. We consider the reciprocal series g of t

⁻^θ(0)

f defined by t

⁻^θ(0)

f (t)g(t) = 1. We set g(t) = P

∞

i=0

d

_i

t

ⁱ

and we define a functional D

⁽ⁿ⁾

on C[X] by D

⁽ⁿ⁾

(x

ⁱ

) = d

n+i

for i ∈ N. D

⁽ⁿ⁾

is called the reciprocal functional of C

⁽ⁿ⁾

. By convention, we set d

_i

= c

_i

= 0 if i < 0. Let η

_n

be the permutation associated with the functional D

⁽ⁿ⁾

; it is called the reciprocal permutation of θ

n

. We remark that the definition of D

⁽⁰⁾

= D implies η(0) = η

0

(0) = 0. We will find later a relation which gives us the permutation η

_n

from θ

_n

. The complex numbers d

_i

are obtained from the equations

c

θ(0)

d

0

= 1, c

θ(0)

d

_j

+ c

θ(0)+1

d

j−1

+ . . . + c

θ(0)+j

d

0

= 0 for j = 1, 2, . . . An orthogonal polynomial with respect to D

⁽ⁿ⁾

is called reciprocal. We denote by {R

^ηiⁿ

}

i

the family of all these reciprocal orthogonal polynomials.

They are useful for the recursive computation of numerators of Pad´e ap-

(18)

proximants. In the following theorem, we study the connection between the polynomials of the two families {P

i^θⁿ

}

i,n

and {R

^ηiⁿ

}

i,n

.

Theorem 5.1. If one of the polynomials P

_k^θ^θ(0)+n+1

and R

^η_n+k⁻ⁿ⁺¹

is regular and orthogonal, then so is the other. The same holds for P

_n+k^θ^θ^(0)−n+1

and R

_k^ηⁿ⁺¹

. If P

_k^θ^θ(0)+n+1

and P

_n+k^θ^θ^(0)−n+1

are regular and orthogonal, then

S

_n+k^η⁻ⁿ⁺¹

= d

0

P

_k^θ^θ(0)+n+1

, Q

^θ_n+k^θ(0)−n+1

= c

θ(0)

R

^η_kⁿ⁺¹

, n = 1, 2, . . . ,

 

 

 

 



c

_θ(0)

R

^η_n+k⁻ⁿ⁺¹

= P

_k^θ^θ(0)+n+1

X

n i=0

c

_θ(0)+i

x

ⁿ⁻ⁱ

+ Q

^θ_k^θ(0)+n+1

,

d

0

P

_n+k^θ^θ(0)−n+1

= R

^η_kⁿ⁺¹

X

n

i=0

d

_i

x

ⁿ⁻ⁱ

+ S

_k^ηⁿ⁺¹

,

n = 0, 1, . . .

P r o o f. It is sufficient to remark that f

θ(0)

is the reciprocal series of g (this means that C

^(θ(0))

is the reciprocal functional of D) and then use the results of [5, 16]. When θ(0) = 0, the proof given in [5, 4] of the equalities of this theorem is long. It consists in transforming the determinants of the explicit expressions of the orthogonal polynomials. A simple proof is obtained by using only Lemma 5.1 (see the proof of Theorem 5.2).

From this theorem, it is clear that, for a fixed integer n, R

_n+k^η⁻ⁿ⁺¹

, S

_n+k^η⁻ⁿ⁺¹

, P

_k^θ^θ(0)+n+1

and Q

^θ_k^θ(0)+n+1

satisfy the same recurrence relations with different initializations. The same holds for P

_n+k^θ^θ^(0)−n+1

, Q

^θ_n+k^θ^(0)−n+1

, R

^η_kⁿ⁺¹

and S

_k^ηⁿ⁺¹

. If we set, for every n, k ∈ N, N

k^ηⁿ⁺²

= c

_θ(0)

R

^η_k⁻ⁿ

and N

_k^η⁻ⁿ⁺¹

= c

θ(0)

R

^η_kⁿ⁺¹

, then the Pad´e approximant [p/q]

_f

can be written as [p/q]

_f

= N e

p^η^p−q+1

/ e P

^θq^{θ(0)+p−q+1}

whenever P

q^θ^{θ(0)+p−q+1}

is regular (see [5]).

We deduce from the preceding results that whether or not there are blocks in the Pad´e table, the numerator of each Pad´e approximant can be computed recursively by using the recurrence relations satisfied by the denominators.

Corollary 5.1. The permutations η

_n

are connected to θ

_n

by the following relations:

η

−n+1

(i) = θ

_θ(0)+n+1

(i − n) + n,

θ

_θ(0)−n+1

(i) = η

_n+1

(i − n) + n for i ≥ n, n ≥ 1,

θ

θ(0)−n+1

(i) = η

−n+1

(i) = n − 1 − i for i = 0, 1, . . . , n − 1.

P r o o f. The knowledge of the degrees of all the regular orthogonal

polynomials implies that of η

_n

and θ

_n

, see Theorems 3.1 and 3.2. So, from

the definition of the permutations θ

n

, η

n

and by using Theorem 5.1, we get

the assertion.

(19)

The quantities Q

^θ_n+k^θ(0)−n+1

and P

_k^θ^θ(0)+n+1

P

n

i=0

c

θ(0)+i

x

ⁿ⁻ⁱ

+ Q

^θ_k^θ(0)+n+1

intervene in the recursive computation of the numerators of the Pad´e approximants. These quantities do not satisfy the equalities of Theorem 5.1 when P

_k^θ^θ(0)+n+1

and P

_n+k^θ^θ^(0)−n+1

are not orthogonal. For this reason, we give some properties of them below. For every n ∈ Z and k ∈ N, we consider the monic polynomials R

^′_k^ηⁿ

defined by

(18) D

⁽ⁿ⁾

(R

^′_k^ηⁿ

t

^ηⁿ^(j)

) + α

n,k

d

_η_n(j)−ηn(k)

= 0, j = 0, . . . , k − 1, where α

_n,k

is a constant such that the solution R

_k^′^ηⁿ

of (18) is monic. The role of these polynomials is to replace the polynomials R

^η_kⁿ

in the equalities of Theorem 5.1. By substituting R

^′_k^ηⁿ

for R

^η_kⁿ

, the results of Theorem 5.1 are true even if P

_k^θ^θ(0)+n+1

and P

_n+k^θ^θ(0)−n+1

are not orthogonal. Clearly, the family {R

^′k^ηⁿ

}

n,k

is built in such a way that it contains all the regular orthogonal polynomials with respect to the functional D

⁽ⁿ⁾

. From the definition of R

_k^′ηⁿ

, we can easily see that the condition for their existence and unicity is the same as for the polynomials R

^η_kⁿ

. Therefore, for every k and n, the polynomial R

^′_k^ηⁿ

exists, is unique and has degree k.

Theorem 5.2. We have

Q

^θ_n+k^θ^(0)−n+1

= c

_θ(0)

R

^′_k^ηⁿ⁺¹

, S

_n+k^′^η⁻ⁿ⁺¹

= d

0

P

_k^θ^θ(0)+n+1

, n = 1, 2, . . . ,

 

 

 

 



c

_θ(0)

R

^′_n+k^η⁻ⁿ⁺¹

= P

_k^θ^θ(0)+n+1

X

n i=0

c

_θ(0)+i

x

ⁿ⁻ⁱ

+ Q

^θ_k^θ(0)+n+1

,

d

0

P

_n+k^θ^θ(0)−n+1

= R

^′_k^ηⁿ⁺¹

X

n

i=0

d

_i

x

ⁿ⁻ⁱ

+ S

_k^′^ηⁿ⁺¹

,

n = 0, 1, . . .

P r o o f. We want to prove Q

^θ_n+k^θ^(0)−n+1

= c

_θ(0)

R

^′_k^ηⁿ⁺¹

. First assume that θ(0) = 0. In this case, thanks to Lemma 5.1, we have, for j = 0, 1, . . . , k − 1,

D

⁽ⁿ⁺¹⁾

[Q

^θ_n+k⁻ⁿ⁺¹

(t)t

^ηⁿ⁺¹^(j)

] = D

⁽ⁿ⁺¹⁾

h X

^k

i=0

t

^i+ηⁿ⁺¹^(j)

C

⁽⁻ⁿ⁻ⁱ⁾

(P

_n+k^θ⁻ⁿ⁺¹

(x)) i

=

n+k

X

l=0

a

_l

X

k

i=0

d

_i+η_n+1_(j)+n+1

c

−n−i+l

where D

⁽ⁿ⁺¹⁾

acts on t, C

⁽⁻ⁿ⁻ⁱ⁾

acts on x and P

_n+k^θ⁻ⁿ⁺¹

(x) = P

n+k l=0

a

l

x

^l

. Since θ(0) is zero, Corollary 5.1 implies θ

−n+1

(j + n) = η

n+1

(j) + n, and we conclude that

D

⁽ⁿ⁺¹⁾

[Q

^θ_n+k⁻ⁿ⁺¹

(t)t

^ηⁿ⁺¹^(j)

] =

n+k

X

l=0

a

_l

X

l−n i=0

d

_i+θ

−n+1(j+n)+1

c

−n−i+l