Biconjugate gradient stabilized method

Lua error in package.lua at line 80: module 'strict' not found.

In numerical linear algebra, the biconjugate gradient stabilized method, often abbreviated as BiCGSTAB, is an iterative method developed by H. A. van der Vorst for the numerical solution of nonsymmetric linear systems. It is a variant of the biconjugate gradient method (BiCG) and has faster and smoother convergence than the original BiCG as well as other variants such as the conjugate gradient squared method (CGS). It is a Krylov subspace method.

Algorithmic steps

Unpreconditioned BiCGSTAB

To solve a linear system $Ax = b$ , BiCGSTAB starts with an initial guess $x 0$ and proceeds as follows:

$r 0 = b - Ax 0$
Choose an arbitrary vector $r̂ 0$ such that $(r̂ 0, r 0) \neq 0$ , e.g., $r̂ 0 = r 0$
$ρ 0 = α = ω 0 = 1$
$v 0 = p 0 = 0$
For i = 1, 2, 3, …
1. $ρ i = (r̂ 0, r i -1)$
2. $β = (ρ i / ρ i -1)(α / ω i -1)$
3. $p i = r i -1 + β (p i -1 - ω i -1 v i -1)$
4. $v i = Ap i$
5. $α = ρ i /(r̂ 0, v i)$
6. $s = r i -1 - α v i$
7. if || $s$ || sufficiently small, then set $x i = x i -1 + α p i$ and quit
8. $t = As$
9. $ω i = (t, s)/(t, t)$
10. $x i = x i -1 + α p i + ω i s$
11. If $x i$ is accurate enough, then quit
12. $r i = s - ω i t$

Preconditioned BiCGSTAB

Preconditioners are usually used to accelerate convergence of iterative methods. To solve a linear system $Ax = b$ with a preconditioner $K = K 1 K 2 \approx A$ , preconditioned BiCGSTAB starts with an initial guess $x 0$ and proceeds as follows:

$r 0 = b - Ax 0$
Choose an arbitrary vector $r̂ 0$ such that $(r̂ 0, r 0) \neq 0$ , e.g., $r̂ 0 = r 0$
$ρ 0 = α = ω 0 = 1$
$v 0 = p 0 = 0$
For i = 1, 2, 3, …
1. $ρ i = (r̂ 0, r i -1)$
2. $β = (ρ i / ρ i -1)(α / ω i -1)$
3. $p i = r i -1 + β (p i -1 - ω i -1 v i -1)$
4. $y = K -1 p i$
5. $v i = Ay$
6. $α = ρ i /(r̂ 0, v i)$
7. $s = r i -1 - α v i$
8. if || $s$ || sufficiently small, then set $x i = x i -1 + α p i$ and quit
9. $z = K -1 s$
10. $t = Az$
11. $ω i = (K -1 1 t, K -1 1 s)/(K -1 1 t, K -1 1 t)$
12. $x i = x i -1 + α y + ω i z$
13. If $x i$ is accurate enough then quit
14. $r i = s - ω i t$

This formulation is equivalent to applying unpreconditioned BiCGSTAB to the explicitly preconditioned system

Ãx̃ = b̃

with $Ã = K -1 1 A K -1 2$ , $x̃ = K 2 x$ and $b̃ = K -1 1 b$ . In other words, both left- and right-preconditioning are possible with this formulation.

Derivation

BiCG in polynomial form

In BiCG, the search directions $p i$ and $p̂ i$ and the residuals $r i$ and $r̂ i$ are updated using the following recurrence relations:

p i = r i -1 + β i p i -1

,

p̂ i = r̂ i -1 + β i p̂ i -1

,

r i = r i -1 - α i Ap i

,

r̂ i = r̂ i -1 - α i A T p̂ i

.

The constants $α i$ and $β i$ are chosen to be

α i = ρ i /(p̂ i, Ap i)

,

β i = ρ i / ρ i -1

where $ρ i = (r̂ i -1, r i -1)$ so that the residuals and the search directions satisfy biorthogonality and biconjugacy, respectively, i.e., for $i \neq j$ ,

(r̂ i, r j) = 0

,

(p̂ i, Ap j) = 0

.

It is straightforward to show that

r i = P i (A) r 0

,

r̂ i = P i (A T) r̂ 0

,

p i +1 = T i (A) r 0

,

p̂ i +1 = T i (A T) r̂ 0

where $P i (A)$ and $T i (A)$ are $i$ th-degree polynomials in $A$ . These polynomials satisfy the following recurrence relations:

P i (A) = P i -1 (A) - α i A T i -1 (A)

,

T i (A) = P i (A) - β i +1 T i -1 (A)

.

Derivation of BiCGSTAB from BiCG

It is unnecessary to explicitly keep track of the residuals and search directions of BiCG. In other words, the BiCG iterations can be performed implicitly. In BiCGSTAB, one wishes to have recurrence relations for

r̃ i = Q i (A) P i (A) r 0

where $Q i (A) = (I - ω 1 A)(I - ω 2 A)\dots(I - ω i A)$ with suitable constants $ω j$ instead of $r i = P i (A)$ in the hope that $Q i (A)$ will enable faster and smoother convergence in $r̃ i$ than $r i$ .

It follows from the recurrence relations for $P i (A)$ and $T i (A)$ and the definition of $Q i (A)$ that

Q i (A) P i (A) r 0 = (I - ω i A)(Q i -1 (A) P i -1 (A) r 0 - α i A Q i -1 (A) T i -1 (A) r 0)

,

which entails the necessity of a recurrence relation for $Q i (A) T i (A) r 0$ . This can also be derived from the BiCG relations:

Q i (A) T i (A) r 0 = Q i (A) P i (A) r 0 + β i +1 (I - ω i A) Q i -1 (A) P i -1 (A) r 0

.

Similarly to defining $r̃ i$ , BiCGSTAB defines

p̃ i +1 = Q i (A) T i (A) r 0

.

Written in vector form, the recurrence relations for $p̃ i$ and $r̃ i$ are

p̃ i = r̃ i -1 + β i (I - ω i -1 A) p̃ i -1

,

r̃ i = (I - ω i A)(r̃ i -1 - α i A p̃ i)

.

To derive a recurrence relation for $x i$ , define

s i = r̃ i -1 - α i A p̃ i

.

The recurrence relation for $r̃ i$ can then be written as

r̃ i = r̃ i -1 - α i A p̃ i - ω i As i

,

which corresponds to

x i = x i -1 + α i p̃ i + ω i s i

.

Determination of BiCGSTAB constants

Now it remains to determine the BiCG constants $α i$ and $β i$ and choose a suitable $ω i$ .

In BiCG, $β i = ρ i / ρ i -1$ with

ρ i = (r̂ i -1, r i -1) = (P i -1 (A T) r̂ 0, P i -1 (A) r 0)

.

Since BiCGSTAB does not explicitly keep track of $r̂ i$ or $r i$ , $ρ i$ is not immediately computable from this formula. However, it can be related to the scalar

ρ̃ i = (Q i -1 (A T) r̂ 0, P i -1 (A) r 0) = (r̂ 0, Q i -1 (A) P i -1 (A) r 0) = (r̂ 0, r i -1)

.

Due to biorthogonality, $r i -1 = P i -1 (A) r 0$ is orthogonal to $U i -2 (A T) r̂ 0$ where $U i -2 (A T)$ is any polynomial of degree $i - 2$ in $A T$ . Hence, only the highest-order terms of $P i -1 (A T)$ and $Q i -1 (A T)$ matter in the dot products $(P i -1 (A T) r̂ 0, P i -1 (A) r 0)$ and $(Q i -1 (A T) r̂ 0, P i -1 (A) r 0)$ . The leading coefficients of $P i -1 (A T)$ and $Q i -1 (A T)$ are $(-1) i -1 α 1 α 2 \dots α i -1$ and $(-1) i -1 ω 1 ω 2 \dots ω i -1$ , respectively. It follows that

ρ i = (α 1 / ω 1)(α 2 / ω 2)\dots(α i -1 / ω i -1) ρ̃ i

,

and thus

β i = ρ i / ρ i -1 = (ρ̃ i / ρ̃ i -1)(α i -1 / ω i -1)

.

A simple formula for $α i$ can be similarly derived. In BiCG,

α i = ρ i /(p̂ i, Ap i) = (P i -1 (A T) r̂ 0, P i -1 (A) r 0)/(T i -1 (A T) r̂ 0, A T i -1 (A) r 0)

.

Similarly to the case above, only the highest-order terms of $P i -1 (A T)$ and $T i -1 (A T)$ matter in the dot products thanks to biorthogonality and biconjugacy. It happens that $P i -1 (A T)$ and $T i -1 (A T)$ have the same leading coefficient. Thus, they can be replaced simultaneously with $Q i -1 (A T)$ in the formula, which leads to

α i = (Q i -1 (A T) r̂ 0, P i -1 (A) r 0)/(Q i -1 (A T) r̂ 0, A T i -1 (A) r 0) = ρ̃ i /(r̂ 0, A Q i -1 (A) T i -1 (A) r 0) = ρ̃ i /(r̂ 0, Ap̃ i)

.

Finally, BiCGSTAB selects $ω i$ to minimize $r̃ i = (I - ω i A) s i$ in $2$ -norm as a function of $ω i$ . This is achieved when

((I - ω i A) s i, As i) = 0

,

giving the optimal value

ω i = (As i, s i)/(As i, As i)

.

Generalization

BiCGSTAB can be viewed as a combination of BiCG and GMRES where each BiCG step is followed by a GMRES( $1$ ) (i.e., GMRES restarted at each step) step to repair the irregular convergence behavior of CGS, as an improvement of which BiCGSTAB was developed. However, due to the use of degree-one minimum residual polynomials, such repair may not be effective if the matrix $A$ has large complex eigenpairs. In such cases, BiCGSTAB is likely to stagnate as confirmed by numerical experiments.

One may expect that higher-degree minimum residual polynomials may better handle this situation. This gives rise to algorithms including BiCGSTAB2^[1] and the more general BiCGSTAB( $l$ )^[2]. In BiCGSTAB( $l$ ), a GMRES( $l$ ) step follows every $l$ BiCG steps. BiCGSTAB2 is equivalent to BiCGSTAB( $l$ ) with $l = 2$ .

References

Lua error in package.lua at line 80: module 'strict' not found.
Lua error in package.lua at line 80: module 'strict' not found.
^ Lua error in package.lua at line 80: module 'strict' not found.
^ Lua error in package.lua at line 80: module 'strict' not found.

[1]

[2]

v t e Numerical linear algebra
Key concepts	Floating point Numerical stability
Problems	Matrix multiplication (algorithms) Matrix decompositions Linear equations Sparse problems
Hardware	CPU cache TLB Cache-oblivious algorithm SIMD Multiprocessing
Software	BLAS Specialized libraries General purpose software

Biconjugate gradient stabilized method

Contents

Algorithmic steps

Unpreconditioned BiCGSTAB

Preconditioned BiCGSTAB

Derivation

BiCG in polynomial form

Derivation of BiCGSTAB from BiCG

Determination of BiCGSTAB constants

Generalization

See also

References

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools