Skip to main content
  • Research article
  • Open access
  • Published:

A posteriori error analysis of stabilised FEM for degenerate convex minimisation problems under weak regularity assumptions

Abstract

Background

The discretisation of degenerate convex minimisation problems experiences numerical difficulties with a singular or nearly singular Hessian matrix.

Methods

Some discrete analog of the surface energy in microstrucures is added to the energy functional to define a stabilisation technique.

Results

This paper proves (a) strong convergence of the stress even without any smoothness assumption for a class of stabilised degenerate convex minimisation problems. Given the limitted a priori error control in those cases, the sharp a posteriori error control is of even higher relevance. This paper derives (b) guaranteed a posteriori error control via some equilibration technique which does not rely on the strict Galerkin orthogonality of the unperturbed problem. In the presence of L2 control in the original minimisation problem, some realistic model scenario with piecewise smooth exact solution allows for strong convergence of the gradients plus refined a posteriori error estimates. This paper presents (c) an improved a posteriori error control in this interface problem and so narrows the efficiency reliability gap.

Conclusions

Numerical experiments illustrate the theoretical convergence rates for uniform and adaptive mesh-refinements and the improved a posteriori error control for four benchmark examples in the computational microstructures.

Background

Infimising sequences of variational problems with non-quasiconvex energy densities, in general, develop finer and finer oscillations with no classical limit in Sobolev function spaces called microstructure [16]. Those oscillations cause difficulty to numerical methods because fine grids are necessary to resolve such oscillations which results in ineffective and tricky mesh-depending computations. Strong convergence of gradients of infimising sequences of the non-quasiconvex problem is impossible.

Relaxation techniques replace the nonconvex energy density by its (semi-)convex hull and lead to a macroscopic model. Since the convexified energy density obtained by this method, in general, lacks strict convexity, numerical algorithms might encounter situations where the Hessian matrix is singular. For instance, the Newton minimisation algorithm fails on the convexified three-well problem of Subsection ‘Three-well benchmark’ below. Applications of relaxation techniques include models in computational microstructure [57], some optimal design problems [8, 9], the nonlinear Laplacian [10] (where the Hessian can become arbitrarily ill-conditioned in spite of its strict convexity) and elastoplasticity [1].

Stabilisation techniques regularise the energy term by an additional positive semidefinite stabilisation function. The paper [11] discusses several choices of such stabilisation functions for P1 conforming finite elements and quasiuniform meshes. It turns out that stabilisation can ensure strong convergence of the strain approximations under particular circumstances. A particular stabilisation in [12] leads to strong convergence even on unstructured grids but is still restricted to unrealistically smooth solutions. This paper studies the stabilisation technique of [12] and addresses the question of convergence (i) without extra regularity assumptions, (ii) in a realistic scenario called model interface problem, and (iii) establishes an a posteriori error control.

The stabilisation leads to improved condition numbers of the Hessian matrix and to reduced errors if the numerical solvers fail without stabilisation. Figure 1 shows the convergence of the discrete stress σ of the three-well benchmark corresponding to the discrete minimisers of the energy E ( v )=E( v )+C/2 | v | 2 . The errors are plotted for computations with uniform mesh refinements with various solver tolerances in the discrete minimisation procedure at a fixed triangulation and values of C, cf. Section ‘Numerical experiments’ for details on the MATLAB implementation. Without stabilisation, the convergence stagnates with a moderate tolerance of 10-5 which becomes visible as a “plateau” in Figure 1. The Newton solver even aborts prematurely due to the singular Hessian. In conclusion, stabilisation enables higher accuracies in numerical examples.

Figure 1
figure 1

Impact of the stabilisation on the error. Error of the stabilised stress σ with coefficients C = 0, 10-4,1 of the stabilisation and various tolerances tol = 10-5, 10-6, 10-8 of the Newton solver.

For β0 the convex energy functional assumes the form

E(v):= Ω W ( D v ( x ) ) + β | v ( x ) - g ( x ) | 2 - f ( x ) · v ( x ) dx.
(1.1)

Assume that W is convex with quadratic growth so that there exist minimisers u H 0 1 (Ω); below p-th order growth is included while p = 2 throughout this simplifying introduction. Given a sequence of shape-regular triangulations ( T ) N 0 [13], let u minimise the stabilised discrete energy

E ( v ) : = E ( v ) + 1 2 | v | 2 with | v | 2 : = H 2 F F ( Ω ) h F - 1 || D v F || L 2 ( F ) 2

amongst all conforming P1 finite element functions v on T , where [Dv ] F is the jump of the gradient Dv along the interior side F, written F F (Ω), and H  := maxT h T is the maximal diameter h T of all simplices T T .

Section ‘Global convergence’ verifies the strong convergence of the discrete solution u and its stress σ  := DW(Du ) to their respective continuous conterparts,

|| σ - σ | | L 2 ( Ω ) 2 + β || u - u | | L 2 ( Ω ) 2 + | u | 2 0 as ∞.

Section ‘A posteriori error estimates’ presents a novel application of [1417] to nonlinear problems. For the L2 projection Π onto the space of piecewise P0 functions, any Raviart-Thomas function τ RT 0 ( T ) satisfies

|| σ - σ | | L 2 ( Ω ) 2 || σ - τ || L 2 ( Ω ) + || Π Λ + div τ || L 2 ( Ω ) + osc , 2 ( Λ ) || u - u || H 1 ( Ω ) .

This error bound holds for any discrete displacement u that satisfies the boundary conditions; the point is that inexact solve is included — there is no Galerkin orthogonality required. The drawback is to minimise the expression on the right-hand side with respect to τ in order to obtain a sharp error bound. This is a particular selection: degenerate convex minimisation problems do not allow for a control of ||u- u | | H 1 ( Ω ) and may even face multiple exact or discrete solutions while the discrete minimum of E is unique. However, in some results of this paper, either W or the lower-order terms lead to some control over ||u- u | | H 2 ( Ω ) and the selection via stabilisation is correct.

Phase transition problems motivate the investigation of scenarios with a smooth solution u up to a one-dimensional interface Γ Ω ¯ [18]. Section ‘Refined analysis for an interface model problem’ proves that such problems allow even for strong convergence of the gradients for any unique solution u in W1,(Ω)∩H2(ΩΓ) [19]. This result also leads to an improvement of the a posteriori error control of the discrete stresses and narrows the efficiency-reliability gap; the efficiency-reliability gap is the difference of the convergence rates of the guaranteed upper a posteriori error bound and the guaranteed lower a posteriori error bound.

Section ‘Numerical experiments’ complements the theoretical findings with numerical experiments to provide empirical evidence of the improved error control. The stabilisation technique competes in four benchmark examples, with and without known exact solution, for uniform and two different mesh-refining algorithms for the explicit residual-based error estimator of [7] and with an averaging-type error estimator of ([18], (1.11)).

Standard notation on Lebesgue and Sobolev spaces is employed throughout this paper and ab abbreviates a ≤ C b with some generic constant 0 < C <  independent of crucial parameters (like the mesh-size on level ); a ≈ b means aba. Furthermore, A:B abbreviates the matrix inner product that corresponds to the Frobenius norm.

Methods: Discretisation and Stabilisation

Based on the convergence results for unstructured grids, this paper will develop reliable error estimators for a class of stabilised convex minimisation problems described in the sequel. Let Ω R n be a bounded Lipshitz domain with polygonal boundary for n = 2 or 3. Given a continuous convex energy density W: R m × n R, g,f L 2 (Ω; R m ), β0, and v W 1 , p (Ω; R m ) with 2 ≤ p <  and m = 1, …, n, the energy is given by (1.1).

Throughout this paper, the energy density W C 1 ( R m × n ;R) satisfies (2.1)–(2.2) for parameters 1 < r ≤ 2, 0 ≤ s <  and s + r + p ≤ r p. The two-sided growth condition reads

|F | p -1W(F)|F | p +1for allF R m × n .
(2.1)

The convexity control assumption reads, for all F 1 , F 2 R m × n ,

|DW( F 1 )-DW( F 2 ) | r 1 + | F 1 | s + | F 2 | s D W ( F 1 ) - D W ( F 2 ) : F 1 - F 2 .
(2.2)

The proof of Theorem 2 in [7] shows that (2.2) is crucial for the uniqueness of the stress tensor DW(Du).

Given Dirichlet data u D W 2 , p (Ω; R m ) H 2 (∂Ω; R m ) for the set of admissible functions A:= u D +V:= u D + W 0 1 , p (Ω; R m ), the continuous (convex) model problem reads

minimise E ( v ) within v A .
(2.3)

A finite element approximation of (2.3) is based on a family of regular triangulations ( T ) N 0 of the domain Ω into simplices in the sense of Ciarlet [13] (e.g., for n = 2, two non-disjoint triangles of T share either a common edge or a common node). The set of sides F consists of edges (for n = 2) or faces (for n = 3) of T and is split into the union of the sets of all interiour sides F (Ω) and of all boundary sides F (∂Ω).

For latter reference, define the diameter h T :=diamT of a triangle (or tetrahedron) T T and the size h F  := diamF of a side F F . The mesh size function h :Ω R > 0 is given by

h ( x ) : = h T for x int T T , min h F : F F and x F otherwise.

The global mesh size will be abbreviated by H := h L ( Ω ) . We presume the family ( T ) N 0 to be shape-regular so that h F  ≈ h T for all T T , F F and FT.

The space of T -piecewise polynomials of degree k N 0 is P k ( T ). The nodal interpolation I w P 1 ( T )C( Ω ¯ ) of wC( Ω ¯ ) is given by I w(z) = w(z) for all nodes z. Let furthermore Π w be the L2 projection of wL2(Ω) onto P 0 ( T ), and osc , q (w):= h (id- Π )w L q ( Ω ) be the oscillation of wLq(Ω) for 2 ≤ q ≤  with respect to the triangulation T . The symbol id denotes the identity operator. Let uD, = I u D , and

A : = u D , + V with V : = V P 1 ( T ; R m ) C ( Ω ¯ ) .

Given a function v on Ω which is possibly discontinuous along some side F F (Ω) shared by the two elements T± such that there exist traces from either sides, the jump of v along F reads

[ v ] ( x ) = [ v ] F ( x ) : = lim T + y x v ( y ) - lim T - y x v ( y ) for x F .

The stabilisation of [12] will be used throughout this paper with -1 < γ <  and

a (v,w):= F F ( Ω ) H 1 + γ h F F [ D v ] F : [ D w ] F dsand | v | 2 := a (v,v).
(2.4)

The stabilised discrete problem reads

minimise E (v):=E(v)+ 1 2 a (v,v)amongstv A .
(2.5)

Convergence of gradients with a guaranteed convergence rate is shown in [12] under unrealistically high regularity assumptions. A comprehensive collection of the results in [12] is summarised in the following theorem.

Theorem 2.1.

([12]) LetuA H 3 / 2 + ε (Ω; R m )be some solution of (2.3) for some ε > 0; let pand rbe the Hölder conjugate of p and r, -1 < γ < 3, and set

ζ : = min 1 + γ , r for β > 0 and ζ : = min 1 + γ , 2 for β = 0 .

Then the discrete solution u A of (2.5) and the continuous and discrete stress σ:=DW(Du) L p (Ω; R m × n ) and σ :=DW(D u ) P 0 ( T ; R m × n ) satisfy

σ - σ L p ( Ω ) r + u - u L 2 ( Ω ) 2 + | | | u | | | 2 + H ( 1 + γ ) / 2 D ( u - u ) L 2 ( Ω ) 2 H ζ .

Proof

This combines Lemma 3.5 and 4.1 – 4.2 plus Theorem 3.8 and 4.4 in [12].□

Global convergence

This section is devoted to the proof of a general convergence result without higher regularity assumptions. Let uA and u A solve the minimisation problem (2.3) and (2.5) and set σ := DW(Du) and σ  := DW(Du ). For the unstabilised approximation, the a priori error estimates of [7] plus a density argument prove convergence of

σ - σ L p ( Ω ) r + β u - u L 2 ( Ω ) 2 0 as H 0 .

Note that β = 0 is permitted. Then, however, uniqueness of u and convergence of u- u L 2 ( Ω ) 2 are guaranteed. The point in the following result is that the stabilised approximation converges as well as |||u |||  → 0 even for non-smooth or non-unique minimisers. Under special circumstances, uniqueness of u and the convergence u- u L 2 ( Ω ) 0 can be shown even for β = 0, e.g., in Example 3.3.

Theorem 3.1.

( Global Convergence) Provided lim H =0 it holds

σ - σ L p ( Ω ) r + β u - u L 2 ( Ω ) 2 + | | | u | | | 2 0 as ∞.

The proof is based on the following lemma.

Lemma 3.2.

The errors δ  := σ - σ and e  := u - u satisfy, for all v vV , that

δ L p ( Ω ) r + β e L 2 ( Ω ) 2 | e - v | W 1 , p ( Ω ) r + β e - v | L 2 ( Ω ) 2 + a ( u , v ) .

Proof

The minimisation problems (2.3) and (2.5) are equivalent to their respective Euler-Lagrange equations, namely for vV and v V ,

Ω σ ( x ) : D v ( x ) + 2 β ( u ( x ) - g ( x ) ) · v ( x ) - f ( x ) · v ( x ) dx=0;
(3.1)
Ω σ ( x ) : D v ( x ) + 2 β ( u ( x ) - g ( x ) ) · v ( x ) - f ( x ) · v ( x ) dx+ a ( u , v )=0.
(3.2)

Algebraic transformations of the difference of these two equations lead to

Ω δ : D e d x + 2 β e L 2 ( Ω ) 2 = Ω δ : D ( e - v ) + 2 β e · ( e - v ) d x + a ( u , v ) .

It is shown in ([12], Lemma 3.5) that

δ L p ( Ω ) r Ω δ :D e dx.
(3.3)

Two Hölder inequalities on the right-hand side and absorbtions of δ L p ( Ω ) and e L 2 ( Ω ) eventually conclude the proof. Further details are dropped for brevity.□

Proof of Theorem 3.1

Given any positive ε, the density of smooth functions in W 0 1 , p (Ω; R m ) leads to some v ε D(Ω; R m ) such that u- u D - v ε W 1 , p ( Ω ) ε. Hence v  := I (v ε  + u D ) - u V satisfies

e - v = ( u - u D - v ε ) + ( id - I ) ( v ε + u D ) .

Note that the nodal interpolation I (v ε  + u D ) is well-defined since v ε and u D are assumed to be smooth. With ([12], Lemma 3.1 – 3.2) it follows that

( id - I ) ( v ε + u D ) W 1 , p ( Ω ) H 0 and | | | I ( v ε + u D ) | | | 2 = | | | ( id - I ) ( v ε + u D ) | | | 2 H 1 + γ 0 as ∞.

Since · L 2 ( Ω ) · W 1 , p ( Ω ) , this yields some 0 N such that

| e - v | W 1 , p ( Ω ) r + β e - v L 2 2 ( Ω ) + | | | I ( v ε + u D ) | 2 ε for all 0 .

A Cauchy inequality applied to the stabilisation norm proves

a ( u , v ) = - | | | u | | | 2 + a ( u , I ( v ε + u D ) ) - 1 2 | u | 2 + 1 2 | I ( v ε + u D ) | 2 .

Substitute a (u , v ) in Lemma 3.2 and add 1 2 ||| u || | 2 on both sides. This leads to

δ L p ( Ω ) r + β e L 2 ( Ω ) 2 + | | | u | | | 2 ε for all 0 .

Example 3.3.

The two-well example from the computational benchmark [18] allows an estimate on e L 2 ( Ω ) even for β = 0. Let n = 2, let F 1 :=- F 2 :=(3,2)/ 13 , and let the energy density W be the convex hull of F |F - F1|2|F - F2|2. That is

W(F)= max 0 , | F | 2 - 1 2 +4 | F | 2 - ( 3 F ( 1 ) + 2 F ( 2 ) ) 2 / 13 .
(3.4)

Then ([11], Lemma 9.1) proves, for all v V , that

e L 2 ( Ω ) 2 Ω δ : D e d x + e - v H 1 ( Ω ) 2 .

Therefore, the arguments of Lemma 3.2 lead to

δ L p ( Ω ) r + e L 2 ( Ω ) 2 | e - v | W 1 , p ( Ω ) r + e - v H 1 ( Ω ) 2 + a ( u , v ) .

This result can be used in the proof of Theorem 3.1 in order to obtain

σ - σ L p ( Ω ) r + u - u L 2 ( Ω ) 2 + | | | u | | | 2 0 as ∞.

A posteriori error estimates

Beyond the a posteriori error analysis of [7], the additional stabilisation term in the discretisation of this paper causes an additional difficulty in that the Galerkin orthogonality does not hold for the natural residual. Inspired from novell developments in the a posteriori error control of elliptic PDEs motivated by inexact solve [1417], this section presents some guaranteed upper error bound for the discretisation at hand for any approximation u which does not necessarily satisfy (3.2) exactly. Thereby inexact solve is included.

Let uA solve (2.3) and let u A be arbitrary. It is not assumed that u solves the discrete problem (2.5); the following theorem holds regardless of this. Recall the definitions of osc,q(·) and Π from Section ‘Methods: Discretisation and Stabilisation’ and given σ := DW(Du) and σ  := DW(Du ), abbreviate

Λ : = - 2 β ( u - g ) + f , e : = u - u and δ : = σ - σ .

Theorem 4.1.

Given any w W 1 , p (Ω; R m )with w  = u - u on the boundary ∂ Ω, and given anyτH(div,Ω; R m × n ), it holds, for all 2 ≤ q ≤ p and for some constant ϰ known from ([12], Lemma 3.5), that

ϰ / 2 δ L p ( Ω ) r + β e L 2 ( Ω ) 2 r ϰ / 2 1 - r / r | w | W 1 , p ( Ω ) r + β w L 2 ( Ω ) 2 + σ - τ L q ( Ω ) + Π Λ + div τ L q ( Ω ) + osc , q ( Λ ) e - w W 1 , q ( Ω ) .

The constant ϰ depends on problem-specific data such as u W 1 , p ( Ω ) and the size of the domain Ω. Refer to the proof of Lemma 3.5 in [12] for details.

Before the proofs conclude this section, some practical choice of τ in Theorem 4.1 is discussed as some Raviart-Thomas finite element functions in

RT 0 ( T ) : = { τ RT P 1 ( T ) H ( div , Ω ) : T T a , b , c R x T , τ RT ( x ) = ( a , b ) + cx } .

We suggest the computation (or an accurate approximation) of

μ := min τ RT 0 ( T ) m σ - τ L q ( Ω ) + Π Λ + div τ L q ( Ω )
(4.1)

and emphasise that any upper bound is allowed in Theorem 4.1. This leads to

ϰ / 2 δ L p ( Ω ) r + β e L 2 ( Ω ) 2 r ϰ / 2 1 - r / r | w | W 1 , p ( Ω ) r + β w L 2 ( Ω ) 2 + μ + osc , q ( Λ ) e - w W 1 , q ( Ω ) .

The algorithm of ([20], Prop. 4.1) computes some w from (id-I )u D with

w L q ( T ) h T 1 / q ( id - I ) u D L q ( ∂T ∂Ω ) and D w L q ( T ) h T 1 / q - 1 ( id - I ) u D L q ( ∂T ∂Ω ) + h T 1 / q ( id - I ) u D / ∂s L q ( ∂T ∂Ω ) .
(4.2)

(The proof of the second assertion is analogous to that of ([20], Prop. 4.1) and the first is an immediate consequence of the design of w ). This and e - w W 1 , q ( Ω ) 1 for bounded u (i.e. solely u W 1 , p ( Ω ) 1 is assumed) lead to the practical estimate μ as a computable guaranteed upper bound of the left-hand side of Theorem 4.1. Since the minimisation of (4.1) is computationally intensive for q ≠ 2, Section ‘Numerical experiments’ actually computes an approximation of μ , based on q = 2.

The choice τ = σ in Theorem 4.1 shows that the right-hand side is in fact optimal up to oscillations. The reliability-efficiency gap of [18] is visible here in that we have no further estimate on u W 1 , p ( Ω ) [7, 18]. However, additional smoothness assumptions on u may lead to refined estimates on the term e - w W 1 , q ( Ω ) (cf. Section ‘Refined analysis for an interface model problem’). The following result indicates that μ is sharp in the sense that it converges with the correct convergence rate. This theorem employs the Fortin interpolation operator IF, defined for τH(div,Ω) L t (Ω; R n ) with t > 2 by I F , τ RT 0 ( T ) and

F n F · ( id - I F , ) τ d s = 0 for all F F .

Here and in the following, n F denotes a unit normal vector of the side F; the direction of n F arbitrary, but fixed for a given side F. For the improved regularity of stress in the class of degenerate convex minimisation problems at hand, we refer to [3, 21].

Theorem 4.2.

( Efficiency) If the exact stress σ is sufficiently regular such that its Fortin interpolant τ = I F , σ RT 0 ( T ; R m × n )is defined, it holds

σ - τ L q ( Ω ) + Π Λ + div τ L q ( Ω ) δ L q Ω + 2 β e L q ( Ω ) + ( id - I F , ) σ L q ( Ω ) .

It is expected that (id- I F , )σ L q ( Ω ) H . This is shown in ([22], Prop. 3.6) for q = 2 and therefore also holds for q ≤ 2. Hence the right-hand side of the assertion of Theorem 4.2 converges with the (expected) optimal convergence rates.

Proof of Theorem 4.1.

Let ϰ be the reciprocal of c1 in ([12], Lemma 3.5), which is also the multiplicative constant hidden in (3.3). Recall Young’s inequality, which reads ab a r /r+ b r / r for a, b > 0. This, (3.3) and the continuous Euler-Lagrange equation (3.1) show, for v = e  - w V, that

ϰ δ L p ( Ω ) r + 2 β e L 2 ( Ω ) 2 Ω δ : D v + 2 β e · v d x + Ω δ : D w + 2 β e · w d x - Ω σ : D v - Λ · v d x + β e L 2 ( Ω ) 2 + β w L 2 ( Ω ) 2 + ϰ / 2 δ L p ( Ω ) r + r ϰ / 2 1 - r / r | w | W 1 , p ( Ω ) r .

Hence Res (v):=- Ω ( σ :Dv- Λ ·v)dx satisfies

ϰ / 2 δ L p ( Ω ) r + β e L 2 ( Ω ) 2 Res ( v ) + r ϰ / 2 1 - r / r | w | W 1 , p ( Ω ) r + β w L 2 ( Ω ) 2 .

Let C q denote the Poincaré constant of convex domains with respect to the W 1 , q norm. The fundamental theorem of calculus on some one-dimensional arc shows that C  ≤ 1. The paper [23] proves C1 = 1 / 2. Hence, operator-interpolation arguments [24, 25] prove C q ( 1 / 2 ) 1 / q 1. The Poincaré inequality shows, for any 2 ≤ q ≤ p, that

Ω ( id - Π ) Λ · v d x = Ω h ( id - Π ) Λ · 1 h ( id - Π ) v d x h ( id - Π ) Λ L q ( Ω ) D v L q ( Ω ) = osc , q ( Λ ) D v L q ( Ω ) .

For any τH(div,Ω; R m × n ), the Hölder and Poincaré inequalities show

Res ( v ) = - Ω ( ( σ - τ ) : D v - ( Π Λ + div τ ) · v - ( id - Π ) Λ · v ) d x σ - τ L q ( Ω ) + Π Λ + div τ L q ( Ω ) + osc , q ( Λ ) v W 1 , q ( Ω ) .

Proof of Theorem 4.2

The triangle inequality yields

σ - τ L q ( Ω ) ( id - I F , ) σ L q ( Ω ) + δ L q Ω .

Since f = 2β(u - g)-divσ, the commutative property divIF, = Π div (cf. ([22], p. 129)) yields

Π Λ + div τ L q ( Ω ) = 2 β Π e L q ( Ω ) 2 β e L q ( Ω ) .

Refined analysis for an interface model problem

This section is devoted for a model scenario from phase transition problems [18] with some solution u that is smooth outside some one-dimensional interface Γ. Suppose some (possibly non-unique) minimiser u of the continuous problem (2.3) satisfies u W 1 , (Ω; R m ) W 2 , p (ΩΓ; R m ) for some finite union Γ of (n - 1) dimensional Lipschitz surfaces in Ω ¯ . Since Ω has a Lipschitz boundary, this implies Lipschitz continuity of u on Ω. We refer to [19] for sufficient conditions for u W 1 , (Ω; R m ) and conclude that the remaining assumption u W 2 , p (ΩΓ; R m ) is the essential hypothesis expected in many interface problems. Let u A be the (unique) minimiser of the discrete stabilised problem (2.5). In the following, also Γ =  is permitted to extend previous results [12] for highly regular minimisers.

The following theorem leads to a priori convergence rates for the interface model problem. Thereby it recovers the results of [12] for problems with piecewise smooth exact solution.

We will abbreviate the set of all triangles that are touched by Γ as T (Γ):={T T :dist(T,Γ)=0}, its cardinality as T ( Γ ) , its union as Ω Γ , :=int( T (Γ)) with volume |ΩΓ,| and its complement as Ω Γ , C :=Ω Ω Γ , ¯ .

Theorem 5.1.

Provided β > 0, it holds

δ L p ( Ω ) r + e L 2 ( Ω ) 2 + | u | 2 H 1 + γ | u | H 2 ( Ω Γ ) 2 + H 2 | u | W 1 , ( Ω ) 2 + H r / ( r - 1 ) | u | W 2 , p ( Ω Γ , C ) r / ( r - 1 ) + H γ + n - 1 | u | W 1 , ( Ω ) 2 | T ( Γ ) | + | u | W 1 , ( Ω ) r / ( r - 1 ) | Ω Γ , | r / ( ( r - 1 ) p ) .

Remark 5.2.

In the case of uniform mesh refinements we may expect | T (Γ)| H 1 - n and |ΩΓ,| ≈ H and Theorem 5.1 simplifies to

δ L p ( Ω ) r + e L 2 ( Ω ) 2 + | u | 2 H min γ , 2 | u | W 1 , ( Ω ) 2 + H r / ( ( r - 1 ) p ) | u | W 1 , ( Ω ) r / ( r - 1 ) .

Proof.

With w  = (id - I )e  = (id - I )u, a Young inequality, (3.3) and ([12], Theorem 3.8) yield

δ L p ( Ω ) r + e L 2 ( Ω ) 2 + | u | 2 | w | W 1 , p ( Ω ) r / ( r - 1 ) + w L 2 ( Ω ) 2 + | I u | 2 .

Theorem 4.4.4 in [25] shows w L 2 ( Ω ) w L ( Ω ) H |u | W 1 , ( Ω ) and

| w | W 1 , p ( Ω ) p = | w | W 1 , p ( Ω Γ , ) p + | w | W 1 , p ( Ω Γ , C ) p | u | W 1 , ( Ω Γ , ) p | Ω Γ , | + H p | u | W 2 , p ( Ω Γ , C ) p .

Let ω F = T T F T T be the patch of a side F F , and set F (Γ)={F F (Ω): ω F Γ} and F C (Γ)= F (Ω) F (Γ). Note that [Du] F = 0 for F F C (Γ). Then

| I u | 2 = H 1 + γ F F C ( Γ ) h F - 1 [ D w ] F L 2 ( F ) 2 + F F ( Γ ) h F - 1 [ D I u ] F L 2 ( F ) 2 ·

The first sum can be estimated as in the proof of ([12], Lemma 3.2), the second sum with

[ D I u ] F L 2 ( F ) 2 h F n - 1 | I u | W 1 , ( F ) 2 h F n - 1 | u | W 1 , ( F ) 2 .

The observation | F (Γ)|(n+1)| T (Γ)| concludes the proof.□

Together with Theorem 5.1, the subsequent result implies strong convergence of the gradients in the model interface problem as H  → 0.

Theorem 5.3.

Under the aforementioned conditions on the (possibly non-unique) exact minimiseru W 1 , (Ω; R m ) W 2 , p (ΩΓ; R m ), the error e  = u - u of the discrete solution u A of (2.5) satisfies

D e L 2 ( Ω ) e L 2 ( Ω ) 1 / 3 + H 5 / 6 2 u D / s 2 L 2 ( ∂Ω ) 1 / 3 + H ( 1 - γ ) / 2 | u | + H - ( 1 + γ ) / 4 | u | 1 / 2 e L 2 ( Ω ) 1 / 2 + H 5 / 4 2 u D / s 2 L 2 ( Ω ) 1 / 2 .

Proof

The basic idea of gradient control is the generalisation of the interpolation estimate H1(Ω) = [L2(Ω), H2(Ω)]1/2 for a reduced domain ΩΓ; refer to [24, 25] for a detailed analysis of interpolation spaces. Let w be the boundary value interpolation of (id - I )u D as described in ([20], Prop. 4.1), such that w satisfies the inequalities in (4.2). A piecewise integration by parts shows, for v:= e - w W 0 1 , p (Ω; R m ), that

D e L 2 ( Ω ) 2 = Ω D ( u - u ) : D v d x + Ω D e : D w d x Γ v · [ D u ] Γ n Γ d s - Ω Γ v · Δ u d x - F F ( Ω ) F v · [ D u ] F n F d s + D e L 2 ( Ω ) D w L 2 ( Ω ) ,

where nΓ is a unit normal vector of the interface Γ. The Lipschitz continuity of u implies |[Du]ΓnΓ|  1. This and the trace inequality on Γ lead to

Γ v · [ D u ] Γ n Γ d s v L 2 ( Γ ) v L 2 ( Ω ) + v L 2 ( Ω ) 1 / 2 D v L 2 ( Ω ) 1 / 2 .

The case Γ= is contained in ([12], Theorem 4.4). The piecewise Laplacian of u is bounded in L2(Ω) and so (with the generic constant C:=Δu L 2 ( Ω Γ ) hidden in the notation C ≈ 1)

Ω Γ v · Δ u d x v L 2 ( Ω )

The elementwise trace inequality ([25], Theorem 1.6.6, p. 39) for an n-dimensional simplex T and one of its sides F, and f W 1 , q (T; R m ), 1 ≤ q < , reads

f L q ( F ) q h T - 1 f L q ( T ) q + f L q ( T ) q - 1 D f L q ( T ) h T - 1 f L q ( T ) q + h T q - 1 D f L q ( T ) q .

The term F v· [ D u ] F n F ds and the stabilisation |u | are already analysed in the Estimate on C in the proof of ([12], Theorem 4.4). This results in

F F ( Ω ) F v · [ D u ] F n F d s | u | H ( 1 - γ ) / 2 D v L 2 ( Ω ) + H - ( 1 + γ ) / 2 v L 2 ( Ω ) .

The preceding estimates plus the absorbtion of D e L 2 ( Ω ) lead to

D e L 2 ( Ω ) 2 v L 2 ( Ω ) + v L 2 ( Ω ) 1 / 2 D v L 2 ( Ω ) 1 / 2 + D w L 2 ( Ω ) 2 + | u | H ( 1 - γ ) / 2 D v L 2 ( Ω ) + H - ( 1 + γ ) / 2 v L 2 ( Ω ) .

The triangle inequality applied to v = e  - w and some careful elementary analysis to absorb D e L 2 ( Ω ) 1 / 2 eventually lead to

D e L 2 ( Ω ) e L 2 ( Ω ) 1 / 3 + w L 2 ( Ω ) 1 / 3 + | w | H 1 ( Ω ) + H ( 1 - γ ) / 2 | u | + H - ( 1 + γ ) / 4 | u | 1 / 2 e L 2 ( Ω ) + w L 2 ( Ω ) 1 / 2 .

The inequalities (4.2), Poincaré and Friedrichs inequalities on sides F F (∂Ω) and removal of higher-order terms in H conclude the proof.□

The following theorem is an improved a posteriori estimate based on Theorems 4.1 and 5.3.

Theorem 5.4.

Recallu W 1 , (Ω; R m ) W 2 , p (ΩΓ; R m ), the definitions e  := u - u and δ  := σ - σ for σ := DW(Du) and σ  := DW(Du ), and the definition of Λ from Section ‘A posteriori error estimates’. Set

M ( τ ) : = σ - τ L 2 ( Ω ) + Π Λ + div τ L 2 ( Ω ) + osc , 2 ( Λ ) for all τ H ( div , Ω ; R m × n ) .

Provided β>0, it holds

δ L p ( Ω ) r + e L 2 ( Ω ) 2 M ( τ ) 6 / 5 + H - ( 1 + γ ) / 3 M ( τ ) 4 / 3 | u | 2 / 3 + M ( τ ) H ( 1 - γ ) / 2 | u | + H 1 - γ / 4 | u | 1 / 2 + H min 5 , r ( 1 + 1 / p )

and

D e L 2 ( Ω ) 2 M ( τ ) 2 / 5 + H - ( 1 + γ ) / 9 M ( τ ) 4 / 9 | u | 2 / 9 + H min 5 / 3 , r ( 1 + 1 / p ) / 3 + M ( τ ) 1 / 3 H ( 1 - γ ) / 2 | u | + H 1 - γ / 4 | u | 1 / 2 1 / 3 + H 1 - γ | u | 2 + H - ( 1 + γ ) / 2 | u | M ( τ ) 6 / 5 + H - ( 1 + γ ) / 3 M ( τ ) 4 / 3 | u | 2 / 3 + H min 5 , r ( 1 + 1 / p ) 1 / 2 + H - ( 1 + γ ) / 2 | u | M ( τ ) 1 / 2 H ( 1 - γ ) / 2 | u | + H 1 - γ / 4 | u | 1 / 2 1 / 2

The generic constants in Theorem 5.4 depend on problem-specific data such as the shapes of Ω and Γ as well as the generic constant ϰ of Theorem 4.1.

Theorem 5.5.

Theorem 5.4 holds verbatim in Example 3.3 and in the modified two-well problem of Subsection ‘Modified two-well benchmark’, where β = 0.

Remark 5.6.

The assertion of Theorem 5.4 holds for any discrete u uD, + V which may approximate the discrete unique exact solution of (2.5). This allows the inexact SOLVE via an iterative procedure.

Proof of Theorem 5.4

Choose w as in the proof of Theorem 5.3. Then Theorem 4.1 with q = 2 and (4.2) imply

δ L p ( Ω ) r + e L 2 ( Ω ) 2 M ( τ ) e - w H 1 ( Ω ) + | w | W 1 , p ( Ω ) r + w L 2 ( Ω ) 2 M ( τ ) | e | H 1 ( Ω ) + e L 2 ( Ω ) + H 3 / 2 + H min 5 , r ( 1 + 1 / p ) .

Theorem 5.3 provides an estimate of the semi-norm | e | H 1 ( Ω ) . A Young inequality shows H 5 / 6 M(τ) H 5 +M ( τ ) 6 . The absorbtion of e L 2 ( Ω ) then proves the first assertion. The second assertion is an immediate consequence of the first one, Theorem 5.3 and several algebraic transformations.

Numerical experiments

This section illustrates the theoretical estimates and their impact on the reliability-efficiency gap on 2D benchmarks in computational microstructures [18, 26].

Numerical algorithms

The adaptive finite element method (AFEM) and algorithmic details on the implementation in MATLAB in the spirit of [27] concern the state-of-the-art AFEM loop

SOLVE ESTIMATE MARK REFINE

and are explained below together with some notation.

Solve

The stabilised discrete problem (2.5) is solved in a nested iteration on a given triangulation T with MATLAB’s standard-minimiser fminunc with default tolerances. Gradient and Hessian of the discrete energy are available and therefore provided to fminunc. We set γ = 1 in the stabilisation term (2.4) in all our experiments. This is motivated by ([12], Theorem 4.4) which suggest that γ = 1 yields an optimal convergence rate. The discrete solution of the previous AFEM loop iteration serves as a start vector for fminunc; for the first iteration, the initial vector is zero everywhere up to the Dirichlet boundary nodes. Since the Galerkin orthogonality is not required in Theorem 4.1, the termination of an iterative realisation for SOLVE is not a sensitive issue. In the computational PDEs, it is a fundamental issue to involve inexact solve. In this paper, however, the numerical examples are run with the standard settings of MATLAB.

Estimate

The refinement indicator results from the error estimator of Theorem 4.1. As in the work of Repin [28], the computation of the minimiser τ RT 0 ( T ) m of

σ -τ L 2 ( Ω ) + Π Λ +divτ L 2 ( Ω )
(6.1)

runs Algorithm 1 based on the formula

a + b 2 = min s > 0 ( 1 + s ) a 2 + ( 1 + 1 / s ) b 2 for a , b > 0
Algorithm 1 Approximate flux computation

The stopping criterion of Algorithm 1 monitors relative changes and avoids degenerate values of s. Undisplayed experiments have conviced us that a maxmium of three iterations and a stopping tolerance of ε M 0.8 (with the machine precision εM) yield satisfying results. The iteration is stopped whenever s, 1 / s or the relative change of s drops below this tolerance. As an additional precaution, the iteration also stops if the linear system is deemed “nearly singular” by MATLAB. Our experiments convinced us that ignoring this warning causes a breakdown with NaNs. Note that if q ≠ 2, we still minimise the L2 sums in (6.1) to avoid the computational cost of a nonlinear solve. With the computed minimiser τ, Section ‘A posteriori error estimates’ yields the error estimator

η F , q : = σ - τ L q ( Ω ) + Π Λ + div τ L q ( Ω ) + osc , q ( Λ ) .

This will be compared with the well-established residual based a posteriori error estimator[7]

η R , q : = T T h T q Λ L q ( T ) q 1 / q + F F ( Ω ) h F [ σ ] F · n F L q′ ( F ) q 1 / q ,

which is reliable for the original discretisation without stabilisation. Undisplayed experiments computed the averaging error estimator[18], which is founded on the same theoretical background as η R , q and therefore yielded essentially the same convergence rates.

The error estimators in Theorem 5.4 read

η L , 2 : = η F , 2 6 / 5 + H - ( 1 + γ ) / 3 η F , 2 4 / 3 | u | 2 / 3 + η F , 2 H ( 1 - γ ) / 2 | u | + H 1 - γ / 4 | u | 1 / 2 + H min 5 , r ( 1 + 1 / p )
η H , 2 : = η F , 2 2 / 5 + H - ( 1 + γ ) / 9 η F , 2 4 / 9 | u | 2 / 9 + H min 5 / 3 , r ( 1 + 1 / p ) / 3 + η F , 2 1 / 3 H ( 1 - γ ) / 2 | u | + H 1 - γ / 4 | u | 1 / 2 1 / 3 + H 1 - γ | u | 2 + H - ( 1 + γ ) / 2 | u | η F , 2 6 / 5 + H - ( 1 + γ ) / 3 η F , 2 4 / 3 | u | 2 / 3 + H min 5 , r ( 1 + 1 / p ) 1 / 2 + H - ( 1 + γ ) / 2 | u | η F , 2 1 / 2 H ( 1 - γ ) / 2 | u | + H 1 - γ / 4 | u | 1 / 2 1 / 2 .

MARK

For any given T T with its set of faces F(T), ∂T=F(T), and given τ from (6.1), set

η F q ( T ) : = σ - τ L q ( T ) q + Π Λ + div τ L q ( T ) q + h T q ( id - Π ) Λ L q ( T ) q . η R q ( T ) : = | T | q / n Λ L q ( T ) q + | T | 1 / n F F ( Ω ) F ( T ) [ σ ] F · n F L q ( F ) q .

Let η q (T) be one of the refinement indicators η F q (T) and η R q (T). Some greedy algorithm computes M T of (almost) minimal cardinality such that

T M η q ( T ) 1 / 2 T T η q ( T ) .

Refine

This step computes the smallest refinement T + 1 of T with M T T + 1 based on the red-green-blue refinement strategy as illustrated in Figure 2. This refinement involves some closure algorithm to avoid hanging nodes.

Figure 2
figure 2

Possible refinements of a triangle T .

Two-well benchmark

The computational microstructure benchmark of ([18], Section 2) considers two wells with W from (3.4) in Example 3.3. The energy is given by (1.1) on the domain Ω=(0,1)×(0,3/2) R 2 with

g ( x ) : = - 3 t 5 / 128 - t 3 / 3 and u D ( x ) : = g ( x ) for t 0 , t 3 / 24 + t for t 0

for t:=(3( x 1 -1)+2 x 2 )/ 13 ; p = q = 4 and f ≡ 0. The unique minimiser u of min v A E(v) with A= u D + W 0 1 , 4 (Ω) reads u=u D ([18], Theorem 2.1) and β = 1 allows for Theorems 5.1 – 5.4 to hold. An initial triangulation T 0 is given by a criss triangulation of (0,1)×(0,3/2) with 12 congruent triangles and the two interior nodes (1/2,1/2) and (1/2,1). The adaptive algorithm of Subsection ‘Numerical algorithms’ computes a sequence of discrete solutions (u ) and stresses (σ ) , as well as error estimators η F and η R with and without stabilisation for uniform and adaptive meshes and led to Figure 3 with overall observations of Section ‘Conclusions’. The empirical convergence rates for uniform and R- as well as F-adapted mesh-refining are collected in Table 1. Note that the error estimator η L performs better than η F . This is evident from the table for uniform mesh refinements, but a closer look at Figure 3 reveals that even in the adaptive scenarios, η L converges slightly faster than η F . This is in accordance to the theory of Section ‘Refined analysis for an interface model problem’ where η L is derived from η F based on additional smoothness assumptions.

Figure 3
figure 3

Convergence plot of the two-well benchmark. Errors and error estimators of the two-well benchmark of Subsection ‘Two-well benchmark’, plotted against the number of degrees of freedom.

Table 1 Observed convergence rates in Figures 3 , 4 , 6 and 7 for uniform and adaptive mesh refining

Modified two-well benchmark

This subsection concerns a modification of the previous problem with (3.4) and a linear right-hand side for β = 0 and f(x) := -div(DW(Du D (x))) and unique solution u = u D as before. Note that Example 3.3 applies to this problem, and so the proof of Theorem 3.1 yields

σ - σ L p ( Ω ) r + u - u L 2 ( Ω ) 2 + | u | 2 0 as

and Theorems 5.1–5.4 hold as well. The algorithms of Subsection ‘Numerical algorithms’ ran with and without stabilisation for uniform and adaptive meshes with the same initial triangulation as in Subsection ‘Two-well benchmark’ and led to Figure 4 with overall observations of Section ‘Conclusions’. The empirical convergence rates for uniform and R- as well as F-adapted mesh-refining are collected in Table 1 for completeness although they are almost identical with those observed in Subsection ‘Two-well benchmark’.

Figure 4
figure 4

Convergence plot of the modified two-well benchmark. Errors and error estimators of the modified two-well benchmark of Subsection ‘Modified two-well benchmark’, plotted against the number of degrees of freedom.

Three-well benchmark

The energy density W of ([26], Example 5.9.3, p. 72) is the convex hull of min{|F|2, |F - (1, 0)|2, |F - (0, 1)|2} with explicit form in ([26], Example 5.6.4, p. 58). Let furthermore Ω= ( 0 , 1 ) 2 R 2 and u D (x1, x2) := a(x1 - 1/4) + a(x2 - 1/4) with a(t) := t3/6 + t / 8 for t ≤ 0 and a(t) := t5/40 + t3/8 for t0. Then the energy is given by (1.1) with β = 0 and f := -divDW(Du D ). The exact solution u = u D satisfies the interface condition of Section ‘Refined analysis for an interface model problem’ and allows Theorem 5.3 to hold. Theorems 5.1 and 5.4 do not apply because β = 0. We use the grid of Figure 5 as initial triangulation to resolve discontinuities in f.

Figure 5
figure 5

Initial grid for the three-well benchmark. Initial grid for the three-well benchmark of Subsection ‘Three-well benchmark’.

The algorithms of Subsection ‘Numerical algorithms’ ran with and without stabilisation for uniform and adaptive meshes and led to Figure 6 with overall observations of Section ‘Conclusions’. Beyond those general conclusions, this example demonstrates the difficulties with ill-conditioned Hessians. While the unstabilised method reaches 106 degrees of freedom without difficulty on uniform meshes, the adapted algorithms fail without stabilisation beyond 687 324 degrees of freedom (η F -adaptive) and 33 169 degrees of freedom (η R -adaptive). MATLAB’s error message “Input to EIG must not contain NaN or Inf” indicates that a matrix operation returned non-finite numbers let fminunc break down. Undisplayed numerical experiments show condition numbers up to 1010 and beyond. The empirical convergence rates for uniform and R- as well as F-adapted mesh-refining are collected in Table 1. Moreover, Figure 1 in Section ‘Background’ reveals that stabilisation not only remedies ill-conditioned Hessians but thereby indeed allows for reduced errors in the discrete solution.

Figure 6
figure 6

Convergence plot of the three-well benchmark. Errors and error estimators of the three-well benchmark of Subsection ‘Three-well benchmark’, plotted against the number of degrees of freedom.

An optimal design example

The energy density of the topology optimisation problem of [3, 8, 2933] reads

W ( F ) : = ϕ ( | F | ) for F R 2 with ϕ ( t ) : = λ / 2 + t 2 for 0 t λ , 2 λ ( t - λ / 2 ) for λ t 2 λ , t 2 / 2 + λ for t 2 λ .

This leads to problem (2.3) with β = 0, λ = 0.0084,u D  ≡ 0 and f ≡ 1. Since regularity of the solutions is unclear, only the results of Sections ‘Global convergence’, ‘A posteriori error estimates’, ‘Refined analysis for an interface model problem’ and ‘Numerical experiments’ apply. As initial triangulation T 0 , we use the coarsest cross triangulation T 0 ={conv{(0,0),(1,0),(0,1)},conv{(1,0),(0,1),(1,1)}} of Ω = (0, 1)2.

The algorithms of Subsection ‘Numerical algorithms’ ran with and without stabilisation for uniform and adaptive meshes and led to Figure 7 with the overall observations of Section ‘Conclusions’. The empirical convergence rates for uniform and R- as well as F-adapted mesh-refining are collected in Table 1. Undocumented experiments with a modified lower-order term f and known exact solution u led to the same convergence rates of the error estimators and confirm their accuracy.

Figure 7
figure 7

Convergence plot of the optimal design benchmark. Error estimators of the optimal design example of Subsection ‘An optimal design example’, plotted against the number of degrees of freedom.

Discussion of Empirical Convergence Rates

Global convergence without regularity assumptions

Theorem 3.1 asserts that σ- σ L p ( Ω ) , βu- u L 2 ( Ω ) , and |u | all tend to zero as H  → 0. The plain convergence result applies to all examples from Subsections ‘Two-well benchmark’, ‘An optimal design example’, ‘Three-well benchmark’, and ‘An optimal design example’ for the uniform mesh-refinements with H+1 = H  / 2. The numerical experiments, however, show empirical convergence rates displayed in the first columns of Table 1. The adaptive algorithms do not reflect the condition H  → 0 explicitly and hence convergence is not guaranteed a priori. Undisplayed investigations show that indeed in the R-adapted version of the three-well example of Subsection ‘Three-well benchmark’, this condition H  → 0 does not appear to be true for more than 4 978 degrees of freedom. In all other experiments we observe convergence rates even for unstabilised discretisations.

Empirical convergence rates for interface model problems

Theorem 5.1 provides an a priori error estimate and an estimate of the stabilisation norm. It applies to the benchmark of Subsections ‘Two-well benchmark’, ‘An optimal design example’, ‘Three-well benchmark’, and ‘An optimal design example’ only, because of β > 0 and Example 3.3, and the smoothness conditions imposed upon u from Section ‘Refined analysis for an interface model problem’. Recall the definitions of T (Γ), ΩΓ, and Ω Γ , C from Section ‘Refined analysis for an interface model problem’ and assume u L 2 ( Ω Γ ) 1u W 2 , p ( Ω Γ , C ) , | T (Γ)| H - 1 and |ΩΓ,|≈H in this discussion. This leads to a convergence rate of H 2 / p for the right-hand side of Theorem 5.1. The observed convergence rates of σ- σ L p ( Ω ) and u- u L 2 ( Ω ) for the stabilised benchmark examples in Table 1 show convergence rates beyond those guaranteed in Theorem 5.1.

Theorem 5.3 implies, up to perturbations on the boundary,

D ( u - u ) L 2 ( Ω ) u - u L 2 ( Ω ) 1 / 3 + | u | + H - 1 / 2 | u | 1 / 2 u - u L 2 ( Ω ) 1 / 2 .

Since the exact solutions of Subsections ‘Two-well benchmark’, ‘An optimal design example’, ‘Three-well benchmark’, and ‘An optimal design example’ are all smooth up to a one-dimensional interface line, Theorem 5.3 applies to these examples. The experiments shows that the right-hand side of Theorem 5.3 is dominated by H - 1 / 2 | u | 1 / 2 u- u L 2 ( Ω ) 1 / 2 in all examples and that the inequality is satisfied.

Reliability without regularity assumptions

Up to boundary terms, Theorem 4.1 states

σ - σ L p ( Ω ) 2 + β u - u L 2 ( Ω ) 2 η F u - u W 1 , p ( Ω ) .

The convergence rates confirm this assertion for the general and rough estimate u- u W 1 , p ( Ω ) 1 in the sense that the rates for η F are worse than or equal to those of σ- σ L p ( Ω ) 2 and u- u L 2 ( Ω ) 2 . In the numerical examples, u- u H 1 ( Ω ) is computed and displayed in Table 1 and the convergence rates of the product u- u H 1 ( Ω ) η F can be compared with those of σ- σ L p ( Ω ) 2 +u- u L 2 ( Ω ) 2 . This comparison confirms the above a posteriori error estimate. In the examples with p = 2 (of Subsections ‘Two-well benchmark’, ‘An optimal design example’, ‘Three-well benchmark’, and ‘An optimal design example’), there holds even equality of the convergence rates which demonstrates the efficiency of the estimate of Theorem 4.1.

Efficiency without regularity assumptions

Up to oscillations and the (possibly) higher-order term (id- I F , )σ L q ( Ω ) , Theorem 4.2 states

η F σ - σ L p ( Ω ) + β u - u L p ( Ω ) .

The displayed convergence rates of Table 1 confirm this estimate.

Reliability of the refined a posteriori error control

Theorem 5.4 applies to the example of Subsection ‘Two-well benchmark’ and states

σ - σ L p ( Ω ) 2 + u - u L 2 ( Ω ) 2 η L and D ( u - u ) L 2 ( Ω ) 2 η H .

Table 1 confirms this estimate and shows that the estimators η L and η H accurately predict the convergence rate of the errors, even with equality of the convergence rates in the case of adaptive mesh refinements in the examples of Subsections ‘Two-well benchmark’, ‘An optimal design example’, ‘Three-well benchmark’, and ‘An optimal design example’.

All displayed convergence rates of η L are better or at least equal to those of η F . For instance, for uniform mesh-refining in Subsections ‘Two-well benchmark’, ‘An optimal design example’, ‘Three-well benchmark’, and ‘An optimal design example’, the error terms σ- σ L p ( Ω ) 2 +u- u L 2 ( Ω ) 2 converge with the empirical convergence rate 7/5 while the upper bound η F does so with a reduced convergence rate 4/5. The refined error estimator η L is a guaranteed upper bound (via Theorem 5.4) and converges with an empirical convergence rate 1.

Performance of the minimisation algorithm 1

In all numerical experiments of this paper, Algorithm 1 reaches the maximal number 3 of iterations. While this suggests that the optimal s is not found after three iterations, undisplayed experiments with higher iteration counts and hence higher computational efforts result solely in marginal improvements.

Conclusions

Effects of stabilisation

The empirical convergence rates of the error estimators η F , η R and the errors u- u L 2 ( Ω ) and σ- σ L p ( Ω ) for uniform mesh-refinement with and without stabilisation coincide. This indicates that the choice γ = 1 leads to some significant perturbation but maintains the correct convergence rate at the same time. This is different for adaptive mesh refinement with less optimal convergence rates. Our conclusion is that an improved adaptive algorithm has to be developed with balance of local mesh-refinement and global stabilisation parameters in future research. The tested algorithm from Subsection ‘Numerical algorithms’ does neither reflect the effects of stabilisation nor that of inexact solve.

Another important aspect of the stabilisation is the regularisation of the Hessian in the step SOLVE of Subsection ‘Numerical algorithms’. In the three-well problem of Subsection ‘Three-well benchmark’, the unstabilised adaptive algorithms fail.

Adaptive versus uniform mesh-refinement

The overall empirical convergence rates of the errors and estimators of the unstabilised computation for adaptive mesh-refinements are better than those for uniform mesh-refinements. This is in contrast to the stabilised computation, where the true errors σ- σ L p ( Ω ) and u- u L 2 ( Ω ) behave better for uniform compared with the two adaptive mesh-refinments (with the exception in Subsection ‘An optimal design example’ where there is equality). It is observed that adaptivity does not necessarily improve the converegnce rates of the error σ- σ L p ( Ω ) and u- u L 2 ( Ω ) in a stabilised computation. Surprisingly, the convergence of the gradient errors D(u- u ) L 2 ( Ω ) are slightly improved in the instabilised calculation by adaptive mesh-refinements. The adaptive mesh-refinement is expected to reduce the a posteriori error estimators in the first place: cf. [1, 34] for the estimator reduction property. Indeed, the convergence rates of the a posteriori error estimators η R , η F , η L , η H are improved (or optimal) for adaptive mesh-refinements (except for the three-well example of Subsection ‘Three-well benchmark’).

Strong convergence of the gradients

The convergence of the gradient error of the stabilised problem surpasses the expectations of [12] in Subsection ‘An optimal design example’ but fails to do so in Subsections ‘Two-well benchmark’, ‘An optimal design example’, ‘Three-well benchmark’, and ‘An optimal design example’. The improved error estimator η H shows the same convergence rate as the error of the gradients in Subsections ‘Two-well benchmark’, ‘An optimal design example’, ‘Three-well benchmark’, and ‘An optimal design example’. This holds for uniform and for adapted mesh refinements and suggests that η H is in fact reliable and efficient for β > 0.

Guaranteed error control

The assertion on η F in Theorem 4.1 is reflected in the numerical examples in that the stress approximations converge faster than η F in all cases. This suggests that the estimate u- u W 1 , p ( Ω ) 1 is by far too pessimistic. In fact, the benchmark examples with known exact solution fulfil σ- σ L 2 ( Ω ) 2 η F u- u H 1 ( Ω ) . Similar affirmative conclusions follow for Theorem 4.2 and 5.4.

Reliability-efficiency gap

In comparison with the residual-based error estimator of [7, 18], the new a posteriori error estimators η L and η H of Theorem 5.4 lead to refined error control. The improvement is marginal for uniform meshes without stabilisation but significant for adaptive stabilised computations. η L and η H match the convergence of the errors and so narrow the reliability-efficiency gap.

References

  1. Carstensen C: Convergence of an adaptive fem for a class of degenerate convex minimisation problems. IMA J Numer Anal 2008,28(3):423–439.

    Article  MathSciNet  Google Scholar 

  2. Dacorogna B: Direct methods in the calculus of variations, 2nd Ed. Applied Mathematical Sciences 78. Berlin: Springer; 2008. xii xii

    Google Scholar 

  3. Carstensen C, Müller S: Local stress regularity in scalar non-convex variational problems. SIAM J Math Anal 2002,34(2):495–509. 10.1137/S0036141001396436

    Article  MathSciNet  Google Scholar 

  4. Chipot M: Elements of Nonlinear Analysis. Birkhäuser Advanced Texts. Birkhäuser: Basel; 2000. vi vi

    Book  Google Scholar 

  5. Müller S: Variational models for microstructure and phase transisions. In Calculus of variations and geometric evolution problems. Lectures given at the 2nd session of the, Centro Internazionale Matematico Estivo (CIME), Cetraro, Italy, June 15–22, 1996, Lect. Notes Math., 1713. Edited by: Hildebrandt S. Berlin: Springer; 1999:85–210.

    Google Scholar 

  6. Ball JM, James RD: Proposed experimental tests for the theory of fine microstructures and the two-well problem. Phil Trans R Soc Lond A 1992, 338: 389–450. 10.1098/rsta.1992.0013

    Article  Google Scholar 

  7. Carstensen C, Plecháĉ P: Numerical solution of the scalar double-well problem allowing microstructure. Math Comp 1997,66(219):997–1026. 10.1090/S0025-5718-97-00849-1

    Article  MathSciNet  Google Scholar 

  8. Bartels S, Carstensen C: A convergent adaptive finite element method for an optimal design problem. Numer Math 2007, 108: 359–385.

    Article  MathSciNet  Google Scholar 

  9. Goodman J, Kohn RV, Reyna L: Numerical study of a relaxed variational problem from optimal design. Comput Methods Appl Mech Eng 1986, 57: 107–127. 10.1016/0045-7825(86)90073-3

    Article  MathSciNet  Google Scholar 

  10. Carstensen C, Klose R: Guaranteed a posteriori finite element error control for the p-Laplace problem. SIAM J Sci Comput 2003, 25: 792–814. 10.1137/S1064827502416617

    Article  MathSciNet  Google Scholar 

  11. Bartels S, Carstensen C, Plecháĉ P, Prohl A: Convergence for stabilisation of degenerate convex minimsation problems. IFB 2004,6(2):253–269.

    Google Scholar 

  12. Boiger W, Carstensen C: On the strong convergence of gradients in stabilised degenerate convex minimisation problems. SIAM J Numer Anal 2010,47(6):4569–4580. 10.1137/090746409

    Article  MathSciNet  Google Scholar 

  13. Ciarlet PG: The finite element method for elliptic problems. Philadelphia, PA, USA: Society for Industrial Mathematics; 2002.

    Book  Google Scholar 

  14. El Alaoui L, Ern A, Vohralík M: Guaranteed and robust a posteriori error estimates and balancing discretization and linearization errors for monotone nonlinear problems. Comp Meth Appl Mech Eng 2011,200(37–40):2782–2795.

    Article  Google Scholar 

  15. Ern A, Nicaise S, Vohralík M: An accurate h (div) flux reconstruction for discontinuous galerkin approximations of elliptic problems. C R, Math, Acad Sci Paris 2007,345(12):709–712. 10.1016/j.crma.2007.10.036

    Article  MathSciNet  Google Scholar 

  16. Luce R, Wohlmuth B: A local a posteriori error estimator based on equilibrated fluxes. SIAM J Numer Anal 2004,42(4):1394–1414. 10.1137/S0036142903433790

    Article  MathSciNet  Google Scholar 

  17. Ainsworth M: A synthesis of a posteriori error estimation techniques for conforming, non-conforming and discontinuous galerkin finite element methods. Providence: American Mathematical Society (AMS); 2005.

    Book  Google Scholar 

  18. Carstensen C, Jochimsen K: Adaptive finite element methods for microstructures? Numerical experiments for a Two-well benchmark. Computing 2003, 71: 175–204. 10.1007/s00607-003-0027-1

    Article  MathSciNet  Google Scholar 

  19. Chipot M, Evans LC: Linearisation at infinity and Lipschitz estimates for certain problems in the calculus of variations. Proc Roy Soc Edinburgh Sect A 1986,102(3–4):291–303.

    Article  MathSciNet  Google Scholar 

  20. Bartels S, Carstensen C, Dolzmann G: Inhomogeneous Dirichlet conditions in a priori and a posteriori finite element error anylysis. Numer Math 2004,99(1):1–24. 10.1007/s00211-004-0548-3

    Article  MathSciNet  Google Scholar