A posteriori error analysis of stabilised FEM for degenerate convex minimisation problems under weak regularity assumptions

Boiger, Wolfgang; Carstensen, Carsten

doi:10.1186/2213-7467-1-5

Research article
Open access
Published: 29 January 2014

A posteriori error analysis of stabilised FEM for degenerate convex minimisation problems under weak regularity assumptions

Wolfgang Boiger^1,2 &
Carsten Carstensen^1,2

Advanced Modeling and Simulation in Engineering Sciences volume 1, Article number: 5 (2014) Cite this article

7269 Accesses
1 Citations
2 Altmetric
Metrics details

Abstract

Background

The discretisation of degenerate convex minimisation problems experiences numerical difficulties with a singular or nearly singular Hessian matrix.

Methods

Some discrete analog of the surface energy in microstrucures is added to the energy functional to define a stabilisation technique.

Results

This paper proves (a) strong convergence of the stress even without any smoothness assumption for a class of stabilised degenerate convex minimisation problems. Given the limitted a priori error control in those cases, the sharp a posteriori error control is of even higher relevance. This paper derives (b) guaranteed a posteriori error control via some equilibration technique which does not rely on the strict Galerkin orthogonality of the unperturbed problem. In the presence of L² control in the original minimisation problem, some realistic model scenario with piecewise smooth exact solution allows for strong convergence of the gradients plus refined a posteriori error estimates. This paper presents (c) an improved a posteriori error control in this interface problem and so narrows the efficiency reliability gap.

Conclusions

Numerical experiments illustrate the theoretical convergence rates for uniform and adaptive mesh-refinements and the improved a posteriori error control for four benchmark examples in the computational microstructures.

Background

Infimising sequences of variational problems with non-quasiconvex energy densities, in general, develop finer and finer oscillations with no classical limit in Sobolev function spaces called microstructure [1–6]. Those oscillations cause difficulty to numerical methods because fine grids are necessary to resolve such oscillations which results in ineffective and tricky mesh-depending computations. Strong convergence of gradients of infimising sequences of the non-quasiconvex problem is impossible.

Relaxation techniques replace the nonconvex energy density by its (semi-)convex hull and lead to a macroscopic model. Since the convexified energy density obtained by this method, in general, lacks strict convexity, numerical algorithms might encounter situations where the Hessian matrix is singular. For instance, the Newton minimisation algorithm fails on the convexified three-well problem of Subsection ‘Three-well benchmark’ below. Applications of relaxation techniques include models in computational microstructure [5–7], some optimal design problems [8, 9], the nonlinear Laplacian [10] (where the Hessian can become arbitrarily ill-conditioned in spite of its strict convexity) and elastoplasticity [1].

Stabilisation techniques regularise the energy term by an additional positive semidefinite stabilisation function. The paper [11] discusses several choices of such stabilisation functions for P₁ conforming finite elements and quasiuniform meshes. It turns out that stabilisation can ensure strong convergence of the strain approximations under particular circumstances. A particular stabilisation in [12] leads to strong convergence even on unstructured grids but is still restricted to unrealistically smooth solutions. This paper studies the stabilisation technique of [12] and addresses the question of convergence (i) without extra regularity assumptions, (ii) in a realistic scenario called model interface problem, and (iii) establishes an a posteriori error control.

The stabilisation leads to improved condition numbers of the Hessian matrix and to reduced errors if the numerical solvers fail without stabilisation. Figure 1 shows the convergence of the discrete stress σ_ℓ of the three-well benchmark corresponding to the discrete minimisers of the energy $E_{ℓ} (v_{ℓ}) = E (v_{ℓ}) + C / 2 {{∥ | v}_{ℓ} ∥ |}_{ℓ}^{2}$ . The errors are plotted for computations with uniform mesh refinements with various solver tolerances in the discrete minimisation procedure at a fixed triangulation and values of C, cf. Section ‘Numerical experiments’ for details on the MATLAB implementation. Without stabilisation, the convergence stagnates with a moderate tolerance of 10^-5 which becomes visible as a “plateau” in Figure 1. The Newton solver even aborts prematurely due to the singular Hessian. In conclusion, stabilisation enables higher accuracies in numerical examples.

For $β \geq 0$ the convex energy functional assumes the form

E (v) : = \int_{Ω} (W (D v (x)) + β | v (x) - g (x) |^{2} - f (x) \cdot v (x)) d x .

(1.1)

Assume that W is convex with quadratic growth so that there exist minimisers $u \in H_{0}^{1} (Ω)$ ; below p-th order growth is included while p = 2 throughout this simplifying introduction. Given a sequence of shape-regular triangulations ${(T_{ℓ})}_{ℓ \in N_{0}}$ [13], let u_ℓ minimise the stabilised discrete energy

E_{ℓ} (v_{ℓ}) : = E (v_{ℓ}) + \frac{1}{2} {{∥ | v}_{ℓ} ∥ |}_{ℓ}^{2} with {{∥ | v}_{ℓ} ∥ |}_{ℓ}^{2} : = H_{ℓ}^{2} \sum_{F \in F_{ℓ} (Ω)} h_{F}^{- 1} {|| {[D v_{ℓ}]}_{F} ||}_{L^{2} (F)}^{2}

amongst all conforming P₁ finite element functions v_ℓ on $T_{ℓ}$ , where [Dv_ℓ]_F is the jump of the gradient Dv_ℓ along the interior side F, written $F \in F_{ℓ} (Ω)$ , and H_ℓ := maxT h_T is the maximal diameter h_T of all simplices $T \in T_{ℓ}$ .

Section ‘Global convergence’ verifies the strong convergence of the discrete solution u_ℓ and its stress σ_ℓ := DW(Du_ℓ) to their respective continuous conterparts,

|| σ - σ_{ℓ} | |_{L^{2} (Ω)}^{2} + β || u - u_{ℓ} | |_{L^{2} (Ω)}^{2} + {∥ | u}_{ℓ} {∥ |}_{ℓ}^{2} \to 0 as ℓ \to ∞.

Section ‘A posteriori error estimates’ presents a novel application of [14–17] to nonlinear problems. For the L² projection Π_ℓ onto the space of piecewise P₀ functions, any Raviart-Thomas function $τ_{ℓ} \in {RT}_{0} (T_{ℓ})$ satisfies

\begin{align} || σ & - σ_{ℓ} | |_{L^{2} (Ω)}^{2} \\ ≲ (|| σ_{ℓ} - {τ_{ℓ} ||}_{L^{2} (Ω)} + || Π_{ℓ} Λ_{ℓ} + {div τ_{ℓ} ||}_{L^{2} (Ω)} + {osc}_{ℓ, 2} (Λ_{ℓ})) {|| u - u_{ℓ} ||}_{H^{1} (Ω)} . \end{align}

This error bound holds for any discrete displacement u_ℓ that satisfies the boundary conditions; the point is that inexact solve is included — there is no Galerkin orthogonality required. The drawback is to minimise the expression on the right-hand side with respect to τ_ℓ in order to obtain a sharp error bound. This is a particular selection: degenerate convex minimisation problems do not allow for a control of $|| u - u_{ℓ} | |_{H^{1} (Ω)}$ and may even face multiple exact or discrete solutions while the discrete minimum of E_ℓ is unique. However, in some results of this paper, either W or the lower-order terms lead to some control over $|| u - u_{ℓ} | |_{H^{2} (Ω)}$ and the selection via stabilisation is correct.

Phase transition problems motivate the investigation of scenarios with a smooth solution u up to a one-dimensional interface $Γ \subset \bar{Ω}$ [18]. Section ‘Refined analysis for an interface model problem’ proves that such problems allow even for strong convergence of the gradients for any unique solution u in W^1,∞(Ω)∩H²(Ω∖Γ) [19]. This result also leads to an improvement of the a posteriori error control of the discrete stresses and narrows the efficiency-reliability gap; the efficiency-reliability gap is the difference of the convergence rates of the guaranteed upper a posteriori error bound and the guaranteed lower a posteriori error bound.

Section ‘Numerical experiments’ complements the theoretical findings with numerical experiments to provide empirical evidence of the improved error control. The stabilisation technique competes in four benchmark examples, with and without known exact solution, for uniform and two different mesh-refining algorithms for the explicit residual-based error estimator of [7] and with an averaging-type error estimator of ([18], (1.11)).

Standard notation on Lebesgue and Sobolev spaces is employed throughout this paper and a ≲ b abbreviates a ≤ C b with some generic constant 0 < C < ∞ independent of crucial parameters (like the mesh-size on level ℓ); a ≈ b means a ≲ b ≲ a. Furthermore, A:B abbreviates the matrix inner product that corresponds to the Frobenius norm.

Methods: Discretisation and Stabilisation

Based on the convergence results for unstructured grids, this paper will develop reliable error estimators for a class of stabilised convex minimisation problems described in the sequel. Let $Ω \subset R^{n}$ be a bounded Lipshitz domain with polygonal boundary for n = 2 or 3. Given a continuous convex energy density $W : R^{m \times n} \to R$ , $g, f \in L^{2} (Ω; R^{m})$ , $β \geq 0$ , and $v \in W^{1, p} (Ω; R^{m})$ with 2 ≤ p < ∞ and m = 1, …, n, the energy is given by (1.1).

Throughout this paper, the energy density $W \in C^{1} (R^{m \times n}; R)$ satisfies (2.1)–(2.2) for parameters 1 < r ≤ 2, 0 ≤ s < ∞ and s + r + p ≤ r p. The two-sided growth condition reads

| F |^{p} - 1 ≲ W (F) ≲ | F |^{p} + 1 for all F \in R^{m \times n} .

(2.1)

The convexity control assumption reads, for all $F_{1}, F_{2} \in R^{m \times n}$ ,

| D W (F_{1}) - D W (F_{2}) |^{r} ≲ (1 + | F_{1} |^{s} + | F_{2} |^{s}) (D W (F_{1}) - D W (F_{2})) : (F_{1} - F_{2}) .

(2.2)

The proof of Theorem 2 in [7] shows that (2.2) is crucial for the uniqueness of the stress tensor DW(Du).

Given Dirichlet data $u_{D} \in W^{2, p} (Ω; R^{m}) \cap H^{2} (∂Ω; R^{m})$ for the set of admissible functions $A : = u_{D} + V : = u_{D} + W_{0}^{1, p} (Ω; R^{m})$ , the continuous (convex) model problem reads

\begin{align} minimise E (v) within v \in A . \end{align}

(2.3)

A finite element approximation of (2.3) is based on a family of regular triangulations ${(T_{ℓ})}_{ℓ \in N_{0}}$ of the domain Ω into simplices in the sense of Ciarlet [13] (e.g., for n = 2, two non-disjoint triangles of $T_{ℓ}$ share either a common edge or a common node). The set of sides $F_{ℓ}$ consists of edges (for n = 2) or faces (for n = 3) of $T_{ℓ}$ and is split into the union of the sets of all interiour sides $F_{ℓ} (Ω)$ and of all boundary sides $F_{ℓ} (∂Ω)$ .

For latter reference, define the diameter h_T:=diamT of a triangle (or tetrahedron) $T \in T_{ℓ}$ and the size h_F := diamF of a side $F \in F_{ℓ}$ . The mesh size function $h_{ℓ} : Ω \to R_{> 0}$ is given by

h_{ℓ} (x) : = \{\begin{array}{l} h_{T} & for x \in int T \in T_{ℓ}, \\ min \{h_{F} : F \in F_{ℓ} and x \in F\} & otherwise. \end{array}

The global mesh size will be abbreviated by $H_{ℓ} : = {∥ h}_{ℓ} ∥_{L^{\infty} (Ω)}$ . We presume the family ${(T_{ℓ})}_{ℓ \in N_{0}}$ to be shape-regular so that h_F ≈ h_T for all $T \in T_{ℓ}$ , $F \in F_{ℓ}$ and F ⊂ T.

The space of $T_{ℓ}$ -piecewise polynomials of degree $\leq k \in N_{0}$ is $P_{k} (T_{ℓ})$ . The nodal interpolation $I_{ℓ} w \in P_{1} (T_{ℓ}) \cap C (\bar{Ω})$ of $w \in C (\bar{Ω})$ is given by I_ℓw(z) = w(z) for all nodes z. Let furthermore Π_ℓw be the L² projection of w ∈ L²(Ω) onto $P_{0} (T_{ℓ})$ , and ${osc}_{ℓ, q} (w) : = {∥ h}_{ℓ} (id - Π_{ℓ}) w ∥_{L^{q} (Ω)}$ be the oscillation of w ∈ L^q(Ω) for 2 ≤ q ≤ ∞ with respect to the triangulation $T_{ℓ}$ . The symbol id denotes the identity operator. Let u_D,ℓ = I_ℓu_D, and

A_{ℓ} : = u_{D, ℓ} + V_{ℓ} with V_{ℓ} : = V \cap P_{1} (T_{ℓ}; R^{m}) \cap C (\bar{Ω}) .

Given a function v on Ω which is possibly discontinuous along some side $F \in F_{ℓ} (Ω)$ shared by the two elements T_± such that there exist traces from either sides, the jump of v along F reads

[v] (x) = {[v]}_{F} (x) : = lim_{T_{+} ∋ y \to x} v (y) - lim_{T_{-} ∋ y \to x} v (y) for x \in F .

The stabilisation of [12] will be used throughout this paper with -1 < γ < ∞ and

a_{ℓ} (v, w) : = \sum_{F \in F_{ℓ} (Ω)} \frac{H_{ℓ}^{1 + γ}}{h_{F}} \int_{F} {[D v]}_{F} : {[D w]}_{F} d s and {∥ | v ∥ |}_{ℓ}^{2} : = a_{ℓ} (v, v) .

(2.4)

The stabilised discrete problem reads

minimise E_{ℓ} (v) : = E (v) + \frac{1}{2} a_{ℓ} (v, v) amongst v \in A_{ℓ} .

(2.5)

Convergence of gradients with a guaranteed convergence rate is shown in [12] under unrealistically high regularity assumptions. A comprehensive collection of the results in [12] is summarised in the following theorem.

Theorem 2.1.

([12]) Let $u \in A \cap H^{3 / 2 + ε} (Ω; R^{m})$ be some solution of (2.3) for some ε > 0; let p^′and r^′be the Hölder conjugate of p and r, -1 < γ < 3, and set

ζ : = min \{1 + γ, r^{'}\} for β > 0 and ζ : = min \{1 + γ, 2\} for β = 0 .

Then the discrete solution $u_{ℓ} \in A_{ℓ}$ of (2.5) and the continuous and discrete stress $σ : = D W (D u) \in L^{p^{'}} (Ω; R^{m \times n})$ and $σ_{ℓ} : = D W (D u_{ℓ}) \in P_{0} (T_{ℓ}; R^{m \times n})$ satisfy

∥ σ - σ_{ℓ} ∥_{L^{p^{'}} (Ω)}^{r} + ∥ u - u_{ℓ} ∥_{L^{2} (Ω)}^{2} + {| | | u_{ℓ} | | |}_{ℓ}^{2} + H_{ℓ}^{(1 + γ) / 2} ∥ D (u - u_{ℓ}) ∥_{L^{2} (Ω)}^{2} ≲ H_{ℓ}^{ζ} .

Proof

This combines Lemma 3.5 and 4.1 – 4.2 plus Theorem 3.8 and 4.4 in [12].□

Global convergence

This section is devoted to the proof of a general convergence result without higher regularity assumptions. Let $u \in A$ and $u_{ℓ} \in A_{ℓ}$ solve the minimisation problem (2.3) and (2.5) and set σ := DW(Du) and σ_ℓ := DW(Du_ℓ). For the unstabilised approximation, the a priori error estimates of [7] plus a density argument prove convergence of

∥ σ - σ_{ℓ} ∥_{L^{p^{'}} (Ω)}^{r} + β ∥ u - u_{ℓ} ∥_{L^{2} (Ω)}^{2} \to 0 as H_{ℓ} \to 0 .

Note that β = 0 is permitted. Then, however, uniqueness of u and convergence of $∥ u - u_{ℓ} ∥_{L^{2} (Ω)}^{2}$ are guaranteed. The point in the following result is that the stabilised approximation converges as well as |||u_ℓ|||_ℓ → 0 even for non-smooth or non-unique minimisers. Under special circumstances, uniqueness of u and the convergence $∥ u - u_{ℓ} ∥_{L^{2} (Ω)} \to 0$ can be shown even for β = 0, e.g., in Example 3.3.

Theorem 3.1.

( Global Convergence) Provided ${lim}_{ℓ \to \infty} H_{ℓ} = 0$ it holds

∥ σ - σ_{ℓ} ∥_{L^{p^{'}} (Ω)}^{r} + β ∥ u - u_{ℓ} ∥_{L^{2} (Ω)}^{2} + {| | | u_{ℓ} | | |}_{ℓ}^{2} \to 0 as ℓ \to ∞.

The proof is based on the following lemma.

Lemma 3.2.

The errors δ_ℓ := σ - σ_ℓand e_ℓ := u - u_ℓsatisfy, for all v_ℓ ∈vV_ℓ, that

{∥ δ}_{ℓ} ∥_{L^{p^{'}} (Ω)}^{r} + β {∥ e}_{ℓ} ∥_{L^{2} (Ω)}^{2} ≲ | e_{ℓ} - v_{ℓ} |_{W^{1, p} (Ω)}^{r^{'}} + β {∥ e}_{ℓ} - v_{ℓ} |_{L^{2} (Ω)}^{2} + a_{ℓ} (u_{ℓ}, v_{ℓ}) .

Proof

The minimisation problems (2.3) and (2.5) are equivalent to their respective Euler-Lagrange equations, namely for v ∈ V and v_ℓ ∈ V_ℓ,

\int_{Ω} (σ (x) : D v (x) + 2 β (u (x) - g (x)) \cdot v (x) - f (x) \cdot v (x)) d x = 0;

(3.1)

\int_{Ω} (σ_{ℓ} (x) : D v_{ℓ} (x) + 2 β (u_{ℓ} (x) - g (x)) \cdot v_{ℓ} (x) - f (x) \cdot v_{ℓ} (x)) d x + a_{ℓ} (u_{ℓ}, v_{ℓ}) = 0 .

(3.2)

Algebraic transformations of the difference of these two equations lead to

\int_{Ω} δ_{ℓ} : D e_{ℓ} d x + 2 β {∥ e}_{ℓ} ∥_{L^{2} (Ω)}^{2} = \int_{Ω} (δ_{ℓ} : D (e_{ℓ} - v_{ℓ}) + 2 β e_{ℓ} \cdot (e_{ℓ} - v_{ℓ})) d x + a_{ℓ} (u_{ℓ}, v_{ℓ}) .

It is shown in ([12], Lemma 3.5) that

{∥ δ}_{ℓ} ∥_{L^{p^{'}} (Ω)}^{r} ≲ \int_{Ω} δ_{ℓ} : D e_{ℓ} d x .

(3.3)

Two Hölder inequalities on the right-hand side and absorbtions of ${∥ δ}_{ℓ} ∥_{L^{p^{'}} (Ω)}$ and ${∥ e}_{ℓ} ∥_{L^{2} (Ω)}$ eventually conclude the proof. Further details are dropped for brevity.□

Proof of Theorem 3.1

Given any positive ε, the density of smooth functions in $W_{0}^{1, p} (Ω; R^{m})$ leads to some $v_{ε} \in D (Ω; R^{m})$ such that $∥ u - u_{D} - v_{ε} ∥_{W^{1, p} (Ω)} ≲ ε$ . Hence v_ℓ := I_ℓ(v_ε + u_D) - u_ℓ ∈ V_ℓ satisfies

e_{ℓ} - v_{ℓ} = (u - u_{D} - v_{ε}) + (id - I_{ℓ}) (v_{ε} + u_{D}) .

Note that the nodal interpolation I_ℓ(v_ε + u_D) is well-defined since v_ε and u_D are assumed to be smooth. With ([12], Lemma 3.1 – 3.2) it follows that

\begin{align} ∥ (id - I_{ℓ}) (v_{ε} + u_{D}) ∥_{W^{1, p} (Ω)} ≲ H_{ℓ} \to 0 and \\ {| | | I_{ℓ} (v_{ε} + u_{D}) | | |}_{ℓ}^{2} = | | | (id - I_{ℓ}) (v_{ε} + u_{D}) | | |_{ℓ}^{2} ≲ H_{ℓ}^{1 + γ} \to 0 as ℓ \to ∞. \end{align}

Since $∥ \cdot ∥_{L^{2} (Ω)} ≲ ∥ \cdot ∥_{W^{1, p} (Ω)}$ , this yields some $ℓ_{0} \in N$ such that

| e_{ℓ} - v_{ℓ} |_{W^{1, p} (Ω)}^{r^{'}} + β {∥ e}_{ℓ} - v_{ℓ} ∥_{L^{2}}^{2} (Ω) + | | | I_{ℓ} (v_{ε} + u_{D}) {∥ |}_{ℓ}^{2} ≲ ε for all ℓ \geq ℓ_{0} .

A Cauchy inequality applied to the stabilisation norm proves

a_{ℓ} (u_{ℓ}, v_{ℓ}) = - | | | u_{ℓ} | | |_{ℓ}^{2} + a_{ℓ} (u_{ℓ}, I_{ℓ} (v_{ε} + u_{D})) \leq - \frac{1}{2} {∥ | u}_{ℓ} {∥ |}_{ℓ}^{2} + \frac{1}{2} {∥ | I}_{ℓ} (v_{ε} + u_{D}) {∥ |}_{ℓ}^{2} .

Substitute a_ℓ(u_ℓ, v_ℓ) in Lemma 3.2 and add $\frac{1}{2} | | | u_{ℓ} | | |_{ℓ}^{2}$ on both sides. This leads to

{∥ δ}_{ℓ} ∥_{L^{p^{'}} (Ω)}^{r} + β {∥ e}_{ℓ} ∥_{L^{2} (Ω)}^{2} + | | | u_{ℓ} | | |_{ℓ}^{2} ≲ ε for all ℓ \geq ℓ_{0} .

□

Example 3.3.

The two-well example from the computational benchmark [18] allows an estimate on ${∥ e}_{ℓ} ∥_{L^{2} (Ω)}$ even for β = 0. Let n = 2, let $F_{1} : = - F_{2} : = (3, 2) / \sqrt{13}$ , and let the energy density W be the convex hull of F ↦ |F - F₁|²|F - F₂|². That is

W (F) = {(max \{0, | F |^{2} - 1\})}^{2} + 4 (| F |^{2} - {(3 F (1) + 2 F (2))}^{2} / 13) .

(3.4)

Then ([11], Lemma 9.1) proves, for all v_ℓ ∈ V_ℓ, that

{∥ e}_{ℓ} ∥_{L^{2} (Ω)}^{2} ≲ \int_{Ω} δ_{ℓ} : D e_{ℓ} d x + {∥ e}_{ℓ} - v_{ℓ} ∥_{H^{1} (Ω)}^{2} .

Therefore, the arguments of Lemma 3.2 lead to

{∥ δ}_{ℓ} ∥_{L^{p^{'}} (Ω)}^{r} + {∥ e}_{ℓ} ∥_{L^{2} (Ω)}^{2} ≲ | e_{ℓ} - v_{ℓ} |_{W^{1, p} (Ω)}^{r^{'}} + {∥ e}_{ℓ} - v_{ℓ} ∥_{H^{1} (Ω)}^{2} + a_{ℓ} (u_{ℓ}, v_{ℓ}) .

This result can be used in the proof of Theorem 3.1 in order to obtain

∥ σ - σ_{ℓ} ∥_{L^{p^{'}} (Ω)}^{r} + ∥ u - u_{ℓ} ∥_{L^{2} (Ω)}^{2} + | | | u_{ℓ} | | |_{ℓ}^{2} \to 0 as ℓ \to ∞.

A posteriori error estimates

Beyond the a posteriori error analysis of [7], the additional stabilisation term in the discretisation of this paper causes an additional difficulty in that the Galerkin orthogonality does not hold for the natural residual. Inspired from novell developments in the a posteriori error control of elliptic PDEs motivated by inexact solve [14–17], this section presents some guaranteed upper error bound for the discretisation at hand for any approximation u_ℓ which does not necessarily satisfy (3.2) exactly. Thereby inexact solve is included.

Let $u \in A$ solve (2.3) and let $u_{ℓ} \in A_{ℓ}$ be arbitrary. It is not assumed that u_ℓ solves the discrete problem (2.5); the following theorem holds regardless of this. Recall the definitions of osc_ℓ,q(·) and Π_ℓ from Section ‘Methods: Discretisation and Stabilisation’ and given σ := DW(Du) and σ_ℓ := DW(Du_ℓ), abbreviate

Λ_{ℓ} : = - 2 β (u_{ℓ} - g) + f, e_{ℓ} : = u - u_{ℓ} and δ_{ℓ} : = σ - σ_{ℓ} .

Theorem 4.1.

Given any $w_{ℓ} \in W^{1, p} (Ω; R^{m})$ with w_ℓ = u - u_ℓon the boundary ∂ Ω, and given any $τ \in H (div, Ω; R^{m \times n})$ , it holds, for all 2 ≤ q ≤ p and for some constant ϰ known from ([12], Lemma 3.5), that

\begin{align} ϰ / 2 {∥ δ}_{ℓ} ∥_{L^{p^{'}} (Ω)}^{r} + β {∥ e}_{ℓ} ∥_{L^{2} (Ω)}^{2} \leq {(r ϰ / 2)}^{1 - r^{'}} / r^{'} | w_{ℓ} |_{W^{1, p} (Ω)}^{r^{'}} + β {∥ w}_{ℓ} ∥_{L^{2} (Ω)}^{2} \\ + ({∥ σ}_{ℓ} - τ ∥_{L^{q^{'}} (Ω)} + {∥ Π}_{ℓ} Λ_{ℓ} + div τ ∥_{L^{q^{'}} (Ω)} + {osc}_{ℓ, q^{'}} (Λ_{ℓ})) {∥ e}_{ℓ} - w_{ℓ} ∥_{W^{1, q} (Ω)} . \end{align}

The constant ϰ depends on problem-specific data such as $∥ u ∥_{W^{1, p} (Ω)}$ and the size of the domain Ω. Refer to the proof of Lemma 3.5 in [12] for details.

Before the proofs conclude this section, some practical choice of τ in Theorem 4.1 is discussed as some Raviart-Thomas finite element functions in

{RT}_{0} (T_{ℓ}) : = {τ_{RT} \in P_{1} (T_{ℓ}) \cap H (div, Ω) : \forall T \in T_{ℓ} \exists a, b, c \in R \forall x \in T, τ_{RT} (x) = (a, b) + cx} .

We suggest the computation (or an accurate approximation) of

μ_{ℓ} : = min_{τ \in {RT}_{0} {(T_{ℓ})}^{m}} ({∥ σ}_{ℓ} - τ ∥_{L^{q^{'}} (Ω)} + {∥ Π}_{ℓ} Λ_{ℓ} + div τ ∥_{L^{q^{'}} (Ω)})

(4.1)

and emphasise that any upper bound is allowed in Theorem 4.1. This leads to

\begin{align} ϰ / 2 {∥ δ}_{ℓ} ∥_{L^{p^{'}} (Ω)}^{r} + β {∥ e}_{ℓ} ∥_{L^{2} (Ω)}^{2} & \leq {(r ϰ / 2)}^{1 - r^{'}} / r^{'} | w_{ℓ} |_{W^{1, p} (Ω)}^{r^{'}} + β {∥ w}_{ℓ} ∥_{L^{2} (Ω)}^{2} \\ + (μ_{ℓ} + {osc}_{ℓ, q^{'}} (Λ_{ℓ})) {∥ e}_{ℓ} - w_{ℓ} ∥_{W^{1, q} (Ω)} . \end{align}

The algorithm of ([20], Prop. 4.1) computes some w_ℓ from (id-I_ℓ)u_D with

\begin{align} {∥ w}_{ℓ} ∥_{L^{q} (T)} & \approx {h_{T}}^{1 / q} ∥ (id - I_{ℓ}) u_{D} ∥_{L^{q} (∂T \cap ∂Ω)} and \\ ∥ D w_{ℓ} ∥_{L^{q} (T)} & ≲ h_{T}^{1 / q - 1} ∥ (id - I_{ℓ}) u_{D} ∥_{L^{q} (∂T \cap ∂Ω)} + h_{T}^{1 / q} ∥ \partial (id - I_{ℓ}) u_{D} / ∂s ∥_{L^{q} (∂T \cap ∂Ω)} . \end{align}

(4.2)

(The proof of the second assertion is analogous to that of ([20], Prop. 4.1) and the first is an immediate consequence of the design of w_ℓ). This and ${∥ e}_{ℓ} - w_{ℓ} ∥_{W^{1, q} (Ω)} ≲ 1$ for bounded u_ℓ (i.e. solely ${∥ u}_{ℓ} ∥_{W^{1, p} (Ω)} ≲ 1$ is assumed) lead to the practical estimate μ_ℓ as a computable guaranteed upper bound of the left-hand side of Theorem 4.1. Since the minimisation of (4.1) is computationally intensive for q ≠ 2, Section ‘Numerical experiments’ actually computes an approximation of μ_ℓ, based on q = 2.

The choice τ = σ in Theorem 4.1 shows that the right-hand side is in fact optimal up to oscillations. The reliability-efficiency gap of [18] is visible here in that we have no further estimate on ${∥ u}_{ℓ} ∥_{W^{1, p} (Ω)}$ [7, 18]. However, additional smoothness assumptions on u may lead to refined estimates on the term ${∥ e}_{ℓ} - w_{ℓ} ∥_{W^{1, q} (Ω)}$ (cf. Section ‘Refined analysis for an interface model problem’). The following result indicates that μ_ℓ is sharp in the sense that it converges with the correct convergence rate. This theorem employs the Fortin interpolation operator I_F,ℓ defined for $τ \in H (div, Ω) \cap L^{t} (Ω; R^{n})$ with t > 2 by $I_{F, ℓ} τ \in {RT}_{0} (T_{ℓ})$ and

⨏_{F} n_{F} \cdot (id - I_{F, ℓ}) τ d s = 0 for all F \in F_{ℓ} .

Here and in the following, n_F denotes a unit normal vector of the side F; the direction of n_F arbitrary, but fixed for a given side F. For the improved regularity of stress in the class of degenerate convex minimisation problems at hand, we refer to [3, 21].

Theorem 4.2.

( Efficiency) If the exact stress σ is sufficiently regular such that its Fortin interpolant $τ_{ℓ} = I_{F, ℓ} σ \in {RT}_{0} (T_{ℓ}; R^{m \times n})$ is defined, it holds

\begin{align} {∥ σ}_{ℓ} - τ_{ℓ} ∥_{L^{q^{'}} (Ω)} + {∥ Π}_{ℓ} Λ_{ℓ} & + div τ_{ℓ} ∥_{L^{q^{'}} (Ω)} \\ ≲ {∥ δ}_{ℓ} ∥_{L^{q^{'}} Ω} + 2 β {∥ e}_{ℓ} ∥_{L^{q^{'}} (Ω)} + ∥ (id - I_{F, ℓ}) σ ∥_{L^{q^{'}} (Ω)} . \end{align}

It is expected that $∥ (id - I_{F, ℓ}) σ ∥_{L^{q^{'}} (Ω)} ≲ H_{ℓ}$ . This is shown in ([22], Prop. 3.6) for q^′ = 2 and therefore also holds for q^′ ≤ 2. Hence the right-hand side of the assertion of Theorem 4.2 converges with the (expected) optimal convergence rates.

Proof of Theorem 4.1.

Let ϰ be the reciprocal of c₁ in ([12], Lemma 3.5), which is also the multiplicative constant hidden in (3.3). Recall Young’s inequality, which reads $ab \leq a^{r} / r + b^{r^{'}} / r^{'}$ for a, b > 0. This, (3.3) and the continuous Euler-Lagrange equation (3.1) show, for v = e_ℓ - w_ℓ ∈ V, that

\begin{align} ϰ {∥ δ}_{ℓ} ∥_{L^{p^{'}} (Ω)}^{r} + 2 β {∥ e}_{ℓ} ∥_{L^{2} (Ω)}^{2} & \leq \int_{Ω} (δ_{ℓ} : D v + 2 β e_{ℓ} \cdot v) d x \\ + \int_{Ω} (δ_{ℓ} : D w_{ℓ} + 2 β e_{ℓ} \cdot w_{ℓ}) d x \\ \leq - \int_{Ω} (σ_{ℓ} : D v - Λ_{ℓ} \cdot v) d x \\ + β {∥ e}_{ℓ} ∥_{L^{2} (Ω)}^{2} + β {∥ w}_{ℓ} ∥_{L^{2} (Ω)}^{2} \\ + ϰ / 2 {∥ δ}_{ℓ} ∥_{L^{p^{'}} (Ω)}^{r} + {(r ϰ / 2)}^{1 - r^{'}} / r^{'} | w_{ℓ} |_{W^{1, p} (Ω)}^{r^{'}} . \end{align}

Hence ${Res}_{ℓ} (v) : = - \int_{Ω} (σ_{ℓ} : D v - Λ_{ℓ} \cdot v) d x$ satisfies

ϰ / 2 {∥ δ}_{ℓ} ∥_{L^{p^{'}} (Ω)}^{r} + β {∥ e}_{ℓ} ∥_{L^{2} (Ω)}^{2} \leq {Res}_{ℓ} (v) + {(r ϰ / 2)}^{1 - r^{'}} / r^{'} | w_{ℓ} |_{W^{1, p} (Ω)}^{r^{'}} + β {∥ w}_{ℓ} ∥_{L^{2} (Ω)}^{2} .

Let $C_{q^{'}}$ denote the Poincaré constant of convex domains with respect to the $W^{1, q^{'}}$ norm. The fundamental theorem of calculus on some one-dimensional arc shows that C_∞ ≤ 1. The paper [23] proves C₁ = 1 / 2. Hence, operator-interpolation arguments [24, 25] prove $C_{q^{'}} \leq {(1 / 2)}^{1 / q^{'}} \leq 1$ . The Poincaré inequality shows, for any 2 ≤ q ≤ p, that

\begin{align} \int_{Ω} (id - Π_{ℓ}) Λ_{ℓ} & \cdot v d x = \int_{Ω} h_{ℓ} (id - Π_{ℓ}) Λ_{ℓ} \cdot \frac{1}{h_{ℓ}} (id - Π_{ℓ}) v d x \\ \leq {∥ h}_{ℓ} (id - Π_{ℓ}) Λ_{ℓ} ∥_{L^{q^{'}} (Ω)} ∥ D v ∥_{L^{q} (Ω)} = {osc}_{ℓ, q^{'}} (Λ_{ℓ}) ∥ D v ∥_{L^{q} (Ω)} . \end{align}

For any $τ \in H (div, Ω; R^{m \times n})$ , the Hölder and Poincaré inequalities show

\begin{align} {Res}_{ℓ} (v) & = - \int_{Ω} ((σ_{ℓ} - τ) : D v - (Π_{ℓ} Λ_{ℓ} + div τ) \cdot v - (id - Π_{ℓ}) Λ_{ℓ} \cdot v) d x \\ \leq ({∥ σ}_{ℓ} - τ ∥_{L^{q^{'}} (Ω)} + {∥ Π}_{ℓ} Λ_{ℓ} + div τ ∥_{L^{q^{'}} (Ω)} + {osc}_{ℓ, q^{'}} (Λ_{ℓ})) ∥ v ∥_{W^{1, q} (Ω)} . \end{align}

Proof of Theorem 4.2

The triangle inequality yields

{∥ σ}_{ℓ} - τ_{ℓ} ∥_{L^{q^{'}} (Ω)} \leq ∥ (id - I_{F, ℓ}) σ ∥_{L^{q^{'}} (Ω)} + {∥ δ}_{ℓ} ∥_{L^{q^{'}} Ω} .

Since f = 2β(u - g)-divσ, the commutative property divI_F,ℓ = Π_ℓdiv (cf. ([22], p. 129)) yields

{∥ Π}_{ℓ} Λ_{ℓ} + div τ_{ℓ} ∥_{L^{q^{'}} (Ω)} = 2 β {∥ Π}_{ℓ} e_{ℓ} ∥_{L^{q^{'}} (Ω)} \leq 2 β {∥ e}_{ℓ} ∥_{L^{q^{'}} (Ω)} .

Refined analysis for an interface model problem

This section is devoted for a model scenario from phase transition problems [18] with some solution u that is smooth outside some one-dimensional interface Γ. Suppose some (possibly non-unique) minimiser u of the continuous problem (2.3) satisfies $u \in W^{1, \infty} (Ω; R^{m}) \cap W^{2, p} (Ω ∖ Γ; R^{m})$ for some finite union Γ of (n - 1) dimensional Lipschitz surfaces in $\bar{Ω}$ . Since Ω has a Lipschitz boundary, this implies Lipschitz continuity of u on Ω. We refer to [19] for sufficient conditions for $u \in W^{1, \infty} (Ω; R^{m})$ and conclude that the remaining assumption $u \in W^{2, p} (Ω ∖ Γ; R^{m})$ is the essential hypothesis expected in many interface problems. Let $u_{ℓ} \in A_{ℓ}$ be the (unique) minimiser of the discrete stabilised problem (2.5). In the following, also Γ = ∅ is permitted to extend previous results [12] for highly regular minimisers.

The following theorem leads to a priori convergence rates for the interface model problem. Thereby it recovers the results of [12] for problems with piecewise smooth exact solution.

We will abbreviate the set of all triangles that are touched by Γ as $T_{ℓ} (Γ) : = {T \in T_{ℓ} : dist (T, Γ) = 0}$ , its cardinality as $|T_{ℓ} (Γ)|$ , its union as $Ω_{Γ, ℓ} : = int (⋃ T_{ℓ} (Γ))$ with volume |Ω_Γ,ℓ| and its complement as $Ω_{Γ, ℓ}^{C} : = Ω ∖ \bar{Ω_{Γ, ℓ}}$ .

Theorem 5.1.

Provided β > 0, it holds

\begin{align} {∥ δ}_{ℓ} ∥_{L^{p^{'}} (Ω)}^{r} + {∥ e}_{ℓ} ∥_{L^{2} (Ω)}^{2} + {∥ | u}_{ℓ} {∥ |}_{ℓ}^{2} & ≲ H_{ℓ}^{1 + γ} | u |_{H^{2} (Ω ∖ Γ)}^{2} + H_{ℓ}^{2} | u |_{W^{1, \infty} (Ω)}^{2} + H_{ℓ}^{r / (r - 1)} | u |_{W^{2, p} (Ω_{Γ, ℓ}^{C})}^{r / (r - 1)} \\ + H_{ℓ}^{γ + n - 1} | u |_{W^{1, \infty} (Ω)}^{2} | T_{ℓ} (Γ) | + | u |_{W^{1, \infty} (Ω)}^{r / (r - 1)} | Ω_{Γ, ℓ} |^{r / ((r - 1) p)} . \end{align}

Remark 5.2.

In the case of uniform mesh refinements we may expect $| T_{ℓ} (Γ) | \approx H_{ℓ}^{1 - n}$ and |Ω_Γ,ℓ| ≈ H_ℓ and Theorem 5.1 simplifies to

{{∥ δ}_{ℓ} ∥}_{L^{p^{'}} (Ω)}^{r} + {{∥ e}_{ℓ} ∥}_{L^{2} (Ω)}^{2} + {∥ | u}_{ℓ} {∥ |}_{ℓ}^{2} ≲ H_{ℓ}^{min \{γ, 2\}} {| u |}_{W^{1, \infty} (Ω)}^{2} + H_{ℓ}^{r / ((r - 1) p)} {| u |}_{W^{1, \infty} (Ω)}^{r / (r - 1)} .

Proof.

With w_ℓ = (id - I_ℓ)e_ℓ = (id - I_ℓ)u, a Young inequality, (3.3) and ([12], Theorem 3.8) yield

{∥ δ}_{ℓ} ∥_{L^{p^{'}} (Ω)}^{r} + {∥ e}_{ℓ} ∥_{L^{2} (Ω)}^{2} + {∥ | u}_{ℓ} {∥ |}_{ℓ}^{2} ≲ | w_{ℓ} |_{W^{1, p} (Ω)}^{r / (r - 1)} + {∥ w}_{ℓ} ∥_{L^{2} (Ω)}^{2} + {∥ | I}_{ℓ} u {∥ |}_{ℓ}^{2} .

Theorem 4.4.4 in [25] shows ${∥ w}_{ℓ} ∥_{L^{2} (Ω)} ≲ {∥ w}_{ℓ} ∥_{L^{\infty} (Ω)} ≲ H_{ℓ} | u |_{W^{1, \infty} (Ω)}$ and

\begin{align} | w_{ℓ} |_{W^{1, p} (Ω)}^{p} & = | w_{ℓ} |_{W^{1, p} (Ω_{Γ, ℓ})}^{p} + | w_{ℓ} |_{W^{1, p} (Ω_{Γ, ℓ}^{C})}^{p} \\ ≲ | u |_{W^{1, \infty} (Ω_{Γ, ℓ})}^{p} | Ω_{Γ, ℓ} | + H_{ℓ}^{p} | u |_{W^{2, p} (Ω_{Γ, ℓ}^{C})}^{p} . \end{align}

Let $ω_{F} = ⋃_{\begin{matrix} T \in T_{ℓ} \\ F \subset T \end{matrix}} T$ be the patch of a side $F \in F_{ℓ}$ , and set $F_{ℓ} (Γ) = {F \in F_{ℓ} (Ω) : ω_{F} \cap Γ \neq \emptyset}$ and $F_{ℓ}^{C} (Γ) = F_{ℓ} (Ω) ∖ F_{ℓ} (Γ)$ . Note that [Du]_F = 0 for $F \in F_{ℓ}^{C} (Γ)$ . Then

{∥ | I}_{ℓ} {u ∥ |}_{ℓ}^{2} = H_{ℓ}^{1 + γ} (\sum_{F \in F_{ℓ}^{C} (Γ)} {h_{F}^{- 1} {∥ [D w_{ℓ}]}_{F} ∥}_{L^{2} (F)}^{2} + \sum_{F \in F_{ℓ} (Γ)} {h_{F}^{- 1} {∥ [D I_{ℓ} u]}_{F} ∥}_{L^{2} (F)}^{2}) \cdot

The first sum can be estimated as in the proof of ([12], Lemma 3.2), the second sum with

{∥ [D I_{ℓ} u]}_{F} ∥_{L^{2} (F)}^{2} ≲ h_{F}^{n - 1} | I_{ℓ} u |_{W^{1, \infty} (F)}^{2} ≲ h_{F}^{n - 1} | u |_{W^{1, \infty} (F)}^{2} .

The observation $| F_{ℓ} (Γ) | \leq (n + 1) | T_{ℓ} (Γ) |$ concludes the proof.□

Together with Theorem 5.1, the subsequent result implies strong convergence of the gradients in the model interface problem as H_ℓ → 0.

Theorem 5.3.

Under the aforementioned conditions on the (possibly non-unique) exact minimiser $u \in W^{1, \infty} (Ω; R^{m}) \cap W^{2, p} (Ω ∖ Γ; R^{m})$ , the error e_ℓ = u - u_ℓof the discrete solution $u_{ℓ} \in A_{ℓ}$ of (2.5) satisfies

\begin{align} ∥ D e_{ℓ} ∥_{L^{2} (Ω)} ≲ & {∥ e}_{ℓ} ∥_{L^{2} (Ω)}^{1 / 3} + H_{ℓ}^{5 / 6} ∥ \partial^{2} u_{D} / \partial s^{2} ∥_{L^{2} (∂Ω)}^{1 / 3} + H_{ℓ}^{(1 - γ) / 2} {∥ | u}_{ℓ} {∥ |}_{ℓ} \\ + H_{ℓ}^{- (1 + γ) / 4} {∥ | u}_{ℓ} {∥ |}_{ℓ}^{1 / 2} ({∥ e}_{ℓ} ∥_{L^{2} (Ω)}^{1 / 2} + H_{ℓ}^{5 / 4} ∥ \partial^{2} u_{D} / \partial s^{2} ∥_{L^{2} (Ω)}^{1 / 2}) . \end{align}

Proof

The basic idea of gradient control is the generalisation of the interpolation estimate H¹(Ω) = [L²(Ω), H²(Ω)]_1/2 for a reduced domain Ω∖Γ; refer to [24, 25] for a detailed analysis of interpolation spaces. Let w_ℓ be the boundary value interpolation of (id - I_ℓ)u_D as described in ([20], Prop. 4.1), such that w_ℓ satisfies the inequalities in (4.2). A piecewise integration by parts shows, for $v : = e_{ℓ} - w_{ℓ} \in W_{0}^{1, p} (Ω; R^{m})$ , that

\begin{align} ∥ D e_{ℓ} ∥_{L^{2} (Ω)}^{2} & = \int_{Ω} D (u - u_{ℓ}) : D v d x + \int_{Ω} D e_{ℓ} : D w_{ℓ} d x \\ \leq \int_{Γ} v \cdot {[D u]}_{Γ} n_{Γ} d s - \int_{Ω ∖ Γ} v \cdot Δ u d x - \sum_{F \in F ℓ (Ω)} \int_{F} v \cdot {[D u_{ℓ}]}_{F} n_{F} d s \\ + ∥ D e_{ℓ} ∥_{L^{2} (Ω)} ∥ D w_{ℓ} ∥_{L^{2} (Ω)}, \end{align}

where n_Γ is a unit normal vector of the interface Γ. The Lipschitz continuity of u implies |[Du]_Γn_Γ| ≲ 1. This and the trace inequality on Γ lead to

\int_{Γ} v \cdot {[D u]}_{Γ} n_{Γ} d s ≲ ∥ v ∥_{L^{2} (Γ)} ≲ ∥ v ∥_{L^{2} (Ω)} + ∥ v ∥_{L^{2} (Ω)}^{1 / 2} ∥ D v ∥_{L^{2} (Ω)}^{1 / 2} .

The case $Γ = \emptyset$ is contained in ([12], Theorem 4.4). The piecewise Laplacian of u is bounded in L²(Ω) and so (with the generic constant $C : = ∥ Δ u ∥_{L^{2} (Ω ∖ Γ)}$ hidden in the notation C ≈ 1)

\int_{Ω ∖ Γ} v \cdot Δ u d x ≲ ∥ v ∥_{L^{2} (Ω)}

The elementwise trace inequality ([25], Theorem 1.6.6, p. 39) for an n-dimensional simplex T and one of its sides F, and $f \in W^{1, q} (T; R^{m})$ , 1 ≤ q < ∞, reads

∥ f ∥_{L^{q} (F)}^{q} ≲ h_{T}^{- 1} ∥ f ∥_{L^{q} (T)}^{q} + ∥ f ∥_{L^{q} (T)}^{q - 1} ∥ D f ∥_{L^{q} (T)} ≲ h_{T}^{- 1} ∥ f ∥_{L^{q} (T)}^{q} + h_{T}^{q - 1} ∥ D f ∥_{L^{q} (T)}^{q} .

The term $\int_{F} v \cdot {[D u_{ℓ}]}_{F} n_{F} d s$ and the stabilisation ∥ |u_ℓ∥ |_ℓ are already analysed in the Estimate on C in the proof of ([12], Theorem 4.4). This results in

\sum_{F \in F_{ℓ} (Ω)} \int_{F} v \cdot {[D u_{ℓ}]}_{F} n_{F} d s ≲ {∥ | u}_{ℓ} {∥ |}_{ℓ} (H_{ℓ}^{(1 - γ) / 2} {∥ D v ∥}_{L^{2} (Ω)} + H_{ℓ}^{- (1 + γ) / 2} {∥ v ∥}_{L^{2} (Ω)}) .

The preceding estimates plus the absorbtion of $∥ D e_{ℓ} ∥_{L^{2} (Ω)}$ lead to

\begin{align} ∥ D e_{ℓ} ∥_{L^{2} (Ω)}^{2} & ≲ ∥ v ∥_{L^{2} (Ω)} + ∥ v ∥_{L^{2} (Ω)}^{1 / 2} ∥ D v ∥_{L^{2} (Ω)}^{1 / 2} + ∥ D w_{ℓ} ∥_{L^{2} (Ω)}^{2} \\ + {∥ | u}_{ℓ} {∥ |}_{ℓ} (H_{ℓ}^{(1 - γ) / 2} {∥ D v ∥}_{L^{2} (Ω)} + H_{ℓ}^{- (1 + γ) / 2} {∥ v ∥}_{L^{2} (Ω)}) . \end{align}

The triangle inequality applied to v = e_ℓ - w_ℓ and some careful elementary analysis to absorb $∥ D e_{ℓ} ∥_{L^{2} (Ω)}^{1 / 2}$ eventually lead to

\begin{align} ∥ D e_{ℓ} ∥_{L^{2} (Ω)} & ≲ {∥ e}_{ℓ} ∥_{L^{2} (Ω)}^{1 / 3} + {∥ w}_{ℓ} ∥_{L^{2} (Ω)}^{1 / 3} + | w_{ℓ} |_{H^{1} (Ω)} + H_{ℓ}^{(1 - γ) / 2} {∥ | u}_{ℓ} {∥ |}_{ℓ} \\ + H_{ℓ}^{- (1 + γ) / 4} {∥ | u}_{ℓ} {∥ |}_{ℓ}^{1 / 2} {({∥ e}_{ℓ} ∥_{L^{2} (Ω)} + {∥ w}_{ℓ} ∥_{L^{2} (Ω)})}^{1 / 2} . \end{align}

The inequalities (4.2), Poincaré and Friedrichs inequalities on sides $F \in F_{ℓ} (∂Ω)$ and removal of higher-order terms in H_ℓ conclude the proof.□

The following theorem is an improved a posteriori estimate based on Theorems 4.1 and 5.3.

Theorem 5.4.

Recall $u \in W^{1, \infty} (Ω; R^{m}) \cap W^{2, p} (Ω ∖ Γ; R^{m})$ , the definitions e_ℓ := u - u_ℓand δ_ℓ := σ - σ_ℓfor σ := DW(Du) and σ_ℓ := DW(Du_ℓ), and the definition of Λ_ℓfrom Section ‘A posteriori error estimates’. Set

\begin{align} M (τ) : = {∥ σ}_{ℓ} - τ ∥_{L^{2} (Ω)} + {∥ Π}_{ℓ} Λ_{ℓ} + div τ ∥_{L^{2} (Ω)} & + {osc}_{ℓ, 2} (Λ_{ℓ}) \\ for all τ \in H (div, Ω; R^{m \times n}) . \end{align}

Provided β>0, it holds

\begin{align} {∥ δ}_{ℓ} ∥_{L^{p^{'}} (Ω)}^{r} + {∥ e}_{ℓ} ∥_{L^{2} (Ω)}^{2} & ≲ M {(τ)}^{6 / 5} + H_{ℓ}^{- (1 + γ) / 3} M {(τ)}^{4 / 3} {∥ | u}_{ℓ} {∥ |}_{ℓ}^{2 / 3} \\ + M (τ) (H_{ℓ}^{(1 - γ) / 2} {∥ | u}_{ℓ} {∥ |}_{ℓ} + H_{ℓ}^{1 - γ / 4} {∥ | u}_{ℓ} {∥ |}_{ℓ}^{1 / 2}) + H_{ℓ}^{min \{5, r^{'} (1 + 1 / p)\}} \end{align}

and

\begin{align} ∥ D e_{ℓ} ∥_{L^{2} (Ω)}^{2} & ≲ M {(τ)}^{2 / 5} + H_{ℓ}^{- (1 + γ) / 9} M {(τ)}^{4 / 9} {∥ | u}_{ℓ} {∥ |}_{ℓ}^{2 / 9} + H_{ℓ}^{min \{5 / 3, r^{'} (1 + 1 / p) / 3\}} \\ + M {(τ)}^{1 / 3} {(H_{ℓ}^{(1 - γ) / 2} {∥ | u}_{ℓ} {∥ |}_{ℓ} + H_{ℓ}^{1 - γ / 4} {∥ | u}_{ℓ} {∥ |}_{ℓ}^{1 / 2})}^{1 / 3} + H_{ℓ}^{1 - γ} {∥ | u}_{ℓ} {∥ |}_{ℓ}^{2} \\ + H_{ℓ}^{- (1 + γ) / 2} {∥ | u}_{ℓ} {∥ |}_{ℓ} {(M {(τ)}^{6 / 5} + H_{ℓ}^{- (1 + γ) / 3} M {(τ)}^{4 / 3} {∥ | u}_{ℓ} {∥ |}_{ℓ}^{2 / 3} + H_{ℓ}^{min \{5, r^{'} (1 + 1 / p)\}})}^{1 / 2} \\ + H_{ℓ}^{- (1 + γ) / 2} {∥ | u}_{ℓ} {∥ |}_{ℓ} M {(τ)}^{1 / 2} {(H_{ℓ}^{(1 - γ) / 2} {∥ | u}_{ℓ} {∥ |}_{ℓ} + H_{ℓ}^{1 - γ / 4} {∥ | u}_{ℓ} {∥ |}_{ℓ}^{1 / 2})}^{1 / 2} \end{align}

The generic constants in Theorem 5.4 depend on problem-specific data such as the shapes of Ω and Γ as well as the generic constant ϰ of Theorem 4.1.

Theorem 5.5.

Theorem 5.4 holds verbatim in Example 3.3 and in the modified two-well problem of Subsection ‘Modified two-well benchmark’, where β = 0.

Remark 5.6.

The assertion of Theorem 5.4 holds for any discrete u_ℓ ∈ u_D,ℓ + V_ℓ which may approximate the discrete unique exact solution of (2.5). This allows the inexact SOLVE via an iterative procedure.

Proof of Theorem 5.4

Choose w_ℓ as in the proof of Theorem 5.3. Then Theorem 4.1 with q = 2 and (4.2) imply

\begin{align} {∥ δ}_{ℓ} ∥_{L^{p^{'}} (Ω)}^{r} + {∥ e}_{ℓ} ∥_{L^{2} (Ω)}^{2} & ≲ M (τ) {∥ e}_{ℓ} - w_{ℓ} ∥_{H^{1} (Ω)} + | w_{ℓ} |_{W^{1, p} (Ω)}^{r^{'}} + {∥ w}_{ℓ} ∥_{L^{2} (Ω)}^{2} \\ ≲ M (τ) (| e_{ℓ} |_{H^{1} (Ω)} + {{∥ e}_{ℓ} ∥}_{L^{2} (Ω)} + H_{ℓ}^{3 / 2}) + H_{ℓ}^{min \{5, r^{'} (1 + 1 / p)\}} . \end{align}

Theorem 5.3 provides an estimate of the semi-norm $| e_{ℓ} |_{H^{1} (Ω)}$ . A Young inequality shows $H_{ℓ}^{5 / 6} M (τ) ≲ H_{ℓ}^{5} + M {(τ)}^{6}$ . The absorbtion of ${∥ e}_{ℓ} ∥_{L^{2} (Ω)}$ then proves the first assertion. The second assertion is an immediate consequence of the first one, Theorem 5.3 and several algebraic transformations.

Numerical experiments

This section illustrates the theoretical estimates and their impact on the reliability-efficiency gap on 2D benchmarks in computational microstructures [18, 26].

Numerical algorithms

The adaptive finite element method (AFEM) and algorithmic details on the implementation in MATLAB in the spirit of [27] concern the state-of-the-art AFEM loop

SOLVE \to ESTIMATE \to MARK \to REFINE

and are explained below together with some notation.

Solve

The stabilised discrete problem (2.5) is solved in a nested iteration on a given triangulation $T_{ℓ}$ with MATLAB’s standard-minimiser fminunc with default tolerances. Gradient and Hessian of the discrete energy are available and therefore provided to fminunc. We set γ = 1 in the stabilisation term (2.4) in all our experiments. This is motivated by ([12], Theorem 4.4) which suggest that γ = 1 yields an optimal convergence rate. The discrete solution of the previous AFEM loop iteration serves as a start vector for fminunc; for the first iteration, the initial vector is zero everywhere up to the Dirichlet boundary nodes. Since the Galerkin orthogonality is not required in Theorem 4.1, the termination of an iterative realisation for SOLVE is not a sensitive issue. In the computational PDEs, it is a fundamental issue to involve inexact solve. In this paper, however, the numerical examples are run with the standard settings of MATLAB.

Estimate

The refinement indicator results from the error estimator of Theorem 4.1. As in the work of Repin [28], the computation of the minimiser $τ \in {RT}_{0} {(T_{ℓ})}^{m}$ of

{∥ σ}_{ℓ} - τ ∥_{L^{2} (Ω)} + {∥ Π}_{ℓ} Λ_{ℓ} + div τ ∥_{L^{2} (Ω)}

(6.1)

runs Algorithm 1 based on the formula

{(a + b)}^{2} = min_{s > 0} ((1 + s) a^{2} + (1 + 1 / s) b^{2}) for a, b > 0

Algorithm 1 Approximate flux computation

The stopping criterion of Algorithm 1 monitors relative changes and avoids degenerate values of s. Undisplayed experiments have conviced us that a maxmium of three iterations and a stopping tolerance of $ε_{M}^{0.8}$ (with the machine precision ε_M) yield satisfying results. The iteration is stopped whenever s, 1 / s or the relative change of s drops below this tolerance. As an additional precaution, the iteration also stops if the linear system is deemed “nearly singular” by MATLAB. Our experiments convinced us that ignoring this warning causes a breakdown with NaNs. Note that if q ≠ 2, we still minimise the L² sums in (6.1) to avoid the computational cost of a nonlinear solve. With the computed minimiser τ, Section ‘A posteriori error estimates’ yields the error estimator

η_{F, q^{'}} : = {{∥ σ}_{ℓ} - τ ∥}_{L^{q^{'}} (Ω)} + {{∥ Π}_{ℓ} Λ_{ℓ} + div τ ∥}_{L^{q^{'}} (Ω)} + {osc}_{ℓ, q^{'}} (Λ_{ℓ}) .

This will be compared with the well-established residual based a posteriori error estimator[7]

η_{R, q^{'}} : = {(\sum_{T \in T_{ℓ}} h_{T}^{q^{'}} {{∥ Λ}_{ℓ} ∥}_{L^{q^{'}} (T)}^{q^{'}})}^{1 / q^{'}} + {(\sum_{F \in F_{ℓ} (Ω)} h_{F} {∥ [σ_{ℓ}]}_{F} \cdot {n_{F} ∥}_{L^{q′} (F)}^{q^{'}})}^{1 / q^{'}},

which is reliable for the original discretisation without stabilisation. Undisplayed experiments computed the averaging error estimator[18], which is founded on the same theoretical background as $η_{R, q^{'}}$ and therefore yielded essentially the same convergence rates.

The error estimators in Theorem 5.4 read

\begin{align} η_{L, 2} & : = η_{F, 2}^{6 / 5} + H_{ℓ}^{- (1 + γ) / 3} η_{F, 2}^{4 / 3} {∥ | u}_{ℓ} {∥ |}_{ℓ}^{2 / 3} \\ + η_{F, 2} (H_{ℓ}^{(1 - γ) / 2} {∥ | u}_{ℓ} {∥ |}_{ℓ} + H_{ℓ}^{1 - γ / 4} {∥ | u}_{ℓ} {∥ |}_{ℓ}^{1 / 2}) + H_{ℓ}^{min \{5, r^{'} (1 + 1 / p)\}} \end{align}

\begin{align} η_{H, 2} & : = η_{F, 2}^{2 / 5} + H_{ℓ}^{- (1 + γ) / 9} η_{F, 2}^{4 / 9} {∥ | u}_{ℓ} {∥ |}_{ℓ}^{2 / 9} + H_{ℓ}^{min \{5 / 3, r^{'} (1 + 1 / p) / 3\}} \\ + η_{F, 2}^{1 / 3} {(H_{ℓ}^{(1 - γ) / 2} {∥ | u}_{ℓ} {∥ |}_{ℓ} + H_{ℓ}^{1 - γ / 4} {∥ | u}_{ℓ} {∥ |}_{ℓ}^{1 / 2})}^{1 / 3} + H_{ℓ}^{1 - γ} {∥ | u}_{ℓ} {∥ |}_{ℓ}^{2} \\ + H_{ℓ}^{- (1 + γ) / 2} {∥ | u}_{ℓ} {∥ |}_{ℓ} {(η_{F, 2}^{6 / 5} + H_{ℓ}^{- (1 + γ) / 3} η_{F, 2}^{4 / 3} {∥ | u}_{ℓ} {∥ |}_{ℓ}^{2 / 3} + H_{ℓ}^{min \{5, r^{'} (1 + 1 / p)\}})}^{1 / 2} \\ + H_{ℓ}^{- (1 + γ) / 2} {∥ | u}_{ℓ} {∥ |}_{ℓ} η_{F, 2}^{1 / 2} {(H_{ℓ}^{(1 - γ) / 2} {∥ | u}_{ℓ} {∥ |}_{ℓ} + H_{ℓ}^{1 - γ / 4} {∥ | u}_{ℓ} {∥ |}_{ℓ}^{1 / 2})}^{1 / 2} . \end{align}

MARK

For any given $T \in T_{ℓ}$ with its set of faces $F (T)$ , $∂T = ⋃ F (T)$ , and given τ from (6.1), set

\begin{align} η_{F}^{q^{'}} (T) & : = {∥ σ}_{ℓ} - τ ∥_{L^{q^{'}} (T)}^{q^{'}} + {∥ Π}_{ℓ} Λ_{ℓ} + div τ ∥_{L^{q^{'}} (T)}^{q^{'}} + {h_{T}}^{q^{'}} ∥ (id - Π_{ℓ}) Λ_{ℓ} ∥_{L^{q^{'}} (T)}^{q^{'}} . \\ η_{R}^{q^{'}} (T) & : = {| T |}^{q^{'} / n} {∥ Λ}_{ℓ} ∥_{L^{q^{'}} (T)}^{q^{'}} + {| T |}^{1 / n} \sum_{F \in F_{ℓ} (Ω) \cap F (T)} {∥ [σ_{ℓ}]}_{F} \cdot n_{F} ∥_{L^{q^{'}} (F)}^{q^{'}} . \end{align}

Let $η^{q^{'}} (T)$ be one of the refinement indicators $η_{F}^{q^{'}} (T)$ and $η_{R}^{q^{'}} (T)$ . Some greedy algorithm computes $M_{ℓ} \subset T_{ℓ}$ of (almost) minimal cardinality such that

\sum_{T \in M_{ℓ}} η^{q^{'}} (T) \geq 1 / 2 \sum_{T \in T_{ℓ}} η^{q^{'}} (T) .

Refine

This step computes the smallest refinement $T_{ℓ + 1}$ of $T_{ℓ}$ with $M_{ℓ} \subset T_{ℓ} ∖ T_{ℓ + 1}$ based on the red-green-blue refinement strategy as illustrated in Figure 2. This refinement involves some closure algorithm to avoid hanging nodes.

Two-well benchmark

The computational microstructure benchmark of ([18], Section 2) considers two wells with W from (3.4) in Example 3.3. The energy is given by (1.1) on the domain $Ω = (0, 1) \times (0, 3 / 2) \subset R^{2}$ with

g (x) : = - 3 t^{5} / 128 - t^{3} / 3 and u_{D} (x) : = \{\begin{array}{l} g (x) & for t \leq 0, \\ t^{3} / 24 + t & for t \geq 0 \end{array}

for $t : = (3 (x_{1} - 1) + 2 x_{2}) / \sqrt{13}$ ; p = q = 4 and f ≡ 0. The unique minimiser u of $min_{v \in A} E (v)$ with $A = u_{D} + W_{0}^{1, 4} (Ω)$ reads u=u_D ([18], Theorem 2.1) and β = 1 allows for Theorems 5.1 – 5.4 to hold. An initial triangulation $T_{0}$ is given by a criss triangulation of (0,1)×(0,3/2) with 12 congruent triangles and the two interior nodes (1/2,1/2) and (1/2,1). The adaptive algorithm of Subsection ‘Numerical algorithms’ computes a sequence of discrete solutions (u_ℓ)_ℓ and stresses (σ_ℓ)_ℓ, as well as error estimators η_F and η_R with and without stabilisation for uniform and adaptive meshes and led to Figure 3 with overall observations of Section ‘Conclusions’. The empirical convergence rates for uniform and R- as well as F-adapted mesh-refining are collected in Table 1. Note that the error estimator η_L performs better than η_F. This is evident from the table for uniform mesh refinements, but a closer look at Figure 3 reveals that even in the adaptive scenarios, η_L converges slightly faster than η_F. This is in accordance to the theory of Section ‘Refined analysis for an interface model problem’ where η_L is derived from η_F based on additional smoothness assumptions.

Table 1 Observed convergence rates in Figures 3 , 4 , 6 and 7 for uniform and adaptive mesh refining

Full size table

Modified two-well benchmark

This subsection concerns a modification of the previous problem with (3.4) and a linear right-hand side for β = 0 and f(x) := -div(DW(Du_D(x))) and unique solution u = u_D as before. Note that Example 3.3 applies to this problem, and so the proof of Theorem 3.1 yields

∥ σ - σ_{ℓ} ∥_{L^{p^{'}} (Ω)}^{r} + ∥ u - u_{ℓ} ∥_{L^{2} (Ω)}^{2} + {∥ | u}_{ℓ} {∥ |}_{ℓ}^{2} \to 0 as ℓ \to \infty

and Theorems 5.1–5.4 hold as well. The algorithms of Subsection ‘Numerical algorithms’ ran with and without stabilisation for uniform and adaptive meshes with the same initial triangulation as in Subsection ‘Two-well benchmark’ and led to Figure 4 with overall observations of Section ‘Conclusions’. The empirical convergence rates for uniform and R- as well as F-adapted mesh-refining are collected in Table 1 for completeness although they are almost identical with those observed in Subsection ‘Two-well benchmark’.

Three-well benchmark

The energy density W of ([26], Example 5.9.3, p. 72) is the convex hull of min{|F|², |F - (1, 0)|², |F - (0, 1)|²} with explicit form in ([26], Example 5.6.4, p. 58). Let furthermore $Ω = {(0, 1)}^{2} \subset R^{2}$ and u_D(x₁, x₂) := a(x₁ - 1/4) + a(x₂ - 1/4) with a(t) := t³/6 + t / 8 for t ≤ 0 and a(t) := t⁵/40 + t³/8 for $t \geq 0$ . Then the energy is given by (1.1) with β = 0 and f := -divDW(Du_D). The exact solution u = u_D satisfies the interface condition of Section ‘Refined analysis for an interface model problem’ and allows Theorem 5.3 to hold. Theorems 5.1 and 5.4 do not apply because β = 0. We use the grid of Figure 5 as initial triangulation to resolve discontinuities in ∇f.

The algorithms of Subsection ‘Numerical algorithms’ ran with and without stabilisation for uniform and adaptive meshes and led to Figure 6 with overall observations of Section ‘Conclusions’. Beyond those general conclusions, this example demonstrates the difficulties with ill-conditioned Hessians. While the unstabilised method reaches 10⁶ degrees of freedom without difficulty on uniform meshes, the adapted algorithms fail without stabilisation beyond 687 324 degrees of freedom (η_F-adaptive) and 33 169 degrees of freedom (η_R-adaptive). MATLAB’s error message “Input to EIG must not contain NaN or Inf” indicates that a matrix operation returned non-finite numbers let fminunc break down. Undisplayed numerical experiments show condition numbers up to 10¹⁰ and beyond. The empirical convergence rates for uniform and R- as well as F-adapted mesh-refining are collected in Table 1. Moreover, Figure 1 in Section ‘Background’ reveals that stabilisation not only remedies ill-conditioned Hessians but thereby indeed allows for reduced errors in the discrete solution.

An optimal design example

The energy density of the topology optimisation problem of [3, 8, 29–33] reads

\begin{align} W (F) : = ϕ (| F |) & for F \in R^{2} \\ with ϕ (t) : = λ / 2 + \{\begin{array}{l} t^{2} & for 0 \leq t \leq \sqrt{λ}, \\ 2 \sqrt{λ} (t - \sqrt{λ} / 2) & for \sqrt{λ} \leq t \leq 2 \sqrt{λ}, \\ t^{2} / 2 + λ & for t \geq 2 \sqrt{λ} . \end{array} \end{align}

This leads to problem (2.3) with β = 0, λ = 0.0084,u_D ≡ 0 and f ≡ 1. Since regularity of the solutions is unclear, only the results of Sections ‘Global convergence’, ‘A posteriori error estimates’, ‘Refined analysis for an interface model problem’ and ‘Numerical experiments’ apply. As initial triangulation $T_{0}$ , we use the coarsest cross triangulation $T_{0} = {conv {(0, 0), (1, 0), (0, 1)}, conv {(1, 0), (0, 1), (1, 1)}}$ of Ω = (0, 1)².

The algorithms of Subsection ‘Numerical algorithms’ ran with and without stabilisation for uniform and adaptive meshes and led to Figure 7 with the overall observations of Section ‘Conclusions’. The empirical convergence rates for uniform and R- as well as F-adapted mesh-refining are collected in Table 1. Undocumented experiments with a modified lower-order term f and known exact solution u led to the same convergence rates of the error estimators and confirm their accuracy.

Discussion of Empirical Convergence Rates

Global convergence without regularity assumptions

Theorem 3.1 asserts that $∥ σ - σ_{ℓ} ∥_{L^{p^{'}} (Ω)}$ , $β ∥ u - u_{ℓ} ∥_{L^{2} (Ω)}$ , and ∥ |u_ℓ∥ |_ℓ all tend to zero as H_ℓ → 0. The plain convergence result applies to all examples from Subsections ‘Two-well benchmark’, ‘An optimal design example’, ‘Three-well benchmark’, and ‘An optimal design example’ for the uniform mesh-refinements with H_ℓ+1 = H_ℓ / 2. The numerical experiments, however, show empirical convergence rates displayed in the first columns of Table 1. The adaptive algorithms do not reflect the condition H_ℓ → 0 explicitly and hence convergence is not guaranteed a priori. Undisplayed investigations show that indeed in the R-adapted version of the three-well example of Subsection ‘Three-well benchmark’, this condition H_ℓ → 0 does not appear to be true for more than 4 978 degrees of freedom. In all other experiments we observe convergence rates even for unstabilised discretisations.

Empirical convergence rates for interface model problems

Theorem 5.1 provides an a priori error estimate and an estimate of the stabilisation norm. It applies to the benchmark of Subsections ‘Two-well benchmark’, ‘An optimal design example’, ‘Three-well benchmark’, and ‘An optimal design example’ only, because of β > 0 and Example 3.3, and the smoothness conditions imposed upon u from Section ‘Refined analysis for an interface model problem’. Recall the definitions of $T_{ℓ} (Γ)$ , Ω_Γ,ℓ and $Ω_{Γ, ℓ}^{C}$ from Section ‘Refined analysis for an interface model problem’ and assume $∥ u ∥_{L^{2} (Ω ∖ Γ)} \approx 1 \approx ∥ u ∥_{W^{2, p} (Ω_{Γ, ℓ}^{C})}$ , $| T_{ℓ} (Γ) | \approx H_{ℓ}^{- 1}$ and |Ω_Γ,ℓ|≈H_ℓ in this discussion. This leads to a convergence rate of $H_{ℓ}^{2 / p}$ for the right-hand side of Theorem 5.1. The observed convergence rates of $∥ σ - σ_{ℓ} ∥_{L^{p^{'}} (Ω)}$ and $∥ u - u_{ℓ} ∥_{L^{2} (Ω)}$ for the stabilised benchmark examples in Table 1 show convergence rates beyond those guaranteed in Theorem 5.1.

Theorem 5.3 implies, up to perturbations on the boundary,

∥ D (u - u_{ℓ}) ∥_{L^{2} (Ω)} ≲ ∥ u - u_{ℓ} ∥_{L^{2} (Ω)}^{1 / 3} + {∥ | u}_{ℓ} {∥ |}_{ℓ} + H_{ℓ}^{- 1 / 2} {∥ | u}_{ℓ} {∥ |}_{ℓ}^{1 / 2} ∥ u - u_{ℓ} ∥_{L^{2} (Ω)}^{1 / 2} .

Since the exact solutions of Subsections ‘Two-well benchmark’, ‘An optimal design example’, ‘Three-well benchmark’, and ‘An optimal design example’ are all smooth up to a one-dimensional interface line, Theorem 5.3 applies to these examples. The experiments shows that the right-hand side of Theorem 5.3 is dominated by $H_{ℓ}^{- 1 / 2} {∥ | u}_{ℓ} {∥ |}_{ℓ}^{1 / 2} ∥ u - u_{ℓ} ∥_{L^{2} (Ω)}^{1 / 2}$ in all examples and that the inequality is satisfied.

Reliability without regularity assumptions

Up to boundary terms, Theorem 4.1 states

∥ σ - σ_{ℓ} ∥_{L^{p^{'}} (Ω)}^{2} + β ∥ u - u_{ℓ} ∥_{L^{2} (Ω)}^{2} ≲ η_{F} ∥ u - u_{ℓ} ∥_{W^{1, p} (Ω)} .

The convergence rates confirm this assertion for the general and rough estimate $∥ u - u_{ℓ} ∥_{W^{1, p} (Ω)} ≲ 1$ in the sense that the rates for η_F are worse than or equal to those of $∥ σ - σ_{ℓ} ∥_{L^{p^{'}} (Ω)}^{2}$ and $∥ u - u_{ℓ} ∥_{L^{2} (Ω)}^{2}$ . In the numerical examples, $∥ u - u_{ℓ} ∥_{H^{1} (Ω)}$ is computed and displayed in Table 1 and the convergence rates of the product $∥ u - u_{ℓ} ∥_{H^{1} (Ω)} η_{F}$ can be compared with those of $∥ σ - σ_{ℓ} ∥_{L^{p^{'}} (Ω)}^{2} + ∥ u - u_{ℓ} ∥_{L^{2} (Ω)}^{2}$ . This comparison confirms the above a posteriori error estimate. In the examples with p = 2 (of Subsections ‘Two-well benchmark’, ‘An optimal design example’, ‘Three-well benchmark’, and ‘An optimal design example’), there holds even equality of the convergence rates which demonstrates the efficiency of the estimate of Theorem 4.1.

Efficiency without regularity assumptions

Up to oscillations and the (possibly) higher-order term $∥ (id - I_{F, ℓ}) σ ∥_{L^{q^{'}} (Ω)}$ , Theorem 4.2 states

η_{F} ≲ ∥ σ - σ_{ℓ} ∥_{L^{p^{'}} (Ω)} + β ∥ u - u_{ℓ} ∥_{L^{p^{'}} (Ω)} .

The displayed convergence rates of Table 1 confirm this estimate.

Reliability of the refined a posteriori error control

Theorem 5.4 applies to the example of Subsection ‘Two-well benchmark’ and states

∥ σ - σ_{ℓ} ∥_{L^{p^{'}} (Ω)}^{2} + ∥ u - u_{ℓ} ∥_{L^{2} (Ω)}^{2} ≲ η_{L} and ∥ D (u - u_{ℓ}) ∥_{L^{2} (Ω)}^{2} ≲ η_{H} .

Table 1 confirms this estimate and shows that the estimators η_L and η_H accurately predict the convergence rate of the errors, even with equality of the convergence rates in the case of adaptive mesh refinements in the examples of Subsections ‘Two-well benchmark’, ‘An optimal design example’, ‘Three-well benchmark’, and ‘An optimal design example’.

All displayed convergence rates of η_L are better or at least equal to those of η_F. For instance, for uniform mesh-refining in Subsections ‘Two-well benchmark’, ‘An optimal design example’, ‘Three-well benchmark’, and ‘An optimal design example’, the error terms $∥ σ - σ_{ℓ} ∥_{L^{p^{'}} (Ω)}^{2} + ∥ u - u_{ℓ} ∥_{L^{2} (Ω)}^{2}$ converge with the empirical convergence rate 7/5 while the upper bound η_F does so with a reduced convergence rate 4/5. The refined error estimator η_L is a guaranteed upper bound (via Theorem 5.4) and converges with an empirical convergence rate 1.

Performance of the minimisation algorithm 1

In all numerical experiments of this paper, Algorithm 1 reaches the maximal number 3 of iterations. While this suggests that the optimal s is not found after three iterations, undisplayed experiments with higher iteration counts and hence higher computational efforts result solely in marginal improvements.

Conclusions

Effects of stabilisation

The empirical convergence rates of the error estimators η_F, η_R and the errors $∥ u - u_{ℓ} ∥_{L^{2} (Ω)}$ and $∥ σ - σ_{ℓ} ∥_{L^{p^{'}} (Ω)}$ for uniform mesh-refinement with and without stabilisation coincide. This indicates that the choice γ = 1 leads to some significant perturbation but maintains the correct convergence rate at the same time. This is different for adaptive mesh refinement with less optimal convergence rates. Our conclusion is that an improved adaptive algorithm has to be developed with balance of local mesh-refinement and global stabilisation parameters in future research. The tested algorithm from Subsection ‘Numerical algorithms’ does neither reflect the effects of stabilisation nor that of inexact solve.

Another important aspect of the stabilisation is the regularisation of the Hessian in the step SOLVE of Subsection ‘Numerical algorithms’. In the three-well problem of Subsection ‘Three-well benchmark’, the unstabilised adaptive algorithms fail.

Adaptive versus uniform mesh-refinement

The overall empirical convergence rates of the errors and estimators of the unstabilised computation for adaptive mesh-refinements are better than those for uniform mesh-refinements. This is in contrast to the stabilised computation, where the true errors $∥ σ - σ_{ℓ} ∥_{L^{p^{'}} (Ω)}$ and $∥ u - u_{ℓ} ∥_{L^{2} (Ω)}$ behave better for uniform compared with the two adaptive mesh-refinments (with the exception in Subsection ‘An optimal design example’ where there is equality). It is observed that adaptivity does not necessarily improve the converegnce rates of the error $∥ σ - σ_{ℓ} ∥_{L^{p^{'}} (Ω)}$ and $∥ u - u_{ℓ} ∥_{L^{2} (Ω)}$ in a stabilised computation. Surprisingly, the convergence of the gradient errors $∥ D (u - u_{ℓ}) ∥_{L^{2} (Ω)}$ are slightly improved in the instabilised calculation by adaptive mesh-refinements. The adaptive mesh-refinement is expected to reduce the a posteriori error estimators in the first place: cf. [1, 34] for the estimator reduction property. Indeed, the convergence rates of the a posteriori error estimators η_R, η_F, η_L, η_H are improved (or optimal) for adaptive mesh-refinements (except for the three-well example of Subsection ‘Three-well benchmark’).

Strong convergence of the gradients

The convergence of the gradient error of the stabilised problem surpasses the expectations of [12] in Subsection ‘An optimal design example’ but fails to do so in Subsections ‘Two-well benchmark’, ‘An optimal design example’, ‘Three-well benchmark’, and ‘An optimal design example’. The improved error estimator η_H shows the same convergence rate as the error of the gradients in Subsections ‘Two-well benchmark’, ‘An optimal design example’, ‘Three-well benchmark’, and ‘An optimal design example’. This holds for uniform and for adapted mesh refinements and suggests that η_H is in fact reliable and efficient for β > 0.

Guaranteed error control

The assertion on η_F in Theorem 4.1 is reflected in the numerical examples in that the stress approximations converge faster than η_F in all cases. This suggests that the estimate $∥ u - u_{ℓ} ∥_{W^{1, p} (Ω)} ≲ 1$ is by far too pessimistic. In fact, the benchmark examples with known exact solution fulfil $∥ σ - σ_{ℓ} ∥_{L^{2} (Ω)}^{2} ≲ η_{F} ∥ u - u_{ℓ} ∥_{H^{1} (Ω)}$ . Similar affirmative conclusions follow for Theorem 4.2 and 5.4.

Reliability-efficiency gap

In comparison with the residual-based error estimator of [7, 18], the new a posteriori error estimators η_L and η_H of Theorem 5.4 lead to refined error control. The improvement is marginal for uniform meshes without stabilisation but significant for adaptive stabilised computations. η_L and η_H match the convergence of the errors and so narrow the reliability-efficiency gap.

References

Carstensen C: Convergence of an adaptive fem for a class of degenerate convex minimisation problems. IMA J Numer Anal 2008,28(3):423–439.
Article MathSciNet Google Scholar
Dacorogna B: Direct methods in the calculus of variations, 2nd Ed. Applied Mathematical Sciences 78. Berlin: Springer; 2008. xii xii
Google Scholar
Carstensen C, Müller S: Local stress regularity in scalar non-convex variational problems. SIAM J Math Anal 2002,34(2):495–509. 10.1137/S0036141001396436
Article MathSciNet Google Scholar
Chipot M: Elements of Nonlinear Analysis. Birkhäuser Advanced Texts. Birkhäuser: Basel; 2000. vi vi
Book Google Scholar
Müller S: Variational models for microstructure and phase transisions. In Calculus of variations and geometric evolution problems. Lectures given at the 2nd session of the, Centro Internazionale Matematico Estivo (CIME), Cetraro, Italy, June 15–22, 1996, Lect. Notes Math., 1713. Edited by: Hildebrandt S. Berlin: Springer; 1999:85–210.
Google Scholar
Ball JM, James RD: Proposed experimental tests for the theory of fine microstructures and the two-well problem. Phil Trans R Soc Lond A 1992, 338: 389–450. 10.1098/rsta.1992.0013
Article Google Scholar
Carstensen C, Plecháĉ P: Numerical solution of the scalar double-well problem allowing microstructure. Math Comp 1997,66(219):997–1026. 10.1090/S0025-5718-97-00849-1
Article MathSciNet Google Scholar
Bartels S, Carstensen C: A convergent adaptive finite element method for an optimal design problem. Numer Math 2007, 108: 359–385.
Article MathSciNet Google Scholar
Goodman J, Kohn RV, Reyna L: Numerical study of a relaxed variational problem from optimal design. Comput Methods Appl Mech Eng 1986, 57: 107–127. 10.1016/0045-7825(86)90073-3
Article MathSciNet Google Scholar
Carstensen C, Klose R: Guaranteed a posteriori finite element error control for the p-Laplace problem. SIAM J Sci Comput 2003, 25: 792–814. 10.1137/S1064827502416617
Article MathSciNet Google Scholar
Bartels S, Carstensen C, Plecháĉ P, Prohl A: Convergence for stabilisation of degenerate convex minimsation problems. IFB 2004,6(2):253–269.
Google Scholar
Boiger W, Carstensen C: On the strong convergence of gradients in stabilised degenerate convex minimisation problems. SIAM J Numer Anal 2010,47(6):4569–4580. 10.1137/090746409
Article MathSciNet Google Scholar
Ciarlet PG: The finite element method for elliptic problems. Philadelphia, PA, USA: Society for Industrial Mathematics; 2002.
Book Google Scholar
El Alaoui L, Ern A, Vohralík M: Guaranteed and robust a posteriori error estimates and balancing discretization and linearization errors for monotone nonlinear problems. Comp Meth Appl Mech Eng 2011,200(37–40):2782–2795.
Article Google Scholar
Ern A, Nicaise S, Vohralík M: An accurate h (div) flux reconstruction for discontinuous galerkin approximations of elliptic problems. C R, Math, Acad Sci Paris 2007,345(12):709–712. 10.1016/j.crma.2007.10.036
Article MathSciNet Google Scholar
Luce R, Wohlmuth B: A local a posteriori error estimator based on equilibrated fluxes. SIAM J Numer Anal 2004,42(4):1394–1414. 10.1137/S0036142903433790
Article MathSciNet Google Scholar
Ainsworth M: A synthesis of a posteriori error estimation techniques for conforming, non-conforming and discontinuous galerkin finite element methods. Providence: American Mathematical Society (AMS); 2005.
Book Google Scholar
Carstensen C, Jochimsen K: Adaptive finite element methods for microstructures? Numerical experiments for a Two-well benchmark. Computing 2003, 71: 175–204. 10.1007/s00607-003-0027-1
Article MathSciNet Google Scholar
Chipot M, Evans LC: Linearisation at infinity and Lipschitz estimates for certain problems in the calculus of variations. Proc Roy Soc Edinburgh Sect A 1986,102(3–4):291–303.
Article MathSciNet Google Scholar
Bartels S, Carstensen C, Dolzmann G: Inhomogeneous Dirichlet conditions in a priori and a posteriori finite element error anylysis. Numer Math 2004,99(1):1–24. 10.1007/s00211-004-0548-3
Article MathSciNet Google Scholar
Knees D: Global stress regularity of convex and some nonconvex variational problems. Ann Mat Pura Appl (4) 2008,187(1):157–184.
Article MathSciNet Google Scholar
Brezzi F, Fortin M: Mixed and hybrid finite element methods. Springer series in computational mathematics. New York: Springer-Verlag; 1991.
Book Google Scholar
Acosta G, Durán RG: An optimal Poincaré inequality in L¹ for convex domains. Proc Amer Math Soc 2004,132(1):195–202. 10.1090/S0002-9939-03-07004-7
Article MathSciNet Google Scholar
Bergh J, Löfstrom J: Interpolation spaces. Berlin: Springer-Verlag; 1976.
Book Google Scholar
Brenner SC, Scott LR: The mathematical theory of finite element methods, 2nd Ed. Texts in Applied Mathematics. 15. Berlin: Springer; 2002. p361, xv p361, xv
Book Google Scholar
Bartels S: Numerical analysis of some non-convex variational problems. Ph.D. thesis. Germany: Christian-Albrechts Universität zu Kiel, Kiel; 2001. [http://eldiss.uni-kiel.de/macau/receive/dissertation_diss_00000519]
Google Scholar
Alberty J, Carstensen C, Funken SA: Remarks around 50 lines of Matlab: short finite element implementation. Numer. Algorithms 1999,20(2–3):117–137.
Article MathSciNet Google Scholar
Repin SI, Sauter S, Smolianski A: A posteriori error estimation for the dirichlet problem with account of the error in the approximation of boundary conditions. Computing 2003,70(3):205–233.
MathSciNet Google Scholar
Carstensen C, Günther D, Rabus H: Mixed finite element method for a degenerate convex variational problem from topology optimization. SIAM J Math Anal 2012,50(2):522–543.
Google Scholar
Murat F, Tartar L: Calcul des variations et homogénéisation. In Homogenization methods: theory and applications in physics. Collect Dir Études Rech Élec France, vol. 57. Edited by: Bergman D. Paris, France: Éditions Eyrolles; 1985:319–369.
Google Scholar
Kohn RV, Strang G: Optimal design and relaxation of variational problems I–III. Comm Pure Appl Math 1986,39(1–3):113–137139182353377.
Article MathSciNet Google Scholar
Kawohl B, Stará J, Wittum G: Analysis and numerical studies of a problem of shape design. Arch Rational Mech Anal 1991,114(4):349–363. 10.1007/BF00376139
Article MathSciNet Google Scholar
Glowinski R, Lions J-L, Trémolières R: Numerical analysis of variational inequalities. Studies in Mathematics and its Applications, vol. 8. Amsterdam: North-Holland Publishing Co.; 1981. p 776 p 776
Google Scholar
Cascón JM, Kreuzer C, Nochetto RH, Siebert KG: Quasi-optimal convergence rate for an adaptive finite element method. SIAM J Numer Anal 2008,46(5):2524–2550. 10.1137/07069047X
Article MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

Department of Mathematics, Humboldt-Universität zu Berlin, Unter den Linden 6, 10099, Berlin, Germany
Wolfgang Boiger & Carsten Carstensen
Department of Computational Science and Engineering, Yonsei University, Unter den Linden 6, 120-749, Seoul, Korea
Wolfgang Boiger & Carsten Carstensen

Authors

Wolfgang Boiger
View author publications
You can also search for this author in PubMed Google Scholar
Carsten Carstensen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Carsten Carstensen.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

All authors contributed equally to all parts of this article. All authors read and approved the final manuscript.

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Authors’ original file for figure 5

Authors’ original file for figure 6

Authors’ original file for figure 7

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Boiger, W., Carstensen, C. A posteriori error analysis of stabilised FEM for degenerate convex minimisation problems under weak regularity assumptions. Adv. Model. and Simul. in Eng. Sci. 1, 5 (2014). https://doi.org/10.1186/2213-7467-1-5

Download citation

Received: 29 July 2013
Accepted: 06 December 2013
Published: 29 January 2014
DOI: https://doi.org/10.1186/2213-7467-1-5

A posteriori error analysis of stabilised FEM for degenerate convex minimisation problems under weak regularity assumptions

Abstract

Background

Methods

Results

Conclusions

Background

Methods: Discretisation and Stabilisation

Theorem 2.1.

Proof

Global convergence

Theorem 3.1.

Lemma 3.2.

Proof

Proof of Theorem 3.1

Example 3.3.

A posteriori error estimates

Theorem 4.1.

Theorem 4.2.

Proof of Theorem 4.1.

Proof of Theorem 4.2

Refined analysis for an interface model problem

Theorem 5.1.

Remark 5.2.

Proof.

Theorem 5.3.

Proof

Theorem 5.4.

Theorem 5.5.

Remark 5.6.

Proof of Theorem 5.4

Numerical experiments

Numerical algorithms

Solve

Estimate

Algorithm 1 Approximate flux computation

MARK

Refine

Two-well benchmark

Modified two-well benchmark

Three-well benchmark

An optimal design example

Discussion of Empirical Convergence Rates

Global convergence without regularity assumptions

Empirical convergence rates for interface model problems

Reliability without regularity assumptions

Efficiency without regularity assumptions

Reliability of the refined a posteriori error control

Performance of the minimisation algorithm 1

Conclusions

Effects of stabilisation

Adaptive versus uniform mesh-refinement

Strong convergence of the gradients

Guaranteed error control

Reliability-efficiency gap

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors’ contributions

Authors’ original submitted files for images

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Authors’ original file for figure 5

Authors’ original file for figure 6

Authors’ original file for figure 7

Rights and permissions

About this article

Cite this article

Share this article

Keywords