Multivariate moment-matching for model order reduction of quadratic-bilinear systems using error bounds

Khattak, Muhammad Altaf; Ahmad, Mian Ilyas; Feng, Lihong; Benner, Peter

doi:10.1186/s40323-022-00236-6

Research article
Open access
Published: 12 December 2022

Multivariate moment-matching for model order reduction of quadratic-bilinear systems using error bounds

Muhammad Altaf Khattak¹,
Mian Ilyas Ahmad¹,
Lihong Feng² &
…
Peter Benner^2,3

Advanced Modeling and Simulation in Engineering Sciences volume 9, Article number: 23 (2022) Cite this article

1831 Accesses
Metrics details

Abstract

We propose an adaptive moment-matching framework for model order reduction of quadratic-bilinear systems. In this framework, an important issue is the selection of those shift frequencies where moment-matching is to be achieved. So far, the choice often has been random or linked to the linear part of the nonlinear system. In this paper, we extend the use of an existing a posteriori error bound for general linear time invariant systems to quadratic-bilinear systems and develop a greedy-type framework to select a good choice of interpolation points for the construction of the projection matrices. The results are compared with standard quadratic-bilinear projection methods and we observe that the approximations obtained by the proposed method yield high accuracy.

Introduction

There are different applications where the dynamics of the system can be represented by quadratic-bilinear differential algebraic equations (QBDAEs). These include simulation of distribution networks [1], fluid flow problems [2] and nonlinear VLSI circuits [3, 4]. In addition, a large class of nonlinear systems can be written in quadratic-bilinear form by using liftings to higher-dimensional state-spaces [4]. Most of these applications involve a large number of equations i.e., a high-dimensional state-space. This makes simulation, control and optimization computationally inefficient. A remedy to this issue is the use of model order reduction (MOR).

We consider the problem of MOR for a single-input single-output quadratic-bilinear descriptor system of the form:

$$\begin{aligned} \begin{aligned} E\dot{x}(t)&=Ax(t)+Nx(t)u(t)+ Q(x(t)\otimes x(t))+Bu(t),\\ ~~\quad y(t)&=Cx(t), \end{aligned} \end{aligned}$$

(1)

where $E, ~\!A, ~\!N\in \mathbb {R}^{n\times n}$, $Q\in \mathbb {R}^{n\times n^2}$, $B,~\!C^T\in \mathbb {R}^{n}$ are the coefficient matrices and vectors. $x(t)\in \mathbb {R}^n$ is the state vector and $u(t),~\!y(t)\in \mathbb {R}$ are the input and output of the system. The matrix E may or may not be singular but the pencil is assumed to be regular, i.e., $\lambda E-A$ is singular only for finitely many values $\lambda \in \mathbb {C}$ [5].

The goal of MOR is to construct a reduced system of dimension $r\ll n$:

$$\begin{aligned} \begin{aligned} E_r\dot{x}_r(t)&=A_r x_r(t)+N_r x_r(t) u(t)+ Q_r( x_r(t)\otimes x_r(t))+B_r u(t),\\ ~~\quad y_r(t)&=C_r x_r(t), \end{aligned} \end{aligned}$$

(2)

with the output response $y_r(t)$ approximately equal to y(t). In case of linear systems (where Q and N are zero matrices), there are various techniques in the literature to compute reduced-order models (ROMs), cf., [6,7,8]. Among these methods, projection-based moment-matching methods [9, 10] are well used and have been extended to quadratic-bilinear systems [4, 11, 12]. Projection involves approximating the state vector x(t) in an r-dimensional subspace spanned by the column vectors of $V\in \mathbb {R}^{n{\times }r}$, so that the residual in the state equation is orthogonal to another r-dimensional subspace spanned by the column vectors of $W\in \mathbb {R}^{n{\times }r}$. That is, we approximate $x(t)\approx Vx_r(t)$ such that the following Petrov-Galerkin orthogonality condition holds:

$$\begin{aligned} \begin{aligned} \!\!\!&W^T\bigg (\!EV\dot{x}_r(t)\!-\!\big (AVx_r(t)+NVx_r(t)u(t)+Q(Vx_r(t)\otimes Vx_r(t))+Bu(t)\big )\bigg )=0. \end{aligned} \end{aligned}$$

(3)

If $W=V$, the projection is orthogonal and is often called one-sided projection, otherwise it is oblique and is called two-sided projection. The oblique projection framework leads to a set of reduced system matrices of the form:

$$\begin{aligned} \begin{aligned} E_r=W^TEV,~~~A_r=&W^TAV,~~~Q_r=W^TQ(V\otimes V),~~~N_r=W^TNV,\\&B_r =W^TB,~~~C_r = CV. \end{aligned} \end{aligned}$$

(4)

In case of linear systems, a suitable choice of the basis matrices V and W implicitly ensures moment-matching, where moments are the coefficients of the series expansion of the transfer function at some predefined shift frequencies. Thus for projection-based moment-matching, the choice of V and W is related to the transfer function of the system. However, nonlinear systems have no universal input-output representation. For some classes of nonlinear systems, though, including QBDAE systems, it is possible to generalise the transfer function concept by utilising the Volterra theory [13], where the input-output relationship is represented by a set of high-order transfer functions. This makes the concept of moment-matching more complex in the nonlinear case, since the structure of the basis matrices V and W in (4) now depends on multiple high-order transfer functions. To achieve moment-matching, simplifications are often made in the literature [4, 11] for computing the ROMs. For example, [11] constructs V and W such that the reduced system matches the moments of the first- and second-order transfer functions. In [12], simplified forms of high-order transfer functions are derived, which also enable the projection-based techniques to match moments of high-order transfer functions. In addition, all the existing moment-matching/interpolation approaches [4, 11, 12] are based on the simplification that the interpolation points for each frequency variable are the same. We discuss these results further in “Background”.

Recently, a new framework [14] for quadratic-bilinear systems has been proposed that is based on generalized Sylvester-type matrix equations. The approach involves truncated solution of two complex matrix equations to identify a good choice for the basis matrices V and W. Another approach is the extension of the Loewner framework from linear/bilinear systems [15, 16] to quadratic bilinear systems [17]. Also, an indirect approach for MOR of QBDAE systems is proposed in [18], where the basis matrices are constructed from the bilinear part of the quadratic-bilinear system. In [19], the bilinear part of the system is viewed as a linear parametric system and an a posteriori error bound is used to select the interpolation points and construct the basis matrices adaptively. All these techniques are using the first two or three high-order transfer functions and their structure is different from the one identified in [11]. Recently, a new direction has been explored in [20] where the properties of higher order moment matching are analysed so that the nonlinear matrices are not used in the construction of basis matrices. Also, in [21] the idea of signal generator driven system is used so that univariate moment-matching can be utilised for model order reduction. These approaches are comparing their results with [11] and [19] and for some benchmark examples there behaviour is comparable. However, our target is general moment-matching for QBDAEs, so we will mainly focus on the two-sided moment-matching technique from [11].

In this paper, we identify a good choice of interpolation points for quadratic-bilinear systems by utilizing a greedy type framework based on error bounds for quadratic-bilinear systems motivated by the recently proposed error bound for linear parametric systems in [22]. Here, we relax the restriction of using the same interpolation points for different frequency variables. The approach starts from some initial interpolation points that are iteratively updated to identify a set of interpolation points corresponding to the maximal values of certain error bounds. For each choice of interpolation points, we interpolate, not only, the original transfer function and its first derivative but also higher derivatives, so that the quadratic-bilinear system is well approximated. The iteration stops when the approximation error is less than the prescribed tolerance level. Each iteration contributes to constructing a better set of basis matrices V and W, until a given error tolerance is achieved. The main difference from the work in [19] is that the quadratic part of the system is also involved in the basis construction in the proposed framework based on an a posteriori error bound for quadratic-bilinear systems, whereas only the bilinear part is considered for the basis matrix computation in [19]. The error estimator used in [19] only estimates the error of the linear-bilinear part.

The remaining part of the paper is organized as follows. “Background” reviews the existing projection based moment-matching techniques for quadratic-bilinear systems. “Error bound for QBDAEs” presents the error bound expressions for quadratic-bilinear systems and “Interpolation points using error bounds” utilises these error bounds in a greedy-type algorithm to select interpolation points. Finally in “Numerical results”, numerical results are shown for some benchmark examples.

Background

In this section, we briefly review the concept of moment-matching discussed in [11, 12] for quadratic-bilinear systems. Before going into the details of nonlinear moment-matching, we begin with the structure of high-order transfer functions.

Multivariate transfer functions

The input-output representation for single input quadratic-bilinear systems can be expressed by the Volterra series expansion of the output y(t) with quantities analogous to the standard convolution operator. That is,

$$\begin{aligned} \begin{aligned} y(t) = \sum _{k=1}^{\infty }\int _{0}^{t}\!\!\int _{0}^{t_1}\!\!\!\! \cdots \int _{0}^{t_{k-1}}h_k(t_1,\ldots ,t_k)u(t-t_1)\cdots u(t-t_k)dt_k\cdots dt_1, \end{aligned} \end{aligned}$$

(5)

where it is assumed that the input signal is one-sided, i.e., $u(t)=0$ for $t<0$. In addition, each of the generalized impulse responses, $h_k(t_1,\ldots ,t_k)$, also called the k-dimensional kernel of the subsystem, is assumed to be one-sided. In terms of the multivariate Laplace transform, the k-dimensional subsystem can be represented as,

$$\begin{aligned} Y_k(s_1,\ldots ,s_k) = H_k(s_1,\ldots ,s_k)U(s_1)\cdots U(s_k), \end{aligned}$$

(6)

where $H_k(s_1,\ldots ,s_k)$ is the multivariate transfer function of the k-dimensional subsystem. The generalized transfer functions in the output expression (6) are in the so-called triangular form [13]. We denote the k-dimensional triangular form by $H_{tri}^{[k]}(s_1,\ldots ,s_k)$. There are some other useful forms such as the symmetric form and the regular form of the multivariate transfer functions as discussed in [13]. The triangular form is related to the symmetric form by the following expression

$$\begin{aligned} H_{sym}^{[k]}(s_1,\ldots ,s_k)=\frac{1}{n!}\sum _{\pi (\cdot )}H_{tri}^{[k]}(s_{\pi (1)}, \ldots ,s_{\pi (k)}), \end{aligned}$$

(7)

where the summation includes all k! permutations of $s_1,\ldots ,s_k$. Also, the triangular form can be connected to the regular form of the transfer function by using

$$\begin{aligned} H_{tri}^{[k]}(s_1,\ldots ,s_k)=H_{reg}^{[k]}(s_1,s_1+s_2,\ldots ,s_1+s_2+\cdots +s_k). \end{aligned}$$

(8)

According to [13], the structure of the generalized symmetric transfer functions can be identified by the growing exponential approach. The structure of these symmetric transfer functions for the first two subsystems of the quadratic-bilinear system (1) can be written as

$$\begin{aligned} \begin{aligned} H_1(s_1)&=C(s_1E-A)^{-1}B,\\ H_2(s_1,s_2)&=C((s_1+s_2)E-A)^{-1}B(s_1,s_2), \end{aligned} \end{aligned}$$

(9)

where

$$\begin{aligned} \begin{aligned}&B(s_1,s_2) = :Q(x_1(s_1)\otimes x_1(s_2))+\frac{1}{2!}N(x_1(s_1)+x_1(s_2)), \end{aligned} \end{aligned}$$

(10)

in which $x_1(s):=(sE-A)^{-1}B$. Defining $x_2(s_1,s_2):= ((s_1+s_2)E-A)^{-1}B(s_1,s_2)$, the first two (first- and second-order) symmetric transfer functions can be written as

$$\begin{aligned} \begin{aligned} H_1(s_1)&=Cx_1(s_1),\\ H_2(s_1,s_2)&=Cx_2(s_1,s_2). \end{aligned} \end{aligned}$$

(11)

Before going into the partial differentiation of these multivariate transfer functions, we introduce the concept of matricization. The process of reshaping a tensor into a matrix is called matricization. In [11], the matrix $Q\in \mathbb {R}^{n\times n^2}$ is considered as the mode-1 matricization of a 3 dimensional tensor $\mathcal {Q}\in \mathbb {R}^{n\times n\times n}$. The $n\times n$ components of Q are the frontal slices $\mathcal {Q}_i \in \mathbb {R}^{n\times n}$, $i=1,\ldots ,n$ of the tensor $\mathcal {Q}$, i.e. $Q = \begin{bmatrix} \mathcal {Q}_1&\cdots&\mathcal {Q}_n\end{bmatrix}$. The mode-2 and mode-3 matricizations can be defined as

$$\begin{aligned} \begin{aligned} Q^{(2)}&= \begin{bmatrix} \mathcal {Q}_1^T&\cdots&\mathcal {Q}_n^T\end{bmatrix},\\ Q^{(3)}&= \begin{bmatrix} vec(\mathcal {Q}_1)&\cdots&vec(\mathcal {Q}_n)\end{bmatrix}^T. \end{aligned} \end{aligned}$$

Note that the concept of matricization allows us to symmetrize Q to $\tilde{Q}$ so that $Q(x\otimes x) = \tilde{Q}(x\otimes x)$ holds and the commutativity property $\tilde{Q}(u\otimes v)=\tilde{Q}(v\otimes u)$ for arbitrary choices of $u,v\in \mathbb {R}^n$ is enforced. In addition, the property

$$\begin{aligned} w^TQ(u\otimes v)=u^TQ^{(2)}(v\otimes w), \end{aligned}$$

(12)

also holds, where $w,u,v\in \mathbb {R}^{n}$ are arbitrary and Q is assumed to be in the symmetrized form, see [23]. Let $G(s):= sE-A$, then by using

$$\begin{aligned} \frac{\partial G(s)^{-1} }{\partial s}= -G(s)^{-1}\frac{\partial G(s)}{\partial s} G(s)^{-1}, \end{aligned}$$

and (12), we have

$$\begin{aligned} \begin{aligned} \frac{\partial H_2(s_1,s_2) }{\partial s_1}= -y_1(s_1+s_2)^{T}Ex_2(s_1,s_2)-x_1(s_1)^TE^Ty_2(s_1,s_2), \end{aligned} \end{aligned}$$

(13)

where $y_1(s):= (sE-A)^{-T}C^T$ and $y_2(s_1, s_2):= (s_1E-A)^{-T}C(s_1,s_2)^T$ in which

$$\begin{aligned} C(s_1,s_2) = Q^{(2)}\big (x_1(s_2)\otimes y_1(s_1+s_2)\big )+\frac{1}{2!}N^Ty_1(s_1+s_2). \end{aligned}$$

Similarly

$$\begin{aligned} \begin{aligned} \frac{\partial H_2(s_1,s_2) }{\partial s_2}= -y_1(s_1+s_2)^{T}Ex_2(s_1,s_2)-x_1(s_2)^TE^Ty_2(s_2,s_1). \end{aligned} \end{aligned}$$

(14)

Notice that when $s_1=s_2=\sigma $, the two partial differentiations are the same. This condition on interpolation points is assumed in [11] to show the moment-matching properties of the ROM. In the following, we show moment-matching in the multivariate settings when $s_1\ne s_2$ ($s_1=\sigma _{1i}$ and $s_2=\sigma _{2i}$).

Moment-matching for QBDAE

The goal of a moment-matching based reduction approach is to ensure that the high-order transfer functions are well approximated. In case of symmetric transfer functions, we can represent it as

$$\begin{aligned} H_k(s_1,\ldots ,s_k)\approx \hat{H}_k(s_1,\ldots ,s_k), \quad \text{ for } k=1,\ldots ,K, \end{aligned}$$

(15)

with $\hat{H}_k(s_1,\ldots ,s_k)$ being the k-th order multivariate transfer function of the reduced system (2). With the task in (15) achieved for some K, we can expect that the output y(t) is well approximated by $\hat{y}(t)$. To get recursive relations between vectors for approximation subspaces, it is assumed in [11] that $s_1=s_2=\sigma $. With these settings, the second-order transfer function becomes

$$\begin{aligned} H_2(\sigma ,\sigma )= y(2\sigma )^T\Big ( Q \left( x_1(\sigma )\otimes x_1(\sigma )\right) + Nx_1(\sigma )\Big ). \end{aligned}$$

The following Lemma summarizes the result introduced in [11].

Lemma 1

Let $\sigma _i\in \mathbb {C}$ be the interpolation points and $\sigma _i\notin \{\Lambda (A,E), \Lambda (A_r,E_r)\}$, where $\Lambda (A,E)$ represents the generalized eigenvalues of the matrix pencil $\lambda E-A$. Assume that $\hat{E}=W^TEV$ is nonsingular and $\hat{A}$, $\hat{Q}$, $\hat{N}$, $\hat{B}$, $\hat{C}$ are as in (4) with full rank matrices $V,W\in \mathbb {R}^{n{\times }r}$ such that

$$\begin{aligned} \begin{aligned}&\textrm{span}(V)=\textrm{span}_{i=1,\ldots ,k}\{x_1(\sigma _i), ~x_2(\sigma _i,\sigma _i)\}, \\&\textrm{span}(W)=\textrm{span}_{i=1,\ldots ,k} \{y_1(2\sigma _i),~y_2(\sigma _i,\sigma _i)\}, \end{aligned} \end{aligned}$$

then the reduced QBDAE satisfies the following (Hermite) interpolation conditions:

$$\begin{aligned} \begin{aligned} H_1(\sigma _i)&=\hat{H}_1(\sigma _i), \qquad \quad H_1(2\sigma _i)=\hat{H}_1(2\sigma _i),\\ H_2(\sigma _i,\sigma _i)&=\hat{H}_2(\sigma _i,\sigma _i), \quad \frac{\partial }{\partial s_j}H_2(\sigma _i,\sigma _i)=\frac{\partial }{\partial s_j}\hat{H}_2(\sigma _i,\sigma _i),~~ j=1,2. \end{aligned} \end{aligned}$$

See [11] for a proof. Next, we present moment-matching properties in the multivariable settings, where $s_1\ne s_2$.

Lemma 2

Let $\sigma _{1i},\sigma _{2i}\in \mathbb {C}$ with $\sigma _{1i},\sigma _{2i}\notin \{\Lambda (A,E), \Lambda (A_r,E_r)\}$. Assume that $\hat{E}=W^TEV$ is nonsingular and $\hat{A}$, $\hat{Q}$, $\hat{N}$, $\hat{B}$, $\hat{C}$ are as in (4) with full rank matrices $V,W\in \mathbb {R}^{n{\times }r}$ such that

$$\begin{aligned} \begin{aligned}&\textrm{span}(V)=\textrm{span}_{i=1,\ldots ,k}\{x_1(\sigma _{1i}), ~x_1(\sigma _{2i}),~x_2(\sigma _{1i},\sigma _{2i})\}\\&\textrm{span}(W)=\textrm{span}_{i=1,\ldots ,k} \{y_1(\sigma _{1i}+\sigma _{2i}),~y_2(\sigma _{1i},\sigma _{2i}),~y_2(\sigma _{2i},\sigma _{1i})\}. \end{aligned} \end{aligned}$$

Then the reduced QBDAE satisfies the following (Hermite) interpolation conditions:

$$\begin{aligned} \begin{aligned} H_1(&\sigma _{1i})=\hat{H}_1(\sigma _{1i}), \quad H_1(\sigma _{2i})=\hat{H}_1(\sigma _{2i}), \quad H_1(\sigma _{1i}+\sigma _{2i})=\hat{H}_1(\sigma _{1i}+\sigma _{2i}),\\&H_2(\sigma _{1i},\sigma _{2i})=\hat{H}_2(\sigma _{1i},\sigma _{2i}), \quad \frac{\partial }{\partial s_1}H_2(\sigma _{1i},\sigma _{2i})=\frac{\partial }{\partial s_1}\hat{H}_2(\sigma _{1i},\sigma _{2i}),\\&\qquad \frac{\partial }{\partial s_2}H_2(\sigma _{2i},\sigma _{1i})=\frac{\partial }{\partial s_2}\hat{H}_2(\sigma _{2i},\sigma _{1i}). \end{aligned} \end{aligned}$$

The proof of the statement is similar to Lemma 1 and therefore omitted. Note that the statement in Lemma 2 reduces to Lemma 1, if $\sigma _{1i}=\sigma _{2i}$. In the remaining part of the paper, our goal is to identify a good choice of the interpolation points $\sigma _{1i}$ and $\sigma _{2i}$.

Error bound for QBDAEs

In this section, we show how the error bound expression, derived initially in [22] for parametric linear time invariant systems, can be extended to quadratic-bilinear DAEs. We begin with a brief overview of the error bound for the first subsystem, as in [22] and then discuss the extension to the second subsystem of the QBDAE (1).

Error bound for $H_1(s_1)$

Here the error bound provides an estimate for the error between $H_1(s_1)$ and $\hat{H}_1(s_1)$. To this end, we define the primal and the dual systems as:

$$\begin{aligned} (s_1E-A)x_1(s_1)&=B, \end{aligned}$$

(16)

$$\begin{aligned} (s_1E-A)^Tx_1^{du}(s_1)&=-C^T, \end{aligned}$$

(17)

respectively, where T denotes the transpose of a matrix. The error bound is constructed so that it is based on two residuals, which result from MOR of the primal and the dual system, respectively. The primal system is reduced using the matrix pair $V_{1}$ and $W_{1}$,

where

$$\begin{aligned} \textrm{span}(V_{1})=\textrm{span}_{i=1,\ldots ,k}\{x_1(\sigma _{1i})\}, \quad \textrm{span}(W_{1})=\textrm{span}_{i=1,\ldots ,k}\{x_1^{du}(\sigma _{1i})\}. \end{aligned}$$

(18)

As a result, the reduced primal system is,

$$\begin{aligned} (s_1\hat{E}_1-\hat{A}_1)z_1(s_1)=\hat{B}, \end{aligned}$$

where $\hat{E}_1= W_1^T EV_1$, $\hat{A}_1= W_1^T AV_1$, $\hat{B}_1= W_1^T B$ and $\hat{C}_1= CV_1$. Here $\hat{x}_1(s_1):=V_1z_1(s_1)$ is the approximation of $x_1(s_1)$. Due to the dual relation between (16) and (17), the dual system can be reduced by using $V_1^{du} = W_{1}$ and $W_{1}^{du} = V_1$. The reduced dual system is

$$\begin{aligned} (s_1\tilde{E}_1-\tilde{A}_1)^Tz_1^{du}(s_1)=-\tilde{C}_1^T, \end{aligned}$$

where $\tilde{E}_1= V_1^T EW_1$, $\tilde{A}_1= V_1^T AW_1$, $\tilde{C}_1= W_1^T C^T$. Also $\tilde{x}_1^{du}(s_1):=W_1z_1^{du}(s_1)$ is the approximation of $x_1^{du}(s_1)$. The residuals associated with the reduction of the primal and the dual systems can be written as

$$\begin{aligned} \begin{aligned}&r_1^{pr}(s_1)=B-(s_1E-A)V_1z_1(s_1),\\ {}&r_1^{du}(s_1)=-C^T-(s_1E-A)^TW_1z_1^{du}(s_1). \end{aligned} \end{aligned}$$

(19)

With these quantities, the following result provides an a posteriori upper bound on the approximation error, $|H_1(s_1)-\hat{H}_1(s_1)|$:

Theorem 1

[22] The upper bound on the approximation of the transfer function $H_1(s_1)=C(s_1E-A)^{-1}B$ can be written as $|H_1(s_1)-\hat{H}_1(s_1)| \le \Delta _1(s_1)$, where

$$\begin{aligned} \Delta _1(s_1):=\frac{\Vert r_1^{du}(s_1)\Vert _2\Vert r_1^{pr}(s_1)\Vert _2}{\beta _1(s_1)}, \end{aligned}$$

(20)

in which $\beta _1(s_1)=\sigma _{\min } (G(s_1))$, where $\sigma _{\min }$ indicates the smallest singular value of $G(s_1)$.

Error bound for $H_2(s_1,s_2)$

Analogous to $H_1(s_1)$, we define the primal and dual systems as:

$$\begin{aligned}&G(s_{1}+s_2)x_2(s_{1},s_2)=B(s_1,s_2), \end{aligned}$$

(21)

$$\begin{aligned}&G^T(s_{1}+s_2)x_2^{du}(s_{1},s_2)=-C^T, \end{aligned}$$

(22)

respectively. The interpolation points for $H_1(s_1)$ can be identified through the error bound $\Delta _1(s_1)$ by using a greedy framework as presented in [22]. This means that we can select $\sigma _{1i}$ for $i=1,\ldots ,r$ as the interpolation points corresponding to the maximal values of the error bound at subsequent iterations of the greedy algorithm in [22]. With these interpolation points fixed for $s_1$, we can also express the error bound for the second subsystem. The error bound is constructed based on two residuals, which result from MOR of the primal and the dual systems in (21) (22), respectively. The primal system is reduced using the matrix pair $V_{2}$ and $W_{2}$, where

$$\begin{aligned} \textrm{span}(V_{2})=\textrm{span}_{j=1,\ldots ,k}\{x_2(\sigma _{1i},\sigma _{2j})\}, \quad \textrm{span}(W_{2})=\textrm{span}_{j=1,\ldots ,k}\{x_2^{du}(\sigma _{1i},\sigma _{2j})\}. \end{aligned}$$

(23)

As a result, the reduced primal system is

$$\begin{aligned} ((s_1+s_2)\hat{E}_2-\hat{A}_2)z_2(s_1,s_2)=\hat{B}(s_1,s_2), \end{aligned}$$

where $\hat{E}_2= W_2^T EV_2$, $\hat{A}_2= W_2^T AV_2$, $\hat{B}(s_1,s_2)= W_2^T B(s_1,s_2)$ and $\hat{C}_2= CV_2$. Similarly, the dual system is reduced using the matrix pair $V_{2}^{du}$ and $W_{2}^{du}$,

$$\begin{aligned} \textrm{span}(V_{2}^{du})=\textrm{span}_{i=1,\ldots ,k}\{x_2^{du}(\sigma _{1i},\sigma _{2i})\}, \quad \textrm{span}(W_{2}^{du})=\textrm{span}_{i=1,\ldots ,k}\{x_2(\sigma _{1i},\sigma _{2i})\}. \end{aligned}$$

(24)

The reduced dual system is

$$\begin{aligned} ((s_1+s_2)\tilde{E}_2-\tilde{A}_2)^Tz_2^{du}(s_1,s_2)=-\tilde{C}_2^T, \end{aligned}$$

where $\tilde{E}_2= (W_2^{du})^T EV_2^{du}$, $\tilde{A}_2= (W_2^{du})^T AV_2^{du}$, $\tilde{C}^T_2= (V_2^{du})^T C^T$. The residuals associated with the reduction of the primal and dual systems can be written as

$$\begin{aligned} \begin{aligned}&r_2^{pr}(s_1,s_2)=B(s_1,s_2)-((s_1+s_2)E-A)V_2z_2(s_1,s_2),\\ {}&r_2^{du}(s_1,s_2)=-C^T-((s_1+s_2)E-A)^TV_2^{du}z_2^{du}(s_1,s_2). \end{aligned} \end{aligned}$$

(25)

With these quantities, the following result provides an a posteriori upper bound on the approximation error, $|H_2(s_1,s_2)-\hat{H}_2(s_1,s_2)|$:

Theorem 2

The upper bound on the approximation of

$$\begin{aligned} H_2(s_1,s_2)=C((s_1+s_2)E-A)^{-1}B(s_1,s_2) \end{aligned}$$

can be written as $|H_2(s_1,s_2)-\hat{H}_2(s_1,s_2)| \le \Delta _2(s_1,s_2)$,

where

$$\begin{aligned} \Delta _2(s_1,s_2):=\frac{\Vert r_2^{du}(s_1,s_2)\Vert _2 \Vert r_2^{pr}(s_1,s_2)\Vert _2}{\beta _2(s_1,s_2)}, \end{aligned}$$

(26)

in which $\beta _2(s_1,s_2)=\sigma _{\min } (G(s_1+s_2))$, where $\sigma _{\min }$ indicates the smallest singular value of $G(s_1+s_2)=(s_1+s_2)E-A$.

The proof is similar to Theorem 1 and therefore is omitted.

Interpolation points using error bounds

As discussed in “Background”, the projection matrices V and W defined in Lemma 2 require a good choice of interpolation points $\sigma _{1i}$ and $\sigma _{2i}$ which also serve as interpolation points for MOR of the primal and dual systems in (16)-(17) and (21)-(22). In this section, we show the use of the error bound expressions derived previously to select the interpolation points.

The idea is to identify interpolation points corresponding to the maximal bound $\Delta _1 (s_1)$. Assuming that $\sigma _{1i}$ are the selected interpolation points for $s_1$, the remaining interpolation points for $s_2$ correspond to the maximal bound $\Delta _2(\sigma _{1i},s_2)$ for each value of $\sigma _{1i}$. In this way, the error bound can be used iteratively to select a good choice of interpolation points in a predefined sample space, starting from an initial choice of sigma’s. The sample spaces $S_1$ and $S_2$ can be arbitrarily selected with some fixed size. One possible choice is to use the $\mathcal {H}_2$-(sub)optimal interpolation points obtained from IRKA applied to the linear part of (1) and some other random interpolation points in the complex plane around IRKA points. The selected interpolation points are then used to construct and update the required basis matrices V and W, by using the multimoment-matching technique described before. It is interesting to see that although we need to construct the ROMs for the primal and the dual systems in (16), (17) and (21), (22), the projection matrices for those ROMs are obtained without extra computations, since $V_1, W_1$ and $V_2, W_2$ are part of V, W by definition. Therefore, V, W can be obtained by orthogonalizing $V_1$ with $V_2$ and $W_1$ with $W_2$ as indicated in Step of Algorithm 1, where a greedy framework for selecting interpolation points is presented. For an initial pair of interpolation points, the ROMs of the primal and the dual systems in (16), (17) and (21), (22) are constructed and the error bounds $\Delta _1, \Delta _2$ are computed. A new pair is selected such that the corresponding error bounds $\Delta _1$ and $\Delta _2$ are maximized at these points. With the selected interpolation points, we enrich the projection matrices V, W for MOR of the original quadratic-bilinear system iteratively during the greedy algorithm. Finally, the reduced quadratic bilinear system is constructed using V, W that are derived upon convergence of Algorithm 1. Algorithm 1 stops when $\Delta :=\Delta _1+\Delta _2$ is below the tolerance $\epsilon _{tol}$, where $\Delta $ includes the errors introduced by approximating the first and second transfer functions. Since the interpolation points are selected according to the error bounds $\Delta _1$ and $\Delta _2$, it is important that the error bounds dynamically reflect the decay of the true error with the iteration of the greedy algorithm. Ideally, the error bounds should be very close to the true error. Numerical tests in the next section show that the error bounds really control the true error robustly.

Numerical results

We consider three benchmark examples for our results on MOR of QBDAE systems. The results are compared with the one-sided and two-sided projection methods, where the interpolation points are computed by IRKA, implemented on the linear part of the system. We represent the proposed method by 1s/2s-greedy (one-sided/two-sided projection with greedy based interpolation points) and the method from literature by 1s/2s-IRKA (one-sided/two- sided projection with IRKA interpolation points). The use of IRKA on the linear part of the QBDAE system on convergence results in IRKA interpolation points which in the greedy framework is used to define the initial guess of the optimal points. The Max. True Error in the tables is defined as $\max \limits _{s_1, s_2 \in S_2} |H_1(s_1)-\hat{H}_1(s_1)|+|H_2(s_1,s_2)-\hat{H}_2(s_1,s_2)|$ and the Max. Est. Error is $\max \limits _{s_1, s_2 \in S_2} \Delta (s_1, s_2)$.

Nonlinear RC circuit

The nonlinear RC circuit was first considered in [24] and since then it has been used in many papers for nonlinear MOR [5]. Consider the voltage v and the current function g(v). Then the I-V characteristics can be represented as: $g(v)= e^{40v} + v -1$. The nonlinearity in the current function results in a nonlinear model. All the capacitances are fixed to $C=1$. Figure 1 shows the complete circuit.

It is shown in [4] that the nonlinearity in the RC circuit can be written in quadratic-bilinear form as in (1) by introducing some auxiliary variables. The transformation is exact, but the dimension of the system increases to $n =2\cdot l$, where l represents the number of nodes in Figure 1, and it is also the dimension of the original nonlinear system.

For our results, we set $l=50$, so $n=100$ and use two-sided projection to reduce the system. Table 1 shows the results with tolerance $\epsilon _{tol}=1e^{-5}$ and an initial choice of interpolation points as $\sigma _{1}=\sigma _{20}=119.5642$.

Table 1 Error estimation results for RC circuit

Full size table

The second column of Table 1 shows interpolation points that are identified by the greedy framework and are based on the error bound. It is clear that the error bound tightly catches the true error and can be used as a surrogate of the true error to select the interpolation points. The size of the ROM obtained from both approaches has been kept the same i.e. $r_1 = r_2 = 12$. For the input $u(t) = e^{-t}$, the output of the original model and ROMs along with corresponding relative errors are shown in Fig. 2.

Figure 2a shows the comparison of transient response of the two approaches, while Fig. 2b plots relative errors of the two approaches. It is clearly seen that 1s-greedy and 2s-greedy outperform 1s-IRKA and 2s-IRKA, respectively, in terms of accuracy.

Burgers’ equation

In nonlinear MOR, the 1D Burgers’ equation is commonly used as a benchmark [2, 11]. The mathematical model of the 1D Burgers’ equation with $\Gamma = (0,1) \times (1,T)$ is:

$$\begin{aligned} \begin{aligned} \upsilon _t + \upsilon \upsilon _x&= \nu \cdot \upsilon \upsilon _{xx},{} & {} &\text {in } \Gamma , \\ \alpha \upsilon (0,t) +\beta x(0,t)&= u(t),&\upsilon _x(1,t) = 0,{} & {} t\in (0,T), \\ \upsilon (x,0)&= \upsilon _0(x),&\upsilon _0(x) = 0,{} & {} x\in (0,1). \\ \end{aligned} \end{aligned}$$

(27)

We use it as an example to test our proposed method. We keep the size of the original model as $n = 1000$. Table 2 shows our results with tolerance $\epsilon _{tol}=1e^{-4}$ and an initial choice of interpolation points as $\sigma _{10} = \sigma _{20} = 5.4124$.

Table 2 Error estimation results for Burgers’ equation

Full size table

The second column of the table shows the interpolation points that are based on the error bound and identified by the greedy framework. Similarly, the error bound again tightly bounds the true error and therefore is reliable for choosing the interpolation points in the greedy algorithm. The sizes of the ROMs obtained from both approaches are the same, i.e. $r_1 = r_2 = 16$. The ROMs constructed from IRKA interpolation points and the proposed framework are shown in Fig. 3 for input $u(t) = cos(\pi t)$.

Figure 3a shows the transient responses of the Burgers’ equation computed from simulating the original model and the two different MOR approaches, while Fig. 3b compares the absolute response errors of the ROMs derived using the two approaches. The absolute error of the ROM constructed using the proposed methodology of choosing interpolation points is less than that of the ROM constructed using IRKA interpolation points, especially for the two-sided projection.

FitzHugh–Nagumo system

We use the FitzHugh–Nagumo system as our third example to check our results. The FitzHugh–Nagumo system can be represented as [14]:

$$\begin{aligned} \begin{aligned} \epsilon \upsilon _t(x,t) = \epsilon ^{2}\upsilon _{xx}(x,t) + f(\upsilon (x,t)) - w(x,t) + g , \\ w_t(x,t) = h\upsilon (x,t) -\gamma w(x,t) +g, \\ \end{aligned} \end{aligned}$$

(28)

with $f(\upsilon ) = \upsilon (\upsilon -0.1)(1-\upsilon )$ and boundary conditions

$$\begin{aligned} \begin{aligned} \upsilon (x,0)&= 0,&w(x,0) = 0, \\ \upsilon _x(0,t)&= - i_0(t),&\upsilon _x(1,t) = 0.\\ \end{aligned} \end{aligned}$$

(29)

Here, we choose $\epsilon = 0.015$, $h=0.5$, $\gamma = 0.05$ and $i_0(t) = 5 \times 10^4 t^3 e^{-15t}$. When standard finite difference method is applied to numerically discretize the PDEs in (28), a system of ODEs with cubic non-linearities is obtained. We can get a quadratic-bilinear system by introducing new variables. For an original discretized system with size $\bar{n}$, a quadratic-bilinear system has the size of $n = 3\bar{n}$. We set $\bar{n} = 100$, which gives rise to quadratic-bilinear system of order $n = 300$. Then we choose interpolation points using the proposed greedy framework to construct a ROM of size $r = 26$ and then compare it with the ROM of the same size, which is constructed from the interpolation points using IRKA. Table 3 shows our results with tolerance $\epsilon _{tol}=1e^{-6}$ and the interpolation points $\sigma _{10}=\sigma _{20}=534.69$.

Table 3 Error estimation results for the FitzHugh–Nagumo model

Full size table

Table 3 shows the interpolation points that are selected by the error bound and the decay of the true error and the error bound at each iteration of the greedy algorithm. The error bound once more estimates the true error accurately, implicating that the selected interpolation points indeed nearly correspond to the largest error. The sizes of the ROMs obtained from both approaches are the same, i.e. $r_1 = r_2 = 26$. Figure 4 shows the transient responses of the FitzHugh–Nagumo system computed from simulating the original model and two approaches.

The input signal is $u(t) =50000t^3 e^{-15t}$. It is seen that the 1s-greedy performs better than the 1s-IRKA when the outputs in both cases are compared with that of the original model; however, 2s-greedy and 2s-IRKA produce unstable responses.

Conclusions

In this paper, the proposed methodology of choosing interpolation points for construction of ROM of the first- and second-order transfer functions of quadratic-bilinear systems has been tested for three different models. The results have also been compared with ROMs of the same size constructed using the interpolation points chosen by linear IRKA. In each case, the ROMs constructed using interpolation points from the greedy framework yield better approximation of the output than the ROMs constructed from IRKA.

Data availability statement

The model data of FitzHugh–Nagumo System and RC ladder is available at MOR https://morwiki.mpi-magdeburg.mpg.de/morwiki/index.php/Category:Benchmark.

Abbreviations

MOR:: Model order reduction
ODE:: Ordinary differential equation
IRKA:: Iterative rational krylov algorithm
ROM:: Reduced ordered model
PDE:: Partial differential equation
QBDAE:: Quadratice blinear differential algebraic equation
VLSI:: Very large scale integration

References

Grundel S, Hornung N, Klaassen B, Benner P, Clees T. Computing surrogates for gas network simulation using model order reduction. In: Koziel S, Leifsson L, editors. Surrogate-based modeling and optimization. Berlin: Springer; 2013. p. 189–212.
Chapter Google Scholar
Kunisch K, Volkwein S. Proper orthogonal decomposition for optimality systems. ESAIM Math Model Numer Anal. 2008;42(1):1–23.
Article MathSciNet MATH Google Scholar
Phillips JR. Projection-based approaces for model reduction of weakly nonlinear, time-varying systems. IEEE Trans Comput Aided Des Integr Circuits Syst. 2003;22(2):171–87.
Article Google Scholar
Gu C. QLMOR: a projection-based nonlinear model order reduction approach using quadratic-linear representation of nonlinear systems. IEEE Trans Comput Aided Des Integr Circuits Syst. 2011;30(9):1307–20.
Article Google Scholar
Freund RW. The SPRIM algorithm for structure-preserving order reduction of general RCL circuits. In: Benner P, Hinze M, ter Maten EJW, editors. Model reduction for circuit simulation. Springer: Berlin; 2011. p. 25–52.
Chapter Google Scholar
Antoulas AC. Approximation of large-scale dynamical systems. Philadelphia: SIAM Publications; 2005.
Book MATH Google Scholar
Baur U, Benner P, Feng L. Model order reduction for linear and nonlinear systems: a system-theoretic perspective. Arch Comput Methods Eng. 2014;21(4):331–58.
Article MathSciNet MATH Google Scholar
Benner P, Grivet-Talocia S, Quarteroni A, Rozza G, Schilders W, Silveira LM, editors. Model order reduction. vol. 1: System- and data- driven methods and algorithms, De Gruyter, 2021.
Grimme EJ. Krylov projection methods for model reduction, Phd thesis, Univ. of Illinois at Urbana-Champaign, USA; 1997.
Benner P, Feng L. Model order reduction based on moment-matching. In: Benner P, Grivet-Talocia S, Quarteroni A, Rozza G, Schilders W, Silveira LM, editors. Model order reduction. vol. 1: system- and data-driven methods and algorithms, De Gruyter, 2021, Ch. 3, pp. 57–96.
Benner P, Breiten T. Two-sided projection methods for nonlinear model order reduction. SIAM J Sci Comput. 2015;37(2):B239–60.
Article MathSciNet MATH Google Scholar
Ahmad M, Benner P, Jaimoukha I. Krylov subspace methods for model reduction of quadratic-bilinear systems. IET Control Theory Appl. 2016;10:2010–8.
Article MathSciNet Google Scholar
Rugh RJ. Nonlinear system theory. Baltimore: Johns Hopkins University Press; 1981.
MATH Google Scholar
Benner P, Goyal P, Gugercin S. H2-quasi-optimal model order reduction for quadratic-bilinear control systems. SIAM J Matrix Anal Appl. 2018;39(2):983–1032.
Article MathSciNet MATH Google Scholar
Mayo AJ, Antoulas AC. A framework for the solution of the generalized realization problem. Linear Algebra Appl. 2007;425(2–3):634–62.
Article MathSciNet MATH Google Scholar
Ionita AC, Antoulas AC. Data-driven parametrized model reduction in the loewner framework. SIAM J Sci Comput. 2014;36(3):A984–1007.
Article MathSciNet MATH Google Scholar
Gosea IV, Antoulas AC. Model reduction of linear and nonlinear sys315 tems in the loewner framework: a summary. In: European Control Conference (ECC), IEEE, 2015, pp. 345–349.
Ahmad M, Feng L, Benner P. A new interpolatory model reduction for quadratic-bilinear descriptor systems. Proc Appl Math Mech. 2015;15(1):589–90.
Article Google Scholar
Ahmad MI, Benner P, Feng L. Interpolatory model reduction for quadratic-bilinear systems using error estimators. Eng Comput. 2019;36(1):25–44.
Article Google Scholar
Yang J-M, Jiang Y-L. Krylov subspace approximation for quadraticbilinear differential system. Int J Syst Sci. 2018;49(9):1950–63.
Article MATH Google Scholar
Liljegren-Sailer B, Marheineke N. Input-tailored system-theoretic model order reduction for quadratic-bilinear systems. SIAM J Matrix Anal Appl. 2022;43(1):1–39.
Article MathSciNet MATH Google Scholar
Feng L, Antoulas AC, Benner P. Some a posteriori error bounds for reduced-order modelling of (non-) parametrized linear systems. ESAIM Math Model Numer Anal. 2017;51(6):2127–58.
Article MathSciNet MATH Google Scholar
Kolda TG, Bader BW. Tensor decompositions and applications. SIAM Rev. 2009;51(3):455–75.
Article MathSciNet MATH Google Scholar
Chen Y. Model reduction for nonlinear systems, Master’s thesis, Massachusetts Institute of Technology; 1999.

Download references

Acknowledgements

Muhammad Altaf Khattak and Mian Ilyas Ahmad are supported by HEC Pakistan under NRPU Project ID 10176.

Funding

This research is funded by HEC, Pakistan under NRPU Project ID 10176.

Author information

Authors and Affiliations

School of Interdisciplinary Engineering and Sciences (SINES), National University of Sciences and Technology (NUST), Islamabad, 44000, Pakistan
Muhammad Altaf Khattak & Mian Ilyas Ahmad
Computational Methods for Systems and Control, Max Planck Institute for Dynamics of Complex Technical Systems, Sandtorstrasse 1, Magdeburg, 39106, Germany
Lihong Feng & Peter Benner
Faculty of Mathematics, Otto von Guericke University Magdeburg Universitätsplatz 2, 39106, Magdeburg, Germany
Peter Benner

Authors

Muhammad Altaf Khattak
View author publications
You can also search for this author in PubMed Google Scholar
Mian Ilyas Ahmad
View author publications
You can also search for this author in PubMed Google Scholar
Lihong Feng
View author publications
You can also search for this author in PubMed Google Scholar
Peter Benner
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All authors contributed equally in this research. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Lihong Feng.

Ethics declarations

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Khattak, M.A., Ahmad, M.I., Feng, L. et al. Multivariate moment-matching for model order reduction of quadratic-bilinear systems using error bounds. Adv. Model. and Simul. in Eng. Sci. 9, 23 (2022). https://doi.org/10.1186/s40323-022-00236-6

Download citation

Received: 17 March 2022
Accepted: 10 October 2022
Published: 12 December 2022
DOI: https://doi.org/10.1186/s40323-022-00236-6

Multivariate moment-matching for model order reduction of quadratic-bilinear systems using error bounds

Abstract

Introduction