On the use of neural networks to evaluate performances of shell models for composites

Petrolo, Marco; Carrera, Erasmo

doi:10.1186/s40323-020-00169-y

Research article
Open access
Published: 07 July 2020

On the use of neural networks to evaluate performances of shell models for composites

Advanced Modeling and Simulation in Engineering Sciences volume 7, Article number: 31 (2020) Cite this article

2529 Accesses
5 Citations
Metrics details

Abstract

This paper presents a novel methodology to assess the accuracy of shell finite elements via neural networks. The proposed framework exploits the synergies among three well-established methods, namely, the Carrera Unified Formulation (CUF), the Finite Element Method (FE), and neural networks (NN). CUF generates the governing equations for any-order shell theories based on polynomial expansions over the thickness. FE provides numerical results feeding the NN for training. Multilayer NN have the generalized displacement variables, and the thickness ratio as inputs, and the target is the maximum transverse displacement. This work investigates the minimum requirements for the NN concerning the number of neurons and hidden layers, and the size of the training set. The results look promising as the NN requires a fraction of FE analyses for training, can evaluate the accuracy of any-order model, and can incorporate physical features, e.g., the thickness ratio, that drive the complexity of the mathematical model. In other words, NN can trigger fast informed decision-making on the structural model to use and the influence of design parameters without the need of modifying, rebuild, or rerun an FE model.

Introduction

Shell finite elements (FE) are standard options to model two-dimensional (2D) curved structures. In commercial codes, shell FE have the assumptions of the classical theories [1,2,3] leading to up to six degrees of freedom (DOF) per node. Such assumptions may be too restrictive in the case of composite structures in which the high transverse deformability and the transverse anisotropy require the proper modeling of shear and normal transverse stresses, and variations of the displacement field at the interface between two layers with different mechanical properties, i.e., the Zig-Zag effect [4]. 3D FE can incorporate such effects but can lead to prohibitive computational costs due to severe aspect ratio constraints. 2D FE remain computationally more efficient and attractive and, over the years, many strategies emerged to extend their capabilities via, for instance, the use of higher-order polynomial thickness expansions leading to increasing DOF per node [5]. This paper presents a new methodology to assess shell FE for linear static analyses of composites, and the following literature survey focuses on this specific area. More comprehensive reviews are in [6,7,8,9].

Concerning the solution schemes, analytical and FE strategies are among the most used. Analytical models received a great deal of interest as they provide very useful exact solutions to, for instance, verify FE modelings. Such exact solutions can take into account the shear deformability [10,11,12,13,14,15,16] or directly provide 3D solutions [17,18,19,20,21,22,23]. Research on refined shell FE focused on higher-order models [24,25,26], the inclusion of transverse stretching and continuity [27,28,29], and the development of solid-shell elements [30,31,32,33,34,35,36,37]. Regardless of the solution scheme, the most important strategies to enhance the capabilities of shell models are either asymptotic or axiomatic. The former exploit asymptotic expansions of most relevant parameter, e.g., the thickness ratio, to build models with a priori known accuracy as compared to 3D models [38,39,40,41]. The latter, on the other hand, build models based on assumptions and, usually, less assumptions lead to more cumbersome models. The axiomatic way has various directories starting from the improvement of classical models [42,43,44,45,46,47,48,49]. As mentioned above, the proper modeling of the transverse behavior of composites is decisive as proved by the efforts of many researchers over the past few years. The focus is on improved modelings of the interlaminar stresses and through-the-thickness continuity [50,51,52,53,54,55], shear correction factors [56], Zig-Zag models [57,58,59,60,61], Layer-Wise (LW) models [62,63,64,65], and mixed formulations [66,67,68] allowing for the a priori modeling of transverse stresses.

Another powerful approach is the Proper Generalized Decomposition (PGD) method [69, 70] in which the construction of the refined model and the solution of the problem take place simultaneously.

From the structural standpoint, the methodology in this paper adopts the Carrera Unified Formulation (CUF) allowing to obtain any-order shell theory without formal changes in the problem matrices [4, 71, 72]. One of the capabilities of CUF is the axiomatic/asymptotic method (AAM) [73, 74] to analyze the relevance of any generalized displacement variable. The systematic use of AAM leads to the definition of the Best Theory Diagram (BTD), i.e., a 2D plot to localize shell models with minimum DOF and maximum accuracies [75, 76]. One of the aims of this paper is to reduce the computational costs to obtain BTD via neural networks (NN). Such networks are mathematical models inspired by biological nervous systems and composed of simple computational units interlinked by a system of connections [77] to learn through training via samples. In this paper, CUF FE provides the samples for the supervised learning of multilayer perceptrons to evaluate the accuracy of refined shell models avoiding FE matrices and analyses. The use of NN in structural and material simulation is increasing due to the superior computational efficiency [78,79,80]. Recent applications for composites concern the prediction of the elastic properties [81], buckling load [82, 83], failure strength [84, 85], natural frequencies [86,87,88], and geometry optimization [89].

In this paper, “Finite element formulation” section provides a brief theoretical description of CUF and its FE formulation. “Best Theory Diagram” section introduces the concept of BTD. “Neural networks and coding” section describes the use of NN to evaluate the accuracy of a shell model. Results and conclusions are in “Results” and “Conclusions” sections, respectively.

Finite element formulation

The CUF displacement field for a 2D model is

$$\begin{aligned} {\mathbf {u}}(\alpha , \beta , z)=F_{\tau }(z){\mathbf {u}}_{\tau }(\alpha , \beta )\qquad \tau =1, \dots , M \end{aligned}$$

(1)

The Einstein notation acts on $\tau $. ${\mathbf {u}}$ is the displacement vector, $({\hbox {u}}_{x}\; {\hbox {u}}_{y}\; {\hbox {u}}_{z})^T$. ${\hbox {F}}_{\tau }$ are the thickness expansion functions. ${\mathbf {u}}_{\tau }$ is the vector of the generalized unknown displacements. M is the number of expansion terms. A fourth-order model, referred to as N = 4, is

$$\begin{aligned} \begin{aligned}&u_{x}=u_{x_{1}}+z\,u_{x_{2}}+z^{2}\,u_{x_{3}}+z^{3}\,u_{x_{4}}+z^{4}\,u_{x_{5}}\\&u_{y}=u_{y_{1}}+z\,u_{y_{2}}+z^{2}\,u_{y_{3}}+z^{3}\,u_{y_{4}}+z^{4}\,u_{y_{5}}\\&u_{z}=u_{z_{1}}+z\,u_{z_{2}}+z^{2}\,u_{z_{3}}+z^{3}\,u_{z_{4}}+z^{4}\,u_{z_{5}}\\ \end{aligned} \end{aligned}$$

(2)

and has 15 nodal DOF. The order and type of expansion is a free parameter; thus, the theory of structure is an input of the analysis. The metric coefficients ${\hbox {H}}^k_\alpha $, ${\hbox {H}}^k_\beta $ and ${\hbox {H}}^k_z$ of the ${\hbox {kth}}$ layer are

$$\begin{aligned} \begin{aligned} H^k_\alpha = A^k (1 + z_k/R^k_\alpha ), \;\;\; H^k_\beta = B^k (1 + z_k/R^k_\beta ), \;\;\; H^k_z = 1\; \end{aligned} \end{aligned}$$

(3)

${\hbox {R}}^k_\alpha $ and ${\hbox {R}}^k_\beta $ are the principal radii of the middle surface of the ${\hbox {kth}}$ layer, ${\hbox {A}}^k$ and ${\hbox {B}}^k$ the coefficients of the first fundamental form of $\Omega _k$, see Fig. 1. This paper focused only on shells with constant radii of curvature with ${\hbox {A}}^k = {\hbox {B}}^k = 1$. The geometrical relations are

$$\begin{aligned} \begin{aligned} {{\varvec{\epsilon }}}^k_p&= \begin{Bmatrix} \epsilon ^k_{\alpha \alpha }, \epsilon ^k_{\beta \beta }, \epsilon ^k_{\alpha \beta } \end{Bmatrix}^T = ({{\varvec{D}}}^k_p + {{\varvec{A}}}^k_p) {{\varvec{u}}}^k \\ {{\varvec{\epsilon }}}^k_n&= \begin{Bmatrix} \epsilon ^k_{\alpha z}, \epsilon ^k_{\beta z}, \epsilon ^k_{zz} \end{Bmatrix}^T = ({{\varvec{D}}}^k_{n\Omega } +{{\varvec{D}}}^k_{nz} - {{\varvec{A}}}^k_n) {{\varvec{u}}}^k \end{aligned} \end{aligned}$$

(4)

where

$$\begin{aligned} {\varvec{D}}^k_p= & {} \left[ \begin{array}{c@{\quad }c@{\quad }c} \frac{\partial _{\alpha }}{H^k_{\alpha }} &{} 0 &{} 0 \\ 0 &{} \frac{\partial _{\beta }}{H^k_{\beta }} &{} 0 \\ \frac{\partial _{\beta }}{H^k_{\beta }} &{} \frac{\partial _{\alpha }}{H^k_{\alpha }} &{} 0 \end{array} \right] \; \quad {\varvec{D}}^k_{n\Omega } = \left[ \begin{array}{c@{\quad }c@{\quad }c} 0 &{} 0 &{} \frac{\partial _{\alpha }}{H^k_{\alpha }} \\ 0 &{} 0 &{} \frac{\partial _{\beta }}{H^k_{\beta }} \\ 0 &{} 0 &{} 0 \end{array} \right] \; \quad {\varvec{D}}^k_{nz} = \left[ \begin{array}{ccc} \partial _z &{} 0 &{} 0 \\ 0 &{} \partial _z &{} 0 \\ 0 &{} 0 &{} \partial _z \end{array} \right] \; \end{aligned}$$

(5)

$$\begin{aligned} {\varvec{A}}^k_{p}= & {} \left[ \begin{array}{c@{\quad }c@{\quad }c} 0 &{} 0 &{} \frac{1}{H^k_{\alpha }R^k_{\alpha }} \\ 0 &{} 0 &{} \frac{1}{H^k_{\beta }R^k_{\beta }} \\ 0 &{} 0 &{} 0 \end{array} \right] \; {\varvec{A}}^k_{n} = \left[ \begin{array}{c@{\quad }c@{\quad }c} \frac{1}{H^k_{\alpha }R^k_{\alpha }} &{} 0 &{} 0 \\ 0 &{} \frac{1}{H^k_{\beta }R^k_{\beta }} &{} 0 \\ 0 &{} 0 &{} 0 \end{array} \right] \; \end{aligned}$$

(6)

The stress–strain relations are

$$\begin{aligned} \begin{aligned} {{\varvec{\sigma }}}_{p}^k&= \begin{Bmatrix} \sigma _{\alpha \alpha }^k, \sigma _{\beta \beta }^k, \sigma _{\alpha \beta }^k \end{Bmatrix}^T = {{\varvec{C}}}_{pp}^k {{\varvec{\epsilon }}}_{p}^k + {{\varvec{C}}}_{pn}^k {{\varvec{\epsilon }}}_{n}^k \\ {{\varvec{\sigma }}}_{n}^k&= \begin{Bmatrix} \sigma _{\alpha z}^k, \sigma _{\beta z}^k, \sigma _{z z}^k \end{Bmatrix}^T = {{\varvec{C}}}_{np}^k {{\varvec{\epsilon }}}_{p}^k + {{\varvec{C}}}_{nn}^k {{\varvec{\epsilon }}}_{n}^k \\ \end{aligned} \end{aligned}$$

(7)

where

$$\begin{aligned} {\begin{matrix} {{\varvec{C}}}_{pp}^k=&{}\left[ \begin{array}{c@{\quad }c@{\quad }c} C_{11}^k &{} C_{12}^k &{} C_{16}^k \\ C_{12}^k &{} C_{22}^k &{} C_{26}^k \\ C_{16}^k &{} C_{26}^k &{} C_{66}^k \end{array} \right] \qquad {{\varvec{C}}}_{pn}^k=\left[ \begin{array}{c@{\quad }c@{\quad }c} 0 &{} 0 &{} C_{13}^k\\ 0 &{} 0 &{} C_{23}^k\\ 0 &{} 0 &{} C_{36}^k \end{array} \right] \\ {{\varvec{C}}}_{np}^k= &{}\left[ \begin{array}{c@{\quad }c@{\quad }c} 0 &{} 0 &{} 0 \\ 0 &{} 0&{} 0\\ C_{13}^k &{} C_{23}^k &{} C_{36}^k \end{array} \right] \qquad {{\varvec{C}}}_{nn}^k=\left[ \begin{array}{c@{\quad }c@{\quad }c} C_{55}^k &{} C_{45}^k &{} 0 \\ C_{45}^k &{} C_{44}^k &{} 0 \\ 0 &{} 0 &{} C_{33}^k \end{array} \right] \end{matrix}} \end{aligned}$$

(8)

The FE formulation uses a nine-node shell element based on the Mixed Interpolation of Tensorial Component (MITC) method [90]. The displacement vector becomes

$$\begin{aligned} \delta {{\varvec{u}}}_{s} = N_j \delta {{\varvec{u}}}_{s j}, \quad \quad {{\varvec{u}}}_{\tau } = N_i {{\varvec{u}}}_{\tau i} \quad \quad i,j = 1,\cdots ,9 \end{aligned}$$

(9)

${{\varvec{u}}}_{\tau i}$ and $\delta {{\varvec{u}}}_{s j}$ are the nodal displacement vector and its virtual variation, respectively. The strain expression becomes

$$\begin{aligned} \begin{aligned} {{\varvec{\epsilon }}}_p&= F_{\tau } ({{\varvec{D}}}_p + {{\varvec{A}}}_p) N_i {{\varvec{u}}}_{\tau i} \\ {{\varvec{\epsilon }}}_n&= F_{\tau } ({{\varvec{D}}}_{n \Omega } - {{\varvec{A}}}_n) N_i {{\varvec{u}}}_{\tau i} + F_{\tau _{,z}} N_i {{\varvec{u}}}_{\tau i} \end{aligned} \end{aligned}$$

(10)

MITC contrasts the membrane and shear locking via a specific interpolation strategy for the strain components on the nine-node shell element, as follows:

$$\begin{aligned} \begin{aligned} {{\varvec{\epsilon }}}_{p}&= \begin{bmatrix} \epsilon _{\alpha \alpha }\\ \epsilon _{\beta \beta }\\ \epsilon _{\alpha \beta } \end{bmatrix} = \begin{bmatrix} N_{m1} &{}0 &{}0 \\ 0 &{}N_{m2} &{}0 \\ 0 &{}0 &{}N_{m3} \end{bmatrix} \begin{bmatrix} \epsilon _{\alpha \alpha _{m1}}\\ \epsilon _{\beta \beta _{m2}}\\ \epsilon _{\alpha \beta _{m3}} \end{bmatrix}\\ {{\varvec{\epsilon }}}_{n}&= \begin{bmatrix} \epsilon _{\alpha z}\\ \epsilon _{\beta z}\\ \epsilon _{zz} \end{bmatrix} = \begin{bmatrix} N_{m1} &{}0 &{}0 \\ 0 &{}N_{m2} &{}0 \\ 0 &{}0 &{}1 \end{bmatrix} \begin{bmatrix} \epsilon _{\alpha z_{m1}}\\ \epsilon _{\beta z_{m2}}\\ \epsilon _{zz_{m3}} \end{bmatrix} \end{aligned} \end{aligned}$$

(11)

Strains $\epsilon _{\alpha \alpha _{m1}}$, $\epsilon _{\beta \beta _{m2}}$, $\epsilon _{\alpha \beta _{m3}}$, $\epsilon _{\alpha z_{m1}}$, and $\epsilon _{\beta z_{m2}}$ stem from 10 and

$$\begin{aligned} \begin{aligned} N_{m1}&= [N_{A1}, N_{B1}, N_{C1}, N_{D1}, N_{E1}, N_{F1} ] \\ N_{m2}&= [N_{A2}, N_{B2}, N_{C2}, N_{D2}, N_{E2}, N_{F2} ] \\ N_{m3}&= [N_{P}, N_{Q}, N_{R}, N_{S}] \end{aligned} \end{aligned}$$

(12)

Subscripts m1, m2 and m3 indicate the point groups (A1,B1,C1,D1,E1,F1), (A2,B2,C2,D2,E2,F2), and (P,Q,R,S), respectively, see Fig. 2. Via Principle of Virtual Displacements (PVD) for the static analysis, the equilibrium equation reads

$$\begin{aligned} \quad {{\varvec{k}}}^{k}_{\tau s i j} {{\varvec{u}}}^{k }_{\tau i} = {{\varvec{p}}}^k_{s j} \end{aligned}$$

(13)

The $3 \times 3$ matrix ${\varvec{k}}^{k}_{\tau s i j}$ is the fundamental mechanical nucleus whose expression is independent of the order of the expansion. ${{\varvec{p}}}^k_{s j}$ is the load vector. More details regarding the finite element formulation are in [72].

Best Theory Diagram

One of the CUF capabilities is the axiomatic/asymptotic method (AAM) to evaluate the relevance of generalized variables and the accuracy of structural theories [73, 74]. The fourth-order, equivalent single layer shell model, is the reference model of this paper and all the theories evaluated stem from the combinations of the full fourth-order expansion, i.e., $2^{15}$ models. The CUF generates the governing equations for the theories considered. In particular, the CUF generates reduced models having combinations of the starting terms as generalized unknowns. Two parameters can identify a theory, namely, the number of active terms and the error or accuracy provided. The Best Theory Diagram (BTD) is the curve composed of all models providing the minimum error with the least number of variables, see Fig. 3. Given the accuracy, models with fewer variables than those on the BTD do not exist. Given the number of variables, models with better accuracy than those on the BTD do not exist. In this paper, the error refers to the maximum transverse displacement,

$$\begin{aligned} Error = 100\times \frac{|u_z - u_z^{N = 4}|}{|u_z^{N = 4}|} \end{aligned}$$

(14)

The combined use of CUF and AAM allows the evaluation of the accuracy of any finite element, as shown in Table 1. Black and white triangles indicate active and inactive generalized displacement variables, respectively, and DOF the nodal degrees of freedom of the element. N = 4 is the full expansion of fourth-order. Other three models, well-known from literature, have incomplete expansions, namely,

The First-Order Shear Deformation Theory (FSDT) with five DOF,
$$\begin{aligned} \begin{aligned}&u_{x}=u_{x_{1}}+z\,u_{x_{2}}\\&u_{y}=u_{y_{1}}+z\,u_{y_{2}}\\&u_{z}=u_{z_{1}} \\ \end{aligned} \end{aligned}$$
(15)
A seven DOF model with parabolic transverse displacement, referred to as PTD,
$$\begin{aligned} \begin{aligned}&u_{x}=u_{x_{1}}+z\,u_{x_{2}}\\&u_{y}=u_{y_{1}}+z\,u_{y_{2}}\\&u_{z}=u_{z_{1}}+z\,u_{z_{2}}+z^{2}\,u_{z_{3}}\\ \end{aligned} \end{aligned}$$
(16)
A nine DOF model with third-order in-plane displacements referred to as TSDT,
$$\begin{aligned} \begin{aligned}&u_{x}=u_{x_{1}}+z\,u_{x_{2}}+z^{2}\,u_{x_{3}}+z^{3}\,u_{x_{4}}\\&u_{y}=u_{y_{1}}+z\,u_{y_{2}}+z^{2}\,u_{y_{3}}+z^{3}\,u_{y_{4}}\\&u_{z}=u_{z_{1}}\\ \end{aligned} \end{aligned}$$
(17)

Table 1 Examples of shell models assessed

On the use of neural networks to evaluate performances of shell models for composites

Abstract

Introduction

Finite element formulation

Best Theory Diagram

Neural networks and coding

Results

0/90/0

0/90/0/90

a/h as a training variable

Conclusions

Availability of data and materials

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords