A Brief Summary of the Finite Element Method for Differential Equations

Mahboub Baccouch

doi:10.5772/intechopen.95423

Abstract

The finite element (FE) method is a numerical technique for computing approximate solutions to complex mathematical problems described by differential equations. The method was developed in the 1950s to solve complicated problems in engineering, notably in elasticity and structural mechanics modeling involving elliptic partial differential equations and complicated geometries. But nowadays the range of applications is quite extensive. In particular, the FE method has been successfully applied to many problems such as fluid–structure interaction, thermomechanical, thermochemical, thermo-chemo-mechanical problems, biomechanics, biomedical engineering, piezoelectric, ferroelectric, electromagnetics, and many others. This chapter contains a summary of the FE method. Since the remaining chapters of this textbook are based on the FE method, we present it in this chapter as a method for approximating solutions of ordinary differential equations (ODEs) and partial differential equations (PDEs).

Keywords

the finite element method
initial-value problems
boundary-value problems
Laplace equation
heat equation
wave equation

Author Information

Show +

Mahboub Baccouch*
- Department of Mathematics, University of Nebraska at Omaha, Omaha, NE, USA

*Address all correspondence to: mbaccouch@unomaha.edu

1. Introduction

1.1 An overview of the finite element method

Differential equations arise in many disciplines such as engineering, mathematics, sciences, economics, and many other fields. Unfortunately solutions to differential equations can rarely be expressed by closed formulas and numerical methods are needed to approximate their solutions. There are many numerical methods for approximating the solution to differential equations including the finite difference (FD), finite element (FE), finite volume (FV), spectral, and discontinuous Galerkin (DG) methods. These methods are used when the mathematical equations are too complicated to be solved analytically.

The FE method has become the standard numerical scheme for approximating the solution to many mathematical problems; see [1, 2, 3, 4, 5, 6, 7, 8, 9] and the references therein just to mention a few. In simple words, the FE method is a numerical method to solve differential equations by discretizing the domain into a finite mesh. Numerically speaking, a set of differential equations are converted into a set of algebraic equations to be solved for unknown at the nodes of the mesh. The FE method originated from the need to solve complex elasticity and structural analysis problems in civil and aeronautical engineering. The first development can be traced back to the work by Hrennikoff in 1941 [10] and Courant in 1943 [11]. Although these pioneers used different perspectives in their FE approaches, they each identified the one common and essential characteristic: mesh discretization of a continuous domain into a set of discrete sub-domains, usually called elements. Another fundamental mathematical contribution to the FE method is represented by Gilbert Strang and George Fix [12]. Since then, the FE method has been generalized for the numerical modeling of physical systems in many engineering disciplines including electromagnetism, heat transfer, and fluid dynamics.

The advantages of this method can be summarized as follows:

Numerical efficiency: The discretization of the calculation domain with finite elements yields matrices that are in most cases sparse and symmetric. Therefore, the system matrix, which is obtained after spatial and time discretization, is sparse and symmetric too. Both the storage of the system matrix and the solution of the algebraic system of equations can be performed in a very efficient way.
Treatment of nonlinearities: The modeling of nonlinear material behavior is well established for the FE method (e.g., nonlinear curves, hysteresis).
Complex geometry: By the use of the FE method, any complex domain can be discretized by triangular elements in 2D and by tetrahedra elements in 3D.
Applicable to many field problems: The FE method is suited for structural analysis, heat transfer, electrical/magnetical analysis, fluid and acoustic analysis, multi-physics, etc.

COMSOL Multiphysics (known as FEMLAB before 2005) is a commercial FE software package designed to address a wide range of physical phenomena. It is widely used in science and industry for research and development. It excels at modeling almost any multi-physics problem by solving the governing set of PDEs via the FE method. This software package is able to solve one, two and three-dimensional problems. It comes with a modern graphical user interface to set up simulation models and can be scripted from Matlab or via its native Java API.

In this chapter, we introduce the FE method for several one-dimensional and two-dimensional model problems. Although the FE method has been extensively used in the field of structural mechanics, it has been successfully applied to solve several other types of engineering problems, such as heat conduction, fluid dynamics, seepage flow, and electric and magnetic fields. These applications prompted mathematicians to use this technique for the solution of complicated problems. For illustration, we will use simple one-dimensional and two-dimensional model problems to introduce the FE method.

2. The FE method for ODEs

2.1 The FE method for first-order linear IVPs

We first present the FE method as an approximation technique for solving the following first-order initial-value problem (IVP) using piecewise linear polynomials

u′=fx,x∈ab,ua=u0.E1

In order to apply the FE method to solve this problem, we carry out the following process.

Derive a weak form (variational formulation). This can be done by multiplying the ODE in (1) by a test function vx∈V0=v∈L2ab:∥v∥2+∥v′∥2<∞,va=0, where ∥v∥2=∫abv2xdx, integrating from a to b, using integration by parts, and applying va=0, to get
∫abfvdx=∫abu′vdx=−∫abuv′dx+ubvb−uava=−∫abuv′dx+ubvb.
Generate a triangulation (also called a mesh) of the computational domain ab. For a one-dimensional problem, a mesh is a set of points in the interval ab, say, a=x0≤x1≤⋯≤xN=b. The point xi is called a node or nodal point. The length of the interval (called an element) Ii=xi−1xi is hi=xi−xi−1. Let h=max1≤i≤Nhi (called a mesh size that measures how fine the partition is). If the mesh is uniformly distributed, then xi=a+ih, i=0,1,…,N, where h=b−aN.
Define a finite dimensional space over the triangulation: Let the solution u be in the space V. For the model problem (1), the solution space is V=C1ab. We wish to construct a finite dimensional space (subspace) Vh⊂V based on the mesh. When the FE space is a subspace of the solution space, the method is called conforming. It is known that in this case, the FE solution converges to the true solution provided the FE space approximates the given space in some sense [3]. Different finite dimensional spaces will generate different FE solutions.
Define the FE space as the set of all continuous piecewise linear polynomials Vh={v:vIi∈P1Ii,i=1,2,…,N,va=0}, where P1Ii is the space of polynomials of degree ≤1 on Ii. Functions in Vh are linear on each Ii, and continuous on the whole interval ab. An example of such a function is shown in Figure 1.
We remark that any function v∈Vh is uniquely determined by its nodal values vxi.
Construct a set of basis functions based on the triangulation. Since Vh has finite dimension, we can find one set of basis functions. A basis for Vh is ϕjj=0N, where ϕj∈Vh are linearly independent. Then Vh=vhx∈Vvhx=∑j=0Ncjϕjx is the space spanned by the basis functions ϕii=0N. The simplest finite dimensional space is the piecewise continuous linear function space defined over the triangulation.

Figure 1.
A continuous piecewise linear function v.

Vh=vhx∈Vvhxis piecewise continuous linear overabwithvha=0.

There are infinite number of sets of basis functions. We should choose a set of basis functions that are simple, have compact (minimum) support (that is, zero almost everywhere except for a small region), and meet the regularity requirement, that is, they have to be continuous, and differentiable except at nodal points. The simplest ones are the so-called hat functions satisfying ϕixi=1 and ϕixj=0 for i≠j. The analytic form is (see Figure 2)

ϕ0x=x1−xh,x∈I1,0,else,,ϕNx=x−xN−1h,x∈IN,0,else,,ϕix=x−xi−1h,x∈Ii,xi+1−xh,x∈Ii+1,0,else.

Approximate the exact solution u by a continuous piecewise linear function uhx. The FE method consists of finding uh∈Vh such that

−∫abuhv′dx+uhbvb=∫abfvdx,∀v∈Vh.

This type of FE method (with similar trial and test space) is sometimes called a Galerkin method, named after the famous Russian mathematician and engineer Galerkin.

Implementation: The FE solution is a linear combination of the basis functions. Writing uhx=∑j=0Ncjϕjx, where c0,c1,…,cN are unknowns, and choosing v=ϕi,i=1,2,…,N to get

−∑j=0Ncj∫abϕjϕi′dx+cNϕib=∫abfϕidx,i=1,2,…,N,

since uhb=cN. Note that using the hat functions, we have uhx0=0 and uhxi=∑j=0Ncjϕjxi=ciϕixi=ci for i=1,2,…,N. Thus, we get the following linear system

Figure 2.
A typical hat function ϕi on a mesh. Also shown is the half hat functions ϕ0 and ϕN.

−∑j=1Ncj∫abϕjϕi′dx+cNϕib=∫abϕ0ϕi′dx,i=1,2,…,N.

Finally, we solve the linear system for c1,…,cN. We note that for i=1,2,…,N−1, we have

∫abϕiϕi′dx=∫xi−1xi+1ϕiϕi′dx=1hi∫xi−1xix−xi−1hidx−1hi∫xixi+1xi+1−xhidx=0.

However, for i=N, we have

∫abϕNϕN'dx=∫xN−1xNϕNϕN'dx=∫xN−1xNx−xN−1hNx−xN−1hNdx=1hN∫xN−1xNx−xN−1hNdx=12.

Similarly, for i=1,2,…,N, we have

∫abϕi−1ϕi′dx=∫xi−1xiϕi−1ϕi′dx=∫xi−1xixi−xhix−xi−1hidx=1hi∫xi−1xixi−xhidx=12,

∫abϕi+1ϕi′dx=∫xixi+1ϕi+1ϕi′dx=∫xixi+1x−xihi+1xi+1−xhi+1dx=−1hi+1∫xixi+1x−xihi+1dx=−12.

We next calculate ∫abfϕidx. Since it depends on f, we cannot generally expect to calculate it exactly. However, we can approximate it using a quadrature rule. Using the Trapezoidal rule ∫abfxdx≈b−a2fa+fb and using ϕixi−1=ϕixi+1=0 and ϕixi=1, we get

∫abfϕidx=∫xi−1xifϕidx+∫xixi+1fϕidx≈hi+hi+12fxi,i=1,2,…,N−1,

∫abfxϕNdx=∫xN−1xNfxϕNdx≈hN2fxN−1ϕNxN−1+fxNϕNxN=hN2fxN.

Thus, we obtain the following linear system of equations

0120⋯0−12012⋱0⋮⋱⋱⋱⋮0⋯−120120⋯0−1212c1c2⋮cN−1cN=h1+h22fx1h2+h32fx2⋮hN−1+hN2fxN−1hN2fxN.

The determinant of the above matrix is 12N. Thus, the system has a unique solution c1,c2,…,cN.

Remark 2.1Suppose thatua=u0, then we letuhx=∑j=0Ncjϕjx. Sinceu0=uhx0=∑j=1Ncjϕjx0=c0ϕ0x0=c0, we only need to findc1,c2,…,cN. Choosingv=ϕi,i=1,2,…,N, we get the following linear system

−∑j=1Ncj∫abϕjϕi′dx+cNϕib=∫abfϕidx+u0∫abϕ0ϕi′dx,i=1,2,…,N.

Finally, we solve the linear system for c1,…,cN. We note that ∫abϕ0ϕi′dx=0 for i=2,…,N and

∫abϕ0ϕ1'dx=∫x0x1x1−xh1x−x0h1′dx=1h1∫x0x1x1−xh1dx=12.

Following the same steps used for the case ua=0, we obtain the following linear system of equations

0120⋯0−12012⋱0⋮⋱⋱⋱⋮0⋯−120120⋯0−1212c1c2⋮cN−1cN=h1+h22fx1+u02h2+h32fx2⋮hN−1+hN2fxN−1hN2fxN.

2.2 The FE method for first-order nonlinear IVPs

Here, we extend the FE method for the nonlinear IVP using piecewise linear polynomials

u′=fxu,x∈ab,ua=u0.E2

The FE method consists of finding uh∈Vh={v:vIi∈P1Ii,i=1,2,…,N,va=0}, such that

uhbvb−∫abuhv′dx=∫abfxuhvdx,∀v∈Vh.

Writing uhx=∑j=0Ncjϕjx and choosing v=ϕi,i=1,2,…,N, we get

cNϕi−∫abϕNϕi′dx−∑j=0N−1cj∫abϕjϕi′dx−∫abfx∑j=0Ncjϕjϕidx=0,i=1,2,…,N,

where uhx0=c0=u0. Finally, we solve the nonlinear system for c1,c2,…,cN using e.g., Newton’s method for systems of nonlinear equations. The system can be written as Fic1c2…cN=0,i=1,2,…,N, where

Fi=cNϕi−∫abϕNϕi′dx−∑j=0N−1cj∫abϕjϕi′dx−∫abfx∑j=0Ncjϕjϕidx,i=1,2,…,N.

Let αi=∑j=0Ncj∫abϕjϕi′dx and βi=∫abfx∑j=0Ncjϕjϕidx. Then, for i=1,2,…,N−1,

αi=ci−1∫xi−1xiϕi−1ϕi′dx+ci∫xi−1xiϕiϕi′dx+∫xixi+1ϕiϕi′dx+ci+1∫xixi+1ϕi+1ϕi′dx=ci−1∫xi−1xixi−xhi2dx+ci∫xi−1xix−xi−1hi2dx−∫xixi+1xi+1−xhi+12dx−ci+1∫xixi+1x−xihi+12dx=12ci−1+ci12−12−12ci+1=12ci−1−12ci+1,αN=cN−1∫xN−1xNϕN−1ϕN'dx+cN∫xN−1xNϕNϕN'dx=cN−1∫xN−1xNxN−xhN2dx+cN∫xN−1xNx−xN−1hN2dx=12cN−1+12cN.

Similarly,

βi=∫xi−1xi+1fx∑j=0Ncjϕjϕidx=∫xi−1xifx∑j=0Ncjϕjϕidx+∫xixi+1fx∑j=0Ncjϕjϕidx.

Using Simpson’s Rule ∫abfxdx≈b−a6fa+4fa+b2+fb, and using ϕixi−1=ϕixi+1=0, ϕixi=1, ∑j=0Ncjϕjxi−1+hi2=ci−1+ci2, ϕixi−1+hi2=12, ∑j=0Ncjϕjxi=ci, we have, for i=1,2,…,N−1,

βi≈hi3fxi−1+hi2ci−1+ci2+hi+hi+16fxici+hi+13fxi+hi+12ci+ci+12.

However, for i=N, we have

βN≈hN62fxN−1+h2cN−1+cN2+fxNcN.

Next, we compute the Jacobian matrix with entries

Ji,j=∂Fi∂cj=∫abϕjϕi′dx−∫abfux∑j=0Ncjϕjϕjϕidx=ai,j−bi,j,i=1,2,…,N.

We already computed the entries ai,j as

ai,i−1=∫abϕi−1ϕi′dx=12,ai,i=∫abϕiϕi′dx=0,i=1,2,…,N−1,aN,N=∫abϕNϕN'dx=12,ai,i+1=∫abϕi+1ϕi′dx=−12.

Using Simpson’s Rule, we get

bi,i−1=∫xi−1xiϕi−1ϕifux∑j=0Ncjϕjdx≈hi6fuxi−1+hi2ci−1+ci2,bi,i+1=∫xixi+1ϕi+1ϕifux∑j=0Ncjϕjdx≈hi+16fuxi+hi+12ci+ci+12,bi,i=∫xi−1xiϕi2fux∑j=0Ncjϕjdx+∫xixi+1ϕi2fux∑j=0Ncjϕjdx≈hi6fuxi−1+hi2ci−1+ci2+hi+hi+16fuxici+hi+16fuxi+hi+12ci+ci+12,bN,N=∫xN−1xNϕN2fux∑j=0Ncjϕjdx≈hN6fuxN−1+h2cN−1+cN2+fuxNcN.

2.3 The FE method for two-point BVPs

Here, we shall study the derivation and implementation of the FE method for two-point boundary-value problems (BVPs). For easy presentation, we consider the following model problem: Find u∈C2ab such that

−u″+qxu=fx,x∈Ω=ab,ua=ub=0,E3

where u:Ω¯=ab→R is the sought solution, qx≥0 is a continuous function on ab, and f∈L2ab. Under these assumptions, (3) has a unique solution u∈C2ab. For general qx, it is impossible to find an explicit form of the solution. Therefore, our goal is to obtain a numerical solution via the FE method.

2.3.1 Different mathematical formulations for the 1D model

The model problem (3) can be reformulated into three different forms:

(D)-form: the original differential equation (3).

(V)-form: the variational form or weak form: ∫abu′v′dx+∫abquvdx=∫abfvdx, for any test function v in the Sobolev space H01ab=v∈L2ab:v2+v′2<∞va=vb=0, where v2=∫abv2xdx. The corresponding FE method is often called the Galerkin method. In other words, a Galerkin FE method is a FE method obtained from the variational form.

(M)-form: the minimization form: minvx∈H01ab∫ab12v′2+12qv2−fvdx. The corresponding FE method is often called the Ritz method.

Under some assumptions, the three different forms are equivalent, that is, they have the same solution as will be explained in the following theorem.

Theorem 2.1 (Mathematical equivalences)Suppose thatu″exists and continuous onab. Then we have the following mathematical equivalences.

(D) is equivalent to (V), (V) is equivalent to (M), and (M) is equivalent to (D).

2.3.2 Galerkin method of the problem

To solve (3) using the FE method, we carry out the process described below. Usually, a FE method is always derived from the weak or variational formulation of the problem at hand.

Weak formulation of the problem: The Galerkin FE method starts by rewriting (3) in an equivalent variational formulation. To this end, let us define the vector space H01=v∈L2ab:v2+v′2<∞va=vb=0. Multiplying (3) by a test function v∈H01, integrating from a to b, and using integration by parts, we get

∫abfvdx=∫ab−u″vdx+∫abquvdx=∫abu′v′dx+∫abquvdx,

since va=vb=0. Hence, the weak (or variational) form of (3) reads: Find u∈H01, such that

∫abu′v′dx+∫abquvdx=∫abfvdx,∀v∈H01.E4

We want to find u∈H01 that satisfies (4). We note that a solution u to (4) is less regular than the solution u (3). Indeed, (4) has only u′ whereas (3) contains u″. Furthermore, we can easily verify the following:

If u is strong solution (i.e., solves (3)) then u is also weak solution (i.e., solves (4)).
Conversely, if u is a weak solution with u∈C2ab, it is also strong solution.
Existence and uniqueness of weak solutions is obtained by the Lax-Milgram Theorem.
We can consider solutions with lower regularity using the weak formulation.
FE method gives an approximation of the weak solution.

From now on, we use the notation v=vΩ, where Ω=ab.

The FE formulation: The FE method is based on the variational form (4). We note that the space H01 contains many functions and it is therefore just as hard to find a function u∈H01 which satisfies the variational Eq. (4) as it is to solve the original problem (3). Next, we study in details a special Galerkin method called the FE method. Let a=x0<x1<⋯<xN=b be a regular partition of ab. Suppose that the length of Ii=xi−1xi is hi=xi−xi−1. We define h=maxi=1,2,…,Nhi to be the mesh size. We wish to construct a subspace Vh⊂V=H01. Since Vh has finite dimension, we can find one set of basis functions ϕjj=1N−1 for Vh, where ϕj∈Vh,j=1,2,…,N−1 are linearly independent. We remark that Vh is the space spanned by the basis functions i.e.,Vh=vhxvhx=∑j=1N−1cjϕjx. The FE method consists of choosing a basis for the subspace Vh that satisfies the following properties

The matrix A must be sparse (e.g. traditional or banded matrix). In this case, iterative methods for solving linear systems can be adapted to obtain an efficient solution.
uh must converge to the solution u of the original problem as h→0.

It is natural to obtain an approximation uh to u as follows: Find uh∈Vh such that

∫abuh′v′dx+∫abquhvdx=∫abfvdx,∀v∈Vh.E5

We call uh the FE approximation of u. We say that (5) is the Galerkin approximation of (4) and the method used to find uh∈Vh is called Galerkin method.

FE approximation using LagrangeP1elements: The simplest finite dimensional space is the piecewise continuous linear function space defined over the triangulation

Vh,01=vh∈Vvhis piecewise continuous linear overabwithvha=vhb=0.

It is easy to show that Vh,01 has a finite dimension even although there are infinite number of elements in Vh,01. The approximation of the FE method is therefore to look for an approximation uh within a small (finite dimensional) subspace Vh,01=v∈Vh1va=vb=0 of H01, consisting of piecewise linear polynomials, where Vh1=v∈C0abvIi∈P1Ii.

Let Vh,01 be the space of all continuous piecewise linear functions, which vanish at the end points a and b. There are many types of basis functions ϕii=1N−1. The simplest ones are the so-called hat functions satisfying ϕixj=δij, where δij is the Kronecker symbol. Note especially that there is no need to construct hat functions ϕ0 and ϕN since any function of Vh,01 must vanish at the end points x0=a and xN=b.

The explicit expressions for the hat function ϕix and its derivative ϕi′x are given by

ϕix=0,a≤x≤xi−1,x−xi−1hi,xi−1≤x≤xi,xi+1−xhi+1,xi≤x≤xi+1,0,xi+1≤x≤b,,ϕi′x=0,a<x<xi−1,1hi,xi−1<x<xi,−hi+1,xi<x<xi+1,0,xi+1<x<b,,

for i=1,2,…,N−1. The FE approximation of (4) thus reads: Find u∈Vh,01, such that

∫abuh′v′dx+∫abquhvdx=∫abfvdx,∀v∈Vh,01.E6

We call uh the FE approximation of u. We say that (6) is the Galerkin approximation of (4) and the method used to find uh∈Vh,01 is called Galerkin method.

It can be shown that (6) is equivalent to the N−1 equations

∫abuh′ϕi′dx+∫abquhϕidx=∫abfϕidx,i=1,2,…,N−1.E7

Derivation of the discrete system: Since uh∈Vh,01, we can express it as a linear combination of hat functions i.e.,

uh=∑j=1N−1cjϕjx,E8

where cj are real numbers to be determined. We note that the coefficients cj,j=1,2,…,N−1 are the N−1 nodal values of uh to be determined. Note that the index is only from 1 to N−1, because of the zero boundary conditions. We remark that uha=uhb=0 and uhxi=ci. So ci is an approximate solution to the exact solution at x=xi.

We can use either the weak/variational form (V), or the minimization form (M), to derive a linear system of equations for the coefficients cj.

Substituting (8) into (7) yields

∑j=1N−1cj∫abϕi′ϕj′dx+∫abqϕiϕjdx=∫abfϕidx,i=1,2,…,N−1.E9

The problem (7) is now equivalent to the following: Find the real numbers c1,c2,…,cN−1 that satisfy the linear system (9).

We note that the linear system (9) is equivalent to the system in matrix–vector form

Ac=b,E10

where c=c1c2…cN−1t∈RN−1 is the unknown vector, A is an N−1×N−1 matrix, the so-called stiffness matrix when q=0, with entries

aij=∫abϕi′ϕj′+qϕiϕjdx,i,j=1,2,…,N−1,E11

and b∈RN−1, the so-called load vector, has entries

bi=∫abfϕidx,i=1,2,…,N−1.E12

To obtain the approximate solution we need to solve the linear system for the unknown vector c. We note that aij=aϕiϕj and bi=fϕi, where auv=∫abu′v′+quvdx is a bi-linear and fv=∫abfvdx is a linear form.

2.3.3 Ritz method of the problem

The Ritz method is one of the earliest FE methods. However, not every problem has a minimization form. The minimization form for the model problem (3) is

minvx∈H01abFv,whereFv=∫ab12v′2+12qv2−fvdx.

As before, we look for an approximate solution of the form (8). If we plug this into the functional form above, we get

Fuh=∫ab12∑j=1N−1cjϕj′x2+12q∑j=1N−1cjϕjx2−f∑j=1N−1cjϕjxdx,

which is a multi-variable function of c1,c2,…,cN−1 and can be written as Fuh=Fc1c2…cN−1. The necessary conditions for a global minimum are ∂F∂ci=0, i=1,2,…,N−1. Taking the partial derivatives directly with respect to ci, we get

∫abϕi′x∑j=1N−1cjϕj′x+qϕix∑j=1N−1cjϕjx−fϕixdx=0,i=1,2,…,N−1.

Exchange the order of integration and the summation, we get

∑j=1N−1cj∫abϕi′xϕj′x+qϕixϕjxdx=∫abfϕixdx=0,i=1,2,…,N−1,

which is exactly the same linear system (9) obtained using the Galerkin method.

2.3.4 Computer implementation

It is straightforward to calculate the entries âi,j=∫abϕi′ϕj′dx. For ∣i−j∣>1, we have âi,j=0, since ϕi and ϕj lack overlapping support. However, if i=j, then

âi,i=∫abϕi′2dx=∫xi−1xi1hi2dx+∫xixi+1−1hi+12dx=1hi+1hi+1,i,j=1,2,…,N−1.

Furthermore, if j=i+1, then

âi,i+1=∫abϕi′ϕi+1'dx=∫xixi+1−1hi+11hi+1dx=−1hi+1,i,j=1,2,…,N−2.E13

By symmetry, we also have

âi+1,i=∫abϕi+1′ϕi′dx=−1hi+1,i,j=1,2,…,N−2.

To obtain a˜i,j=∫abqϕiϕjdx and bi=∫abfϕidx, we use the composite trapezoidal rule

∫abfxdx=∑i=1N∫xi−1xifxdx≈12h1fx0+∑i=1N−1hi+hi+1fxi+hNfxN.

So, we can easily verify that

a˜i,j=∫abqϕiϕjdx≈qi2hi+hi+1,i=j0,i≠j,bi=∫abfϕidx≈12hi+hi+1fi,

where qi=qxi and fi=fxi. Thus, the matrix A=âi,j+a˜i,j is tridiagonal and has the form

A=1h1+1h2+q12h1+h2−1h20⋯0−1h21h2+1h3+q22h2+h3−1h3⋱00−1h3⋱⋯0⋮⋱⋱⋱−1hN−10⋯0−1hN−11hN−1+1hN+qN−12hN−1+hN.

Finally, we obtain the following system: c0=cN=0 and

−1hici−1+1hi+hi+1ci−1hi+1ci+1+qihi+hi+12ci=12hi+hi+1fi,i=1,2,…,N−1,

Remark 2.2Suppose that the partition is uniformi.e.,hi=h=b−aNfor alli=1,2,…,N. Then the stiffness matrixAand the load vectorbhave the form:

A=2h+hq1−1h0⋯0−1h2h+hq2−1h⋱00−1h⋱⋯0⋮⋱⋱⋱−1hN−10⋯0−1h2h+hqN−1,b=hf1f2f3⋮fN−1.

Finally, we obtain the following system: c0=cN=0 and

−cj−1+2cj−cj+1h+hqicj=hfi⇒−cj−1−2cj+cj+1h2+qicj=fi,i=1,2,…,N−1,

which is the same system obtained using the finite difference method, where u″ is approximated using the second-order midpoint formula u″xj≈uxj−1−2uxj+uxj+1h2. We conclude that the above FE method using the composite trapezoidal rule is equivalent to the finite difference method of order 2.

2.3.5 Existence, uniqueness, and basic a priori error estimate

Lemma 2.1The matrixAwith entriesai,j=∫abϕi′ϕj′dxis symmetric positive definitei.e.,ai,j=aj,iand

xtAx=∑i,j=1N−1xiai,jxj>0,for all nonzerox=x1…xN−1t∈RN−1.

Theorem 2.2The linear system (10) obtained using the FE method has a unique solution. Consequently, the FE method solution uh is unique.

Next, we state a general convergence result for the Galerkin method. We first define the following norm and semi-norm: For v∈H01, we define

v=∫abv2xdx1/2,v1=v′=∫abv′x2dx1/2.

Theorem 2.3Suppose thatqx≥0, ∀x∈ab. Letube the solution to (4) anduhbe the solution to (6). Then there exists a constantCsuch that

u−uh′≤Cu−vh′,∀vh∈Vh,01,E14

where C is given by C=1+maxx∈ab∣qx∣, which is independent of the choice of Vh,01.

Remark 2.3From (14), taking the minimum over allvh∈Vh,01, we getu−uh′≤Cminvh∈Vh,0u−vh′. Thus,u−uh1≤Cminvh∈Vh,0u−vh1, whereC=1+maxx∈ab∣qx∣.

Next, we study the convergence of uh to u. Let u∈H01. Define the piecewise linear interpolant by

πu=∑j=1Nuxjϕjx∈Vh,01,x∈ab.

Since πu∈Vh,01, the estimate (14) gives

u−uh′≤Cu−πu′.

This inequality suggest that the error between u and uh is controlled by the interpolation error u−πu in the ⋅1-norm.

Theorem 2.4 (A priori error estimate) Suppose thatqx≥0∀x∈ab. Letube the solution to (4) anduhbe the solution to (6). Then there exists a constantCsuch that

u−uh′2≤C∑i=1Nhi2u″Ii2,

where C is a constant independent of h. Consequently, if h=maxihi, then

u−uh′2≤Ch2u″2.

Remark 2.4

If the partition is not uniform then we obtain the same error estimate with h=maxi=1,2,…,Nxi−xi−1.
The error is expressed in terms of the exact solution u. If it is expressed in terms of the computed solution uh it is an a posteriori error estimate (this yields a computable error bound).
uh→u in the v′-norm as h=maxihi→0. If u−uh′=0 then u−uh is constant, but since u0=uh0 we also have u−uh=0 and therefore uh=u.
uh is the best approximation within the space Vh,01 with respect to the v′-norm.
The norm v′ is referred to as the energy norm and has often a physical meaning.

2.3.6 Boundary conditions

In problem (3) we considered a homogeneous Dirichlet boundary conditions. Here, we extend the FE method to boundary conditions of different types. There are three important types of boundary conditions (BCs):

Dirichlet BCs: ua=α and ub=β for two real numbers α and β. This BC is also known as strong BC or essential BC.
Neumann BCs: u′a=α and u′b=β for two real numbers α and β. This BC is also known as natural BCs.
Robin BCs: u′a=αua and u′b=βub for two real numbers α and β.

Note that any combination is possible at the two boundary points.

Nonhomogeneous Dirichlet boundary conditions: Let us consider the following two-point BVP: find u∈C2ab such that

−u″=fx,x∈ab,ua=α,ub=β,E15

where α and β are given constants and f∈Cab is a given function. In this case, the admissible function space H01=v:v2+v′2<∞va=vb=0 and the FE space Vh,01 defined earlier remain the same. Multiplying (15) by a test function v∈H01 and integrating by parts gives

∫abfvdx=∫ab−u″vdx=−u′bvb+u′ava+∫abu′v′dx=∫abu′v′dx,

since va=vb=0. Hence, the weak or variational form of (15) reads: Given ua=α, ub=β, find u∈H1=v:v2+v′2<∞, such that

∫abu′v′dx=∫abfvdx,∀v∈H01.E16

Let Vh1 and Vh,01, respectively, be the space of all continuous piecewise linear functions and the space of all continuous piecewise linear functions which vanish at the endpoints a and b. We also let a=x0<x1<⋯<xN=b be a uniform partition of the interval ab. Moreover let ϕi be the set of hat basis functions of Vh associated with the N+1 nodes xj,j=0,1,…,N, such that ϕixj=δij. The FE approximation of (16) thus reads: Find uh∈Vh1 such that uha=α, uhb=β, and

∫abuh′v′dx=∫abfvdx,∀v∈Vh,01.E17

It can be shown that (17) is equivalent to the N−1 equations

∫abuh′ϕi′dx=∫abfϕidx,i=1,2,…,N−1.E18

Expanding uh as a linear combination of hat functions

uh=∑j=0Ncjϕjx=αϕ0x+∑j=1N−1cjϕjx+βϕNx,E19

where the coefficients cj,j=1,2,…,N−1 are the N−1 nodal values of uh to be determined.

Substituting (19) into (18) yields

∑j=1N−1cj∫abϕi′ϕj′dx=∫abfϕi−αϕ0′ϕi′−βϕN′ϕi′dx,i=1,2,…,N−1,

which is a N−1×N−1 system of equations for cj. In matrix form we write

Ac=b,E20

where A is a N−1×N−1 matrix, the so-called stiffness matrix, with entries

ai,j=∫abϕi′ϕj′dx,i,j=1,2,…,N−1,E21

c=c1c2…cN−1t is a N−1 vector containing the unknown coefficients cj,j=1,2,…,N−1, and b is a N−1 vector, the so-called load vector, with entries

bi=∫abfϕi−αϕ0′ϕi′−βϕN′ϕi′dx,i=1,2,…,N−1.E22

Computer Implementation: The explicit expression for a hat function ϕix is given by

ϕix=0,a≤x≤xi−1,x−xi−1hi,xi−1<x≤xi,xi+1−xhi+1,xi<x≤xi+1,0,xi+1<x≤b,,i=1,2,…,N−1,ϕ0x=x1−xh1,x0<x≤x1,0,x1<x≤b,,ϕNx=0,x0<x≤xN−1,x−xN−1hN,xN−1<x≤b.

For simplicity we assume the partition is uniform so that hi=h for i=1,2,…,N. Hence the derivative ϕi′x is either −1h, 1h, or 0 depending on the interval.

It is straightforward to calculate the entries of the stiffness matrix. For ∣i−j∣>1, we have ai,j=0, since ϕi and ϕj lack overlapping support. However, if i=j, then

ai,j=∫abϕi′2dx=∫xi−1xi1h2dx+∫xixi+1−1h2dx=2h,i,j=1,2,…,N−1,

where we have used that xi−xi−1=xi+1−xi=h. Furthermore, if j=i+1, then

ai,i+1=∫abϕi′ϕi+1'dx=∫xixi+1−1h1hdx=−1h,i,j=1,2,…,N−2.

Changing i to i−1 we also have

ai−1,i=∫abϕi−1'ϕi'dx=∫xi−1xi1h−1hdx=−1h,i,j=2,3,…,N−1.

Thus the stiffness matrix is

A=1h2−10⋯0−12−1⋱00−12⋯0⋮⋱⋱⋱−10⋯0−12.

The entries bi of the load vector must often be evaluated using quadrature, since they involve the function f which can be hard to integrate analytically. For example, using the trapezoidal rule one obtains the approximate load vector entries

b1=∫abfϕ1−αϕ0′ϕ1′−βϕN′ϕ1′dx=∫x0x1fϕ1−α−1h1hdx+∫x1x2fϕ1=αh+∫x0x2fϕ1≈αh+hfx1,bi=∫abfϕi−αϕ0′ϕi′−βϕN′ϕi′dx=∫xi−1xi+1fϕidx≈hfxi,i=2,…,N−2,bN−1=∫abfϕN−1′−αϕ0′ϕN−1−βϕN′ϕN−1′dx=∫xN−2xN−1fϕN−1dx+∫xN−1xNfϕN−1−β1h−1hdx=∫xN−2xNfϕN−1dx+βh≈hfxN−1+βh.

Assembly: We rewrite (20), (21), (22) as

1h2−10⋯0−12−1⋱00−12⋯0⋮⋱⋱⋱−10⋯0−12c1c2c3⋮cN−1=hfx1+αhhfx2hfx3⋮hfxN−1+βh

We note that uha=α=ua and uhb=β=ub. Therefore, we see that the system matrix A remains the same, and only the first and last entries of the load vector b need to be modified because of the definition of the basis functions ϕ0…ϕN. An alternative approach is to use all the basis functions ϕ0…ϕN to form a larger system of equation, i.e., and N+1×N+1 system. The procedure for inserting the boundary conditions into the system equation is: enter zeros in the first and N+1-th rows of the system matrix A except for unity in the main diagonal positions of these two rows, and enter α and β in the first and N+1-th rows of the vector b, respectively.

General boundary conditions: Let us consider the following two-point BVP: find u∈C2ab such that

−u″=fx,x∈ab,ua=α,γub+u′b=β,E23

where α,β and γ are given numbers and f∈Cab is a given function. The boundary condition at x=b is called a Robin boundary condition (combination and u and u′ is prescribed at x=b). In this case, the admissible function space is modified to

H01=v:v2+v′2<∞va=0.

Multiplying (23) by a function v∈H01 and integrating by parts gives

∫abfvdx=∫ab−u″vdx=−u′bvb+u′ava+∫abu′v′dx

=−β−γubvb+u′ava+∫abu′v′dx.

Since va=0, we are left with

∫abu′v′dx+γubvb=∫abfvdx+βvb.

Hence, the weak or variational form of (23) reads: Given ua=α, find the approximate solution u∈H01, such that

∫abu′v′dx+γubvb=∫abfvdx+βvb,∀v∈H01.E24

The FE space Vh1 is now the set of all continuous piecewise linear functions which vanish at the end point a. The FE approximation of (24) thus reads: Find the piecewise linear approximation uh to the solution u satisfies

∫abuh′v′dx+γuhbvb=∫abfvdx+βvb,∀v∈Vh1,E25

with uha=α. As before, (25) can be formulated in matrix form.

2.4 Model problem with coefficient and general Robin BCs

Let us consider the following two-point BVP: find u∈C2ab such that

−pxu′′=fx,x∈I=ab,pau′a=κ0ua−α,pbu′b=κ1ub−β,E26

where p=px with px≥p0>0, f∈L2I, κ0,κ1≥0, and α,β are given numbers. Let

V=v∈C0I:v2+v′2<∞.

Multiplying (26) by a function v∈V and integrating by parts gives

∫abfvdx=∫ab−pu′′vdx=∫abpu′v′dx−pbu′bvb+pau′ava=∫abpu′v′dx−κ1ub−βvb+κ0ua−αva.

We gather all u-independent terms on the left and obtain

∫abpu′v′dx−κ1ubvb+κ0uava=∫abfvdx−κ1βvb+κ0αva,∀v∈V.

The FE method consists of finding uh∈Vh=v∈C0abvIi∈P1Ii such that

∫abpuh′v′dx−κ1uhbvb+κ0uhava=∫abfvdx−κ1βvb+κ0αva,∀v∈Vh.E27

Implementation: We need to assemble a stiffness matrix A and a load vector b. Substituting uh=∑i=0Nciϕi into (27) and taking v=ϕj for j=0,1,…,N yields

∑i=0N∫abpϕi′ϕj′dx−κ1ϕibϕjb+κ0ϕiaϕja=∫abfϕjdx−κ1βϕjb+κ0αϕja,∀j=0,1,…,N.

which is a N+1×N+1 system of equations for ci. In matrix form we write Ac=b, where c=c0…cNt is a N+1 vector containing the unknown coefficients ci,i=0,1,…,N, A is a N+1×N+1 matrix with entries

ai,j=∫abpϕi′ϕj′dx−κ1ϕibϕjb+κ0ϕiaϕja,i,j=0,1,…,N,

and b is a N+1 vector with entries

bj=∫abfϕjdx−κ1βϕjb+κ0αϕja,j=0,1,…,N.

Let for simplification p=1. Then the matrix A and the vector b (when using the trapezoidal rule) are given by

A=κ0+1h1−1h10⋯0−1h11h1+1h2−1h2⋱00−1h2⋱⋯0⋮⋱⋱⋱−1hN0⋯0−1hN1hN−κ1,b=h12f0+κ0αh1+h22f1⋮hN−1+hN2fN−1hN2fN−κ1β.

2.5 The FE method using Lagrange P2 elements

Let a=x0<x1<⋯<xN=b be a regular partition of the interval ab. Suppose that the length of Ii=xi−1xi is hi=xi−xi−1. Let Pk=px=∑j=0kcjxjcj∈R denotes the vector space of polynomials in one variable and of degree less than or equal to k. The FE method for Lagrange P2 elements involves the discrete space:

Vh2={vx∈C0[ab]vIi∈P2Ii,i=1,…,N},

and its subspace V0,h2=v∈Vh2va=vb=0. These spaces are composed of continuous, piecewise parabolic functions (polynomials of degree less than or equal to 2). The P2 FE method consists in applying the internal variational approximation approach to these spaces.

Lemma 2.2The spaceVh2is a subspace ofH1abof dimension2N+1. Every functionvh∈Vh2is uniquely defined by its values at the mesh verticesxj,j=0,1,…,Nand at the midpointsxj+12=xj+xj+12=xj+hj+12,j=0,1,…,N−1, wherehj+1=xj+1−xj:

vhx=∑j=0Nvhxjϕjx+∑j=0N−1vhxj+12ϕj+12x,∀x∈ab,

where ϕjj=0N is the basis of the shape functions ϕj defined as:

ϕjx=ϕx−xjhj+1,j=0,1,…,N,ϕj+12x=ψx−xj+12hj+1,j=0,1,…,N−1,

with

ϕξ=1+ξ1+2ξ,ξ∈−10,1−ξ1−2ξ,ξ∈01,0,∣ξ∣>1,ψξ=1−4ξ2,∣ξ∣≤12,0,∣ξ∣>12,E28

Figure 3 shows the global shape functions for the space Vh2 and the three quadratic Lagrange P2 shape functions on the reference interval −11.

Figure 3.
(left) global shape functions for the space Vh2. (right) the three quadratic Lagrange P2 shape functions on the reference interval −11.

Remark 2.5Notice that we have:

ϕjxj=δij,ϕjxj+12=0,ϕj+12xj=0,ϕj+12xj+12=δij.

Corollary 2.1The spaceV0,h2is a subspace ofH01abof dimension2N−1and every functionvh∈V0,h2is uniquely defined by its values at the mesh verticesxj,j=1,2,…,N−1and at the midpointsxj+12,j=0,1,…,N−1:

vhx=∑j=1N−1vhxjϕjx+∑j=0N−1vhxj+12ϕj+12x,∀x∈ab,

where ϕjj=0N is the basis of the shape functions ϕj defined as:

ϕjx=ϕx−xjhj+1,j=0,1,…,N,ϕj+12x=ψx−xj+12hj+1,j=0,1,…,N−1,

with ϕξ and ψξ are defined by (28).

2.5.1 Homogeneous boundary conditions

The variational formulation of the internal approximation of the Dirichlet BVP (3) consists now in finding uh∈V0,h2, such that:

∫abuh′v′dx+∫abquhvdx=∫abfvdx,∀v∈Vh,02.

Here, it is convenient to introduce the notation xj2,j=1,…,2N−1 for the mesh points and ϕj2,j=1,…,2N−1 for the basis of V0,h2. Using these notations, we have:

uh=∑j=12N−1cj2ϕj2x,

where cj2=uhxj2≈uxj2 are the unknowns coefficients. This formulation leads to solve in R2N−1 a linear system:

Ac=b,

where c=c12c1…cN−12t∈R2N−1 is the unknown vector containing the coefficients cj2,j=1,2,…,2N−1, A is an 2N−1×2N−1 matrix with entries

aij=∫abϕi2'ϕj2'+qϕi2ϕj2dx,i,j=1,2,…,2N−1,

and load vector b∈R2N−1 has entries

bi2=∫abfϕi2dx,i=1,2,…,2N−1.

Since the shape functions ϕi have a small support, the matrix A is mostly composed of zeros. However, the main difference with the Lagrange P1 FE method, the matrix A is no longer a tridiagonal matrix.

Computer Implementation: The coefficients of the matrix A can be computed more easily by considering the following change of variables, for ξ∈−11:

x=xj+xj−12+xj−xj−12ξ=xj−12+xj−xj−12ξ,∀x∈xj−1xj,j=1,2,…,N.

Hence, the shape functions can be reduced to only three basic shape functions (Figure 3):

ϕ̂−1ξ=ξξ−12,ϕ̂0ξ=1−ξ1+ξ,ϕ̂1ξ=ξξ+12.

Their respective derivatives are

dϕ̂−1ξdξ=2ξ−12,dϕ̂0ξdξ=−2ξ,dϕ̂1ξdξ=2ξ+12.

This approach consists in considering all computations on an interval Ii=xi−1xi on the reference interval −11. Thus, we have:

dϕixdx=dϕixi−1/2+xi−xi−12ξdξdξdx=2xi−xi−1dϕ̂kξdξ=2hidϕ̂kξdξ.

In this case, the elementary contributions of the element Ii to the stiffness matrix and to the mass matrix are given by the 3×3 matrices KIi and MIi:

KIi=∫Iiϕi−1′ϕi−1′ϕi−1′ϕi−12′ϕi−1′ϕi′ϕi−12′ϕi−1′ϕi−12′ϕi−12′ϕi−12′ϕi′ϕi′ϕi−1′ϕi′ϕi−12′ϕi′ϕi′dx=2hi∫−11ϕ̂−1′ϕ̂−1′ϕ̂−1′ϕ̂0′ϕ̂−1′ϕ̂1′ϕ̂0′ϕ̂−1′ϕ̂0′ϕ̂0′ϕ̂0′ϕ̂1′ϕ̂1′ϕ̂−1′ϕ̂1′ϕ̂0′ϕ̂1′ϕ̂1′dξ=13hi7−81−816−81−87,

MIi=∫Iiϕi−1ϕi−1ϕi−1ϕi−12ϕi−1ϕiϕi−12ϕi−1ϕi−12ϕi−12ϕi−12ϕiϕiϕi−1ϕiϕi−12ϕiϕidx=hi2∫−11ϕ̂−1ϕ̂−1ϕ̂−1ϕ̂0ϕ̂−1ϕ̂1ϕ̂0ϕ̂−1ϕ̂0ϕ̂0ϕ̂0ϕ̂1ϕ̂1ϕ̂−1ϕ̂−1ϕ̂0ϕ̂1ϕ̂1dξ=hi3042−12162−124.

Coefficients of the right-hand sideb: Usually, the function f is only known by its values at the mesh points xi2,i=0,1,…,2N and thus, we use the decomposition of f in the basis of shape functions ϕi2,i=0,1,…,2N as fx=∑j=02Nfxj2ϕj2. Each component bi2 of the right-hand side vector is obtained as bi2=∑k=1N∫xk−1xkfϕi2dx. Using the previous decomposition of f, we obtain:

bi2=∑k=1N∫xk−1xk∑j=02Nfxj2ϕj2ϕi2dx=∑j=02Nfxj2∑k=1N∫xk−1xkϕi2ϕj2dx.

Thus, the problem is reduced to computing the integrals ∫xk−1xkϕi2ϕj2dx. It is easy to see that we obtain expressions very similar to that of the mass matrix. More precisely, the element Ii=xi−1xi will contribute to only three components of indices i−1, i−12 and i as:

bIi=hi3042−12162−124fxi−1fxi−12fxi..

2.5.2 Nonhomogeneous boundary conditions

Consider the following two-point BVP: find u∈C2ab such that

−u″+qxu=fx,x∈ab,ua=α,ub=β,E29

where α and β are given constants and f∈Cab is a given function.

Multiplying (29) by a function v∈H01=v:v2+v′2<∞va=vb=0 and integrating by parts gives

∫abfvdx=∫ab−u″+quvdx=−u′bvb+u′ava+∫abu′v′+quvdx=∫abu′v′dx.

Hence, the weak or variational form of (29) reads: Given ua=α, ub=β, find u∈H1=v:v2+v′2<∞, such that

∫abu′v′+quvdx=∫abfvdx,∀v∈H01.

Let Vh2 and Vh,02, respectively, be the space of all continuous piecewise quadratic functions and the space of all continuous piecewise quadratic functions which vanish at the end points a and b, on a uniform partition a=x0<x1<⋯<xN=b of the interval ab.

The FE method scheme consists of finding uh∈Vh2, such that:

∫abuh′v′dx+∫abquhvdx=∫abfvdx,∀v∈Vh,02.

Introduce the notation xj2,j=0,1,…,2N−1,2N for the mesh points and ϕj2,j=0,1,…,2N−1,2N for the basis of Vh2 and ϕj2,j=1,…,2N−1 for the basis of V0,h2. Using these notations, we have:

uh=∑j=02Ncj2ϕj2x,

where cj2=uhxj2≈uxj2 are the unknowns coefficients. We note that c0=uhx0=α and c2N=uhxN=β. This formulation leads to solve in R2N−1 a linear system:

Ac=b,

where c=c12c1…cN−12t∈R2N−1 is the unknown vector containing the coefficients cj2,j=1,2,…,2N−1, A is an 2N−1×2N−1 matrix with entries

aij=∫abϕi2′ϕj2′+qϕi2ϕj2dx,i,j=1,2,…,2N−1,

and the load vector b∈R2N−1 has entries

bi2=∫abfϕi2dx−α∫abϕi2′ϕ0′+qϕi2ϕ0dx−β∫abϕi2′ϕN′+qϕi2ϕNdx,i=1,2,…,2N−1.

Clearly, the only extra terms are given in the vector with entries

b˜i2=−α∫abϕi2′ϕ0′+qϕi2ϕ0dx−β∫abϕi2′ϕN′+qϕi2ϕNdx,i=1,2,…,2N−1.

Suppose q=0 then for N≥2, we have

b˜12=−α∫abϕ12′ϕ0′dx−β∫abϕ12′ϕN′dx=−α∫x0x1ϕ12′ϕ0′=8α3h1,b˜1=−α∫abϕ1′ϕ0′dx−β∫abϕ1′ϕN′dx=−α∫x0x1ϕ1′ϕ0′=−α3h1,b˜i2=−α∫abϕi2′ϕ0′dx−β∫abϕi2′ϕN′dx=0,i=3,…,2N−3,b˜N−1=−α∫abϕN−1′ϕ0′dx−β∫abϕN−1′ϕN′dx=−β∫xN−1xNϕN−1′ϕN′dx=−β3h1,b˜N−12=−α∫abϕN−12′ϕ0′dx−β∫abϕN−12′ϕN′dx=−β∫xN−1xNϕN−12′ϕN′dx=8β3h1.

3. The FE for elliptic PDEs

Here, we apply the FE method for two-dimensional elliptic problem: Find u such that

−∇⋅a∇u+bu=fx,x∈Ω,a∇u⋅n=κg−u,on∂Ω,E30

where a>0, b≥0, κ≥0, f∈L2Ω and g∈C0∂Ω.

3.1 Meshes

Let Ω⊂R2 bounded with ∂Ω assumed to be polygonal. A triangulation Th of Ω is a set of triangles T such that Ω=⋃T∈ThT, and two triangles intersect by either a common triangle edge, or a corner, or nothing. Corners will be referred to as nodes. We let hT=diamT the length or the largest edge.

Let Th have N nodes and M triangles. The data is stored in two matrices. The matrix P∈R2×N describes the nodes (x1y1,…,xNyN) and the matrix K∈R3×M describes the triangles, i.e., it describes which nodes (numerated from 1 to N) form a triangle T and how it is orientated:

P=x1x2⋯xNy1y2⋯yN,K=n1αn2α⋯nMαn1βn2β⋯nMβn1γn2γ⋯nMγ.

This means that triangle Ti is formed by the nodes niα, niβ, and niγ (enumeration in counter-clockwise direction).

The Delaunay algorithm determine a triangulation with the given points as triangle nodes. Delaunay triangulations are optimal in the sense that the angles of all triangles are maximal.

Matlab has a built in toolbox called PDE Toolbox and includes a mesh generation algorithm.

3.2 Piecewise polynomial spaces

Let T be a triangle with nodes N1=x1y1, N2=x2y2, and N3=x3y3. We define

P1T=v∈C0Tvxy=c1+c2x+c3yc1c2c3∈R.

Now let vi=vNi for i=1,2,3. Note that v∈P1T is determined by vii=13. Given vi we compute ci by

1x1y11x2y21x3y3c1c2c3=v1v2v3.

This is solvable due to

det1x1y11x2y21x3y3=2∣T∣≠0,1x1y11x2y21x3y3−1=12∣T∣x2y3−x3y2x3y1−x1y3x1y2−x2y1y2−y3y3−y1y1−y2x3−x2x1−x3x3−x1,

where ∣T∣=12x2y3−x3y2−x1y3+x3y1+x1y2−x2y1, which is ± the area of the triangle T.

Let λj∈P1T be given by the nodal values λjNi=δij, where δij is the Kronecker symbol. This gives us vxy=α1λ1xy+α2λ2xy+α3λ3xy, where αi=vNi for i=1,2,3. We can compute λixy as follows: Let λixy=ai+bix+ciy. Using λjNi=δij, we get

1x1y11x2y21x3y3a1b1c1=100,1x1y11x2y21x3y3a2b2c2=010,1x1y11x2y21x3y3a3b3c3=001.

Solving the systems, we get

λ1xy=12∣T∣x2y3−x3y2+y2−y3x+x3−x2y,

λ2xy=12∣T∣x3y1−x1y3+y3−y1x+x1−x3y,

λ3xy=12∣T∣x1y2−x2y1+y1−y2x+x3−x1y.

Let Th be a triangulation of Ω, then we let

Vh=v∈CΩvT∈P1T∀T∈Th.

Functions in Vh are piecewise linear and continuous. We know that v∈Vh is uniquely determined by vNii=12…N. We let ϕjNi=δij and let ϕjj=12…N⊂Vh be a basis for Vh (hat functions), i.e.,

vxy=∑i=1Nαiϕixy,αi=vNi,i=1,2,…,N.

3.3 Interpolation

Given u∈CT on a single triangle with nodes Ni=xiyi, i=1,2,3, we let

πuxy=∑i=13uNiϕixy,

in particular πuNi=uNi, i=1,2,…,N. We want to estimate the interpolation error u−πu. Let

uL2Ω2=∫Ωux2dxdy,DuL2Ω2=uxL2Ω2+uyL2Ω2,D2uL2Ω2=uxxL2Ω2+2uxyL2Ω2+uyyL2Ω2.

Theorem 3.1Suppose thatu∈C2T. Then the following hold

u−πuL2T≤ChT2D2uL2T,Du−πuL2T≤ChTD2uL2T,

where C is a generic constant independent of hT and u, but it depends on the ratio between smallest and largest interior angle of the triangle T.

Now, we consider the piecewise continuous interpolant πu=∑i=1NuNiϕi.

Theorem 3.2Suppose thatu∈C2Tfor allT∈Th. Then the following hold

u−πuL2Ω2≤C∑T∈ThhT4D2uL2T2,Du−πuL2Ω2≤C∑T∈ThhT2D2uL2T2,

where C is a generic constant independent of h and u, but it depends on the ratio between smallest and largest interior angle of the triangles of Th. Here Du−πuL2Ω2=∑T∈ThDu−πuL2T2.

3.4 L2-projection

Let Ω⊂R2. We consider the space L2Ω=v∫Ωv2xydxdy<∞. Let u∈L2Ω. We define the L2-projection Ph:L2Ω→Vh=v∈C0ΩvT∈P1T∀T∈Th by Phu∈Vh such that

∫Ωu−Phuvhdxdy=0,∀vh∈Vh.

The problem of finding Phu∈Vh is equivalent to solve the following linear system

∫Ωu−Phuϕidxdy=0,i=1,2…,N,

where ϕii=1N is a basis of Vh.

Since Phu∈Vh we can express it as Phu=∑i=1Nciϕixy, where ci∈R. Therefore, to find Phu∈Vh we need to find c1,c2,…,cN∈R such that

∑i=1Nci∫Ωϕiϕjdxdy=∫Ωuϕjdxdy,j=1,2,…,N.

The problem can be expressed as a linear system of equations Mc=b, where c=c1c2…cNt and the entries of the matrix M∈RN×N and the vector b∈RN are given by

mij=∫Ωϕiϕjdxdy,bj=∫Ωuϕjdxdy.

In general, we use a quadrature rule to approximate integrals. The general form is

∫Tfxydxdy≈∑j=1nωjfN¯j,

where the ωj ′s denote the weights and the N¯j ′s the quadrature points.

Lemma 3.1The mass matrixMwith entriesmij=∫Ωϕiϕjdxdyis symmetric and positive definite.

Theorem 3.3For anyu∈L2ΩtheL2-projectionPhuexists and is unique.

3.5 A priori error estimate

Theorem 3.4Letu∈L2Ωand letPhube theL2-projection ofu, then

u−PhuL2Ω≤u−vhL2Ω,∀vh∈Vh.

Theorem 3.5Suppose thatu∈C2Ωwithu∈C2Tfor allT∈Th. Then there exists a constantCsuch that

u−PhuL2Ω2≤C∑T∈ThhT4D2uL2T2.

3.6 The FE method for general elliptic problem

The FE method was designed to approximate solutions to complicated equations of elasticity and structural mechanics, usually modeled by elliptic type equations, with complicated geometries. It has been developed for other applications as well.

Consider the following two-dimensional elliptic problem: Find u such that

−∇⋅a∇u+bu=f,inΩ,a∇u⋅n=κg−u,on∂Ω,E31

where a>0, b≥0, κ≥0, f∈L2Ω and g∈C0∂Ω. We seek a weak solution u in

V=H1Ω=v∈L2Ωvhas a weak derivative andvL2Ω+∇vL2Ω<∞.

In order to derive the weak formulation, we multiply (31) with v∈V, integrate over Ω and use Green’s formula to obtain

∫Ωfvdxdy=−∫Ωv∇⋅a∇udxdy+∫Ωbuvdxdy=∫Ωa∇u⋅∇vdxdy−∫∂Ωva∇u⋅nds+∫Ωbuvdxdy=∫Ωa∇u⋅∇vdxdy+∫Ωbuvdxdy+∫∂Ωκu−gvds.

We obtain the weak form: Find u∈V such that

∫Ωa∇u⋅∇vdxdy+∫Ωbuvdxdy+∫∂Ωκuvds=∫Ωfvdxdy+∫∂Ωκgvds,v∈V.E32

We can formulate the method as in the 1D case by using the weak formulation (32). The FE method in 2D is defined as follows: Find uh∈Vh such that

∫Ωa∇uh⋅∇vhdxdy+∫Ωbuhvhdxdy+∫∂Ωκuhvhds=∫Ωfvhdxdy+∫∂Ωκgvhds,vh∈Vh,E33

where Vh=v∈VvT∈P1T∀T∈Th.

Implementation: Let a=1 and b=g=0. Substituting uh=∑j=1Ncjϕj into (33) and picking vh=ϕi, we obtain

∑j=1Ncj∫Ω∇ϕj⋅∇ϕidxdy+∫∂Ωκϕjϕids=∫Ωfϕidxdy,i=1,2,…,N.

This gives us the system A+Rc=b, where c=c1c2…cNt∈RN is the unknown vector and the entries of A∈RN×N, R∈RN×N, and b∈RN are given by

aij=∫Ω∇ϕj⋅∇ϕidxdy,rij=∫∂Ωκϕjϕids,bi=∫Ωfϕidxdy,i,j=1,2,…,N.

Assembly of the stiffness matrixA: We can again identify the local contributions that come form a particular triangle T

aijT=∫Ω∇ϕj⋅∇ϕidxdy,i,j=1,2,3.

where T is an arbitrary triangle with vertices Ni=xiyi and ϕi are the hat functions i.e.,ϕjNi=δij. Let ϕixy=αi+βix+γiy, for i=1,2,3. Then, we compute αi,βi,γi by

1x1y11x2y21x3y3α1β1γ1=100,1x1y11x2y21x3y3α2β2γ2=010,1x1y11x2y21x3y3α3β3γ3=001.

In general we have Bαi=ei for i=1,2,3, where

B=1x1y11x2y21x3y3,αi=αiβiγi,e1=100,e2=010,e3=001.

Furthermore, we obviously have ∇ϕi=βiγit, which gives

aijT=∫Ωβiβj+γiγjdx=βiβj+γiγj∣T∣,i,j=1,2,3.

Assembly of boundary matrixR: Let Γhout denote the set of boundary edges of the triangulation, i.e.Γhout=EE=T∩∂ΩforT∈Th. Assume that κ is constant on E. For an edge E∈Γhout, we define RE∈R2×2 by the entries

rijE=∫Eκϕjϕids=κ61+δij∣E∣,i,j=1,2,

where ∣E∣ is the length of E and δij is 1 for i=j and 0 else.

Assembly of load vector: We use a corner quadrature rule for approximating the integral. We obtain for T∈Th

biT=∫Tfϕidxdy≈∣T∣3fNi,i=1,2,…,N.

Given A, R and b, we can solve A+Rc=b and write uh=∑j=1Ncjϕj.

3.7 The Dirichlet problem

Consider the following Dirichlet Problem: Find u such that

−Δu=f,inΩ,u=g,on∂Ω,E34

where f∈L2Ω and g∈C0∂Ω. We seek a weak solution u in Vg=v∈Vv∂Ω=g. Multiplying (34) by a test function v∈V0 and integrating over Ω, we get

∫Ωfvdxdy=−∫ΩvΔudxdy=∫Ω∇u⋅∇vdxdy−∫∂Ωv∇u⋅nds=∫Ω∇u⋅∇vdxdy.

So the weak problem reads: Find u∈Vg such that

∫Ω∇u⋅∇vdxdy=∫Ωfvdxdy,v∈V0.

Assume that g is piecewise linear on ∂Ω with respect to the triangulation. Then the FE method in 2D is defined as follows: Find uh∈Vh,g=v∈Vhv∂Ω=g such that

∫Ω∇uh⋅∇vhdxdy=∫Ωfvhdxdy,vh∈Vh,0.

Assume that we have N nodes and J boundary nodes, then the matrix form of the FE method problem reads:

A0,0A0,gAg,0A0,gc0c1=b0b1,

where A0,0∈RN−J×N−J, Ag,g∈RJ×J, A0,g∈RN−J×J, Ag,0∈RJ×N−J. Note that c1∈RJ is known (it contains the values of g in the boundary nodes). We can therefore solve the simplified problem reading: find c0∈RN−J with A0,0c0=b0−A0,gc1.

3.8 The Neumann problem

Consider the following Neumann Problem: Find u such that

−Δu=f,inΩ,∇u⋅n=g,on∂Ω,E35

where f∈L2Ω and g∈C0∂Ω. Let us try to seek a solution to this problem in the space V=vvL2Ω+∇vL2Ω<∞. Multiplying (35) by a test function v∈V, integrating over Ω, and using Green’s formula, we get

∫Ωfvdxdy=−∫ΩvΔudxdy=∫Ω∇u⋅∇vdxdy−∫∂Ωv∇u⋅nds=∫Ω∇u⋅∇vdxdy−∫∂Ωvgds.

Thus, the variational formulation reads: find u∈V such that

∫Ω∇u⋅∇vdxdy−∫∂Ωvgds=∫Ωfvdxdy,∀v∈V.

In order to guarantee solvability, we note that if v=1 then we have

0=∫Ω∇u⋅∇1dxdy=∫Ωfdxdy+∫∂Ωgds.

Therefore we need to assume the following compatibility condition

∫Ωfdxdy+∫∂Ωgds=0,

to ensure that a solution can exist. Note that if u exists, it is only determined up to a constant, since u+c is a solution if u is a solution and c∈R. To fix this constant and obtain a unique solution a common trick is to impose the additional constraint ∫Ωudxdy=0. We therefore define the weak solution space

V̂=v∈V∫Ωvdxdy=0,

which contains only functions with a zero mean value. This is a called a quotient space. This space guarantees a unique weak solution (with weak formulation as usual with test functions in V). So the weak problem reads: Find u∈V̂ such that

∫Ω∇u⋅∇vdxdy−∫∂Ωvgds=∫Ωfvdxdy,∀v∈V.

Now, the FE method takes the form: find uh∈V̂h⊂V̂ such that

∫Ω∇uh⋅∇vhdxdy−∫∂Ωvhgds=∫Ωfvhdxdy,∀vh∈V̂h,

where V̂h is the space of all continuous piecewise linear functions with a zero mean.

3.9 Finite elements for mixed Dirichlet-Neumann conditions

Here we describe briefly how Neumann conditions are handled in two-dimensional finite elements. Suppose Ω is a domain in either R2 or R3 and assume that ∂Ω has been partitioned into two disjoint sets: ∂Ω=Γ1∪Γ2. We consider the following BVP:

−∇⋅κx∇u=fx,x∈Ω,u=0,x∈Γ1,∇u⋅n=0,x∈Γ2,E36

where f∈L2Ω. As for the 1-D case, Dirichlet conditions are termed essential boundary conditions because they must be explicitly imposed in the FE method, while Neumann conditions are called natural and need not be mentioned. We therefore define the space of test functions by

V̂=v∈C2Ω¯:vx=0x∈Γ1.

Multiplying (36) by a test function v∈V̂ and integrating over Ω, we get

∫Ωfvdxdy=−∫Ωv∇⋅κx∇udxdy=∫Ωκx∇u⋅∇vdxdy−∫∂Ωκxv∇u⋅nds=∫Ωκx∇u⋅∇vdxdy−∫Γ1κxv∇u⋅nds−∫Γ2κxv∇u⋅nds=∫Ωκx∇u⋅∇vdxdy,

since v=0 on Γ1 and ∇u⋅n on Γ1. Thus the weak form of (36) is: Find u∈V̂ such that

∫Ωκx∇u⋅∇vdxdy=∫Ωfvdxdy,v∈V̂.E37

We now restrict our discussion once more to two-dimensional polygonal domains. To apply the FE method, we must choose an approximating subspace of V̂. Since the boundary conditions are mixed, there are at least two points where the boundary conditions change from Dirichlet to Neumann. We will make the assumption that the mesh is chosen so that all such points are nodes (and that all such nodes belong to Γ1, that is, that Γ1 includes its “endpoints”). We can then choose the approximating subspace of V̂ as follows:

Vh=v∈CΩ¯:v is linear on Thvz=0for all nodesz∈Γ1.

A basis for Vh is formed by including all basis functions corresponding to interior boundary nodes that do not belong to Γ1. If the BVP includes only Neumann conditions, then the stiffness matrix will be singular, reflecting the fact that BVP either does not have a solution or has infinitely many solutions. Special care must be taken to compute a meaningful solution to the resulting linear system.

3.10 The method of shifting the data

3.10.1 Inhomogeneous Dirichlet conditions on a rectangle

In a two-dimensional problem, inhomogeneous boundary conditions are handled just as in one dimension. Inhomogeneous Dirichlet conditions are addressed via the method of shifting the data (with a specially chosen piecewise linear function), while inhomogeneous Neumann conditions are taken into account directly when deriving the weak form. Both types of boundary conditions lead to a change in the load vector.

The method of shifting the data can be used to transform an inhomogeneous Dirichlet problem to a homogeneous Dirichlet problem. This technique works just as it did for a one-dimensional problem, although in two dimensions it is more difficult to find a function satisfying the boundary conditions. We consider the BVP

−Δu=fx,x∈Ω=0a×0b,ux=gx=g1x,x∈Γ1,g2x,x∈Γ2,g3x,x∈Γ3,g4x,x∈Γ4,E38

where Γ1, Γ2, Γ3, and Γ4 are, respectively, the bottom, right, top, and left boundary edges of the rectangular domain Ω=0a×0b. We will assume that the boundary data are continuous, so

g10=g40,g1a=g20,g2b=g3a,g30=g4b.

Suppose we find a function w defined on Ω¯ and satisfying wx=gx for all x∈∂Ω. We then define v=u−w and note that

−Δv=−Δu+Δw=fx+Δw=f̂x,

and vx=ux−wx=0 for all x∈∂Ω. We can then solve

−Δv=f̂x,x∈Ω,vx=0,x∈∂Ω.E39

Finally, the solution u will be given by u=v+w.

We now describe a method for computing a function w that satisfies the given Dirichlet conditions. We first note that there is a polynomial of the form qxy=c0+c1x+c2y+c3xy, which assumes the desired boundary values at the corners:

q00=g10=g40,qa0=g1a=g20,qab=g2b=g3a,q0b=g30=g4b.

A direct calculation shows that

c0=g10,c1=g1a−g10a,c2=g4b−g40b,c3=g2b+g10−g1a−g4bab.

We then define

hx=h1x=g1x−g10+g1a−g10ax,x∈Γ1,h2y=g2y−g20+g2b−g20by,x∈Γ2,h3x=g3x−g30+g3a−g30ax,x∈Γ3,h4y=g4y−g40+g4b−g40by,x∈Γ4.

We have thus replaced each gi by a function hi which differs from gi by a linear function, and which has value zero at the two endpoints:

h10=h1a=h20=h2b=h30=h3a=h40=h4b=0.

Finally, we define

wxy=c0+c1x+c2y+c3xy+h1x+h3x−h1xby+h4y+h2y−h4yax.

The reader should notice how the second term interpolates between the boundary values on Γ1 and Γ3, while the third term interpolates between the boundary values on Γ2 and Γ4. In order for these two terms not to interfere with each other, it is necessary that the boundary data be zero at the corners. It was for this reason that we transformed the gi ′s into the hi ′s. The first term in the formula for w undoes this transformation. It is straightforward to verify that w satisfies the desired boundary conditions.

3.10.2 Inhomogeneous Neumann conditions on a rectangle

We can also apply the technique of shifting the data to transform a BVP with inhomogeneous Neumann conditions to a related BVP with homogeneous Neumann conditions. However, the details are somewhat more involved than in the Dirichlet case. Consider the following BVP with the Neumann conditions

−Δu=fx,x∈Ω=0a×0b,n⋅∇ux=gx=g1x,x∈Γ1,g2x,x∈Γ2,g3x,x∈Γ3,g4x,x∈Γ4,E40

where Γ1, Γ2, Γ3, and Γ4 are, respectively, the bottom, right, top, and left boundary edges of the rectangular domain Ω=0a×0b. We first note that this is equivalent to

−uyx=g1x,x∈Γ1,uxx=g2y,x∈Γ2,uyx=g3x,x∈Γ3,−uxx=g4y,x∈Γ4.

We make the following observation: If there is a twice-continuously differentiable function u satisfying the given Neumann conditions, then, since uxy=uyx, we have

−uxyx0=g1′x,−uyx0y=g4′y,

which together imply that g1′0=g4′0. By similar reasoning, we have all of the following conditions:

g1′0=g4′0,g1′0=g4′0,−g1′a=g2′0,g2′b=g3′a.E41

We will assume that (41) holds.

We now explain how to compute a function that satisfies the desired Neumann conditions. The method is similar to that used to shift the data in a Dirichlet problem: we will “interpolate” between the Neumann conditions in each dimension and arrange things so that the two interpolations do not interfere with each other. We use the fact that

ψx=−αx+α+β2ax2satisfiesψ′0=−α,ψ′a=β.E42

The first step is to transform the boundary data glx to a function h1x satisfying h1′0=h1′a=0, and similarly for g2, g3, g4 and h2, h3, h4. Since these derivatives of the boundary data at the corners are (plus or minus) the mixed partial derivatives of the desired function at the corners, it suffices to find a function qxy satisfying the conditions

uxy00=−g1′0,uxya0=−g1′a,uxy0b=−g3′0,uxyab=−g2′b.

We can satisfy these conditions with a function of the form qxy=c0xy+c1x2y+c2xy2+c3x2y2. The reader can verify that the necessary coefficients are

c0=−g1′0,c1=g1′0−g1′a2a,c2=g3′0+g1′02b,c3=g2′b+g1′a−g3′0−g1′04ab.

If w is to satisfy the desired Neumann conditions, then w−q=hi on Γi, i=1−4, where

h1x=g1x+c0x+c1x2,h2y=g2y−c0+2ac1y−c2+2ac3y2,

h3x=g3x−c0+2bc2x−c1+2bc3x2,h4y=g4y+c0y+c2y2.

We can now define w−q by the interpolation described by (42):

wxy=qxy−h1xy+h3x+h1x2by2−h4yx+h2y+h4y2ayx2.

Then w satisfies the original Neumann conditions, as the interested reader can verify directly.

3.11 Eigenvalue problem

Consider the following Eigenvalue Problem: Find λ∈R and u such that

−Δu=λu,inΩ,∇u⋅n=0,on∂Ω.E43

In order to derive the weak formulation, we multiply (43) with v∈V, integrate over Ω and use Green’s formula to obtain

λ∫Ωuvdxdy=−∫ΩvΔudxdy=∫Ω∇u⋅∇vdxdy−∫∂Ωv∇u⋅nds=∫Ω∇u⋅∇vdxdy.

We obtain the weak form: Find u∈V such that

∫Ω∇u⋅∇vdxdy=λ∫Ωuvdxdy,v∈V.E44

The FE method in 2D is defined as follows: Find λh∈R and uh∈Vh such that

∫Ω∇uh⋅∇vhdxdy=λh∫Ωuhvhdxdy,vh∈Vh,E45

where Vh=v∈VvT∈P1T∀T∈Th.

Implementation: Substituting uh=∑j=1Ncjϕj into (45) and picking vh=ϕi, we obtain

∑j=1Ncj∫Ω∇ϕj⋅∇ϕidxdy−λh∫Ωϕiϕjdxdy=0,i=1,2,…,N.

This leads to an algebraic system of the form Ac=λhMc, i.e. an algebraic eigenvalue problem.

3.12 Error analysis

Consider the following model Problem: Find u such that

−Δu=f,inΩ,u=0,on∂Ω.

The weak form: Find u∈V0 such that

∫Ω∇u⋅∇vdxdy=∫Ωfvdxdy,v∈V0.

The FE approximation is defined as follows: Find uh∈Vh,0 such that

∫Ω∇uh⋅∇vhdxdy=∫Ωfvhdxdy,vh∈Vh,0,

where Vh=v∈VvT∈P1T∀T∈Th. Expressing uh=∑j=1Ncjϕj and picking vh=ϕi, we obtain

∑j=1Ncj∫Ω∇ϕj⋅∇ϕidxdy=∫Ωfϕidxdy,i=1,2,…,N.

This leads to system of the form Ac=b, where the entries of A∈RN×N and b∈RN are

aij=∫Ω∇ϕj⋅∇ϕidxdy,bi=∫Ωfϕidxdy,i,j=1,2,…,N.

Theorem 3.6The stiffness matrixAis symmetric and positive definite.

Theorem 3.7 (Galerkin orthogonality)Letu∈V0denote the weak solution anduh∈Vh,0the corresponding FE method approximation. Then

∫Ω∇u−uh⋅∇vhdxdy=0,vh∈Vh,0.

Now, let v2=∫Ω∇v⋅∇vdxdy=∫Ω∇v2dxdy be the energy norm on V0.

There are two different kinds of error estimates, a priori estimates, where the error is bounded in terms of the exact solution, and a posteriori error estimates, where the error is bounded in terms of the computed solution. Theorem 3.8 (A priori error bound)Letu∈V0denote the weak solution anduh∈Vh,0the corresponding FE method approximation. Then

u−uh≤u−vh,vh∈Vh,0.

Theorem 3.9Letu∈V0denote the weak solution anduh∈Vh,0the corresponding FE method approximation. Ifu∈C2Ω, then there existsCindependent ofhTandusuch that

u−uhL2Ω2≤C∑T∈ThhT2D2uL2T2.

3.13 The FE method for elliptic problems with a convection term

Consider the following convection-diffusion problem: Find u such that

−∇⋅a∇u+b⋅∇u+cu=f,inΩ,u=0,on∂Ω.E46

We seek a weak solution u in V0=v∈Vv∂Ω=0. In order to derive the weak formulation, we multiply (46) with v∈V0, integrate over Ω and use Green’s formula to obtain

∫Ωfvdxdy=−∫Ωv∇⋅a∇udxdy+∫Ωvb⋅∇udxdy+∫Ωcuvdxdy=∫Ωa∇u⋅∇vdxdy−∫∂Ωv∇u⋅nds+∫Ωvb⋅∇udxdy+∫Ωcuvdxdy=∫Ωa∇u⋅∇vdxdy+∫Ωvb⋅∇udxdy+∫Ωcuvdxdy.

Note that there is no need to apply Green’s formula to ∫Ωvb⋅∇udxdy. We obtain the weak form: Find u∈V0 such that

∫Ωa∇u⋅∇vdxdy+∫Ωvb⋅∇udxdy+∫Ωcuvdxdy=∫Ωfvdxdy,v∈V0.

The FE method in 2D is defined as follows: Find uh∈Vh,0=v∈Vhv∂Ω=0 such that

∫Ωa∇uh⋅∇vhdxdy+∫Ωvhb⋅∇uhdxdy+∫Ωcuhvhdxdy=∫Ωfvhdxdy,vh∈Vh,0,E47

where Vh=v∈VvT∈P1T∀T∈Th.

Implementation: Substituting uh=∑j=1Ncjϕj into (47) and picking vh=ϕi, we obtain

∑j=1Ncj∫Ωa∇ϕj⋅∇ϕidxdy+∫Ωϕib⋅∇ϕjdxdy+∫Ωcϕiϕjdxdy=∫Ωfϕidxdy,i=1,2,…,N.

This gives us the system A+B+Cc=d, where c=c1…cNt∈RN is the unknown vector and the entries of A,B,C∈RN×N and d∈RN are given by

aij=∫Ωa∇ϕj⋅∇ϕidxdy,bij=∫Ωϕib⋅∇ϕjdxdy,cij=∫Ωcϕiϕjdxdy,di=∫Ωfϕidxdy,

for i,j=1,2,…,N. Note that B is not symmetric, i.e.bij≠bji.

4. The FE method for the heat equation

Consider the following heat/diffusion problem: Find uxt such that

u̇−Δu=f,inΩ⊂R2,t∈0T,E48

u⋅t=0,on∂Ωandt∈0T,E49

ux0=u0x,forx∈Ωandt=0.E50

We seek a weak solution u in V0=vv+∇v<∞v∂Ω=0. In order to derive the weak formulation, we multiply (48) with v∈V0, integrate over Ω and use Green’s formula to obtain, for t∈0T,

∫Ωfvdx=∫Ωu̇vdx+∫Ω∇u⋅∇vdx−∫∂Ωv∇u⋅nds=∫Ωu̇vdx+∫Ω∇u⋅∇vdx.

The weak form therefore reads: Find u⋅t∈V0 such that for t>0

∫Ωu̇vdx+∫Ω∇u⋅∇vdx=∫Ωfvdx,v∈V0.E51

The semi-discrete FE method in 2D is defined as follows: Find uh⋅t∈Vh,0=v∈Vhv∂Ω=0 such that

∫Ωu̇hvhdx+∫Ω∇uh⋅∇vhdx=∫Ωfvhdx,vh∈Vh,0,E52

where Vh=v∈VvT∈P1T∀T∈Th.

Implementation: Substituting uhxt=∑j=1Ncjtϕjx into (52) and choosing vh=ϕi, we obtain

∑j=1Nċj∫Ωϕjϕidx+∑j=1Ncj∫Ω∇ϕj⋅∇ϕidx=∫Ωfϕidx,i=1,2,…,N.

This gives us the system of ODEs

Mċt+Atct=bt,t∈0T,c0=c0,

where c=c1c2…cNt=uhN1t…uhNNtt∈RN (here Ni denotes the node that belongs to the basis function ϕi) is the unknown vector and the entries of M,A∈RN×N and b∈RN are given by

mij=∫Ωϕiϕjdx,aij=∫Ω∇ϕj⋅∇ϕidx,bi=∫Ωfϕidx,i,j=1,2,…,N.

Finally, the system of ODEs can be solved with e.g., the backward Euler method as follows: Let 0=t0<t1<⋯<tM=T be a discretization, let km=tm−tm−1 for m=1,2,…,M be the time step size and let cm≈ctm for m=1,2,…,M denote corresponding approximations. Then, we can compute cm using

M+kmAmcm=Mcm−1+kmbm,m=1,2,…,M,

where c0 is obtained from u0x. We can either use c0=c10…cN0t=u0N1…u0NNt, or we can let c0 to be the L2-projection of u0. We set uh0=∑j=1Ncj0ϕjx and solve for cj0 using

∑j=1Ncj0∫Ωϕjϕidx=∫Ωu0ϕidx,i=1,2,…,N.

Theorem 4.1 (Stability)There hold continuous and discrete stability estimates

u⋅t≤u⋅0+∫0tf⋅sds,uhm≤uhm−1+kmfm≤uh0+∑i=1mkifi.

5. The FE method for the wave equation

Many physical phenomena exhibit wave characteristics. For instance light which is an electromagnetic wave have the ability to disperse and create diffraction patterns, which is typical of waves.

Consider the following wave problem: Find uxt such that

u¨−∇⋅ε∇u=f,inΩ⊂R2,t∈0T,E53

n⋅∇u⋅t=0,on∂Ωandt∈0T,E54

ux0=u0x,u̇x0=v0x,forx∈Ωandt=0,E55

where f is a given load, ε=εxt is a positive parameter, u0 and v0 are a prescribed initial conditions, and Ω is a bounded domain with boundary ∂Ω and unit outward normal n.

We seek a weak solution u in V=H1Ω=vv+∇v<∞. Multiplying the wave Eq. (53) with v∈V, integrating over Ω, and using Green’s formula, we obtain, for t∈0T,

∫Ωfvdx=∫Ωu¨vdx−∫Ωv∇⋅ε∇udx=∫Ωu¨vdx+∫Ωε∇u⋅∇vdx−∫∂Ωvε∇u⋅nds=∫Ωu¨vdx+∫Ωε∇u⋅∇vdx.

The weak form (variational formulation) therefore reads: Find u⋅t∈V=H1Ω such that for all t>0

∫Ωu¨vdx+∫Ωε∇u⋅∇vdx=∫Ωfvdx,v∈V.E56

Let Vh=v∈VvT∈P1T∀T∈Th⊂V be the space of all continuous piecewise linear functions on a triangle mesh of Ω. The semi-discrete FE method in 2D is defined as follows: Find uh⋅t∈Vh such that

∫Ωu¨hvhdx+∫Ωε∇uh⋅∇vhdx=∫Ωfvhdx,vh∈Vh.E57

Implementation: Substituting uhxt=∑j=1Ncjtϕjx into (57) and choosing vh=ϕi, we obtain

∑j=1Nc¨j∫Ωϕjϕidx+∑j=1Ncj∫Ωε∇ϕj⋅∇ϕidx=∫Ωfϕidx,i=1,2,…,N.

This gives us the system

Mc¨t+Atct=bt,t∈0T,E58

where c=c1…cNt=uhN1t…uhNNtt∈RN (here Ni denotes the node that belongs to the basis function ϕi) is the unknown vector and the entries of the mass and stiffness matrices M,A∈RN×N and the load vector b∈RN are given by

mij=∫Ωϕiϕjdx,aij=∫Ωε∇ϕj⋅∇ϕidx,bi=∫Ωfϕidx,i,j=1,2,…,N.

Eq. (58) is a semi-discretization of the wave equation in the sense that it does not contain any unknowns with spatial derivatives.

Time discretization: We first transform the system of ODEs into a first-order system. Let dt=ċt, we get the new coupled system

Mċt−Mdt=0,Mḋt+Atct=bt,t∈0T.

Let w=cdt then the system is equivalent to M̂ẇt+Âtwt=b̂t, t∈0T, where

M̂=M00M,Â=0−MA0,b̂=0b.

Finally, the system of ODEs can be solved with e.g., the backward Euler method as follows: Let 0=t0<t1<⋯<tM=T be a discretization, let km=tm−tm−1 for m=1,2,…,M be the time step size and let wm≈wtm for m=1,2,…,M denote corresponding approximations. Then, we can compute wm using

M̂+kmÂmwm=M̂wm−1+kmb̂m,m=1,2,…,M,

where w0 is obtained from u0x and v0x.

There are several possible choices of initial data. We can either use w0=w10…c2N0t=u0N1…u0NNv0N1…v0NNt, or we can let w0=w10w20t, where w10 and w20 are the L2-projection of u0 and v0, respectively. We set wh,10=∑j=1Nwj,10ϕjx and wh,20=∑j=1Nwj,20ϕjx and solve for wj,10,wj,20 using

∑j=1Nwj,10∫Ωϕjϕidx=∫Ωu0ϕidx,∑j=1Nwj,20∫Ωϕjϕidx=∫Ωv0ϕidx,i=1,2,…,N.

We can also use Crank–Nicolson scheme

M̂+km2Âmwm=M̂−km2Âm−1wm−1+km2b̂m−1+b̂m≡gm.

Theorem 5.1 (Conservation of energy)Iff=0, then

u̇h⋅tL2Ω2+ε∇uh⋅tL2Ω2=u̇⋅0L2Ω2+ε∇u⋅0L2Ω2.

6. Conclusion

In this chapter, we introduced the finite element (FE) method for approximation the solutions to ODEs and PDEs. More specifically, the FE method is presented for first-order initial-value problems for OEDs, second-order boundary-value problems for ODEs, second-order elliptic PDEs, second-order heat and wave equations. The remaining chapters of this textbook are based on the FE method. The derivation of the FE method for other problems is straightforward. In the remaining chapters, the FE method will developed to solve complicated problems in engineering, notably in elasticity and structural mechanics modeling involving elliptic partial differential equations and complicated geometries. For more details, we refer the reader to [1, 2, 3, 4, 6, 7, 8, 9] and the references therein.

References

1. Ainsworth M. and Oden J. T. A posteriori Error Estimation in Finite Element Analysis. John Wiley, New York, 2000
2. Brenner S. C. and Scott L. R. The Mathematical Theory of Finite Element Methods, second edition. Springer-Verlag, New York, 2002
3. Ciarlet P. G. The finite element method for elliptic problems. North-Holland Pub. Co., Amsterdam-New York-Oxford, 1978
4. Johnson C. Numerical Solution of Partial Differential Equations by the Finite Element Method. Cambridge University Press, New York, 1987
5. Kaltenbacher M. Numerical simulation of mechatronic sensors and actuators: finite elements for computational multiphysics. Springer, Heidelberg, 2015
6. Larson M. The finite element method: theory, implementation, and applications. Springer, Berlin New York, 2013
7. Oden J. T. and Carey G. F. Finite Elements, Mathematical Aspects. Prentice Hall, Englewood Cliffs, 1983
8. Schwab C. p−andhp−Finite Element Methods. Oxford University Press, New York, 1998
9. Szabo B. and Babu I. s⌣ ka. Finite element analysis. Wiley, New York, 1991
10. Hrennikoff A. Solution of problems of elasticity by the framework method. Journal of Applied Mechanics, 8(4):169–175, 1941
11. Courant R. Variational methods for the solution of problems of equilibrium and vibrations. bulletin of the american mathematical society. 49: 1–23. doi: 10.1090. Technical report, 1943
12. Strang G. and Fix G. J. An analysis of the finite element method. 1973

[1] 1. Ainsworth M. and Oden J. T. A posteriori Error Estimation in Finite Element Analysis. John Wiley, New York, 2000

[2] 2. Brenner S. C. and Scott L. R. The Mathematical Theory of Finite Element Methods, second edition. Springer-Verlag, New York, 2002

[3] 3. Ciarlet P. G. The finite element method for elliptic problems. North-Holland Pub. Co., Amsterdam-New York-Oxford, 1978

[4] 4. Johnson C. Numerical Solution of Partial Differential Equations by the Finite Element Method. Cambridge University Press, New York, 1987

[5] 5. Kaltenbacher M. Numerical simulation of mechatronic sensors and actuators: finite elements for computational multiphysics. Springer, Heidelberg, 2015

[6] 6. Larson M. The finite element method: theory, implementation, and applications. Springer, Berlin New York, 2013

[7] 7. Oden J. T. and Carey G. F. Finite Elements, Mathematical Aspects. Prentice Hall, Englewood Cliffs, 1983

[8] 8. Schwab C. p−andhp−Finite Element Methods. Oxford University Press, New York, 1998

[9] 9. Szabo B. and Babu I. s⌣ ka. Finite element analysis. Wiley, New York, 1991

[10] 10. Hrennikoff A. Solution of problems of elasticity by the framework method. Journal of Applied Mechanics, 8(4):169–175, 1941

[11] 11. Courant R. Variational methods for the solution of problems of equilibrium and vibrations. bulletin of the american mathematical society. 49: 1–23. doi: 10.1090. Technical report, 1943

[12] 12. Strang G. and Fix G. J. An analysis of the finite element method. 1973

A Brief Summary of the Finite Element Method for Differential Equations

Finite Element Methods and Their Applications

Abstract

Keywords

Author Information

Mahboub Baccouch*

1. Introduction

1.1 An overview of the finite element method

2. The FE method for ODEs

2.1 The FE method for first-order linear IVPs

Figure 1.

Figure 2.

2.2 The FE method for first-order nonlinear IVPs

2.3 The FE method for two-point BVPs

2.3.1 Different mathematical formulations for the 1D model

2.3.2 Galerkin method of the problem

2.3.3 Ritz method of the problem

2.3.4 Computer implementation

2.3.5 Existence, uniqueness, and basic a priori error estimate

2.3.6 Boundary conditions

2.4 Model problem with coefficient and general Robin BCs

2.5 The FE method using Lagrange P2 elements

Figure 3.

2.5.1 Homogeneous boundary conditions

2.5.2 Nonhomogeneous boundary conditions

3. The FE for elliptic PDEs

3.1 Meshes

3.2 Piecewise polynomial spaces

3.3 Interpolation

3.4 L2-projection

3.5 A priori error estimate

3.6 The FE method for general elliptic problem

3.7 The Dirichlet problem

3.8 The Neumann problem

3.9 Finite elements for mixed Dirichlet-Neumann conditions

3.10 The method of shifting the data

3.10.1 Inhomogeneous Dirichlet conditions on a rectangle

3.10.2 Inhomogeneous Neumann conditions on a rectangle

3.11 Eigenvalue problem

3.12 Error analysis

3.13 The FE method for elliptic problems with a convection term

4. The FE method for the heat equation

5. The FE method for the wave equation

6. Conclusion

References

Continue reading from the same book

Finite Element Methods and Their Applications