Efficient and robust Schur complement approximations in the augmented Lagrangian
preconditioner for the incompressible laminar flows
He, Xin; Vuik, Cornelis
DOI
10.1016/j.jcp.2020.109286
Publication date
2020
Document Version
Final published version
Published in
Journal of Computational Physics
Citation (APA)
He, X., & Vuik, C. (2020). Efficient and robust Schur complement approximations in the augmented
Lagrangian preconditioner for the incompressible laminar flows. Journal of Computational Physics, 408,
[109286]. https://doi.org/10.1016/j.jcp.2020.109286
Important note
To cite this publication, please use the final published version (if applicable).
Please check the document version above.
Copyright
Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons. Takedown policy
Please contact us and provide details if you believe this document breaches copyrights. We will remove access to the work immediately and investigate your claim.
This work is downloaded from Delft University of Technology.
Green Open Access added to TU Delft Institutional Repository
‘You share, we take care!’ – Taverne project
https://www.openaccess.nl/en/you-share-we-take-care
Otherwise as indicated in the copyright section: the publisher
is the copyright holder of this work and the author uses the
Dutch legislation to make this work public.
Contents lists available atScienceDirect
Journal
of
Computational
Physics
www.elsevier.com/locate/jcp
Efficient
and
robust
Schur
complement
approximations
in
the
augmented
Lagrangian
preconditioner
for
the
incompressible
laminar
flows
Xin He
a,
Cornelis Vuik
b,
∗
aStateKeyLaboratoryofComputerArchitecture,InstituteofComputingTechnology,ChineseAcademyofSciences,No.6KexueyuanSouth
RoadZhongguancunHaidianDistrictBeijing,100190,P.R.China
bDelftInstituteofAppliedMathematics,DelftUniversityofTechnology,VanMourikBroekmanweg6,2628XE,Delft,theNetherlands
a
r
t
i
c
l
e
i
n
f
o
a
b
s
t
r
a
c
t
Articlehistory:
Received12November2018
Receivedinrevisedform17January2020 Accepted25January2020
Availableonline7February2020 Keywords:
Navier-Stokesequations Stabilizedfiniteelementmethod Blockstructuredpreconditioners Schurcomplementapproximations AugmentedLagrangianpreconditioner
Thispaper introducesthree new Schur complementapproximations for theaugmented Lagrangian preconditioner. The incompressible Navier-Stokes equations discretized by a stabilized finite element method are utilized to evaluate thesenew approximations of the Schur complement. A widerangeof numerical experiments inthe laminar context determines the most efficient Schur complement approximation and investigates the effectof the Reynoldsnumber, meshanisotropy and refinementonthe optimal choice. Furthermore, the advantage over the traditional Schur complement approximation is exhibited.
©2020ElsevierInc.Allrightsreserved.
1. Introduction
Inthispaperweconsiderthenumericalsolutionofthesteady,laminarandincompressibleNavier-Stokes equationsas follows
−
ν
u
+ (
u· ∇)
u+ ∇
p=
f on,
∇ ·
u=
0 on.
(1)Here u is thevelocity, p isthe pressure,the positive coefficient
ν
isthe kinematicviscosityand f isa givenforce field.isa2Dor3Dboundedandconnecteddomainwiththeboundary
∂
.Ontheboundariesofthecomputationaldomain, eithertheDirichletboundarycondition u=
g or Neumannboundaryconditionν
∂u∂n
−
np=
0 isimposed,wheren denotestheoutward-pointingunitnormaltotheboundary.
AfterthePicardlinearizationandFEMdiscretization[1], theincompressibleNavier-Stokesequationsconverttothe fol-lowinglinearsysteminsaddle-pointform
A BT B−
Cu p
=
f g withA :=
A BT B−
C,
(2)*
Correspondingauthor.E-mailaddresses:hexin2016@ict.ac.cn(X. He),c.vuik@tudelft.nl(C. Vuik). URLs:http://english.ict.cas.cn/(X. He),http://www.ewi.tudelft.nl/(C. Vuik).
https://doi.org/10.1016/j.jcp.2020.109286
wherethematricesB andBT correspondtothedivergenceandgradientoperators,respectively.Picardlinearizationleadsto thematrix A inblock diagonalstructure,andeachdiagonalblockcorrespondstotheconvection-diffusionoperator.Dueto thepresenceoftheconvectiveterm, A isnotsymmetric.ForthefiniteelementdiscretizationsatisfyingtheLBB(‘inf-sup’) stability condition [1], no pressure stabilization is required and C
=
0 is taken. When LBB unstable finite elements are applied,thenonzeromatrixC correspondstoastabilizationoperator.Blockstructuredpreconditioners[1–3] areoftenutilized toacceleratetheconvergenceoftheKrylovsubspacesolversfor saddlepointsystemsas(2).Theyarebasedontheblock
LDU
decompositionofthecoefficientmatrixgivenbyA = LDU =
A BT B−
C=
I1 O B A−1 I2A O O S
I1 A−1BT O I2
,
(3)where S
= −(
C+
B A−1BT)
is theso-calledSchur complement. A combinationofthisblock factorizationwitha suitable approximation oftheSchur complementisutilized tosuccessfullydesignthe blockstructured preconditioners,whichare givenasfollowsP
F=
A O B SI1 A−1BT O I2
,
(4)P
L=
A O B S,
P
U=
A BT O S.
(5)Multiplying the
LD
andDU
factors of(3) results inthe block lower- andupper-triangularpreconditionersP
L andP
U,respectively. Preconditioner
P
F isbasedonthe multiplicationoftheLDU
factors.ThetermA−1 denotessomeapproxi-mation oftheinverse actionof A,which isgiveneitherinan explicitformorimplicitlydefinedviaan iterativesolution methodwithaproperstoppingtolerance.
It is notpractical to explicitlyformthe exactSchur complementdueto theaction of A−1, typicallywhen thesize is large.Thisimpliesthatthemostchallenging taskistofindthespectrallyequivalentandnumericallycheapapproximation oftheSchur complement,whichisdenotedby
S in(4) and(5).Thereexistseveralstate-of-the-artapproximationsofthe Schurcomplement,e.g.theleast-squarecommutator(LSC)[4,5],pressureconvection-diffusion(PCD)operator[6,7] andthe approximations fromthe SIMPLE(R)[8–10] andaugmented Lagrangian (AL) preconditioner[11,12] etc. Among them, the ALpreconditionerexhibitsattractivefeatureswithstablefiniteelementmethods(FEM)usedforthediscretization,e.g.the purelyalgebraicandsimpleconstructionoftheSchurcomplementapproximationandrobustnesswithrespecttothemesh refinementandReynoldsnumber,atleastforacademicbenchmarks.Motivatedbytheseadvantages,thefurtherextensionto thecontextoffinitevolumemethod(FVM)[13] andthemodifiedvariant[14] withreducedcomputationalcomplexitiesare promoted.Recently,theauthorsofthispaperproposeanewvariantoftheALpreconditioner[15] fortheReynolds-Averaged Navier-Stokes(RANS)equationsdiscretizedbyastabilizedFVM,whicharewidelyusedtomodelturbulentflowsinindustrial computationalfluiddynamic(CFD)applications.The roleoftheALtermforpreconditioningisverysimple:by varyingparameter
γ
itputs moreweightoneitherthe (1,1)orthe(2,2)blockoftheALpreconditioner.Ifonecannotaffordlargervaluesofγ
,thenfindingasuitable(more compli-cated)preconditionerforSγ becomesimportantagain,whereSγ denotestheSchurcomplementfortheaugmentedsystem. Morediscussions on Sγ andtheinvolvedparameterγ
aregiveninSection 3ofthispaper.Knownrepresentationsfor Sγ[11,14] suggestwaystoutilizeearlierdevelopedpreconditionersforthenon-augmentedproblem.Thepaperisbuiltonthis simpleobservationandtheoriginalideaasgivenin[11].Thisobservationisalreadyexploited,forexample,in[14,16].The challengesencounteredintheturbulentcalculations[17–19] areinevitablefactorswhichcouldcausethebreakdownofthe AL preconditioner,includingthehighReynoldsnumber,high-aspectratiocellsnearthevery thinboundary layerandthe significant variation inthevalue ofviscositydueto thepresenceof theeddy-viscosity.Toovercome thesechallenges, an alternative methodtoapproximatethe SchurcomplementfortheALpreconditionerisintroduced in[15], whichleads to a new variantof theAL preconditioner. Thisnew methodapproximates the Schur complementthrough its inverseform andfacilitatestheutilizationoftheexistingSchurcomplementapproximations.Amongtheavailablecandidates,theSchur complementapproximationfromtheSIMPLEpreconditioner[8,20] ischosenandsubstitutedintotheinverseSchur comple-mentapproximationfortheALpreconditioner.ThischoiceismotivatedfromthenotionthatitreducestoascaledLaplacian matrix[8,20] withtheconsideredFVMandits promisingefficiencyontheturbulentapplicationsofthemaritimeindustry [8,21].Consequently,theso-arisingnewvariantoftheALpreconditionerreducesthenumberofKrylovsubspaceiterations byafactorupto36 comparedtotheoriginalone[15].
Since the new methodto approximate the Schur complementfor theAL preconditioner usethe existing Schur com-plement approximations, the following questions straightforwardly raise. Does the utilization of other existing Schur complement approximationsdeliver a better performance than that fromthe SIMPLEpreconditioner? Ifso, whichSchur complementapproximationisthemostefficientone?Doestheoptimalchoicedependonthetestproblemandparameters arising from the physics and discretization,e.g. the Reynolds number andgrid size? To answer these questions,in this paper we utilizethe existing Schur complementapproximations not onlyfromthe SIMPLE preconditionerbutalso from the LSCand PCD operators to construct the new Schur complement approximation in the AL preconditioner. Moreover, extensivecomparisonsbetweentheconsidered Schurcomplementapproximationsare carriedoutonawide rangeof nu-mericalexperimentstoevaluatetheeffectoftheReynoldsnumber,meshanisotropyandrefinementontheoptimalchoice.
Thesenumericalevaluationsareconsideredinthecontextoflaminarflows,whichismotivatedbytheexpectationthatthe obtainedresultscanprovideafundamentalguidelineforthemorecomplicatedturbulentflowcalculations.
The structure of this paper is given as follows. The utilized stabilization method and a brief survey of the existing approximationsoftheSchurcomplementareintroducedinSection2.Section3illustratesthemethodusingtheseexisting SchurcomplementapproximationstoconstructthenewapproximationoftheSchurcomplementintheALpreconditioner. Section4includesnumericalresultsonvaryinglaminarbenchmarks.ConclusionsandfutureworkareoutlinedinSection5.
2. StabilizationmethodandsurveyofSchurcomplementapproximations 2.1. Stabilization
Inthispaperwe usethemixedFEMwhichdoesnotuniformlysatisfya discreteinf-supcondition [1] todiscretizethe Navier-Stokesequationsgoverninglaminarflows,whichischosenbythefollowingconsiderations.Firstly,theexistingSchur complement approximationsare originally designedwith finite element methods used for discretization. Therefore,it is expectedtoapplythenewSchurcomplementapproximationfortheALpreconditionerintheFEMcontext.Inaddition,this closesagapintheapplicationofthenewSchurcomplementapproximation.Secondly,boththestabilizedFEM[1] andFVM [17] lead tosaddlepoint systemwithanonzero
(
2,
2)
block whicharisesfromthe pressurestabilization. Thanks to this similarity,aminoradaptionisrequiredtoextendthenewvariantoftheALpreconditionerfromthestabilizedFVMtothe stabilizedFEM.Finally,theutilizationofstabilizedFVMdegradesthegeneralitytosomeextentsincetheSchurcomplement approximationintheSIMPLEpreconditionerreducestoaspecialformation[8,20].However,thisspecialformationcannot beobtainedwithotherstabilizationanddiscretizationmethods.UsingstabilizedFEM,allSchurcomplementapproximations consideredinthispaperareexpressedintheirdefinedmanners,includingthatfromtheSIMPLEpreconditioner.Inthisway, aconvincingevaluationofthenovelSchurcomplementapproximationfortheALpreconditionercanbeexpected.Basedontheabovemotivations,inthispaperweusethe Q1- Q1 mixedfiniteelementapproximationwheretheequal
first-orderdiscretevelocitiesandpressurearespecifiedonacommonsetofnodes.Amongtheavailablestabilization meth-ods[22–27] specifiedforthe Q1- Q1discretization,wechoosetheapproachintroducedin[25].Themainmotivationisthat
therearefewstabilizationparametersrequiredinthefollowingoperator
C(proj)
(
ph,
qh)
=
1
ν
(
ph−
0ph,
qh−
0qh),
(6)where
0 is the L2 projectionfrom thepressure approximationspaceinto thespace P0 ofthepiecewise constant basis
function.Thisprojectionisdefinedlocally:
0phisaconstantfunctionineachelement
2
k∈
Th.Itisdeterminedsimplybythefollowinglocalaveraging
0ph
|
2k=
1|2
k|
2k ph,
for all2
k∈
Th,
(7)where
|2
k|
is thearea ofelement k. Due tothe locality asillustrated by equation (7), thestabilizationmatrix C can beassembledfromthecontributionmatricesonmacroelementsinthesamewayasassemblingastandardfiniteelementmass matrix.Takingthe2Drectangulargridasanexample,the4
×
4 macroelementcontributionmatrixC(macro)isgivenbyC(macro)
=
1ν
(
M(macro)
−
qqT|2
k
|),
(8)where M(macro) isthe4
×
4 macroelementmassmatrixforthe bilineardiscretizationandq= [
1/
4,
1/
4,
1/
4,
1/
4]
T isthe localaveragingoperator.The nullspaceofthemacroelementmatrixC(macro) andassembled stabilizationmatrixC consistofconstantvector,see[1,25] formoredetails.
Contrary to other pressure stabilizationmethods [27,28] which utilize the viscosity and velocity fields to derive the scalingparameterinfrontofthestabilizationmatrix,thealternativeemployedinthispaperonlyinvolvestheviscosity co-efficient.Resultsinnumericalexperimentsectiondemonstratethattheutilizedstabilizationmethodresultsinareasonable andsmoothcalculationofthevelocityandpressureunknownsrangingfromthemoderatetolargeReynoldsnumbers.The assessment ofother pressure stabilizationmethods andtheir effects on theproposed preconditioning techniques bythis paperisincludedinfutureresearchplan.
2.2. SurveyofSchurcomplementapproximations
As follows we briefly introduce severalstate-of-the-art Schur complementapproximations which are utilized to con-structthenewapproximation ofthe SchurcomplementfortheALpreconditioner.Werefer formore detailsoftheSchur complementapproximationtothesurveys[2,3,29,30] andthebooks[1,31].
In the following illustration, we use the notation p to indicate the operators definedon the pressure space andthe notationu fortheoperatorsdefinedonthevelocityspace.
(1) Thepressureconvection-diffusionoperator
SP C D.Thisapproximation,denotedby
SP C D,isproposedbyKayetal. [6] anddefinedas SP C D= −
LpA−p1Mp,
(9)where Mp isthe pressure massmatrix, and Ap and Lp are the discrete pressureconvection-diffusion andLaplacian
operators, respectively.AlthoughthePCDSchur complementapproximation(9) isoriginally proposedforstablefinite elementmethods,itisstraightforwardlyapplicableforthediscretizationsneedingastabilizationterm,e.g.the Q1- Q1
pair.Formoredetailsaboutthisextensionwereferto[1].Ontheotherhand,thisapproximationrequiresusersto pro-vide thediscreteoperators Ap andLp andpresetsomeartificialpressureboundaryconditionsonthem.Theboundary
conditionscouldstronglyeffecttheperformancesoappropriateonesshouldbecarefullyselectedbasedontheproblem characteristic[32,33].ApplyingthePCDSchurcomplementapproximationinvolvestheactionofaPoissonsolve,amass matrixsolveandamatrix-vectorproductwiththematrixAp.
(2) Theleast-squarecommutator
SL SC.Elmanetal. [4] originallypropose thismethodforstablefiniteelementdiscretizations andthen extenditto alterna-tives[5] thatrequirestabilization. Forsystem(2) witha nonzerostabilizationoperatorC , theLSCSchurcomplement approximation
SL SC isdefinedas SL SC= −(
BMu−1BT+
C1)(
BM−u1AMu−1BT+
C2)
−1(
BM−u1BT+
C1),
(10)where M
u denotesthe diagonalapproximation ofthe velocity massmatrix Mu, i.e.Mu=
diag(
Mu)
.Giventhestabi-lizationmatrixC assembledfromthemacroelementcontributionmatrixC(macro) (8),thecontributionmatricesC(macro)
1
andC2(macro) fortheassociatedstabilizationmatricesC1 andC2areintroducedby
C(1macro)
=
ν
|2
k|
·
C(macro),
C(2macro)=
ν
2|2
k|
2·
C(macro),
(11)where
ν
denotes theviscosity parameter. Forthe derivationof C1(macro) and C2(macro) we refer to [5]. The implemen-tationoftheLSCSchur complementapproximationdoesnotrequireanyartificialboundarycondition andconsistsof one matrix-vector product withthe middle term in (10) and two solves with the other term. When the LSC Schur complementapproximationisappliedtostablefiniteelementdiscretizations,thematricesC1andC2aresettozeroin(10).
(3) Theapproximation
SS I M P L EfromtheSIMPLEpreconditioner.SIMPLE(Semi-Implicit PressureLinkedEquation)isused byPatanker [18] asan iterativemethodtosolve the Navier-Stokes problem.Theschemebelongstotheclassofbasiciterativemethodsandexhibitsslowconvergence.Vuiketal. [9,10] useSIMPLEasapreconditionerinaKrylovsubspacemethod,achievinginthisway,amuchfasterconvergence. RegardingtheSchurcomplementS
= −(
C+
B A−1BT)
ofsystem(2),theSIMPLEpreconditionerapproximates A byitsdiagonal,i.e.diag
(
A)
,andobtainstheapproximationSS I M P L E as SS I M P L E= −(
C+
Bdiag(
A)
−1BT).
(12)Substituting
SS I M P L E andA−1=
diag(
A)
−1 into (4) leads tothe so-calledSIMPLEpreconditioner. Forstablefiniteel-ement discretizations, C
=
0 is set in system(2) and correspondingly in the Schur complementapproximation (12). The easyimplementationandpromisingperformanceonthe complicatedmaritimeproblems[8,21] maketheSIMPLE preconditioneranditsvariantsattractiveinrealworldapplications.Themaingoalofthispaperistoutilize theabovementionedSchurcomplementapproximationstoconstructanew ap-proximationoftheSchurcomplementintheALpreconditioner,withmoredetailspresentedinthenextsection.Theoretical analysis andnumerical evaluationoftheabove Schur complementapproximationsfallout ofthe scopeofthiswork and we referto[1,3,34] formoreresults.Herewesummarizethekeydifferences.
SP C D requirestheconstructionofadditionalmatrices onthe pressurespacewhile
SL SC andSS I M P L E rely onmatriceswhich could be easily generatedorare readilyavailable. Asseenfrom
SL SC,thestabilizationterms C1(macro) andC(macro)
2 areeasily obtainedbysubstitutingtheavailable
termC(macro)into(11).Ontheotherhand,
SP C D easilyextendstothestabilizedelementsandaminoradaptionisrequired
by
SS I M P L E forthisextension.However,SL SC doesnotimmediatelyapplyandneedsappropriatestabilizationtermsC1and C2.Wefurthernote that boundaryconditionsforthepressure unknowns,which havefewphysicalmeanings, havetobeconsidered forLp and Ap in
SP C D.What boundaryconditionsworkbestwithaspecifictype ofproblemisusually basedonexperimentalknowledge[32,33].
3. AugmentedLagrangianpreconditioner
The focusofthissectionis thenewmethodtoapproximatetheSchur complementintheaugmentedLagrangian (AL) preconditioner.Inthefollowing,wefirstbrieflyrecalltheALpreconditionerandthenintroducethenewmethodfollowed byacomparisonwiththeoldone.
ThemotivationofapplyingtheALpreconditioneristocircumventthechallengeonfindingtheefficientapproximation oftheSchurcomplementS for theoriginal system(2), cf., [11,14].Toapply theALpreconditioner,theoriginalsystem(2) istransformedintoanequivalentonewiththesamesolution[13,14],whichisoftheform
Aγ BγT B−
Cu p
=
fγ g withA
γ:=
Aγ BTγ B−
C,
(13)where Aγ
=
A+
γ
BTW−1B,BTγ=
BT−
γ
BTW−1C andfγ=
f+
γ
BTW−1g.Thistransformationisobtainedbymultiplyingγ
BTW−1 onboth sidesof thesecond row ofsystem(2) andaddingthe resulting equation tothe first one.Clearly, the transformedsystem(13) hasthesamesolutionassystem(2) foranyvalueofγ
andanynon-singularmatrixW .TheSchur complementofthetransformedsystem(13) isSγ= −(
C+
B A−1γ BγT
)
.TheALpreconditionerisappliedfortheequivalentsystem(13),whichistobesolved.Usingtheblock
D
U decompositionof
A
γ ,theidealALpreconditionerP
I AL anditsvariant,i.e.themodifiedALpreconditionerP
M AL,aregivenbyP
I AL=
Aγ BTγ O Sγ andP
M AL=
Aγ BγT O Sγ,
(14)where
Sγ andAγ denotetheapproximationsofSγ and Aγ ,respectively.First we consider the approximation
Aγ . Given the original pivot matrix A=
A1 O O A1
and the divergence matrix
B
=
B1 B2inthe2Dcase,thetransformedpivotmatrixAγ
=
A+
γ
BTW−1B canbewrittenas Aγ=
A1+
γ
BT1W−1B1γ
B1TW−1B2γ
BT2W−1B1 A1+
γ
B2TW−1B2.
Contraryto
P
I AL,P
M AL approximates Aγ byitsblockupper-triangularpart,i.e.Aγ withazero(2,1)block,suchthatthedifficulty ofsolvingthe systemswith Aγ is avoided[14].When applying
P
M AL oneneeds tosolve thesub-systems withthediagonal blocksof Aγ ,i.e. A1
+
γ
B1TW−1B1 and A1+
γ
BT2W−1B2, whichdonot containthe couplingbetweentwocomponentsofthevelocitysothat standardalgebraicmultigridmethodscanbeapplied[34].Thisadvantagemotivatesus tochoose
P
M AL inthispaperdespitetheobservationthattheperformanceofP
M ALisdependentofthevaluesofγ
,whichisseeninthenumericalexperimentsofthispaperandotherrelatedreferences[14].Theaboveadvantagealsomotivesto approximateAγ byitsblocklower-triangularpartwithazero(1,2)block.Numericalexperimentsdemonstratethatdifferent approximationsofAγ slightlyeffecttheperformanceofthemodifiedALpreconditionerfortheconsideredbenchmarks.For thisreason,inthispaperweonlyillustratetheresultsby applyingtheblock upper-triangularapproximationof Aγ inthe modifiedALpreconditioner.RegardingtheidealALpreconditioner
P
I AL,standardmultigridmethodsareineffectivetosolvethesystemswith Aγ .Aspecializedmultigridalgorithmfor Aγ isbuiltin[11] andtheextension tothethreedimensional applicationsis recentlyproposed in[35]. Alternatively,previous work [12] suggests tosolve thesystems with Aγ bythe Krylovsubspacemethods,whichareacceleratedbytheapproximateinversepreconditionerbasedontheShermon-Morrison formula.Intherelatedwork[34],thecomparisonbetweenthemodifiedandidealALpreconditionersisrealizedbyapplying the direct solution method for the involved sub-systems. Although fewer Krylov iterations are needed by the ideal AL preconditioner,removing thedifficultytosolve thesub-systemswith Aγ makesthemodifiedALpreconditionerattractive inpractice.
3.1. NewSchurapproximationintheALpreconditioner
FindinganeffectiveapproximationoftheSchurcomplementSγ isthekeyfortheidealandmodifiedALpreconditioners. Thispaper ismeant tousethe available Schurapproximationsfor theoriginal system(2), asintroduced inSection 2,to constructanewapproximationofSγ .ThenewSchurcomplementapproximationisrealizedbyusingthefollowinglemma.
Lemma3.1.Assumingthatalltherelevantmatricesareinvertible,thentheinverseofSγ isgivenby
S−γ1
=
S−1(
I−
γ
C W−1)
−
γ
W−1,
(15)whereS
= −(
C+
B A−1BT)
denotestheSchurcomplementoftheoriginalsystem(2).Proof. Fortheproofwereferto[13,14].
2
Lemma3.1 is originally revealed by [14] and used to derive the old approximation of Sγ , which is discussed inthe next section. Here, Lemma 3.1is viewed from another side. Since Lemma 3.1 buildsthe connection between the Schur complements Sγ and S, the natural andsimple method to approximate Sγ is substituting the approximation of S into
=
S−1(
I−
γ
C W−1)
−
γ
W−1,
(16)where
S denotestheapproximationofS.ThenovelapproachprovidesaframeworktousetheknownSchurcomplementapproximation
S fortheoriginalsystem (2) toconstructSγ new inthe ALpreconditioner,whichisapplied tothetransformedsystem(13). SubstitutingtheSchurcomplementapproximationsdemonstratedinSection2,i.e.
SP C D,SL SC andSS I M P L E intoexpression(16),threevariantsof Sγ newarederivedas•
S−γ1P C D=
S−P C D1(
I−
γ
C W−1)
−
γ
W−1,•
S−γ1L SC=
S−L SC1(
I−
γ
C W−1)
−
γ
W−1,•
S−γ1S I M P L E=
S−S I M P L E1(
I−
γ
C W−1)
−
γ
W−1.Followingotherrelatedreferences[11,14],inthispaperwechoosethematrixparameterW tothediagonal approxima-tion ofthepressuremassmatrix,i.e.W
=
Mp=
diag(
Mp)
.It istrivialto obtaintheactionof W−1 inthetransformation(13) andthenewSchurcomplementapproximation(16).ApplyingthenewSchurcomplementapproximation
Sγ newcon-verts tosolveasystemwithitandthechoiceof W
=
Mp focusesthecomplexitymainlyonthesolveofS.Thisimpliesalimitedincrease ofthecomplexity whenimplementingthenewSchurcomplementapproximation
Sγ new comparedtoS.Inaddition,theconsiderableeffortstooptimizetheapproximation
S canstraightforwardlyreducethecomputationaltime ofSγ new.WhenapplyingstabilizedFVM,theinverseofSγ isexpressedinasimilarmanner[15] asLemma3.1andthissimilarity facilitates theextension ofthenewSchur complementapproximationfromthestabilized FVMtothe stabilizedFEM. Re-gardingthenewSchurcomplementapproximation,therearetwomaindifferencesbetween[15] andthiswork.Firstly,only
Sγ S I M P L E isconsideredin[15] and inthispaperweintroduce threevariants,i.e.Sγ P C D,Sγ L SC andSγ S I M P L E.Inthisway,thecomparisonbetweenthemisexpectedtoanswerthequestionsraisedintheintroductionsectionandfindoutthe optimalchoice.Secondly,in[15] finitevolumediscretizationstabilizedbythepressure-weightedinterpolationmethod[36] is applied,which leads to
SS I M P L E in areducedform. The generality isdegraded since thisspecialform ofSS I M P L E cannotbeobtainedbyusingotherstabilizationanddiscretizationmethodsingeneral.Inthispaper,theapproximations
SP C D, SL SC andSS I M P L E are expressedintheir definedmannerssothata convincingassessmentofthenewSchurcomplementapproximationcanbeexpected.
Based onthe above approach,it is easy to see that there is no extra requirement on thevalue ofthe parameter
γ
. This advantage of the new Schur complement approximation can be more clearly seen in the next section, where the contradictoryrequirementsonthevaluesofγ
intheoldapproacharepresented.3.2. OriginalSchurapproximationintheALpreconditioner
The starting point to constructthe original approximation ofthe Schur complement inthe AL preconditioner is also Lemma3.1.However,thestrategyistotallydifferent.ChoosingW1
=
γ
C+
Mp andsubstitutingW1intoexpression(15) wehave
S−γ1
=
S−1(
I− (γ
C+
Mp−
Mp)(
γ
C+
Mp)
−1)
−
γ
(
γ
C+
Mp)
−1=
S−1Mp(
γ
C+
Mp)
−1−
γ
(
γ
C+
Mp)
−1= (γ
−1S−1Mp
−
I)(
C+
γ
−1Mp)
−1.
Forlargevaluesof
γ
suchthatγ
−1S−1Mp
1,thetermγ
−1S−1MpcanbeneglectedsothatwehaveSγ origasfollows Sγorig= −(
C+
γ
−1Mp).
(17)As shownabove,thechoice ofW1
=
γ
C+
Mp isusedtoderivetheoriginal SchurcomplementapproximationSγ orig.However, thechoiceof W1
=
γ
C+
Mp isnotpracticalsincetheactionofW1−1 isneededinthetransformedsystem(13).Onepracticalchoiceistoomittheterm
γ
C inW1andreplaceMp byitsdiagonalapproximation,whichleadstoW=
Mp.This modificationis onlyapplied tosimplifythe matrixparameter W and theoriginal Schur complementapproximation
Sγ origremains thesameasgivenin(17).Insummary,thechoiceof W=
Mp andSγ origisusedinthispaperandotherrelatedwork,forinstance[13,14] wherestabilizeddiscretizationsareemployed.
Thecontradictoryrequirementsintheaboveapproximationareshownasfollows.Theapproximation
Sγ origisobtainedifandonlyifW1
=
γ
C+
Mpandlargevaluesofγ
arechosen.However, W=
MpisspectrallyequivalenttoW1=
γ
C+
Mponlywhen
γ
issmall.Thismeansthatitiscontradictorytotunethevalueofγ
sothatW=
Mp andSγ orig couldbesi-multaneously obtained. By contrast,thiscontradictory requirementsare avoided by applyingthe newSchur complement approximation asgiven inSection 3.1.This disadvantage ofthe original Schurcomplement approximation reflectsin the
Table 1
Summaryofthelinearsystemstobesolved,appliedpreconditionersandapproximationsof theSchurcomplementutilizedtherein.
Linear system Preconditioner Schur complement approximations Transformed system withAγ PM AL Sγ P C D,SγL SC,Sγ S I M P L E,Sγorig Original system withA PU SP C D,SL SC,SS I M P L E
Table 2
Pressuresub-system‘mass-p’withSγ inPM ALandS inPU,andthesystemsinvolvedtherein.
‘mass-p’withSγ new ‘mass-p’withS Systems involvedinS
Sγ P C D SP C D LpandMp
Sγ L SC SL SC (BM−u1BT+C1)twice
Sγ S I M P L E SS I M P L E C+Bdiag(A)−1BT
‘mass-p’withSγ orig – Systems involvedinSγorig
Sγ orig – C+γ−1Mp
slowerconvergencerateoftheKrylovsubspacesolvers comparedtothenewSchurcomplementapproximation.This con-clusionismadebasedonthefactthattheperformanceofthemodifiedALpreconditionerisevaluatedbyvaryingtheSchur complementapproximations.Seemoreresultsinthenumericalsection.
The applicationoftheoriginal Schurcomplementapproximation
Sγ orig involvesthesolution ofthesystemwith C+
γ
−1Mp.SincethecontributionstabilizationmatrixC(macro) onmacroelementsconsistsofthemacroelementpressuremass
matrix asillustrated in (8), the presence ofthe assembled pressure mass matrix Mp does not introduce morenon-zero
fill-ininthestabilizationmatrixC .
3.3. SummaryoftheSchurcomplementapproximations
AteachPicarditeration,wesolveeitherthetransformedsystem(13) withthecoefficientmatrix
A
γ ortheoriginalsys-tem(2) withthecoefficientmatrix
A
.WeapplythemodifiedALpreconditionerP
M AL (14) andtheblockupper-triangularpreconditioner
P
U (5) tothetransformedandoriginalsystems,respectively.TheSchurcomplementapproximationsappliedin
P
M AL andP
U aresummarizedinTable1.Duetothesmallsizeoftestproblemsandthelackofcodeoptimization,thecomplexitycomparisonofpreconditioners
P
M AL andP
U is done based on the followingcosts analysisin thispaper, instead of reportingthe computational time.Firstly,we considerthecosts ofusingthemodified ALpreconditioner
P
M AL foraKrylovsubspacemethodthatsolvesthesystemwith
A
γ . Thepreconditioner isapplied ateach Kryloviteration andthe modified ALpreconditioner involvesthesolutionofthe momentumsub-system ‘mom-u’with
Aγ and thepressuresub-system ‘mass-p’withSγ .Furthermore,at eachKryloviterationadditionalcostsareexpressedintheproductofthecoefficientmatrixA
γ withaKrylovresidualvector bres.Thus,thetotalcostsateachKryloviterationare• P
M AL:mom-uwithAγ +mass-pwithSγ +A
γ×
bres.Clearly, the difference ofcosts by applying
P
M AL arisesfromsolving thepressure sub-system ‘mass-p’with differentSchur complement approximations. Ifwe ignore the multiplications in the definitionof the newSchur complement ap-proximation
Sγ new, finding the solution of the pressure sub-system inP
M AL withthree variants derived fromSγ new,i.e.,
Sγ P C D,Sγ L SC andSγ S I M P L E isreducedtosolve thepressuresub-system inP
U withSP C D,SL SC andSS I M P L E,re-spectively. Systems involved in
SP C D,SL SC andSS I M P L E are shownin Table 2. The costs of applying the original Schurcomplementapproximation
Sγ origarealsoincludedinTable2foracomparisonwiththenewSchurcomplementapproxi-mation
Sγ new.Notethatallinvolvedsystemsareofthesamesize.Ifweassumeacomparablecomplexitytosolvedifferentinvolvedsystems,theanalysisinTable2showsthatthecostsofusing
P
M ALwithSγ P C D andSγ L SC areroughlythesameandtwotimesofthatwith
Sγ S I M P L E andSγ orig.Secondly,we considerthecosts ofapplyingtheupperblock-triangularpreconditioner
P
U withdifferentSchurcomple-mentapproximations,whichareusedfortheoriginalsystem.Similartotheanalysisof
P
M AL,weobtainthetotalcosts ateveryKryloviterationas
• P
U:mom-uwith A +mass-pwithS +A
×
bres.Also,varyingSchurcomplementapproximations
S resultsinthedifferenceofcostsby applyingP
U.BasedontheanalysisinTable2andtheassumptionofacomparablesolutioncomplexity forall involvedsystems,wefindout thatthecostsof applying
P
U withSP C D andSL SC areroughlythesameandtwotimesofthatwithSS I M P L E.Lastly,wecomparethecostsbetween
P
M AL andP
U.Asmentionedbefore,solvingthepressuresub-systemwiththenewSchurcomplementapproximation
Sγ newinP
M AL canbereducedtocalculatethesolutionofthepressuresub-systemwith S,whichistheSchurcomplementapproximationusedinP
U.Thus,thedifferenceofcosts betweenP
M ALandP
U focuseson the solutionof themomentum sub-system andtheproduct ofthe coefficient matrixwith theKrylov residualvector. Morenon-zerofill-inin Aγ and
A
γ [13],comparedto A andA
,resultsinaheaviermatrix-vectorproductwhenapplyingP
M AL ateachKryloviteration.However,theheaviercomplexityofP
M AL couldbepaidoffbyareducednumberofKryloviterations. In this paper we obtain a faster convergence ratepreconditioned by
P
M AL with the new Schur complementapproximations, comparedto
P
U used forthe original system. The time advantage ofP
M AL needs a further assessmentwhichisincludedinfutureresearchplan.
4. Numericalexperiments
Inthissection,wecarryoutnumericalexperimentsonthefollowing2Dlaminarbenchmarks:
(1) Flowoverafiniteflatplate(FP)
This example, known asBlasius flow, models a boundary layer flow over a flat plateon the domain
= (−
1,
5)
×
(
−
1,
1)
.Tomodelthisflow,theDirichletboundaryconditionux=
1,
uy=
0 isimposedattheinflowboundary(x= −
1;−
1≤
y≤
1)andalsoonthetopandbottomofthechannel(−
1≤
x≤
5; y= ±
1),representingwallsmovingfromleft torightwithspeed unity.Theplateismodeledby imposingano-flowconditionontheinternalboundary(0≤
x≤
5;y
=
0), andthe Neumanncondition is applied attheoutflowboundary (x=
5;−
1<
y<
1), i.e.,ν
∂u∂n
−
np=
0. TheReynoldsnumberisdefinedby Re
=
U L/
ν
andthereferencevelocityandlengtharechosenasU=
1 andL=
5.Onthe FPflow,weconsiderfourReynoldsnumbersasRe= {
102,
103,
104,
105}
,whichcorrespondtotheviscosityparametersν
= {
5·
10−2,
5·
10−3,
5·
10−4,
5·
10−5}
,respectively.Since stretched grid is typically needed to compute the flow accurately at large Reynolds numbers, stretched grid is generated based on the uniform Cartesian grid with 12
×
2n·
2n cells. The stretching function is applied in they-directionwiththeparameterb
=
1.
01 [cf. [8]]: y=
(
b+
1)
− (
b−
1)
c(
c+
1)
,
c= (
b
+
1 b−
1)
1− ¯y
,
¯
y=
0,
1/
n,
2/
n, ...
1.
(18)(2) Flowoverbackwardfacingstep(BFS)
The L-shapeddomain isknownasthebackward facingstep.APoisseuille flowprofile isimposedon theinflow(x
=
−
1;
0≤
y≤
1).No-slipboundaryconditionsareimposedonthewalls.TheNeumannconditionisappliedattheoutflow (x=
5;
−
1<
y<
1)whichautomaticallysetstheoutflowpressuretozero.UsingthereferencevelocityandlengthU=
1 andL=
2 andtheviscosityparametersν
= {
2·
10−2,
2·
10−3}
,thecorrespondingReynoldsnumbersare Re=
U L/
ν
=
{
102,
103}
.TheBFSflowismorecomplicatedthantheflat-plateflowasitfeaturesseparation,afreeshear-layerandreattachment. OntheBFSflowwedonotconsidertheReynoldsnumberRe
>
103 sincetheincreaseoftheReynoldsnumberbyanorderofmagnitudewilltransferthe flowtobe turbulent.Onthiscasewe onlyconsideruniformCartesian gridwith 11
×
2n·
2n cells.(3) Liddrivencavity(LDC)
Thisproblemsimulatestheflowinasquarecavity
(
−
1,
1)
2 withenclosedboundaryconditions.Alidmovingfromleft torightwithahorizontalvelocityas:ux
=
1−
x4 for−
1≤
x≤
1 y=
1.
Inordertoaccuratelyresolvethesmallrecirculations,weconsiderstretchedgridaroundthefourcorners.Stretchedgrid isgeneratedbasedontheuniformCartesiangridwith2n
·
2n cells.Thestretchingfunctionisappliedinbothdirectionswithparametersa
=
0.
5 andb=
1.
01 [8] x=
(
b+
2a)
c−
b+
2a(
2a+
1)(
1+
c)
,
c= (
b+
1 b−
1)
¯ x−a 1−a,
¯
x=
0,
1/
n,
2/
n, ....,
1.
(19)ThereferencevelocityandlengthU
=
1 andL=
2 andtheviscosityparametersν
= {
2·
10−2,
2·
10−3,
2·
10−4}
resultinthefollowingReynoldsnumbers Re
= {
102,
103,
104}
.ForthesamereasonasBFS,alargerReynoldsnumber Re>
104 isnotconsideredonthiscase.Inordertoexploretheperformanceof
P
M AL andP
U withvaryingSchurcomplementapproximationsassummarizedinTable1andTable2,numericalevaluationsareclassifiedintofourcategoriesasfollows. (C1) OnsmallReynoldsnumberanduniformgrid
InthiscategoryweconsidertheFP,BFSandLDCcasesonthesmallReynoldsnumberRe
=
102 anduniformCartesian(C2) OnmoderateReynoldsnumberanduniformgrid
Inthiscategorywe applythemoderateReynoldsnumberRe
=
103 ontheFP,BFSandLDCcases.Similartothefirst classofexperiments,uniformCartesiangrid isusedheretocheckthe variationofperformance whenincreasingthe Reynoldsnumberbyanorderofmagnitude.(C3) OnmoderateReynoldsnumbersandstretchedgrid
ThiscategorycontainsthetestscarriedoutontheFPandLDCcaseswithstretchedgrid.Thestretchingfunctionsfor theFPandLDCcasesare (18) and (19), respectively.Still,the moderateReynoldsnumber Re
=
103 isemployed forthetwotests.Comparingwiththesecondclassofexperiments,thiscategoryismeanttoinvestigatetheeffectofmesh anisotropy.
(C4) OnlargeReynoldsnumbersandstretchedgrid
The LDCcasewith Re
=
104 andFPcasewith Re= {
104,
105}
are includedinthis classof teststoassess howthe Krylovsubspace solver behaves atrelatively large Reynolds numbers.Here stretched grid is employed to accurately resolvetheproblemcharacteristics.Inthispaperallexperimentsarecarriedoutbasedontheblocks A,B,C ,C1,C2,Ap,Mp,Lp andMu andtheright-hand
side vector rhs,which are obtainedatthe middlestep ofthe wholenonlinear iterations. Numerical experimentsin [13] show that thenumber oflineariterations variesduring thenonlinear procedure.The motivation ofchoosing themiddle step of the nonlinear iterations to export the blocks and vector is that a representative number of linear iteration can be obtained, compared to the averaged numberof linear iterations through the whole nonlinearprocedure. The relative stopping tolerance to solve the linear systemby GMRES ischosen equal to 10−8. The restart functionality ofGMRES is
not usedinthispaper. Sincethepreconditioners
P
M AL andP
U involvevariousmomentum andpressuresub-systems,allthesesub-systems are directlysolved inthis paperto avoidthesensitiveness ofiterative solverson the varyingsolution complexities.
AspointedoutinSection2,theapplicationoftheSchurcomplementapproximation
Sγ P C D needstopresetboundaryconditions forthepressure Laplacian Lp and convection-diffusion Ap operators. Inthis paper,we follow thesuggestions
of [32,33] to use Dirichletboundary conditionsalong inflow boundaries todefine Lp and Ap.This means that the rows
andcolumns of Lp and Ap corresponding tothe pressure nodes on an inflow boundary are treatedas though they are
associatedwithDirichletboundaryconditions.Fortheenclosedflow,wealgebraicallyaddh2I to Lp andAp tomakethem
non-singular,whereh denotes thegrid sizeand I is theidentity matrixofproper size.Such artificialpressureboundary conditionsare onlyimposed on thepreconditioner. Thecoefficient matrix andright-handside vector are notaffected by theseboundarynodemodifications.
4.1. OnsmallReynoldsnumberanduniformgrid
InthissubsectionwecarryoutexperimentsontheFP,BFSandLDCcaseswithuniformCartesiangridandsmallReynolds number Re
=
102. The numberof Krylov iterations to solve the transformed system preconditioned by the modified ALpreconditioner
P
M AL isgiveninTable 3.The SchurcomplementapproximationsSγ P C D,Sγ L SC,Sγ S I M P L E inP
M AL arederived fromthe newmethod
Sγ new (16) andtheapproximationSγ orig corresponds to theoriginal Schur complementapproximation(17).Inthispaper,thereportednumberofKryloviterationspreconditionedby
P
M AL isobtainedbyusingtheoptimalvalue of
γ
,whichresultsinthefastestconvergencerateoftheKrylovsubspacesolver.Thefollowingobservations aremadefromTable3.Except
Sγ S I M P L E,weseethattheotherSchur complementapproximationsresultintheindependenceofKrylovitera-tionsonthemeshrefinementatthethreetestcases.IntermsofthenumberofKryloviterations,
Sγ L SC issuperiortotheotherSchurcomplementapproximationsontheFPandBFScasesbythereducednumberofiterationsandequallyefficient as
Sγ P C D andSγ orig onthe LDC case. Tounderstand thisadvantage, we take the FP caseasan exampleandplot theeigenvalues ofthepreconditionedSchur complementmatrix
S−γ Sγ in1 Fig.1.As canbe seen,Sγ L SC leads tomoreclus-teredeigenvaluesandthesmallesteigenvaluefurtherawayfromzero.Suchadistributionofeigenvaluesisfavorableforthe Krylovsubspacesolverandafasterconvergenceratecanbeexpected. Weknowthattherecanbematriceswherethereis norelationbetweenthespectrumandtheconvergenceofGMRES[37],especiallyifthematrixisstronglynonnormal.We includethespectrumbecauseinourexamplesthepropertiesofthespectrumareinlinewiththeconvergenceproperties ofGMRES.Inaddition,thefield-of-valuestypeestimatesfortheaugmentedLagrangianpreconditionedmatrixareprovided by[38].
As analyzedin Section 3.3,ateach Krylov iterationthe costs ofapplying
P
M AL withSγ L SC are roughly thesame as Sγ P C D and two timesof that usingSγ S I M P L E andSγ orig. Ifwe assume the computational expenseof applyingP
M ALwith
Sγ orig tobe unit ateachiteration,the totalcostsby usingall SchurcomplementapproximationsonthefinestgridarepresentedinTable4andcalculatedbymultiplyingtheexpenseperiterationby thenumberofiterations.Intheother classesofevaluationswealsousethismethodtocalculatethetotalcomputationalcosts.
Results inTable4 show thatthe minimal computationalcosts are achievedby using
Sγ orig inP
M AL.AlthoughfewerTable 3
Re=102and uniformgrid:thenumberofGMRESiterationstosolvethetransformedsystemwithA
γ
precon-ditionedbyPM ALwithdifferentSchurcomplementapproximationsandtheoptimalvalueofγinparentheses.
Sγ P C D SγL SC Sγ S I M P L E Sγ orig FP case:
n=5 26(1.e-1) 17(8.e-2) 43(2.e-1) 38(2.e-1) n=6 25(1.e-1) 25(8.e-2) 67(2.e-1) 38(2.e-1) n=7 25(1.e-1) 26(8.e-2) 100(2.e-1) 38(2.e-1) BFS case:
n=5 34(2.e-2) 17(2.e-2) 42(1.e-1) 36(1.e-1) n=6 42(3.e-2) 21(2.e-2) 60(1.e-1) 36(1.e-1) n=7 45(3.e-2) 22(2.e-2) 87(1.e-1) 36(1.e-1) LDC case:
n=6 17(2.e-2) 17(2.e-2) 34(1.e-1) 19(1.e-1) n=7 18(2.e-2) 20(2.e-2) 48(1.e-1) 19(1.e-1) n=8 18(2.e-2) 22(2.e-2) 63(1.e-1) 19(1.e-1)
Table 4
Re=102and uniformgrid:thetotalcostsofapplyingP
M AL withdifferentSchurcomplementapproximations
onthefinestuniformCartesiangrid.
Sγ P C D Sγ L SC Sγ S I M P L E Sγorig
FP case: 50 52 100 38
BFS case: 90 44 87 36
LDC case: 36 44 63 19
theheaviercostsof
Sγ L SC.Inthisclassofexperiments,itseemsthattheoriginalSchurcomplementapproximationSγ origismoreefficientthantheotherapproximationsduetothefewercomputationalcostsintotal.
4.2. OnmoderateReynoldsnumberanduniformgrid
Inthissubsectionwe choosethemoderateReynoldsnumberRe
=
103 toevaluatethe performanceoftheSchurcom-plement approximationsusedin themodified ALpreconditioner
P
M AL andcomparewiththeevaluations at Re=
102 inSection4.1.BasedonthenumberofKryloviterationspresentedinTable5,weseethattheindependenceofKryloviterations on themeshrefinement isachievedby usingtheSchur complementapproximations
Sγ P C D andSγ L SC inP
M AL,whichis alsoobservedinSection 4.1.Contrarytothe observationsinSection 4.1,the originalSchur complementapproximation
Sγ orig doesnotresultin themeshindependenceof Kryloviterations at Re=
103.WiththeutilizationofSγ S I M P L E thenumberofKryloviterationsisdependentofthegridsizeatboth Re
=
102and103.Results inTable5showthatthesmallestnumberofKryloviterations isobtainedbyusing
Sγ L SC inP
M AL,whichalsoresultsintheminimaltotalcostsinTable6.ThetotalcostsarecalculatedbyusingthesamemethodasSection4.2.Taking themeshindependenceintoaccount,theutilizationof
Sγ L SC willleadtoa furtherreduction oftotalcosts onfinergridsover
Sγ S I M P L E andSγ orig,whichrequiremoreiterationswithmeshrefinement.ComparedtoSγ P C D whichalsoresultsinthemeshindependenceofKryloviterations,theapplicationof
Sγ L SC reducesthetotalcomputationalcostsatleasttwotimes onthe FPandBFScases,andthisreduction factorcanalso be expectedonfinergrids. Onthe LDCcase
Sγ L SC isequallyefficientas
Sγ P C D.For thetestsat Re
=
103 it showsthatSγL SC issuperior to theother Schur complementapproximationsby the
re-ductionofKryloviterations andtotalcomputationalcosts.IntheprevioustestswithRe
=
102,thesuperiorityofSγ origisseen.ThisimpliesthattheoptimalSchurcomplementapproximationdependsontheReynoldsnumber.
4.3. OnmoderateReynoldsnumberandstretchedgrid
This subsectionismeant to investigatetheinfluence ofmesh anisotropyon theperformance of themodified AL pre-conditioner
P
M AL.TocomparewithSection4.2,weapplythestretched gridandmoderateReynoldsnumberRe=
103 ontheFPandLDCcases.ThenumberofKryloviterationsandtotalcomputationalcostsarepresentedinTable7andTable8, respectively.From Table7wenotethatonly
Sγ P C D resultsinthemeshindependenceandtheminimalnumberofKryloviterations.Althoughthetotalcostsofapplying
Sγ P C D aremorethanthatbyusingSγ S I M P L EandSγ origontheconsideredfinestgrid,asseenfromTable8,fewer costsintotalbyusing
Sγ P C D canbeexpectedonfinergridsduetothemeshin-dependence.Therefore,wethinkthat
Sγ P C D issuperiortotheotherSchurcomplementapproximationsonthetestswithRe
=
103 andstretchedgrid.Note that on the FP and LDC cases with stretched grid,
P
M AL withSγ L SC is not mesh independent any more andFig. 1. FP and Re=102: plot of eigenvalues of the preconditioned matricesS−1
γ Sγ at the uniform Cartesian grid with 12×25·25cells.
Table 5
Re=103and uniformgrid:thenumberofGMRESiterationstosolvethetransformedsystemwithA
γ
precon-ditionedbyPM ALwithdifferentSchurcomplementapproximationsandtheoptimalvalueofγinparentheses.
Sγ P C D Sγ L SC Sγ S I M P L E Sγorig FP case:
n=5 54(8.e-3) 29(8.e-3) 34(2.e-2) 76(6.e-2) n=6 55(8.e-3) 18(8.e-3) 51(2.e-2) 90(6.e-2) n=7 56(8.e-3) 17(8.e-3) 99(2.e-2) 95(6.e-2) BFS case:
n=5 66(4.e-3) 45(3.e-3) 49(1.e-2) 71(3.e-2) n=6 63(4.e-3) 27(3.e-3) 77(1.e-2) 76(3.e-2) n=7 65(3.e-3) 29(3.e-3) 142(1.e-2) 84(3.e-2) LDC case:
n=6 30(4.e-3) 54(1.e-3) 66(7.e-3) 36(2.e-2) n=7 28(4.e-3) 29(4.e-3) 52(1.e-2) 42(2.e-2) n=8 29(4.e-3) 29(4.e-3) 85(1.e-2) 48(2.e-2)
FPcaseasanexample,onthefineststretchedgridofn
=
7 thenumberofKryloviterationspreconditionedbyP
M AL with Sγ L SC increasesbyafactorabout7comparedtothefinestuniformgrid.ThiscanbeseenbycomparingthecorrespondingresultsinTable5andTable7.Thelessefficiencyof
P
M AL withSγ L SC arisingfromthemeshanisotropyisalsoseenontheLDCcase.Onthe otherhand,thenumberofKryloviterations preconditionedby
P
M AL withtheotherSchur complementTable 6
Re=103and uniformgrid:thetotalcostsofapplyingP
M AL withdifferentSchurcomplementapproximations
onthefinestuniformCartesiangrid.
Sγ P C D Sγ L SC Sγ S I M P L E Sγorig
FP case: 112 34 99 95
BFS case: 130 58 142 84
LDC case: 58 58 85 48
Table 7
Re=103and stretchedgrid:thenumberofGMRESiterationstosolvethetransformedsystemwithA
γ
precon-ditionedbyPM ALwithdifferentSchurcomplementapproximationsandtheoptimalvalueofγinparentheses.
Sγ P C D Sγ L SC Sγ S I M P L E Sγ orig FP case:
n=5 59(8.e-3) 90(7.e-3) 37(2.e-2) 69(6.e-2) n=6 66(8.e-3) 89(7.e-3) 63(2.e-2) 85(6.e-2) n=7 62(8.e-3) 117(6.e-3) 119(2.e-2) 92(6.e-2) LDC case:
n=6 65(2.e-3) 98(2.e-3) 57(7.e-3) 69(1.e-2) n=7 41(2.e-3) 58(2.e-3) 46(7.e-3) 40(1.e-2) n=8 38(2.e-3) 84(2.e-3) 75(7.e-3) 54(1.e-2)
Table 8
Re=103and stretchedgrid:thetotalcostsofapplyingP
M ALwithdifferentSchurcomplementapproximations
onthefineststretchedgrid.
Sγ P C D Sγ L SC Sγ S I M P L E Sγorig
FP case: 124 234 119 92
LDC case: 76 168 75 54
Fig. 2. FP and Re=103: plot of eigenvalues of the preconditioned matricesS−1
γ L SCSγ at the uniform and stretched grids with 12×25·25cells.
The lessefficiencyof
Sγ L SC onthestretchedgridcan beexplainedbytheresultsinFig.2,whereweconsidertheFPcaseatRe
=
103 andplottheeigenvaluesofthepreconditionedmatrixS−1γ L SCSγ forbothuniformandstretched grids.As
seen fromFig.2,stretching thegrid considerablyspreadsthe distributionofthe eigenvaluesofthe preconditionedSchur complement
S−γ1L SCSγ ,whichmakestheconvergenceoftheKrylovsubspacesolvermoredifficult.4.4. OnlargeReynoldsnumberandstretchedgrid
In thissubsectionwe applylarge Reynoldsnumbers Re
≥
104 andstretched grids ontheLDCandFPcases.Results inTable9andTable10illustratethatthefastestconvergencerateoftheKrylovsubspacesolverandtheminimalcomputational costs intotal areachievedby using
Sγ S I M P L E inP
M AL onthetwo tests. TakingtheFP caseat Re=
104 asan example,fromTable10weseethattheutilizationof
Sγ S I M P L E reducesthetotalcostsatleasttwotimeswithrespecttotheotherTable 9
Re=104and stretchedgrid:thenumberofGMRESiterationstosolvethetransformedsystemwithA
γ
precon-ditionedbyPM ALwithdifferentSchurcomplementapproximationsandtheoptimalvalueofγinparentheses.
Sγ P C D Sγ L SC Sγ S I M P L E Sγ orig FP case:
n=5 363(8.e-4) 369(6.e-4) 35(2.e-3) 93(1.e-2) n=6 334(8.e-4) 336(6.e-4) 53(3.e-3) 128(2.e-2) n=7 346(8.e-4) 374(6.e-4) 83(4.e-3) 192(2.e-2) LDC case:
n=6 113(3.e-4) 97(2.e-4) 34(1.e-3) 46(5.e-3) n=7 143(3.e-4) 235(2.e-4) 45(1.e-3) 65(5.e-3) n=8 159(4.e-4) 309(2.e-4) 80(2.e-3) 106(5.e-3)
Table 10
Re=104and stretchedgrid:thetotalcostsofapplyingP
M AL withdifferentSchurcomplementapproximations
onthefineststretchedgrid.
Sγ P C D Sγ L SC Sγ S I M P L E Sγ orig
FP case: 692 748 83 192
LDC case: 318 618 80 106
Table 11
FP andRe=105:the numberofGMRESiterationsandtotalcoststosolvethetransformedsystemwithA
γ
preconditionedbyPM ALwithdifferentSchurcomplementapproximationsandtheoptimalvalueofγin
paren-theses.Thestretchedgridisapplied.
Sγ P C D SγL SC Sγ S I M P L E Sγ orig iterations: n=5 1000+ 1000+ 26(1.e-4) 136(1.e-3) n=6 1000+ 1000+ 35(2.e-4) 192(2.e-3) n=7 1000+ 1000+ 58(3.e-4) 310(2.e-3) total costs: n=7 2000+ 2000+ 58 310
onthe FPcase, whichis seenfromTable 11.Inthe contextoflarge Reynoldsnumbers, itappears that
Sγ S I M P L E istheoptimalSchurcomplementapproximationinthemodifiedALpreconditioner
P
M AL.Incontrasttotheprevioustests,atlargeReynoldsnumbersnoneoftheconsideredSchurcomplementapproximationsleadtothemeshindependenceof
P
M AL.Theadvantageof
Sγ S I M P L E onfinergridsneedsafurtherassessment,whichisincludedinfutureresearch.To investigatethe effect ofthe Reynolds number, we take the FP caseasan exampleand in Fig.3 plot the number ofKrylov iterations preconditioned by
P
M AL atvarying Reynolds numbers.It appears that onlySγ S I M P L E results intherobustnessof
P
M AL withrespecttotheReynoldsnumber.Tounderstandthereasons,wecomputetheextremaleigenvaluesofthepreconditionedSchurcomplementmatrix
S−γ Sγ and1 presenttheminTable12. Rmin andRmax denotethesmallestandlargestrealpartsoftheeigenvaluesandImax correspondsthelargestimaginarypart.Theseextremalvaluescorrespond
totheboundariesoftherectangulardomaincontainingalleigenvalues.Regarding
S−γ1S I M P L ESγ ,thevaluesofRmin slightlydecreaseandremainthesameorderofmagnitude.Togetherwiththedecreaseof Rmax
/
Rmin andImax,theeigenvaluesarefurtherclustered. However,fewerclusteredeigenvaluesare yieldedby usingtheotherSchurcomplementapproximations. Thisexplainstherobustnessof
P
M AL withSγ S I M P L E withrespecttotheReynoldsnumber.Toinvestigatethe computedsolutionsatlargeReynoldsnumbers,we choosetheFPcase.Intheinviscidlimit Re
→ ∞
thesolutionissimplyux
=
1,uy=
0 and p=
constant.Sincetheshearboundarylayerisofwidthproportionalto√
ν
andwithin the layerthe horizontalvelocity increases rapidlyfromzero tounity, the plateseems “invisible”as Re
→ ∞
[1]. Tocheckthisfeature,inFig.4weillustratethecalculatedpressureandequallyspacedcontours ofthehorizontalvelocity between0 and0.
95 atdifferentReynoldsnumbers.Thestretched gridwith12×
26·
26 cells isutilized.At Re=
103,thecountersofthehorizontalvelocityshowtheevolutionoftheboundarylayerasthefluidpassingfromtheleadingedgeof the platetothe outflow.The parabolicshape ofthe velocity contours seemsconsistent withasymptotic theory[39] and thereportedresultsin[1].WhenincreasingtheReynoldsnumberstoRe
=
105,weseethattheplate“disappears”andthedifferencebetweenthepressurevaluesdecreasesbyoneorderofmagnitudecomparedtothecaseofRe
=
103.Resultsin Fig.4demonstratethatthecomputedsolutions,rangingfromthemoderatetolargeReynoldsnumbers,seemreasonable.4.5. SummaryoftheSchurcomplementapproximationsin
P
M ALBased on the above fourclasses of numericalevaluations, inTable 13 we summarizethe optimal Schur complement approximationinthemodifiedALpreconditioner
P
M AL.ItshowsthattheoptimalSchurcomplementapproximation,whichFig. 3. FP and stretched grid: plot of the number of GMRES iterations preconditioned byPM ALat varying Reynolds numbers.
Ateveryclassofevaluations,theoptimalSchurcomplementapproximationisproblemindependent.Numericalevaluations inthispapershowthat
Sγ origissuitableforthecalculationswithsmallReynoldsnumbersandSγ S I M P L E deliversabetterperformance forlarge Reynolds numbers due to its Reynolds robustness. In the context ofmoderate Reynolds numbers,
Sγ L SC ismoreefficientwithuniformgrids butsensitiveto meshanisotropy.Whenstretched gridsareemployed,Sγ P C Dturns out to be the optimalchoice inthe moderateReynolds numbercontext. Exceptthe calculationsatsmall Reynolds numbers anduniformgrids,theoptimalSchurcomplementapproximationsonother classesoftestsarederived fromthe newmethod
Sγ newproposedinthispaper.Thisdemonstratestheadvantageofthenewapproachoverthetraditionalone Sγ orig.ThemeshindependenceofKryloviterationsisnotachievedbyusingtheoptimalSchurcomplementapproximation only forthe class of tests withlarge Reynolds numbers. The reason and possible improvement on this issueare to be consideredinfutureresearch.4.6. Comparisonbetween
P
M ALandP
U.To apply the modified AL preconditioner
P
M AL,one needs totransform the original system(2) to an equivalent one(13) withthecoefficient matrix
A
γ .Thistransformationconsumesadditionalcosts.Furthermore,ateachKryloviterationextra costs arisefromthe productof
A
γ with aKrylovresidual vectordueto morefill-in inA
γ [13]. Inthissense, theheavier complexities of
P
M AL could be payed offonly by a reducednumberof Kryloviterations, compared to theblockupper-triangularpreconditioner
P
U applied tothe originalsystem. In thissection,we considerthe comparisonsbetweenP
M AL andP
U on theLDCandFP casesatthe largeReynolds number Re=
104 and stretchedgrid whichrepresent stifftestsontheconsideredpreconditioners.
ItisrevealedinSection 4.4that
Sγ S I M P L E turnsouttobethemostefficientSchurcomplementapproximationfortheFig. 4. FP and stretchedgrid:plotofthecalculatedpressureunknown(left)andcontoursofthehorizontalvelocitybetween0 and0.95 (right)atdifferent Reynoldsnumbers.