ABSTRACT Title of thesis: DENOISING THE DESIGN SPACE: DIFFUSION MODELS FOR ACCELERATED AIRFOIL SHAPE OPTIMIZATION Cashen Diniz, Master of Science, 2024 Thesis directed by: Professor Mark D. Fuge Department of Mechanical Engineering Generative models offer the possibility to accelerate and potentially substitute parts of the often expensive traditional design optimization process. We present Aero-DDM, a novel application of a latent denoising diffusion model (DDM) capable of generating airfoil geometries conditioned on flow parameters and an area constraint. Additionally, we create a novel, diverse dataset 1 of optimized airfoil designs that better reflects a realistic design space than has been done in previous work. Aero-DDM is applied to this dataset, and key metrics are assessed both statistically and with an open-source computational fluid dy- namics (CFD) solver to determine the performance of the generated designs. We compare our approach to an optimal transport GAN, and demonstrate that our model can generate designs with superior performance statistically, in aerodynamic benchmarks, and in warm- start scenarios. We also extend our diffusion model approach, and demonstrate that the number of steps required for inference can be reduced by as much as ∼ 86%, compared to an optimized version of the baseline inference process, without meaningful degradation in design quality, simply by using the initial design to start the sampling process. 1https://github.com/IDEALLab/OptimizingDiffusionSciTech2024 DENOISING THE DESIGN SPACE: DIFFUSION MODELS FOR ACCELERATED AIRFOIL SHAPE OPTIMIZATION by Cashen Diniz Thesis submitted to the Faculty of the Graduate School of the University of Maryland, College Park in partial fulfillment of the requirements for the degree of Master of Science 2024 Advisory Committee: Professor Mark D. Fuge, Chair/Advisor Professor Michael Otte Professor James Baeder © Copyright by Cashen Diniz 2024 Foreword Portions of this thesis are taken from and based on the author’s own previous work, ”Optimizing Diffusion to Diffuse Optimal Design”, originally published as a conference paper by the American Institute of Aeronautics and Astronautics [6]. ii 4/18/2024 Re: Previously Published Materials appearing in Thesis or Dissertation Dean of the Graduate School, Cashen Diniz has: NOT INCLUDED any previously published works within their thesis or dissertation. INCLUDED one or more previously published works within their thesis or dissertation. This letter certifies that the examining committee for the student has determined that the student made a substantial contribution to the previously published work. The inclusion of the previously published work has the approval of the thesis or dissertation advisor and the Graduate Director. Sincerely, Sincerely, Peter Sandborn Mark Fuge Director of Graduate Studies Advisor for Cashen Diniz Department of Mechanical Engineering University of Maryland 2181 Glenn L. Martin Hall College Park, MD 20742-3035 TEL: 301-405-2410 http://www.enme.umd.edu Mark Fuge Digitally signed by Mark Fuge Date: 2024.04.18 12:08:51 -04'00' iii Dedication I dedicate this thesis to my dog, Coco, who has been at my side throughout the process of writing this thesis. Thanks for listening to all my crazy ideas and reminding me to have some fun. iv Acknowledgments I am extremely grateful to all those who have helped me get to the point of finishing this thesis. First and foremost, I would like to thank Dr. Mark Fuge, who has been an invaluable source of support, not only as an advisor, but as a colleague and a friend. Without his belief in me, I simply would not have been able to come this far. I would also like to thank Milad, Nathan, Nicholas, Arthur, Quiyi, and Xuiliang at the IDEAL lab for welcoming me, and for all the discussions we’ve had. I also want to thank my best friend Sean for keeping me motivated through and through. Finally, I want to thank my parents, my brother Odin, and everyone else in my family for being there for me at every step along the way. v Table of Contents Foreword ii Dedication iv Acknowledgements v 1 Introduction 1 2 Related Work 4 2.1 Approaches to Aerodynamic Shape Optimization . . . . . . . . . . . . . . 4 2.2 CEBGAN . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 2.2.1 Bezier Representation . . . . . . . . . . . . . . . . . . . . . . . . . 6 2.3 Diffusion Model Theory . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 2.4 Key Metrics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8 2.4.1 Maximum Mean Discrepancy . . . . . . . . . . . . . . . . . . . . 9 2.4.2 Area Constraint Mean Squared Error (Area MSE) . . . . . . . . . . 9 2.4.3 Instantaneous Optimality Gap (IOG) . . . . . . . . . . . . . . . . . 10 2.4.4 Cumulative Optimality Gap (COG) . . . . . . . . . . . . . . . . . 10 2.4.5 Reduction in Cumulative Optimality Gap (RiCOG) . . . . . . . . . 10 3 Optimization Problem Setup 12 3.1 Design Space . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12 3.2 Problem Formulation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13 3.3 Solver Setup . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15 3.4 Geometry Parameterization and Meshing . . . . . . . . . . . . . . . . . . . 15 4 Dataset Results 18 4.1 Optimization Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19 4.2 Data Analysis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22 vi 5 Generative Model Methodology 27 5.1 CEBGAN Modifications . . . . . . . . . . . . . . . . . . . . . . . . . . . 27 5.2 Aero-DDM Architecture . . . . . . . . . . . . . . . . . . . . . . . . . . . 27 5.3 Bezier Autoencoder Validation . . . . . . . . . . . . . . . . . . . . . . . . 29 5.4 Diffusion Parameters and Scheduling . . . . . . . . . . . . . . . . . . . . . 31 5.5 Benchmarking and Warm Start Optimization . . . . . . . . . . . . . . . . . 32 5.6 DDM Inference Efficiency Study . . . . . . . . . . . . . . . . . . . . . . . 33 6 Model Results 35 6.1 Sample Generations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 35 6.2 Statistical Validation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37 6.3 DDM Inference Ablation Study . . . . . . . . . . . . . . . . . . . . . . . . 38 6.4 Aerodynamic Performance Benchmarks . . . . . . . . . . . . . . . . . . . 41 6.5 Warm Start Performance . . . . . . . . . . . . . . . . . . . . . . . . . . . 42 7 Conclusions 46 7.1 Future Work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 47 A Other Geometric Parameterization Methods 50 B Example Optimization Trajectory Plots for a Single Case 51 C Effect of Tstart on Designs 53 Bibliography 54 vii Chapter 1: Introduction Airfoil design is an approachable, yet sufficiently difficult and realistic problem. Tradi- tionally, airfoil design problems have been tackled by applying adjoint-based optimization methods, which can be implemented relatively efficiently and robustly [14]. Despite this, these solvers can be difficult for a non-expert to use properly and importantly, still require much in the way of computational resources for any large-scale design space study. In addition such gradient based solvers can easily converge to local minima, as the objective function is non-convex. To address some of these concerns, some authors have presented data-driven approaches to this problem [20, 39]. More recently, generative machine learning models have garnered significant attention in both academic and commercial circles. Generative models, when applied to engineer- ing problems allow for the modeling of diverse, multi-modal distributions, and depending on the model used, can do so in a relatively data-efficient manner. One particular type of generative model, the denoising diffusion model (DDM) has recently surpassed other models such as generative adversarial networks (GANs) in image generation tasks [5]. Notable examples of such models include OpenAI’s DALL-E2 and Stability AI’s Stable Diffusion [30, 33]. Generative models have also demonstrated their efficacy in various engineering optimization design problems, either as a replacement or complement to tra- ditional approaches. This is because engineering problems often exhibit multimodal be- havior, high dimensionality, and non-intuitive design spaces [29, 4]. However, applying generative models to engineering design problems presents unique challenges due to the 1 need to satisfy physical constraints and simulate designs, making the task more complex than simpler image processing or generation tasks. Regenwetter et al. [31] identified four key challenges to applying generative models in engineering design, namely data sparsity, modeling design performance, feasibility, and sample diversity or model creativity. DDMs have shown promise in image generation and processing tasks, outperforming GANs with generally higher quality and more diverse outputs [5]. DDMs are also rela- tively more stable to train than GANs, as they do not rely on an adversarial/game theoretic approach. Although DDMs are relatively new to the field of engineering design, early re- search suggests advantages in topology optimization. For example, in one study, DDMs reduced feasibility error by 11-fold and improved performance by a factor of 8 [26]. Several relevant publications have also applied generative models to design engineering problems. Ghosh et al. presented a probabilistic framework for generating airfoil designs using an invertible neural network trained on data generated by a real-valued non-volume preserving (real NVP) model. In this way, they were able to replace a traditional gradient- based iterative optimization approach. The real NVP model applies a sequence of proba- bilistic transformations to a latent input [9]. However, most publications apply a GAN to the task airfoil shape optimization in favor of a probabilistic approach. For example, Du et al. applied a GAN to efficiently reconstruct the (unconditional) UIUC dataset of airfoil designs [7]. Achour et al. built on the airfoil-generating GAN architecture to add con- ditioning to generations, with an intent to replace aspects of the traditional optimization pipeline [1]. Separately, Morton et al. have applied conditional GANs to predict the flow around circles and half-cylinders [28]. Although this work was not focused on the design optimization process, it demonstrates the broad applicability of generative models to the more general issue of parameterized fluid flow problems. Our work focuses on the problem of airfoil shape optimization, in particular, we follow the conditional generative model work of Chen et al. who used a Conditional GAN to 2 generate airfoil designs based on a given Mach number (Ma), Reynold’s number (Re) and desired coefficient of lift (CL) [2]. In this work we present four main contributions: 1. A diverse, robust dataset of optimized airfoils. We note the following novel contri- butions compared to previous work: (a) An extensive description and analysis of our data and the collection process, which includes the use of a more robust FFD-based framework which has been well validated in literature. This includes the use of a structured, curvilinear meshing algorithm, as opposed to the unstructured meshes used in past work [17, 24, 34]. (b) Higher design diversity in the dataset; previous work has made use of optimized datasets which had a much greater degree of uni modal behaviour [2]. 2. Presentation of a novel Diffusion model application to aero shape optimization, Aero- DDM, which has been tailored to the problem, and can generate designs conditional on both flow parameters, an intial design, and an area constraint. 3. A comparison of our Aero-DDM model to a state of the art optimal transport GAN model. We study statstical metrics, aerodynamic performance, and warm start effi- ciency. 4. An inference efficiency study of our Aero-DDM model, which demonstrates a reduc- tion in the number of sampling steps required by ∼ 86% compared to an optimized version of the baseline inference process, without loss in performance. The method we propose is simple, yet offers immense benefits for rapid design space exploration, considering how traditionally expensive inference can be for diffusion models in gen- eral. 3 Chapter 2: Related Work Although we have chosen to add the context of related works throughout portions of this thesis, in this chapter we will address key areas which help explain our motivation and some of the background behind our work. 2.1 Approaches to Aerodynamic Shape Optimization Aerodynamic shape optimization is the process of tuning a set of geometric design vari- ables, x, typically associated with a lifting body surface, to achieve improvement in some objective function, J subject to constraints. Effectively, this is a constrained minimization problem, and as such, can be solved with a wide array of methods depending on the specific problem. As J is often a performance metric, such as lift or drag, a flow solution, obtained through a computational fluid dynamics solver (CFD) may be required to achieve any level of accuracy. The computational cost of the simulations can be expensive, and therefore rules out a brute force search of the design space to find an optimal design [21]. One of the most efficient methods developed to tackle this problem is the adjoint optimization method [18]. The main benefit the adjoint method, is that the sensitivity of J with respect to the design variables, x can be computed at a cost which scales independently of the number of design variables. The sensitivity information can then be used to inform the change in the current design variables according to a chosen optimization algorithm [16]. Although the adjoint approach may be appealing for a wide range of problems, the 4 computational cost of evaluations can still prohibit it from being used as a tool for rapid, interactive, exploration of designs early on in the engineering process. As a result, many machine learning and surrogate based methods have been developed to fill in the gaps. Surrogate methods, which attempt to approximate after being trained on example data, can provide evaluations of key metrics in fractions of the amount of time it would take to run a full simulation or adjoint optimization. If the surrogate model is differentiable, which is the case for many neural-network based approaches, it can be substituted for the CFD evaluation in a gradient based optimization problem in order to generate new designs [21, 36]. An alternative to surrogate approaches is the use of generative models for design. These models can generate high quality designs without the need to solve a surrogate optimiza- tion problem, as the input to the model can include conditional information about the flow regime or required constraints. Popular generative models include GANs, Variational Au- toencoders, Flow-based models, and diffusion models [13]. As discussed in the introduc- tion, of these approaches, GANs have seen the most use in the field of aerodynamic shape optimization, specifically with application to airfoil shape optimization. 2.2 CEBGAN Much of the work presented in this thesis focuses on a comparison of our implemen- tation to a GAN model developed Chen et. al. The model they developed, the conditional entropic Bezier GAN, CEBGAN, is a discriminator-free GAN which relies on optimal transport based loss to learn the airfoil distribution. We chose this comparison because CEBGAN was shown to outperform traditional GANs in airfoil generation, and in general produced state-of-the-art performance on the task. In addition, the authors of CEBGAN applied a Sinkhorn divergence algorithm to overcome the issue of mode collapse, which 5 is a common issue with traditional GANs where the generator produces outputs with very little variation. In similar diffusion model applications to design tasks, traditional GANs are still often used in comparisons, despite being outclassed by more recent GAN variants [2]. 2.2.1 Bezier Representation A central component of CEBGAN, and an attribute shared with our Aero-DDM model, is the use of a latent Bezier representation for the airfoil coordinates. This idea was origi- nally developed by the same authors in [3]. The general Bezier formulation, describing the coordinates, Y in terms of control points, cp, and weights, w, is as follows; Yj = ∑ N i (n i ) t i j ( 1− t j )n−i cpiwi ∑ N i (n i ) t i j ( 1− t j )n−i wi (2.1) [3]. Where j varies between 0 and the number of points, m. t ∈ [0,1] is a parametric variable which discretizes the curve, and N is the degree of the Bezier polynomial. In their work, Chen. et. al found that Bezier representations offered smoother, more feasible designs in a lower dimensional space compared to a GAN trained on raw coordinates [3]. Finally, we direct the reader to the appendix A for a brief discussion of alternative geometric parame- terizations, which may be useful for future work. 2.3 Diffusion Model Theory Diffusion models are part of a greater family of models, all of which are based on the idea of maximizing the likelihood, p(x), of all known data, x. In practical problems, the ground-truth function describing p(x) is often complex, and x can also be quite high- 6 dimensional. As such, learning p(x) exactly can be computationally infeasible. There- fore, likelihood-maximizing models instead introduce a random latent variable, z, of lower- dimensional, and or lower complexity, which can be used to describe the joint distribution with x, p(x) = ∫ p(x,z)dz. (2.2) (2.3) To realize the benefits of introducing the lower complexity z, a tractable Evidence Lower Bound (ELBO) can be defined to approximate the joint distribution integral; p(x) = ∫ p(x,z)dz (2.4) log p(x) = log ∫ p(x,z)dz (2.5) = log ∫ p(x,z)qθ (z|x) qθ (z|x) (2.6) = log Eqθ (z|x) [ p(x,z) qθ (z|x) ] (2.7) ≥ Eqθ (z|x) [ p(x,z) qθ (z|x) ] (Using Jensen’s Inequality) (2.8) (2.9) Adapted from Luo: [23, 6] Where qθ (z|x) is the variational distribution with learnable model parameters θ [23]. Diffusion models differ from other related likelihood maximizing approaches in that z, has the same cardinality as the data x, but is noised according to a variance schedule 7 parameterized by a hyperparameter, βt . The index t describes the data-to-noise ratio and ranges from 0 to T , with 0 representing the original, un-noised data, and T representing maximally noised data. In the limit as T increases, the data approaches an isotropic Gaus- sian distribution; q(xt|xt−1) = N (xt; √ αtxt−1,1−αt) (2.10) Here, αt = 1−βt . The act of noising the original data according to a parameterized Gaussian is known as the forward process. The reverse process can be computed using the model predictions; pθ (x0:T) = p(xT) T ∏ t=1 pθ (xt−1|xt) (2.11) where, pθ (xt−1|xt) = N (xt−1; µθ (xt , t),σθ (xt , t)) (2.12) [23]. Commonly, σθ is set equal to βt . The final ELBO objective can be written as ELBO = Eq(x0:T|x0) [ p(x0:T) q(x1:T|x0) ] (2.13) In many cases, including this thesis, the MSE between the predicted and actual noise added to the data can be used as a much simpler estimation of the ELBO [15]. 2.4 Key Metrics Throughout this thesis, we will refer to various statistical, and performance measures and related terms which we will briefly discuss here. We specifically chose to use many 8 metrics which were also used by Chen et. al. in [2], as it creates a degree of consistency between both studies. 2.4.1 Maximum Mean Discrepancy Maximum mean discrepancy (MMD) is a metric which attempts to measure the sim- ilarity between two distributions, and is commonly used to determine the performance of generative machine learning models. It is defined as MMD2(P,Q) = Ex,x′∼P[k(x,x ′)]+Ey,y′∼Q[k(y,y ′)]−2Ex∼P,y∼Q[k(x,y)] (2.14) . Where P and Q are the distributions we are interested in, x and y are independent random variables with distributions P, and Q respectively. x′ and y′ are additional independent copies of x and y with the same P and Q distributions. k is a chosen kernel function - in this thesis we use a Gaussian kernel with a σ of 1 [11]. 2.4.2 Area Constraint Mean Squared Error (Area MSE) In parts of this thesis we attempt to measure violation of the area constraint specific to our problem. Our formulation of this metric is simple; prescribing a value of zero unless the area of the generated airfoil is less than the minimum area constraint, in which case the squared error is applied: AError =    0 if Agen ≥ Amin ||Amin −Agen||2 if Agen < Amin (2.15) 9 2.4.3 Instantaneous Optimality Gap (IOG) Instantaneous optimality gap is a metric which attempts to measure the difference in performance between an initial design, x0, with objective function value J0, and the optimal design, x∗, with the objective function value J∗: IOG(J) = |J0 − J∗| J∗ . (2.16) While some formulations in literature choose to use a simple ratio, this can be mis- leading, as high objective ratios may be achieved by violating certain constraints. This is especially true for aerodynamic efficiency quantities such as Cl Cd where induced drag may spike even with a relatively minor change in Cl . Evidence for this can be found in the highly nonlinear drag polars from experiments and simulations in literature [32, 22]. 2.4.4 Cumulative Optimality Gap (COG) Cumulative optimality gap (COG) is a measure of how efficiently an optimizer is ap- proaching an optimal design, x∗, at the current step i in an optimization trajectory, taking into consideration the values of all previous objective function evaluations: COG(J) = N ∑ i J∗− Ji J∗ (2.17) [2]. If i = n, COG measures the total efficiency of an optimization trajectory. 2.4.5 Reduction in Cumulative Optimality Gap (RiCOG) Reduction in cumulative optimality gap (RiCOG) is similar to the COG metric, but instead measures optimizer efficiency relative to a baseline COG trajectory, Jo: 10 RiCOG(J) = COG(Jo)−COG(J) COG(Jo) ×100 (2.18) In this thesis, Jo is always the objective function optimization trajectory of our original adjoint optimized dataset. 11 Chapter 3: Optimization Problem Setup Note: This chapter was taken from the author’s original work as published in [6]. We formulate a diverse, multi-modal, optimization problem by first specifying the flow conditions; Mach number (M) and Reynolds’s number (Re). In addition, we require the final design to attain the specified coefficient of lift (Cl ) while keeping the area above the minimum area (( A Ainit ) min ) constraint. Thickness constraints are imposed based on the initial design—although we note that in practice, none of the adjoint solver iterations activated these constraints. We also impose no-twist leading and trailing edge conditions. After defining the problem, we used Latin-hypercube sampling to generate the input parameter space, within the bounds specified in Table 3.1. 3.1 Design Space Table 3.1: Sampled Parameter Bounds Category Parameter Lower Upper Description Flow Condition M 0.4 0.9 Mach Number Re 1E6 10E6 Reynold’s Number Constraint Ccon l 0.5 1.2 Coefficient of Lift ( A Ainit ) min 0.75 1.0 Minimum Area Fraction 12 The bounds were chosen purposefully to cover part of the transonic regime which is known to be difficult to model with non-RANS approaches, such as panel methods [20]. 3.2 Problem Formulation We define the specific optimization problem below in equation 3.1. min ∆yi,α Cd s.t. Cl =Ccon l −0.025 ≤ ∆yi ≤ 0.025 0.0 ≤ α ≤ 10.0 ( A Ainit ) min ≤ A Ainit ≤ 1.2 0.15 ≤ t tinit ≤ 3.0 tT E = tT E,init yupper T E =−ylower T E yupper LE =−ylower LE (3.1) Where ∆yi is the difference in the y-coordinates of the Free-Form Deformation (FFD) cage from the initial FFD cage. As such, ∆yi, has a large impact on how diverse the resulting design space is, and its value can be thought of as being embedded in the initial airfoil shape. Table 3.2 describes each of the optimization parameters in detail. 13 Table 3.2: Optimization Problem Parameters Category Parameter Quantity Lower Upper Units Description Objective Cd 1 - - Non-Dim./Counts Coefficient of Drag Variable ∆yi 20 -0.025 0.025 m Change from initial FFD cage y value: ∆yi = yi − yinit α 1 0.0 10.0 Degrees Angle of Attack Constraint Cl =Ccon l 1 0.0 0.0 Non-Dim. Coefficient of Lift A Ainit 1 ( A Ainit ) min 1.20 Non-Dim. Area Fraction; Relative to Initial t tinit 100 0.15 3.00 Non-Dim. Thickness Fraction; Relative to Initial tT E = tT E,init 1 0.0 0.0 m Trailing Edge Thickness: Blunted TE Condition yupper T E =−ylower T E 1 0.0 0.0 m Trailing Edge FFD Point Shearing Twist Condition yupper LE =−ylower LE 1 0.0 0.0 m Leading Edge FFD Point Shearing Twist Condition In addition to the scalar optimization parameters, 1400 initial airfoil designs were taken from a random subset of the UIUC dataset [27]. Raw coordinates from the dataset were re- sampled, smoothed, and normalized as part of a pre-processing procedure. Samples from this dataset are shown in Figure 3.1. 0.0 0.2 0.4 0.6 0.8 1.0 x (m) 0.0 0.1 y (m ) Figure 3.1: Processed Samples From UIUC Dataset 14 3.3 Solver Setup We used the MACH-Aero framework for all discussed simulations 1. Within the MACH- Aero framework, structured mesh generation was handled automatically by pyHyp, a hy- perbolic mesh generator [34]. Additional details concerning the mesh setup are described in the next section. pyOptSparse, a sparse numerical optimization suite, was used to ef- ficiently solve the nonlinear optimization problem [37]. Specifically, we use the sequen- tial least squares programming algorithm (SLSQP). pyGeo, a geometry parameterization module, was used to parameterize and deform the FFD cages, with warping of the cages handled by the robust IDWarp framework [12, 35]. The flow simulation, and the corre- sponding adjoint solution, was computed using the open-source ADflow solver [24, 17]. Within the solver, we used the approximate Newton–Krylov (ANK) method for improved convergence and robustness [38]. ADflow was configured to run Reynolds Averaged Navier Stokes (RANS) simulations with the Spalart-Allmaras model for turbulence effects. For the flow solver we required that the L2 norm of the residual decrease to 1e− 8, relative to the initial freestream flow residual. We use the same value for the adjoint resid- ual convergence factor. For the optimization problem, we limit the maximum number of iterations to 500, and set a maximum step size of 1e−6 for each of the design variables. 3.4 Geometry Parameterization and Meshing Since we had to run a diverse set of input flow conditions and also initial airfoils, we decided to use a meshing setup that was able to successfully match the results from the benchmark ADODG RAE2822 airfoil optimization problem. A more detailed description of this benchmark problem can be found in [14]. The mesh used to successfully replicate 1https://github.com/mdolab/MACH-Aero 15 the results from this benchmark problem, as well as the corresponding FFD cage is shown in Figure 3.2. (a) Example Mesh (b) FFD Cage Figure 3.2: Example Setup: RAE2822 We set the far-field distance to 100 chord lengths away from each airfoil, and extrude the mesh using 100 grid levels. For each optimization run, we set the number of FFD points to 20 based on the results in [14], which showed reduced benefit after about 20 points were used. Finally, to improve stability we blunt the trailing edge at the point 99% away from the leading edge. Figure 3.3 shows the meshes for four random samples in our dataset after applying the described setup. 16 (a) Sample a (b) Sample b (c) Sample c (d) Sample d Figure 3.3: Four Random Samples: Meshing Setup 17 Chapter 4: Dataset Results Note: This chapter was taken from the author’s original work as published in [6]. From the starting dataset of 1400 airfoils, we observed 935 successful optimizations, corresponding to a 67% success rate. In general, there were three main ways cases could fail. Firstly, they could fail due to issues with the flow solver. This could include meshing failures, such as the consistent presence of negative mesh volumes or highly irregular air- foil designs that were the result of large FFD cage changes. In these cases the flow solver could fail outright, or the bad flow solutions could make the Hessian ill-conditioned and cause the optimizer not to converge. The second failure mode was observed in the opti- mization process, with the most common error being a positive directional derivative in the linesearch. Although there is probably not one single reason for this, this could indicate that the problem has constraints which make a sufficient reduction in the objective infeasi- ble. Finally, certain cases failed due to technical issues on the HPC system - although most of these cases were automatically restarted, and therefore this failure mode made up only a small portion of all failed cases. The training, testing, and validation sets were split into 748, 140, and 47 airfoils respec- tively. We display a random subset of the resulting optimized airfoil dataset in Figure 4.1. Each optimization case was ran using 16 cores on a AMD EPYC 7763 processor, and took anywhere from 10-30 minutes to complete, depending on how many iterations were required for convergence, and on the specific performance of the specific CPU node at the 18 time. Batches of cases were submitted to be ran in parallel on an HPC system 1. The entire set of simulations could take several days, and a few thousand service hours (∼4k), to complete. This includes the time required to submit and initialize jobs (in their virtual environments) and to post-process the results. We acknowledge some of the scaling issues could be greatly improved upon in the future. 4.1 Optimization Results 0.0 0.2 0.4 0.6 0.8 1.0 x (m) 0.0 0.1 y (m ) Initial Optimized Figure 4.1: Examples of Initial and Optimized Airfoils Of note is the relatively slight change in shape from the initial design. This can be explained by the proportionately small deviation constraint (±0.025 m) from the initial FFD points we enforced on the optimizer. In testing, we found that making the range much larger made it difficult for the adjoint optimizer to converge consistently across the relatively wide range of flow parameters we sampled from. Techniques such as FFD point adaption, as described in [14], may be required for more dramatic changes to the design space. Figure 4.2 presents more detailed results for the four aforementioned random samples in our dataset. In addition, Figure 4.3 shows the resulting pressure contours for the final, optimized designs. We note these samples display design characteristics one might expect for the given design parameters we used. For example, Figure 4.2 (b), corresponds to an 1Zaratan Cluster; https://hpcc.umd.edu/hpcc/zaratan.html 19 optimized design with a relatively sharp, “diamond-shaped”, leading-edge profile typical of high Mach number airfoil designs, as such designs can better take advantage of the shock-expansion patterns relevant to transonic regimes [8]. M∞ =0.40, Cl = 1.15, Re = 1.5×106,( A Ainit ) min = 0.92 M∞ =0.73, Cl = 1.15, Re = 1.3×106,( A Ainit ) min = 0.90 M∞ =0.47, Cl = 0.59, Re = 9.8×106,( A Ainit ) min = 0.78 M∞ =0.64, Cl = 1.11, Re = 2.2×106,( A Ainit ) min = 0.99 0.0 0.2 0.4 0.6 0.8 1.0 x (m) −0.1 0.0 0.1 0.2 y (m ) (a) αfinal = 4.63◦ (b) αfinal = 2.95◦ (c) αfinal = 2.24◦ (d) αfinal = 3.79◦ Figure 4.2: Four Random Samples: Optimized Airfoils 20 (a) Sample a (b) Sample b (c) Sample c (d) Sample d Figure 4.3: Four Random Samples: Cp Contours Darker shades correspond to higher pressure differences relative to free stream conditions. Red and blue shades indicate higher and lower relative pressures respectively. Finally, we present the optimization trajectory for the four random designs in Figure 4.4. 21 (a) Iteration 0 (0%), α = 2.50◦ Iteration 13 (25%), α = 4.11◦ Iteration 26 (50%), α = 4.31◦ Iteration 40 (75%), α = 4.63◦ Iteration 53 (100%), α = 4.63◦ (b) Iteration 0, α = 2.50◦ Iteration 25, α = 2.22◦ Iteration 49, α = 2.70◦ Iteration 72, α = 2.95◦ Iteration 93, α = 2.95◦ (c) Iteration 0, α = 2.50◦ Iteration 10, α = 2.05◦ Iteration 21, α = 1.89◦ Iteration 32, α = 2.25◦ Iteration 42, α = 2.24◦ (d) Iteration 0, α = 2.50◦ Iteration 38, α = 2.60◦ Iteration 76, α = 3.81◦ Iteration 113, α = 3.79◦ Iteration 149, α = 3.79◦ Figure 4.4: Four Random Samples: Optimization Trajectories We make note of the fact that, beyond 25% of the optimization trajectory, the design stays relatively the same, with the latter steps only introducing marginal changes with a correspondingly smaller effect on Cd . 4.2 Data Analysis To interpret all of the variable trajectory-length optimization data, we linearly inter- polated the trajectory data to 1000 samples for all key objective, constraint, and design variables. Figure 4.5 shows the aggregate mean and standard deviation of the difference from the optimal objective, C∗ d , in drag counts (1 drag count equals 0.0001). As was the case visually in Fig. 4.4, Fig. 4.5 confirms that the change in optimal Cd is most affected by changes occurring in the first 20-25% of the design trajectory. We note that all the figures in this section display aggregate quantities computed across all 748 samples in the training set. In this section, σ will always refer to one standard deviation for a normal distribution centered at the mean value at each optimization iteration. Additionally, we direct the reader to appendix B for an example of an individual optimization trajectory for more context. 22 0.0 0.2 0.4 0.6 0.8 1.0 Non-Dimensional Iteration −50 0 50 100 150 200 250 300 |C d − C ∗ d| µ σ Figure 4.5: Aggregate Cd Convergence The design variable optimization trajectories present a more or less linear decrease in the average per-iteration change for the majority of the trajectory, as can be seen for both angle of attack and in the summation of over the FFD y-coordinates in Figure 4.6. Where ∆Yn+1 −∆Yn = 1 N f f d ∑ N f f d i |∆yi n+1 −∆yi n| . Note that this is the absolute change from the initial FFD cage, and is not the same as the change in coordinate value, which would produce a mean around zero due to enforcing symmetry in the FFD point movement (see 4.8). In this sense, a linear rate of change is reasonable, as we are not measuring any form of cumulative statistic. 0.0 0.2 0.4 0.6 0.8 1.0 Non-Dimensional Iteration 0.0 0.5 1.0 1.5 2.0 2.5 |α n + 1 − α n | µ σ (a) Angle of Attack Step Size Convergence 0.0 0.2 0.4 0.6 0.8 1.0 Non-Dimensional Iteration 0.000 0.005 0.010 0.015 0.020 0.025 ∆ Y n + 1 − ∆ Y n µ σ (b) FFD Point Sum Step Size Convergence Figure 4.6: Aggregate Design Variable Step Size Convergence Next, we present the aggregate constraint satisfaction measures for Cl , minimum area fraction, thickness constraints, and blunted thickness constraints in Figure 4.7. 23 0.0 0.2 0.4 0.6 0.8 1.0 Non-Dimensional Iteration −0.6 −0.4 −0.2 0.0 0.2 C l − C co n l µ σ (a) Cl Error 0.0 0.2 0.4 0.6 0.8 1.0 Non-Dimensional Iteration 1.00 1.05 1.10 1.15 1.20 1.25 A A in it / ( A A in it ) m in µ σ (b) Area Fraction 2.9 3.0 3.1 Constraint Bounds 0.8 0.9 1.0 1.1 ( t t i n it ) µ σ 0.0 0.2 0.4 0.6 0.8 1.0 Non-Dimensional Iteration 0.14 0.16 (c) Thickness Fraction 0.0 0.2 0.4 0.6 0.8 1.0 Non-Dimensional Iteration 0.90 0.95 1.00 1.05 1.10 1.15 1.20 ( t t i n it ) T E / ( t t i n it ) T E ,c on µ σ (d) Blunted Thickness Fraction Figure 4.7: Aggregate Constraint Convergence All constraints appear to decrease smoothly in the aggregate, and no abnormal behavior was observed. Additionally, we observed the thickness constraints were never violated in any of the optimization cases we ran. We also investigated how individual FFD points change per iteration, and whether they followed any sort of statistical pattern. We plotted the per iteration change in each FFD point against each other interactively as a function of optimization iteration. A static version of these results is presented in Figure 4.8. 24 0.0 0.2 0.4 0.6 0.8 1.0 Non-Dimensional Iteration −0.02 −0.01 0.00 0.01 0.02 ∆ y (i ) n + 1 − ∆ y (i ) n 0.0 0.2 0.4 0.6 0.8 1.0 Non-Dimensional Iteration 0.000 0.005 0.010 0.015 0.020 0.025 σ (i ) : ∆ y (i ) n + 1 − ∆ y (i ) n ∆y(0) ∆y(15) −0.10 −0.05 0.00 0.05 0.10 ∆y(0) −0.04 −0.02 0.00 0.02 0.04 ∆ y (1 5 ) Samples (0,0) max(δ∆y (i) n ) e(Σ (0) 200, Σ (15) 200 ) (a) First and 11th Points; 20% Along Trajectory 0.0 0.2 0.4 0.6 0.8 1.0 Non-Dimensional Iteration −0.02 −0.01 0.00 0.01 0.02 ∆ y (i ) n + 1 − ∆ y (i ) n 0.0 0.2 0.4 0.6 0.8 1.0 Non-Dimensional Iteration 0.000 0.005 0.010 0.015 0.020 0.025 σ (i ) : ∆ y (i ) n + 1 − ∆ y (i ) n ∆y(0) ∆y(15) −0.10 −0.05 0.00 0.05 0.10 ∆y(0) −0.04 −0.02 0.00 0.02 0.04 ∆ y (1 5 ) Samples (0,0) max(δ∆y (i) n ) e(Σ (0) 700, Σ (15) 700 ) (b) First and 10th Points; 70% Along Trajectory 0.0 0.2 0.4 0.6 0.8 1.0 Non-Dimensional Iteration −0.02 −0.01 0.00 0.01 0.02 ∆ y (i ) n + 1 − ∆ y (i ) n 0.0 0.2 0.4 0.6 0.8 1.0 Non-Dimensional Iteration 0.000 0.005 0.010 0.015 0.020 0.025 σ (i ) : ∆ y (i ) n + 1 − ∆ y (i ) n ∆y(0) ∆y(10) −0.10 −0.05 0.00 0.05 0.10 ∆y(0) −0.04 −0.02 0.00 0.02 0.04 ∆ y (1 0 ) Samples (0,0) max(δ∆y (i) n ) e(Σ (0) 200, Σ (10) 200 ) (c) Leading Edge Points; 20% Along Trajectory 0.0 0.2 0.4 0.6 0.8 1.0 Non-Dimensional Iteration −0.02 −0.01 0.00 0.01 0.02 ∆ y (i ) n + 1 − ∆ y (i ) n 0.0 0.2 0.4 0.6 0.8 1.0 Non-Dimensional Iteration 0.000 0.005 0.010 0.015 0.020 0.025 σ (i ) : ∆ y (i ) n + 1 − ∆ y (i ) n ∆y(19) ∆y(9) −0.10 −0.05 0.00 0.05 0.10 ∆y(9) −0.04 −0.02 0.00 0.02 0.04 ∆ y (1 9 ) Samples (0,0) max(δ∆y (i) n ) e(Σ (9) 200, Σ (19) 200 ) (d) Trailing Edge Points; 20% Along Trajectory Figure 4.8: Per-FFD Point Variation as a Function of Iteration A significant takeaway from Figure 4.8 is that the relative change in FFD points appears to follow a relatively Gaussian-shaped distribution with a linearly decreasing variance and 25 a mean centered around zero. The difference between plots (a) and (b) in Figure 4.8 demon- strates how this Gaussian linearly shrinks as a function of trajectory. Finally, for valida- tion purposes, we plotted the pairs of leading and trailing edge points (Figure 4.8 (c), (d)) which, as expected, follow a direct linear relationship, verifying the no-twist conditions we imposed. Lastly, for reference, we present in more detail the standard deviations of the point changes for all 20 individual FFD points in Figure 4.9. 0.0 0.2 0.4 0.6 0.8 1.0 Non-Dimensional Iteration 0.0000 0.0025 0.0050 0.0075 0.0100 0.0125 0.0150 0.0175 0.0200 σ (i ) : ∆ y (i ) n + 1 − ∆ y (i ) n LE lower TE upper Figure 4.9: Standard Deviations in Variation for All 20 FFD Points Based on these results, it appears the upper surface experiences slightly less variation in FFD point position than the lower surface—however, this difference is not major. We also note how variation in the trailing edge is extremely limited due to the blunted trailing edge thickness constraint we imposed. 26 Chapter 5: Generative Model Methodology This chapter will describe the implementation of both models tested, our approach to running the warm start experiment, and finally our exploratory DDM inference efficiency study. 5.1 CEBGAN Modifications In our work, we keep the majority of the CEBGAN model the same as it was originally presented. However, we do make a change to the architecture in order to accept initial airfoil design, x0, inputs. To do this we apply an auxiliary feedforward network which accepts the flattened x0 input and outputs a set of latent variables which can be concatenated onto the inputs to the airfoil and angle of attack generators. The final model used has around 8 million parameters, and was trained with the same settings as specified in the original CEBGAN paper [2]. 5.2 Aero-DDM Architecture Note: This section is a modified version of the author’s original work as published in [6]. 27 Figure 5.1: U-Net Architecture Our model, Aero-DDM, takes on a standard U-Net architecture, which is commonly used in DDM implementations [15]. Our implementation differs in a few key aspects from the more standard image models. Most importantly, the model was trained in a latent space representing Bezier curve control points and weights (cx, cy, w), rather than in the raw coordinate space, following Chen et al. [4]. This latent space was achieved by pre-training an autoencoder on the training data. In this case, the training data also included all of the airfoils on the optimization trajectories. We use 32 Bezier control points, with 2 of the 32 control points being fixed at the trailing edge. Architecturally, 1-D convolutions with periodic padding were used in place of 2-D convolutions (which are better suited to pixel representations) to match the airfoil coordinate space. Conditioning on the flow parameters, (M, Re, Cl), the minimum area constraint, (Amin), and the initial airfoil design (x0) was handled by a simple concatenation onto the latent space. Additionally, the scalar 28 noise variable for angle of attack, (εα ), was also repeated across the data dimension and concatenated in the same way. We note that since our work in [6], we have modified Aero- DDM to take the time-embedding, temb, (an embedding of t in the diffusion schedule) and concatenate it to the conditional inputs prior to passing it through a feed forward network, φ , which outputs a latent embedding, tlatent : φ(temb,c) = tlatent . The model used was relatively small, with less than 500,000 trainable parameters, not including the pre-trained auto-encoder. Training wall time is about 10 minutes on an NVIDIA A1000 GPU, and sampling time was about 7s for the entire training set. This com- putational efficiency in sampling is on par with other generative models, such as GANs, implemented for similar generative airfoil design problems, but much better than these models in terms of training time required [2]. This output is then added prior to each ResNet concatenation in the U-Net. We found that doing this, along with reducing the learning rate on plateaus led to modest improve- ments in statistical benchmarks. 5.3 Bezier Autoencoder Validation All models presented in this thesis make use of the same latent Bezier encoding of the airfoil coordinate space. In addition, the encoder used is a pre-trained model whose parameters are not changed during the training process of any of the generative models tested. As a result, the upper bound on model performance is decided by how accurately the original data can be reconstructed from the latent space. Therefore, it was important to validate the Bezier autoencoder’s performance. In Figure 5.2 we present the worst 6 reconstructions on the entire validation dataset (N=140) of optimized designs. 29 Figure 5.2: Top 6 Highest MSE Bezier Reconstructions in Validation Set For the optimized dataset, the mean and standard deviation of the reconstruction error were 1.56e-05 and 1.29e-05 respectively. Airfoils shown in Figure 5.2 display the upper end of the variation in reconstruction error. Considering the designs in Figure 5.2 are the highest error cases, yet still visually match ground truth designs, we can be relatively con- fident in the robustness of the autoencoder’s reconstructions. In a sense, the mean recon- struction error of 1.56E-05 represents a loose upper bound on the MSE of any model trained using the encoded control points and weights produced by the Bezier autoencoder model. In other words, all things being equal (and assuming no design space multi-modality), a theoretical generative model with the ability to perfectly predict optimal airfoil designs should expect a maximum achievable MSE between generated designs and the raw opti- mal airfoil coordinate data on the order of 1.56e-05. Figure 5.3 shows the distribution of reconstruction error across the training and validation datasets. 30 Figure 5.3: Distribution of MSE for Bezier Reconstructions Solid blue and red lines represent Gaussian kernel density estimates for the training and validation sets respectively. The shaded regions are histogram bins. The distribution in Figure 5.3 shows a slightly skewed error distribution, with a tail that extends to just less than 1E-4. In addition, the validation data error distribution follows a very similar distribution to the training set, which indicates that overfitting is most likely not an issue. We also note, that to avoid any accidental biases, we only made changes to the Bezier model based on validation data, and reserved the test data for the final diffusion and CEB- GAN model comparisons. Overall, the majority of airfoils in the dataset appear to be able to be encoded into the Bezier space with little loss in quality upon reconstruction. 5.4 Diffusion Parameters and Scheduling We adopted a linear variance (β ) schedule for the latent Bezier points ranging from 1E-4 to 2E-2, with a total of 1000 steps where the 1st step represents the lowest amount 31 of noise, and 1000 is the highest. This range was proposed by Ho et al. in their original diffusion paper [15]. A sample noised by the forward diffusion schedule for Aero-DDM is shown in Figure 5.4. Note we show latent Bezier control points (colored by weight) in the top row and the raw airfoil coordinates in the bottom row. For α we chose a different schedule range of 1E-4 to 1E-1, as we determined higher levels of noise earlier in the schedule were necessary to avoid over-fitting, wherein the model would produce biased α values closer to the statistical mean of the dataset. This is possibly because early on in the schedule, α has little correlation to the actual airfoil shape, and so the loss can be minimized by simply predicting the statistical mean α value if not noised sufficiently—although further investigation may be required to fully validate this hypothesis. Figure 5.4: Forward Diffusion Schedule 5.5 Benchmarking and Warm Start Optimization All models in our study are benchmarked on the test dataset (N=47), and generated 5 passes over this dataset for each model tested (N=235 airfoils generated and benchmarked per model variation). The only exception to this is the set of adjoint optimizations we ran using the original optimized designs as a starting point (discussed below), where we only ran (N=47) airfoils, since the adjoint optimizer is more or less deterministic in the designs generated given the same input conditions. Prior to evaluating any design, we interpolate the airfoil coordinates according to curvature to increase mesh quality and accuracy of 32 integrated quantities. This setup is the same for both the CFD and warm start benchmarks which we describe next. Warm starting refers to the process by which a design is used as the initial state in an optimization process with the intent to accelerate convergence toward an optima. In our study we measure the warm start performance of our Aero-DDM (and its variants) and the CEBGAN model. We also run adjoint optimization cases starting from the original test set of optimized designs, x∗, to set a loose upper bound on the maximum expected perfor- mance. As opposed to the random UIUC designs, x0, used to collect the data, x∗ should be the set of designs closest to the optima (or a local optima, if uni-modal), and therefore should also give the optimizer the largest warm-start efficiency advantage. Although start- ing the optimization with x∗ may be expected to lead to the same x∗ optima, in practice the results may vary slightly if the region around the optima is relatively flat, or the case is multimodal. As a result, comparisons to the optimization from x∗ should partially control for some of this behavior. Important to note here, is that clearly x∗ is a-priori knowledge of the optimal design, and thus this method is not computationally practical - it was only included as a point of comparison to the upper bound. 5.6 DDM Inference Efficiency Study Throughout this thesis we run a study that attempts to determine whether reduced in- ference time, as measured by a reduction in diffusion sampling steps, can be obtained simply by changing the starting distribution from which the inference process is initialized. We tested variations on the baseline Aero-DDM model intialized from 3 different starting distributions; (1) pure Gaussian noise, N (0, I), as is also used to initialize the baseline Aero-DDM model, (2) CEBGAN generated designs, (3) x0, the initial airfoil designs used to generate the optimized test dataset. Here x0 is the same initial (UIUC) airfoil design that 33 the adjoint solver used to start the optimization process. In We conducted an ablation study measuring how many diffusion steps are required for reasonable distributional matching, as measured by airfoil coordinate MMD. The MMD ablation study was ran for 25 time step values between 1 and 350 steps, with each MMD evaluation involving the generation of 10 passes over the test dataset. 34 Chapter 6: Model Results In this section, we present and discuss the following: 1. Qualitative comparisons of CEBGAN and Aero-DDM sample generations. 2. Statistical measures of how well the CEBGAN and Aero-DDM generated designs match the distributions of the dataset. 3. An inference efficiency ablation study, with discussion on the statistical performance of different initializations of the Aero-DDM inference process as a function of step number. 4. Performance of designs generated by CEBGAN, Aero-DDM, and the most promising variant from the inference efficiency ablation study in two areas: (a) CFD-derived aerodynamic performance. (b) Warm-start efficiency as measured by cumulative optimality-gap metrics. 6.1 Sample Generations Input parameters from the held-out test set were used to statistically validate generations produced by each model. A grid of sample generations for Aero-DDM and the modified CEBGAN model is presented in Figure 6.2, along with the corresponding ground truth test data. 35 Figure 6.1: Sample Ground Truth Airfoils (a) CEBGAN (b) Aero-DDM Figure 6.2: Sample Generations Lx = Airfoil MSE 36 Both Aero-DDM and the CEBGAN model produced visually accurate generations. Even so, there are noticeable differences, particularly between for irregular airfoil shapes, whereby the CEBGAN performs worse than Aero-DDM, such as with the design in the lower left corner in Figures 6.2a and 6.2b. The other main observation is that the airfoils generated by CEBGAN also appear to have worse angle of attack predictions compared to Aero-DDM. 6.2 Statistical Validation Model performance, as measured by MSE, MMD, and area constraint satisfaction are collated in Table 6.1. Table 6.1: Statistical Benchmarks Note: values to the right of the ± symbol indicate a 95 % confidence interval. MMD (Airfoil) MMD (α) MSE (Airfoil) MSE (α) Area Constraint MSE Aero-DDM 0.063 ± 4.3E-04 0.060 ± 0.0028 1.55E-05 ± 4.7E-07 0.067 ± 0.01 2.34E-06 ± 4.1E-07 CEBGAN [2] 0.072 ± 3.4E-05 0.106 ± 0.0010 2.16E-05 ± 4.2E-08 0.281 ± 0.003 2.88E-06 ± 5.5E-08 The results in Table 6.1 appear to confirm the qualitative differences observed in Fig- ure 6.2, with Aero-DDM having marginally better performance in MMD, and modestly better performance in angle of attack. The observed difference in angle of attack may have more to do with the fact that the CEBGAN architecture uses a separate network to generate angle of attack predictions that is not internally dependent on the airfoil-generating portion of the network [2]. In contrast, the angle of attack predictions produced by Aero-DDM are coupled to the airfoil latent coordinates prediction, with the result being the average of one of the convolution channel outputs. Another observation is that the Area Constraint MSE is relatively similar for both mod- els, and in general performance on the order of 1E-6 is a strong indicator that both models 37 have learned to generate designs which respect this constraint either through the explicit Amin input, or implicitly through x0. An promising avenue for future work could be the study of how access to Amin or x0 affects area constraint satisfaction. 6.3 DDM Inference Ablation Study After statistically benchmarking the CEBGAN and Aero-DDM, we conducted an abla- tion study on airfoil MMD values for the three different intialized variants of Aero-DDM as described earlier. We briefly address these results here, as they explain our focus on the x0 initialized Aero-DDM variant in all subsequent performance benchmarks. Figure 6.3: Inference Steps Ablation Study Figure 6.3 displays MMD as a function of starting time-step for each scenario. Lower values of Tstart indicate less denoising steps (i.e more efficient inference). In addition, lower values also mean a lower assumed starting variance β . This is a key point, as lower starting variances mean less cumulative change can be effected on the initial design by the model. 38 An example of the effect of Tstart on design quality can be found in appendixC. Results from Figure 6.3 indicate that all initialization approaches tend to converge on the original Aero-DDM MMD value, and that the difference is negligible after inferences with 350 steps or more. This is an interesting result for the N (0,I) started model, as it appears even an unadulterated version of Aero-DDM that works with pure noise can achieve the same level of accuracy as the base model in only 350 steps. The most interesting result however, is that the x0 initialized variant produces a similar MMD to the base model in only 50 steps. This represents a 95 % reduction in steps needed compared to the original 1000 step, N (0,I) started inference process, but considering the N (0,I) started approach also achieved a similar MMD result at 350 steps, a fairer estimate would be a decrease of ∼ 86%. In addition, the x0 case appears to obtain lower MMD values in less steps than even the CEBGAN initialized version of Aero-DDM. This is surprising because the airfoils produced by the CEBGAN share a much higher distributional similarity to the ground-truth optimal airfoils compared to the initial (x0) designs. Figure 6.4 presents some of the different initializations of the Aero-DDM inference process at Tstart = 50. 39 (a) N (0,I) Initialized (b) CEBGAN Initialized (c) x0 Initialized Figure 6.4: Sample Generations for Different Aero-DDM Initializations, Tstart = 50 Lx = Airfoil MSE While the N (0,I) initialized generations in Figure 6.4a are clearly inferior to the other intializations in terms of quality, some more interesting observations can be made about the other two cases. Firstly, while the CEBGAN initialized samples appear to mimic the generations of the original CEBGAN in Figure 6.2a as expected, there also appears to be a slight improvement in angle of attack. What is more surprising, however, is that the 40 angle of attack predictions by the x0 initialized Aero-DDM actually demonstrate a modest improvement compared to even the baseline Aero-DDM (see Figure 6.2b, Tstart = 1000) across all but one sample. Since the predicted airfoil shapes of the x0 initialized Aero- DDM are mostly the same as the baseline, it is reasonable to conclude that any differences in performance observed can be attributed to this angle of attack prediction improvement. We leave a more rigorous study of the mechanism behind this difference as future work. The impact of these results caused us to narrow our focus on the specific case of the x0 initialized Aero-DDM model, which appears to be the most efficient and best performing variation. Throughout the rest of the results, we will present performance benchmarks for the x0 initialized variant for both Tstart = 50 and 100 steps. 6.4 Aerodynamic Performance Benchmarks In addition to our statistical analysis, we benchmarked the generated designs using the same ADflow RANS CFD solver and meshing procedure used to create the dataset. These results are collated in Table 6.2. In general, we observed that most benchmarked cases succeeded, with 233 successes for the CEBGAN model and 211 for Aero-DDM. However, as some of these failures were related to technical issues on the HPC system used, rather than as a result of solver convergence issues, it is difficult to make any conclusions based on failure rate alone. We also note some variation in our results as compared to our previous work in [6], which can be attributed to using a different subset of the test set. In that study, a higher proportion of simulations failed and therefore had to be pruned from the test set as well in order to keep our comparisons to the same subset of airfoils. In addition to the benchmarked models, Table 6.2 includes performance measures of the unoptimized airfoils from our original dataset as a point of reference. 41 Table 6.2: Aerodynamic Performance Benchmarks Note: values to the right of the ± symbol indicate a 95 % confidence interval. |Cd −C∗ d | (Counts) |Cl −Ccon l | IOG ( Cl Cd ) Aero-DDM 9.1 ± 2.6 0.0292 ± 0.0039 5.0E-4 ± 1.0E-4 CEBGAN [2] 18.4 ± 5.2 0.0509 ± 0.0064 6.8E-4 ± 1.4E-4 Aero-DDM (x0;T = 50) 9.1 ± 2.4 0.0260 ± 0.0031 5.4E-4 ± 1.1E-4 Aero-DDM (x0;T = 100) 8.3 ± 2.2 0.0287 ± 0.0040 5.1E-4 ± 1.0E-4 Unoptimized 37.3 ± 21.3 0.2740 ± 0.0570 23.2E-4 ± 4.5E-4 Results reported in Table 6.2 indicate that compared to CEBGAN, Aero-DDM produces a ∼51% lower optimal Cd gap, ∼43% lower Cl constraint satisfaction and ∼26% lower in- stantaneous optimality gap (IOG) for Cl Cd . All models benchmarked exhibit an improvement over the unoptimized data. Taking into consideration the models used in the inference efficiency study, we note that the 50-step x0 initialized Aero-DDM produces a greater reduction in Cd gap and a similar IOG to that of the baseline Aero-DDM. There appears to be little difference between the 50 and 100-step variants of the x0 initialized approach across all metrics. In addition, none of these results indicate significant performance degradation costs associated with the x0 approaches. 6.5 Warm Start Performance Figure 6.5a presents the COG and RiCOG optimality gap metrics as a function of non-dimensionalized iteration. We have also included comparisons to the original adjoint dataset (Adjoint x0 start), as well as the results obtained from running the adjoint solver 42 continuing from the optimal airfoil designs from the original adjoint data (Adjoint x∗ start). (a) Cumulative Optimality Gap (b) Reduction in Cumulative Optimality Gap Figure 6.5: Warm Start Performance Before analyzing the relative performance of the models, we can make some comments on the results of the study as a whole. The first observation is that all models tested—as well as the adjoint continuation cases—experience a sharp decrease in RiCOG (Figure 6.5b) at the beginning of the optimization trajectory. This is most likely due in part to properties of the SLSQP solver that cause it to test the bounds of the constrained optimization problem to guarantee some level of global convergence [19]. In addition, it is important to note 43 that in all warm start cases tested, including the adjoint optimal start case, the final design often produced designs with slightly improved performance over the original adjoint op- timized designs. This was also noticed by Chen et al. in [2], and may indicate that the optimal designs in the original dataset are local optima, and that the act of re-running the optimization allowed for the solver to fine-tune the result within a tighter radius of conver- gence—although this hypothesis needs to be tested in future work. Even so, our inclusion of the adjoint optimized results starting from the optimal airfoil designs should bound the uncertainty related to the local optima issue. Table 6.3 reports the optimality gap metric results from our warm start experiment. Both the COG and RiCOG values listed are the final values at the end of the optimization trajectory. Table 6.3: Warm Start Performance Benchmarks Note: values to the right of the ± symbol indicate a 95 % confidence interval. COG ( Cl Cd ) N RiCOG ( Cl Cd ) N (%) Aero-DDM 4.20 ± 0.66 53.60 ± 6.90 CEBGAN [2] 4.96 ± 0.76 44.84 ± 4.77 Aero-DDM (x0; T=50) 3.62 ± 0.49 59.51 ± 8.87 Aero-DDM (x0; T=100) 3.82 ± 0.53 57.33 ± 8.59 Adjoint (x∗ Start) 3.71 ± 1.14 59.14 ± 18.67 Adjoint (x0 Start) 9.03 ± 2.07 0 (Defines RiCOG) Consistent with our expectations from the CFD benchmark performance, we observe that Aero-DDM produces a lower COG and higher RiCOG than the CEBGAN model. Both of these results imply less wasted optimizer effort when Aero-DDM generations are used 44 in warm start scenarios. Even so, the difference between the two models is relatively subtle (∼20 % for RiCOG, ∼15 % for COG), compared to the difference in IOG. This may indi- cate that the optimizer cannot take full advantage of an airfoil close to the optimal design, or that the airfoil designs produced by the Aero-DDM may have qualities (such as non- optimal Cl satisfaction) that require additional effort to resolve. One of the more surprising results, is that the x0 initialized, T=50 sample efficient Aero-DDM actually shows modest improvement in both RiCOG and COG over the original Aero-DDM. The reasoning behind this is unclear, and is an area for future work, but the result is consistent as a function of Tstart , with the T=100 variant also displaying improvement over the original Aero-DDM. Finally, we note that, as expected, the adjoint x∗ continuation case shows, with exception of the T=50 x0 initialized Aero-DDM, the lowest and highest COG and RiCOG values. Although the T=50 x0 initialized Aero-DDM performs better, we also note the higher un- certainty bounds on the adjoint x∗ continuation case which could easily wipe out this gap, considering the gap in mean value between the two is already relatively small. 45 Chapter 7: Conclusions In this thesis, we have presented several key contributions which will help inform the current use, and future study, of generative models for aerodynamic design purposes. First, we have presented a novel dataset of 935 optimized airfoils obtained from a diversely sam- pled design space. Second, we demonstrated success using our latent denoising diffusion model, Aero-DDM, which features a custom U-Net architecture and simple conditioning on the initial airfoil design and flow parameters. Next, we compared Aero-DDMs perfor- mance to an optimal-transport based GAN model, CEBGAN, and although both models produced reasonably performant designs, we observe that Aero-DDM produces designs which better match the data-distribution, and which produce lower drag values in aggre- gate based on aerodynamic performance benchmarks. We have tested the ability of both models to accelerate convergence of an adjoint solver, when the generated designs are used to warm-start optimizations. Based on these warm start results, we also observe that Aero-DDM produces a higher reduction in cumulative optimality gap, and is therefore is the more efficient choice for these scenarios—although we acknowledge this difference is not major. Finally, considering the high inference cost associated with diffusion models compared to GANs, we experimented with accelerating sampling by having Aero-DDM begin the sampling process with a distribution much closer than pure noise to the opti- mal data distribution. The results of this experiment appear to demonstrate that by simply starting the inference process from the initial design distribution, sampling efficiency can be improved by ∼86% compared to an optimized version of the baseline inference pro- 46 cess, without meaningful degradation in performance. In addition the designs generated through this approach appear to have better warm-start performance than even the baseline Aero-DDM model. 7.1 Future Work This section presents improvements to the methodology in this thesis, as well as various directions for future studies. Here we summarize future work before describing our sugges- tions in more detail. First, improvements can be made to data collection process, including validating mesh quality through a refinement study and comparing the results for different optimizers. In addition, changes to the optimization problem, such as increasing FFD de- formation allowances and modifying constraints, could be made to tackle more complex design problems. Next, investigations into model inputs, such as the current inclusion of the initial design, and the effect of the Bezier latent space could provide useful insight into the performance and generalizability of Aero-DDM. Additionally, measurements of out- of-distribution performance of the models on a larger flight window could be done to test the robustness of the generated designs. Finally, an exploration into the mechanism be- hind the improved sample efficiency we observed when using the initial design distribution would help further optimize model performance, and may provide insight into how widely applicable the method is. In general there are many ways the quality of the dataset could be improved upon if the collection process were to be repeated. Firstly, although we believe the optimized air- foils we generated were of high quality, and the setup we used was sufficient in terms of replicating the general trends in optimal airfoil shape as a function of flow parameters, there would be a great benefit to conducting a rigorous mesh-refinement study at a variety of flow conditions and initializations. This could take the form of a h-refinement (grid-spacing re- 47 finement) volume mesh coarsening study, with the results aggregated across a sparse subset of the design space. Secondly, the SLSQP optimizer we used may not be the best choice for measuring warm start performance, given the global convergence properties it attempts to ensure, which may inadvertently wash out some of the benefits of starting closer to the optimal design. In addition, other nonlinear optimizers, such as SNOPT exhibit much smoother, stable, optimization trajectories, which may make analysis of these trajectories much easier [10]. Finally, future work should explore changes to the optimization problem itself, such as increasing the amount of deformation (∆yi) the optimizer is allowed to make to the FFD cage. This may lead to more multi-modal behaviour and represents a more dif- ficult, realistic, design problem. To do this, robustness improving techniques such as FFD adaptation may have to be used to avoid convergence related failures [14]. Additionally, changing Cl to an inequality constraint from an equality constraint would help make Cl Cd a much more reliable indicator of performance. Certain changes to the model inputs themselves could be interesting explore. For ex- ample, removing x0, the initial design, from the set of inputs could reveal how much of an impact it has on the generated design. A high dependency on x0, may also prompt fur- ther study into what degree the dataset generated by the adjoint optimizer itself is multi modal depending on initial conditions. Secondly, further investigations into the Bezier la- tent space could be conducted. Such investigations may explore the minimum number of control points needed to produce accurate reconstructions, which would be important to un- derstand if the latent model were to be extended to a higher-dimensional problem. It may also be possible to explore how to cluster the control points to create better reconstructions of sharper leading edge profiles, as the current autoencoder implementation struggles with this. Future work could also explore alternatives to Bezier representations. In particular, parameterization using Class-Shape Transformations (CST) may be of interest, since CST design variables may be more interpretable, exclude many infeasible designs, and provide 48 a better leading edge representation. Apart from studying the latent coordinate represen- tation, a sample efficiency study of the model could be undertaken to measure the effect of the number of training samples on generation quality, Such a study could reveal which model deals better in situations with limited data. In this thesis, we used test data that was still inside the LHS sampled flight window. A future study could gather data outside this range and use it to measure model robust- ness. Another approach could be to set the initial design, minimum area constraint, and Cl to a constant value, and see how well the model extrapolates in different Mach and Reynolds regimes, as compared to the adjoint-optimized designs. Based on preliminary results, which are not presented in this thesis, sonic and supersonic performance may be the most difficult regions to extrapolate to, and therefore could be a good starting point for any such future study. Finally, while the inference study we presented led to surprising results, we did attempt to robustly study the underlying mechanism behind the increase in efficiency we observed. Future work could address this by systematically studying the distributional differences in the designs produced with the x0 initialized approach and the baseline Aero-DDM. An additional, straightforward, approach could be to gather the same CFD and warm-start benchmarks, but for a greater number and range of Tstart values, instead of the Tstart = 50, 100 used in this work. Specifically, using Tstart values less than 50 and above 100 may help determine for what value an optimally efficient Tstart exists. It would also be useful to observe whether efficiency can be further improved applying one of the many suggested modifications to diffusion sampling algorithms which exist in literature. If the inference efficiency benefits do not compound with changes to the sampling algorithm, then future work could explore which method produces higher efficiency gains, and the scenarios in which the x0 approach may be more or less appropriate. 49 Appendix A: Other Geometric Parameterization Methods Although this thesis focuses on Bezier parameterizations, further research may benefit from experimenting with using other methods. Here we briefly cover some of the alterna- tives. Generally speaking, there is a tradeoff between how accurately these methods can reconstruct a diverse set of airfoils, and the degree to which infeasible designs are excluded through restrictions to the method itself. For example, highly generalizable methods such as singular value decomposition (SVD) have been shown to provide the most efficient cov- erage of the design space [25]. Despite this, SVD also tends to create many infeasible designs with crossover intersections and high curvature [3]. Other methods construct air- foil geometries using combinations of basis functions, which can be modified to enforce designs with properties such as a minimum thickness, smoothness, and continuity. Some examples include Hicks-Henne bump functions, which add parameterized sine functions to an initial airfoil design, and class shape transformations (CSTs), which modify a ’shape function’ composed of Bernstein polynomials. CST stands out in particular as it does not require an initial design (i.e it is a constructive, not a deformative, method), can be used to create smooth shapes with no crossover, and can be modified to create highly accurate reconstructions of the leading edge [25]. Although the constructive Bezier method we use in this thesis can still suffer from some of these issues, the Bezier method is able to capture a very diverse set of airfoil designs with relatively small amounts of coordinate reconstruc- tion loss. We leave a study of whether the feasibility advantages of methods such as CST can outweigh any generalizability issues they may have, if such tradeoffs exist at all. 50 Appendix B: Example Optimization Trajectory Plots for a Single Case Here we present the optimization history of the objective and constraints for a random individual case in the dataset as a point of comparison to the aggregate quantaties presented in chapter 4. The initial and final designs are presented in Figure B.1, and the histories are presented in Figures B.2 and B.3. Figure B.1: Initial and Optimized Airfoil for Example Case From left to right; unoptimized, optimized Figure B.2: Cd Convergence for Example Case 51 (a) Cl Error (b) Area Fraction (c) Thickness Fraction (d) Blunted Thickness Fraction Figure B.3: Constraint Convergence for Example Case 52 Appendix C: Effect of Tstart on Designs For illustrative purposes, Figure C.1 shows an example of the effect Tstart has on the generated design for the x0 initialized method. Figure C.1: Example Effect of Tstart on x0 Initialized Designs Here, the ground truth is the adjoint optimized design. Figure C.1 demonstrates the main concept of how lower values of Tstart collapse the solution to the original initial design, x0, as for these values of Tstart, the model has much less ability to edit the input. 53 Bibliography [1] Gabriel Achour, Woong Je Sung, Olivia J Pinon-Fischer, and Dimitri N Mavris. De- velopment of a conditional generative adversarial network for airfoil shape optimiza- tion. In AIAA Scitech 2020 Forum, page 2261, 2020. [2] Qiuyi Chen, Jun Wang, Phillip Pope, Wei (Wayne) Chen, and Mark Fuge. Inverse Design of Two-Dimensional Airfoils Using Conditional Generative Models and Sur- rogate Log-Likelihoods. Journal of Mechanical Design, 144(2), 12 2021. 021712. [3] Wei Chen, Kevin Chiu, and Mark D Fuge. Airfoil design parameterization and opti- mization using bézier generative adversarial networks. AIAA journal, 58(11):4723– 4735, 2020. [4] Wei Chen, Kevin Chiu, and Mark D. Fuge. Airfoil design parameterization and opti- mization using bézier generative adversarial networks. AIAA Journal, 58(11):4723– 4735, 2020. [5] Prafulla Dhariwal and Alexander Nichol. Diffusion models beat gans on image syn- thesis. Advances in Neural Information Processing Systems, 34:8780–8794, 2021. [6] Cashen Diniz and Mark Fuge. Optimizing diffusion to diffuse optimal designs. In AIAA SCITECH 2024 Forum, page 2013, 2024. [7] Xiaosong Du, Ping He, and Joaquim RRA Martins. A b-spline-based generative adversarial network model for fast interactive airfoil aerodynamic optimization. In AIAA Scitech 2020 Forum, page 2128, 2020. [8] MHF El-Gohary and MYM Ahmed. Investigating the design parameters of a diamond-shaped supersonic airfoil. In International Conference on Aerospace Sci- ences and Aviation Technology, number AEROSPACE SCIENCES & AVIATION TECHNOLOGY, ASAT-15–May 28-30, 2013, pages 1–18. The Military Technical College, 2013. 54 [9] Sayan Ghosh, Govinda A Padmanabha, Cheng Peng, Steven Atkinson, Valeria An- dreoli, Piyush Pandita, Thomas Vandeputte, Nicholas Zabaras, Liping Wang, and Genghis Khan. Pro-ml ideas: A probabilistic framework for explicit inverse design using invertible neural network. In AIAA Scitech 2021 Forum, page 0465, 2021. [10] Philip E Gill, Walter Murray, and Michael A Saunders. Snopt: An sqp algorithm for large-scale constrained optimization. SIAM review, 47(1):99–131, 2005. [11] Arthur Gretton, Karsten M Borgwardt, Malte J Rasch, Bernhard Schölkopf, and Alexander Smola. A kernel two-sample test. The Journal of Machine Learning Re- search, 13(1):723–773, 2012. [12] Hannah M. Hajdik, Anil Yildirim, Neil Wu, Benjamin J. Brelje, Sabet Seraj, Marco Mangano, Joshua L. Anibal, Eirikur Jonsson, Eytan J. Adler, Charles A. Mader, Gae- tan K. W. Kenway, and Joaquim R. R. A. Martins. pyGeo: A geometry package for multidisciplinary design optimization. Journal of Open Source Software, 8(87):5319, 2023. [13] GM Harshvardhan, Mahendra Kumar Gourisaria, Manjusha Pandey, and Sid- dharth Swarup Rautaray. A comprehensive survey and analysis of generative models in machine learning. Computer Science Review, 38:100285, 2020. [14] Xiaolong He, Jichao Li, Charles A. Mader, Anil Yildirim, and Joaquim R. R. A. Mar- tins. Robust aerodynamic shape optimization—from a circle to an airfoil. Aerospace Science and Technology, 87:48–61, April 2019. [15] Jonathan Ho, Ajay Jain, and Pieter Abbeel. Denoising diffusion probabilistic models. Advances in Neural Information Processing Systems, 33:6840–6851, 2020. [16] Antony Jameson. Aerodynamic shape optimization using the adjoint method. Lec- tures at the Von Karman Institute, Brussels, 2003. [17] Gaetan K. W. Kenway, Charles A. Mader, Ping He, and Joaquim R. R. A. Martins. Effective adjoint approaches for computational fluid dynamics. Progress in Aerospace Sciences, 110:100542, October 2019. [18] Gaetan K.W. Kenway, Charles A. Mader, Ping He, and Joaquim R.R.A. Martins. Effective adjoint approaches for computational fluid dynamics. Progress in Aerospace Sciences, 110:100542, 2019. [19] Dieter Kraft. A software package for sequential quadratic programming. Forschungsbericht- Deutsche Forschungs- und Versuchsanstalt fur Luft- und Raum- fahrt, 1988. [20] Jichao Li, Mohamed Amine Bouhlel, and Joaquim R. R. A. Martins. Data-based approach for fast airfoil analysis and optimization. AIAA Journal, 57(2):581–596, 2019. 55 [21] Jichao Li, Xiaosong Du, and Joaquim RRA Martins. Machine learning in aerody- namic shape optimization. Progress in Aerospace Sciences, 134:100849, 2022. [22] Wu Li, Steven Krist, and Richard Campbell. Transonic airfoil shape optimization in preliminary design environment. Journal of aircraft, 43(3):639–651, 2006. [23] Calvin Luo. Understanding diffusion models: A unified perspective. arXiv preprint arXiv:2208.11970, 2022. [24] Charles A. Mader, Gaetan K. W. Kenway, Anil Yildirim, and Joaquim R. R. A. Mar- tins. ADflow—an open-source computational fluid dynamics solver for aerodynamic and multidisciplinary optimization. Journal of Aerospace Information Systems, 2020. [25] Dominic A Masters, Nigel J Taylor, TCS Rendall, Christian B Allen, and Daniel J Poole. Geometric comparison of aerofoil shape parameterization methods. AIAA journal, 55(5):1575–1589, 2017. [26] François Mazé and Faez Ahmed. Diffusion models beat gans on topology optimiza- tion. Proceedings of the AAAI Conference on Artificial Intelligence, 2023. [27] Applied Aerodynamics Group Michael Selig, University of Illinois Urbana- Champaign. Uiuc dataset, 2023. data retrieved from , https://m-selig.ae. illinois.edu/ads/coord_database.html. [28] Jeremy Morton, Mykel J. Kochenderfer, and Freddie D. Witherden. Parameter- conditioned sequential generative modeling of fluid flows. AIAA Journal, 59(3):825– 841, 2021. [29] Jitesh H. Panchal, Mark Fuge, Ying Liu, Samy Missoum, and Conrad Tucker. Special Issue: Machine Learning for Engineering Design. Journal of Mechanical Design, 141(11), 10 2019. 110301. [30] Aditya Ramesh, Prafulla Dhariwal, Alex Nichol, Casey Chu, and Mark Chen. Hierarchical text-conditional image generation with clip latents. arXiv preprint arXiv:2204.06125, 2022. [31] Lyle Regenwetter, Amin Heyrani Nobari, and Faez Ahmed. Deep Generative Models in Engineering Design: A Review. Journal of Mechanical Design, 144(7), 03 2022. 071704. [32] Melissa Rivers and Ashley Dittberner. Experimental investigations of the nasa com- mon research model in the nasa langley national transonic facility and nasa ames 11-ft transonic wind tunnel. In 49th AIAA aerospace sciences meeting including the new horizons forum and aerospace exposition, page 1126, 2011. 56 https://m-selig.ae.illinois.edu/ads/coord_database.html https://m-selig.ae.illinois.edu/ads/coord_database.html [33] Robin Rombach, Andreas Blattmann, Dominik Lorenz, Patrick Esser, and Björn Om- mer. High-resolution image synthesis with latent diffusion models. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 10684–10695, 2022. [34] Ney Secco, Gaetan K. W. Kenway, Ping He, Charles A. Mader, and Joaquim R. R. A. Martins. Efficient mesh generation and deformation for aerodynamic shape optimiza- tion. AIAA Journal, 2021. [35] Ney Secco, Gaetan K. W. Kenway, Ping He, Charles A. Mader, and Joaquim R. R. A. Martins. Efficient mesh generation and deformation for aerodynamic shape optimiza- tion. AIAA Journal, 2021. [36] Shaun N Skinner and Hossein Zare-Behtash. State-of-the-art in aerodynamic shape optimisation methods. Applied Soft Computing, 62:933–962, 2018. [37] Neil Wu, Gaetan Kenway, Charles A. Mader, John Jasa, and Joaquim R. R. A. Mar- tins. pyoptsparse: A python framework for large-scale constrained nonlinear opti- mization of sparse systems. Journal of Open Source Software, 5(54):2564, 2020. [38] Anil Yildirim, Gaetan KW Kenway, Charles A Mader, and Joaquim RRA Martins. A jacobian-free approximate newton–krylov startup strategy for rans simulations. Jour- nal of Computational Physics, 397:108741, 2019. [39] Kazuo Yonekura and Katsuyuki Suzuki. Data-driven design exploration method us- ing conditional variational autoencoder for airfoil design. Structural and Multidisci- plinary Optimization, 64(2):613–624, 2021. 57 Foreword Dedication Acknowledgements Introduction Related Work Approaches to Aerodynamic Shape Optimization CEBGAN Bezier Representation Diffusion Model Theory Key Metrics Maximum Mean Discrepancy Area Constraint Mean Squared Error (Area MSE) Instantaneous Optimality Gap (IOG) Cumulative Optimality Gap (COG) Reduction in Cumulative Optimality Gap (RiCOG) Optimization Problem Setup Design Space Problem Formulation Solver Setup Geometry Parameterization and Meshing Dataset Results Optimization Results Data Analysis Generative Model Methodology CEBGAN Modifications Aero-DDM Architecture Bezier Autoencoder Validation Diffusion Parameters and Scheduling Benchmarking and Warm Start Optimization DDM Inference Efficiency Study Model Results Sample Generations Statistical Validation DDM Inference Ablation Study Aerodynamic Performance Benchmarks Warm Start Performance Conclusions Future Work Other Geometric Parameterization Methods Example Optimization Trajectory Plots for a Single Case Effect of Tstart on Designs Bibliography