Research Papers

A Parallel Reanalysis Method Based on Approximate Inverse Matrix for Complex Engineering Problems

[+] Author and Article Information
Hu Wang, Guangyao Li

State Key Laboratory of Advanced Design and Manufacturing for Vehicle Body,
Hunan University,
Changsha, Hunan 410082, China

Enying Li

Collage of Transportation and Logistics,
Central South University of Forestry and Technology,
Changsha, Hunan 410004, China

Contributed by the Design Automation Committee of ASME for publication in the JOURNAL OF MECHANICAL DESIGN. Manuscript received October 17, 2011; final manuscript received April 4, 2013; published online May 30, 2013. Assoc. Editor: Olivier de Weck.

J. Mech. Des 135(8), 081001 (May 30, 2013) (8 pages) Paper No: MD-11-1432; doi: 10.1115/1.4024368 History: Received October 17, 2011; Revised April 04, 2013

The combined approximations (CA) method is an effective reanalysis approach providing high quality results. The CA method is suitable for a wide range of structural optimization problems including linear reanalysis, nonlinear reanalysis and eigenvalue reanalysis. However, with increasing complexity and scale of engineering problems, the efficiency of the CA method might not be guaranteed. A major bottleneck of the CA is how to obtain reduced basis vectors efficiently. Therefore, a modified CA method, based on approximation of the inverse matrix, is suggested. Based on the symmetric successive over-relaxation (SSOR) and compressed sparse row (CSR), the efficiency of CA method is shown to be much improved and corresponding storage space markedly reduced. In order to further improve the efficiency, the suggested strategy is implemented on a graphic processing unit (GPU) platform. To verify the performance of the suggested method, several case studies are undertaken. Compared with the popular serial CA method, the results demonstrate that the suggested GPU-based CA method is an order of magnitude faster for the same level of accuracy.

Copyright © 2013 by ASME
Your Session has timed out. Please sign back in to continue.


Kirsch, U., 2000, “Combined Approximations—A General Reanalysis Approach for Structural Optimization,” Struct. Optim., 20, pp. 97–106. [CrossRef]
Kirsch, U., and Papalambros, P. Y., 2001, “Exact and Accurate Solutions in the Approximate Reanalysis of Structures,” AIAA J., 39, pp. 2198–2205. [CrossRef]
Kirsch, U., 1999, “Efficient-Accurate Reanalysis for Structural Optimization,” AIAA J., 37, pp. 1663–1669. [CrossRef]
Kirsch, U., 1991, “Reduced Basis Approximations of Structural Displacements for Optimal Design,” AIAA J., 29, pp. 1751–1758 [CrossRef]
Kirsch, U., 1993, Structural Optimizations, Fundamentals and Applications, Springer, Berlin, Heidelberg, New York.
Kirsch, U., 1993, “Approximate Reanalysis for Topological Optimization,” Struct. Optim., 6, pp. 143–150. [CrossRef]
Leu, L.-J., and Huang, C.-W., 1998, “A Reduced Basis Method for Geometric Nonlinear Analysis of Structures,” IASS J., 39, pp. 71–75.
Aktas, A., and Moses, F., 1998, “Reduced Basis Eigenvalue Solutions or Damaged Structures,” Mech. Struct. Mach., 26, pp. 63–79. [CrossRef]
Chen, S. H., and Yang, X. W., 2000, “Extended Kirsch Combined Method for Eigenvalue Reanalysis,” AIAA J., 38, pp. 927–930. [CrossRef]
Chen, S. H., and Yang, Z. J., 2004, “A Universal Method for Structural Static Reanalysis of Topological Modifications,” Int. J. Numer. Methods Eng., 61, pp. 673–686. [CrossRef]
Kirsch, U., and Bogomolni, M., 2004, “Procedures for Approximate Eigenproblem Reanalysis of Structures,” Int. J. Numer. Methods Eng., 60, pp. 1969–198. [CrossRef]
Kirsch, U., Bogomolni, M., and Sheinman, I., 2007, “Efficient Dynamic Reanalysis of Structures,” ASCE J. Struct. Eng., 133, pp. 440–448. [CrossRef]
Kirsch, U., Bogomolni, M., and Sheinman, I., 2006, “Nonlinear Dynamic Reanalysis for Structural Optimization,” Comput. Methods Appl. Mech. Eng., 195, pp. 4420–4432. [CrossRef]
Kirsch, U., 2008, Reanalysis of Structures, Springer, Dordrecht.
Kirsch, U., 2010, “Reanalysis and Sensitivity Reanalysis by Combined Approximations,” Struct. Multidiscip. Optim., 40, pp. 1–15. [CrossRef]
Raz, R., 2003, “On the Complexity of Matrix Product,” SIAM J. Comput., 32(5), pp. 1356–1369. [CrossRef]
Unterkircher, A., and Reissner, J., 2005, “Parallel Assembling and Equation Solving via Graph Algorithms With an Application to the FE Simulation of Metal Extrusion Processes,” Comput. Struct., 83(8–9), pp. 627–638. [CrossRef]
Farhat, C., Crivelli, L., and Roux, F. X., 1994, “A Transient FETI Methodology for Large-Scale Parallel Implicit Computations in Structural Mechanics,” 37(11), pp. 1945–1975.
Komatitsch, D., Micha, D., and Erlebacher, G., 2009, “Porting a High-Order Finite-Element Earthquake Modeling Application to NVIDIA Graphics Cards Using CUDA,” J. Parallel Distrib. Comput., 69(5), pp. 451–460. [CrossRef]
Owens, J. D., Houston, M., Luebke, D., Green, S., Stone, J. E., and Phillips, J. C., 2008, “GPU Computing,” Proc. IEEE, 5, pp. 879–899. [CrossRef]
Cecka, C., Lew, A. J., and Darve, E., 2011, “Assembly of Finite Element Methods on Graphic Processors,” Int. J. Numer. Methods Eng., 85, pp. 640–669. [CrossRef]
Cai, Y., Li, G., Wang, H., Zheng, G., and Lin, S., 2012, “Development of Parallel Explicit Finite Element Sheet Forming Simulation System Based on GPU Architecture,” Adv. Eng. Software, 45(1), pp. 370–379. [CrossRef]
Joldes, G. R., Wittek, A., and Miller, K., 2010, “Real-Time Nonlinear Finite Element Computations on GPU—Application to Neurosurgical Simulation,” Comput. Methods Appl. Mech. Eng., 199(49–52), pp. 3305–3314. [CrossRef] [PubMed]
Krakiwsky, S. E., Turner, L. E., and Okoniewski, M. M., 2004, “Acceleration of Finite-Difference Time-Domain (FDTD) Using Graphics Processor Units(GPU),” Proceedings of the IEEE MTT-S International Microwave Symposium, pp. 1033–1036.
Inman, M. J., and Elsherbeni, A. Z., 2005, “Programming Video Cards for Computational Electromagnetics Applications,” IEEE Antennas Propag. Mag., 47, pp. 71–78. [CrossRef]
Golub, G. H., and Van Loan, C. F., 1996, Matrix Computations, 3rd ed., The Johns Hopkins University Press, Baltimore.
Chen, R. S., Yung, E. K. N., Chan, C. H., and Fang, D. G., “Application of SSOR Preconditioned Conjugate Gradient Algorithm to Edge-FEM for 3-Dimensional Full Wave Electromagnetic Boundary Value Problems,” IEEE Trans. Microwave Theory Tech., 50(4), pp. 1165–1172. [CrossRef]
Fletcher, R., and Reeves, C. M., 1964, “Function Minimization by Conjugate Gradients,” Comput. J., 7, pp. 149–154. [CrossRef]
Langou, J., Luszczek, P., Kurzak, J., Buttari, A., and Dongarra, J., 2006, “Exploiting the Performance of 32 bit Floating Point Arithmetic in Obtaining 64 bit Accuracy (Revisiting Iterative Refinement for Linear Systems),” Proceedings of the ACM/IEEE Super Computing.
Bell, N., and Garland, M., 2008, “Efficient Sparse Matrix–Vector Multiplication on CUDA,” Technical Report No. NVR-2008-04, NVIDIA.
Adams, M. F., 2000, “Parallel Multigrid Solvers for 3D Unstructured Finite Element Problems in Large Deformation Elasticity and Plasticity,” Int. J. Numer. Methods Eng., 48, pp. 1241–62. [CrossRef]


Grahic Jump Location
Fig. 1

Arrays A, JA, and IA comprise the CSR representation of original one

Grahic Jump Location
Fig. 2

Block and thread of CUDA

Grahic Jump Location
Fig. 3

Flowchart of the parallel CA on CUDA platform

Grahic Jump Location
Fig. 4

An illustration of communication graph for a simple mesh

Grahic Jump Location
Fig. 5

An illustration of colored elements

Grahic Jump Location
Fig. 7

CAD model of a door inner and corresponding load

Grahic Jump Location
Fig. 8

Mesh models of the initial and changed structure of example II

Grahic Jump Location
Fig. 9

The initial simple CAD model (unit mm) of example III

Grahic Jump Location
Fig. 10

Sizes of the cross-section of vertical beam (unit mm) of example III

Grahic Jump Location
Fig. 11

Initial FE model and constraints enforcement nodes example III

Grahic Jump Location
Fig. 12

Initial solution by the complete analysis of example III

Grahic Jump Location
Fig. 13

The modified simple CAD model (unit mm) of example III

Grahic Jump Location
Fig. 14

Approximation response by the parallel CA method of example III

Grahic Jump Location
Fig. 15

Response of modified structure by the complete method of example III



Some tools below are only available to our subscribers or users with an online account.

Related Content

Customize your page view by dragging and repositioning the boxes below.

Related Journal Articles
Related eBook Content
Topic Collections

Sorry! You do not have access to this content. For assistance or to subscribe, please contact us:

  • TELEPHONE: 1-800-843-2763 (Toll-free in the USA)
  • EMAIL: asmedigitalcollection@asme.org
Sign In