图书馆订阅: Guest
国际多尺度计算工程期刊

每年出版 6 

ISSN 打印: 1543-1649

ISSN 在线: 1940-4352

The Impact Factor measures the average number of citations received in a particular year by papers published in the journal during the two preceding years. 2017 Journal Citation Reports (Clarivate Analytics, 2018) IF: 1.4 To calculate the five year Impact Factor, citations are counted in 2017 to the previous five years and divided by the source items published in the previous five years. 2017 Journal Citation Reports (Clarivate Analytics, 2018) 5-Year IF: 1.3 The Immediacy Index is the average number of times an article is cited in the year it is published. The journal Immediacy Index indicates how quickly articles in a journal are cited. Immediacy Index: 2.2 The Eigenfactor score, developed by Jevin West and Carl Bergstrom at the University of Washington, is a rating of the total importance of a scientific journal. Journals are rated according to the number of incoming citations, with citations from highly ranked journals weighted to make a larger contribution to the eigenfactor than those from poorly ranked journals. Eigenfactor: 0.00034 The Journal Citation Indicator (JCI) is a single measurement of the field-normalized citation impact of journals in the Web of Science Core Collection across disciplines. The key words here are that the metric is normalized and cross-disciplinary. JCI: 0.46 SJR: 0.333 SNIP: 0.606 CiteScore™:: 3.1 H-Index: 31

Indexed in

METHODOLOGY FOR EFFICIENT PERFORMANCE OF MULTISCALE MODELING METHODS IN HETEROGENEOUS HARDWARE INFRASTRUCTURES

卷 17, 册 3, 2019, pp. 297-315
DOI: 10.1615/IntJMultCompEng.2019028552
Get accessGet access

摘要

The paper presents a novel approach dedicated to analysis and optimization of multiscale methods execution on hetero-geneous hardware including processors, graphic cards and other accelerators. The methodology proposed in this paper is based on the investigation, which proved that efficiency of various numerical methods, used for computing purposes on different devices, often behaves unpredictably. This efficiency was analyzed in details and presented in this paper. On the basis of this assumptions two-step optimization procedure was proposed, i.e., before and during calculations, allowing to verify current efficiency and to propose new decomposition of computing domain. Both steps were described and verified in details by using loosely and fully coupled multiscale approaches. Performed validation with discussion of some unconsidered aspects is presented in the paper as well.

参考文献
  1. Abdulle, A., Analysis of a Heterogeneous Multiscale FEM for Problems in Elasticity, Math. Models Methods Appl. Sci, vol. 16, pp. 615-635,2006.

  2. Acosta, A., Corujo, R., Blanco, V., and Almeida, F., Dynamic Load Balancing on Heterogeneous Multicore/MultiGPU Systems, Int. Conf. on High Performance Computing and Simulation (HPCS), Caen, France, pp. 467-476,2010.

  3. Alahmad, B.N. and Gopalakrishnan, S., Energy Efficient Task Partitioning and Real-Time Scheduling on Heterogeneous Multi-processor Platforms with QoS Requirements, Sustainable Comp.: Inform. Syst., vol. 1, pp. 314-328,2011.

  4. Beckman, P., Nadella, S., Trebon, N., and Beschastnikh, I., SPRUCE: A System for Supporting Urgent High-Performance Computing, in Grid-Based Problem Solving Environments, P. Gaffney and J.T. Pool, Eds., Heidelberg/Dordrecht/London/New York: Springer, pp. 295-311,2007.

  5. Belgacem, M.B., Chopard, B., Bpigdorff, J., Mamonski, M., Rycerz, K., and Harezlak, D., Distributed Multiscale Computations Using the MAPPER Framework, Proc. Comput. Sci., vol.18, pp. 1106-1115,2013.

  6. Bernsdorf, J., Berti, G., Chopard, B., Hegewald, J., Krafczyk, M., Wang, D., Lorenz, E., and Hoekstra, A., Towards Distributed Multiscale Simulation of Biological Processes, 7th Int. Conf. on e-Science Workshops (eScienceW), IEEE, Stockholm, Sweden, pp. 89-96,2011.

  7. Borgdorff, J., Falcone, J.-L., Lorenz, E., Bona-Casas, C., Chopard, B., and Hoekstra, A.G., Foundations of Distributed Multiscale Computing: Formalization, Specification, and Analysis, J. Parallel Distrib. Comp, vol. 73, pp. 465-483,2013.

  8. Borgdorff, J., Mamonski, M., Bosak, B., Kurowski, K., Ben Belgacem, M., Chopard, B., Groen, D., Coveney, P. V., and Hoekstra, A.G., Distributed Multiscale Computing with MUSCLE 2, the Multiscale Coupling Library and Environment, J. Comput. Sci, vol. 5, pp. 719-731,2014.

  9. Brodtkorb, A.R., Dyken, C., Hagen, T.R., Hjelmervik, A.M., and Storaasli, O.O., State-of-the-Art in Heterogeneous Computing, Sci. Program., vol. 18, pp. 1-33,2010.

  10. Cabellos, L., Campos, I., Fernandez-del-Castillo, E., Owsiak, M., Palak, B., andPiociennik,M., Scientific Workflow Orchestration Interoperating HTC and HPC Resources, Comput. Phys. Commun., vol. 182, pp. 890-897,2011.

  11. Castaiie, G.G., Nunez, A., Llopis, P., and Carretero, J., E-mc2: A Formal Framework for Energy Modelling in Cloud Computing, Simul. Model. Pract. and Theory, vol. 39, pp. 56-75,2013.

  12. Cole, D.,Letsios, D., Nugent, M., andPruhs, K., Optimal Energy Trade-Off Schedules, 2012Intl. Green Computing Conf. (IGCC), San Jose, CA, USA, pp. 1-10,2012.

  13. Couturier, R. and Domas, S., Sparse Systems Solving on GPUs with GMRES, J. Supercomput., vol. 59, pp. 1504-1516,2012.

  14. Das, S., Modeling Mixed Microstructures Using a Multi-Level Cellular Automata Finite Element Framework, Comput. Mater. Sci., vol. 47, pp. 705-711,2010.

  15. Davies, C.H.J., The Effect of Neighbourhood on the Kinetics ofaCellular AutomatonRecrystallization Model, Scr. Metall. Mater, vol. 33, pp. 1139-1143,1995.

  16. de Borst, R., Challenges in Computational Materials Science: Multiple Scales, Multi-Physics and Evolving Discontinuities, Comput. Mater. Sci., vol. 43, pp. 1-15,2008.

  17. El Halabi, F., Gonzalez, D., Chico, A., and Doblare, M., FE2 Multiscale in Linear Elasticity based on Parametrized Microscale Models Using Proper Generalized Decomposition, Comp. Methods Appl. Mech. Eng., vol. 257, pp. 183-202,2013.

  18. Falcone, J.-L., Chopard, B., and Hoekstra, A., Towards a Multiscale Modeling Language, Proc. Comput. Sci., vol. 1, pp. 819-826, 2010.

  19. Farber,R., CUDA Application Design and Development, 1st Ed., Burlington, MA: Morgan Kaufmann, 2011.

  20. Feyel, F., A Multilevel Finite Element Method (FE2) to Describe the Response of Highly Non-Linear Structures Using Generalized Continua, Comp. Methods Appl. Mech. Eng., vol. 192, pp. 3233-3244,2003.

  21. Feyel, F. and Chaboche, J.-L., Multiscale Approach for Modelling the Elastoviscoplastic Behaviour of Long Fibre SiC/Ti Composite Materials, vol. 183, pp. 309-330,2000.

  22. Gandin, C.A. and Rappaz, M.,A Coupled Finite Element-Cellular Automaton Model for the Prediction of Dendritic Grain Structures in Solidification Processes, Acta Metall. Mater., vol. 42, pp. 2233-2246,1994.

  23. Hamandi, L.L. and Ozguner, F., Review of Domain-Decomposition Methods for the Implementation of FEM on Massively Parallel Computers, AntennasPropag. Mag., IEEE, vol. 37, pp. 93-98,1995.

  24. Kazandjieva, M., Heller, B., Gnawali, O., Levis, T., and Kozyrakis, C., Measuring and Analyzing the Energy Use of Enterprise Computing Systems, Sustainable Compos.: Inform. Syst., vol. 3, pp. 218-229,2013.

  25. Korkhov, V.V., Krzhizhanovskaya, V.V., and Sloot, P.M.A., A Grid-Based Virtual Reactor: Parallel Performance and Adaptive Load Balancing, J. Parallel Distrib. Comput., vol. 68, pp. 596-608,2008.

  26. Lan, Y.J., Li, D.Z., and Li, Y.Y., Modeling Austenite Decomposition into Ferrite at Different Cooling Rate in Low-Carbon Steel with Cellular Automaton Method, Acta Mater, vol. 52, pp. 1721-1729,2004.

  27. MacLaren, J., HARC: The Highly-Available Resource Co-Allocator, in OTM'07 Proc. of the 2007 OTM Confederated International Conf. on the Move to Meaningful Internet Systems: CoopIS, DOA, ODBASE, GADA, and IS - Volume Part II, pp. 1385-1402, 2007.

  28. Madej, L., Hodgson, P., and Pietrzyk, M., Development of the Multi-Scale Analysis Model to Simulate Strain Localization Occurring During Material Processing, Arch. Comput. Methods Eng., vol. 16, pp. 287-318,2009.

  29. Pal, M., Lamine, S., Lie, K.-A., and Krogstad, S., Validation of the Multiscale Mixed Finite-Element Method, Int. J. Numer. Methods Fluids, vol. 77, no. 4, pp. 1-17,2014.

  30. Pernach, M., Bzowski, K., Rauch, L., and Pietrzyk, M., Analysis of Predictive Capabilities of Multiscale Phase Transformation Models based on the Numerical Solution of Heat Transfer and Diffusion Equations, Int. J. Multiscale Comput. Eng., vol. 15, no. 5, pp. 413-430,2017.

  31. Pilla, L.L., Ribeiro, C.P., Coucheney, P., Broquedis, F., Gaujal, B., Navaux, P.O.A., and Mehaut, J.-F., A Topology-Aware Load Balancing Algorithm for Clustered Hierarchical Multi-Core Machines, Future Gener. Comput. Syst, vol. 30, pp. 191-201,2014.

  32. Rauch, L., Heterogeneous Hardware Implementation of Molecular Static Method for Modelling of Interatomic Behaviour, Proc. Comput. Sci., vol. 18, pp. 1057-1067,2013.

  33. Rauch, L., Bzowski, K., andRodzaj A., OpenCL Implementation of Cellular Automata Finite Element (CAFE) Method, Parallel Proc. Appl. Math, pp. 381-390,2012.

  34. Rupp, K., High-Level Manipulation of OpenCL-Based Subvectors and Submatrices, Proc. Comput. Sci., vol. 9, pp. 1857-1866, 2012.

  35. Rycerz, K., Bubak, M., Ciepiela, E., Pawlik, M., Hoenen, O., Harezlak, C., Wilk, B., Gubala, T., Meizner, J., and Coster, D., Enabling Multiscale Fusion Simulations on Distributed Computing Resources, eScience on Distributed Computing Infrastructure, pp. 195-210,2014.

  36. Shaofan L. and Dong Q., Multiscale Simulations and Mechanics of Biological Materials, Chichester, West Sussex, UK: John Wiley & Sons, Inc., 2013.

  37. Shterenlikht, A. and Howard, I.C., The CAFE Model of Fracture-Application to a TMCR Steel, Fatigue Fracture Eng. Mater. Struct., vol. 29, pp. 770-787,2006.

  38. Tan, I.K.T., Chai, I., and Hoong, P.K., Dynamic Threshold for Imbalance Assessment on Load Balancing for Multicore Systems, Comp. Electrical Eng., vol. 39, pp. 338-348,2013.

  39. Truong, H.-L., Dustdar, S., and Fahringer, T., Performance Metrics and Ontologies for Grid Workflows, Future Gener. Comput. Systems, vol. 23, pp. 760-772, 2007.

  40. Verschoor, M. and Jalba, A.C., Analysis and Performance Estimation of the Conjugate Gradient Method on Multiple GPUs, Parallel Comput, vol. 38, pp. 552-575,2012.

  41. Wu, C.-M., Chang, R.-S., and Chan, H.-Y., A Green Energy-Efficient Scheduling Algorithm Using the DVFS Technique for Cloud Datacenters, Future Generation Comput. Syst, vol. 37, pp. 141-147,2014.

  42. Wu, S.J., Davis, C.L., Shterenlikht, A., and Howard, I.C., Modeling the Ductile-Brittle Transition Behavior in Thermomechanically Controlled Rolled Steels, Metall. Mater. Trans. A, vol. 36, pp. 989-997,2005.

  43. Zasada, S.J. and Coveney, P.V., Virtualizing Access to Scientific Applications with the Application Hosting Environment, Comput. Phys. Commun, vol. 180, pp. 2513-2525,2009.

  44. Zhang, T., Shu, W., and Wu, M.-Y., CUIRRE: An Open-Source Library for Load Balancing and Characterizing Irregular Applications on GPUs, J. Parallel Distrib. Comput, vol. 74, pp. 2951-2966,2014.

  45. Zheng, C., Xiao, N., Li, D., and Li, Y., Microstructure Prediction of the Austenite Recrystallization during Multi-Pass Steel Strip Hot Rolling: A Cellular Automaton Modeling, Comput. Mater. Sci., vol. 44, pp. 507-514,2008.

  46. Zhu, X., Ge, R., Sun, J., and He, C., 3E: Energy-Efficient Elastic Scheduling for Independent Tasks in Heterogeneous Computing Systems, J. Systems Software, vol. 86, pp. 302-314,2013.

Begell Digital Portal Begell 数字图书馆 电子图书 期刊 参考文献及会议录 研究收集 订购及政策 Begell House 联系我们 Language English 中文 Русский Português German French Spain