High Performance Computing in Accelerator Physics

Kwok Ko, Stanford Linear Accelerator Center

High performance computing powered by increased investments in software and hardware infrastructures is beginning to have a significant impact on accelerator design and analysis. The US DOE funded Accelerator Grand Challenge has laid the groundwork for developing a suite of electromagnetic codes that are based on unstructured grids and utilize parallel processing on supercomputers such as the Cray T3E and IBM SP2 as well as PC clusters. We will show how this new capability has enabled some of the most challenging problems in accelerator modeling to be solved in resolution, accuracy and turnaround time previously not possible. We will describe the technologies and resources required to support such large-scale simulations, and will discuss the benefits to present and future accelerator facilities from advancing simulation as the third tool of science. Further code development plans under the newly approved DOE SciDAC Initiative will be presented.

INTRODUCTION

The US Department of Energy promotes High Performance Computing (HPC) to help advance the progress of science in its program offices through advanced computing initiatives that include both hardware and software investments. The DOE Grand Challenge was such a program that funded selected teams from various disciplines to tackle the most difficult computational problems in their respective areas. The High Energy and Nuclear Physics (HENP) program office supported the Computational Accelerator Physics team that consisted of two national laboratories (SLAC, LANL) and two universities (Stanford, UCLA). This team focused on two main topics, electromagnetic modeling and beam dynamics simulation, and developed new parallel software to take advantage of massively parallel supercomputers installed at the DOE’s National Energy Research Scientific Computing (NERSC) center at Berkeley. The success of the Accelerator Grand Challenge led to the formation of a national collaboration involving six national labs, five universities, and one industrial partner. The funding is provided by DOE’s Scientific Discovery through Advanced Computing (SciDAC) initiative whose goal is to foster large multi-institutional, multi-disciplinary teams to develop community codes that run on terascale platforms to help solve the most challenging problems facing the field. The newly approved SciDAC project has been expanded to include a third research area, that of advanced accelerator concepts to study beams under extreme conditions like those found in laser and plasma based accelerators.

Particle accelerators are among the most important and most complex scientific instruments in use, and are critical to research in fields such as high-energy physics, nuclear physics, materials science, chemistry, and the biosciences. They have been proposed for applications that address national needs, and examples include accelerator transmutation of waste, accelerator-driven fission and fusion energy production, accelerator production of tritium, and proton radiography for stockpile stewardship. Smaller scale accelerators have beneficial use in many areas such as irradiation and sterilization of biological hazards, medical isotope production, particle beams for irradiation therapy, ion implantation and beam lithography. Given the great value of particle accelerators it is imperative that the most advanced computing tools and resources be brought to bear on the design and development of these complex facilities and devices. The availability of high performance, large memory parallel supercomputers has made large-scale computing the third tool of scientific discovery complimentary to the traditional approaches of theory and experimentation. Large-scale simulation enables numerical experiments on systems for which physical experimentation would be prohibitively expensive or technologically unfeasible.

This paper will present an overview of the electromagnetic component of the Accelerator Modeling project. Detailed description of codes and results will be covered in several related papers in the session on “High Performance Computing in Accelerator Physics”. That session will also include papers addressing the beam dynamics and advanced accelerators areas.

THE NEED FOR HIGH PERFORMANCE COMPUTING

(i) High Resolution Component Design

Accelerator physicists and engineers are faced with increasingly stringent requirements on electromagnetic components as new and existing facilities continually strive towards higher energy and current, and greater efficiency. In the proposed Next Linear Collider (NLC) [1] scheme, the frequency of the accelerating field must be accurate to within 1 part in 10,000 which is comparable to fabrication tolerance in order to maintain acceleration efficicency. This requirement is to be met in a complex cavity geometry that optimizes the accelerating field gradient while suppressing parasitic wafefields generated by the beam. One design, called the Round Damped Detuned Structure (RDDS) is shown in Fig. 1a. Simulating such geometry is challenging for existing electromagnetics software running on desktop computers because it involves a huge number of degrees of freedom (DOF) to model the many curved surfaces to the desired accuracy. While these standard packages have been used extensively by the accelerator community, it became evident that new simulation tools utilizing parallel processing that harness the large memory available in supercomputers were needed to provide the high resolution required for the computer-aided design of these complex accelerating structures. The RDDS cavity as modeled with the parallel eigenmode solver Omega3P consists of one million DOF’s is shown in Fig. 1b..

There is another obvious advantage for parallel processing besides gaining access to large memory. In the ideal situation, a parallel code built on scalable algorithms would obtain linear speedup in simulation time. Such a huge jump in processing speed could never be reached by improvement in single CPU performance even if Moore’s law continues to prevail. The combination of large memory and scalable computing enables simulation to become a cheaper and faster alternative to the expensive, time-consuming process of repeated fabrication and testing. This has certainly been the case in the design of the RDDS cavity or cell for the NLC. Dimensions generated by simulation were directly used in computer controlled milling machines for fabrication of the cells (Fig. 1c) and subsequent cold–test measurements found frequency accuracy to be close to 0.01 % as predicated by calculation.

(ii) System Scale Analysis

The case for high performance computing in accelerator modeling becomes more compelling when one considers system scale studies of large, heterogeneous structures. For example, the NLC RDDS accelerating section consists of 206 cells each varying from the next by order of microns. This variation in cavity dimensions follows a prescribed distribution so as to detune the dominant higher order modes that constitute the wakefields to reduce their effect. Additional reduction is provided by slot openings in the cavity walls to couple the wakefields out to external manifolds. So far this detuning and damping scheme has only been analyzed using an approximate model that consists of equivalent circuit chains. It is of great interest to model the entire 206-cell section to verify if the desired wakefield suppression can be achieved, and to validate the equivalent circuit model. Such a simulation is estimated to require hundreds of Gigabytes of memory and the calculation of thousands of modes in the long structure. Calculation has been done on structures with fewer cells. Fig. 2a is the Omega3P model of a 47-cell RRDS section and Fig 2b shows for the first time, the actual fields of a localized mode in the structure. Many issues remain to be addressed before full structure simulation can be carried out and they will be described in later sections.

Kwok Ko, Stanford Linear Accelerator Center

(1) Omega3P – a 3D parallel eigenmode solver to find normal modes in lossless rf cavities using linear and quadratic elements,

(2) Tau3P – a 3D parallel time domain solver to calculate the transmission properties of open structures on a modified Yee grid,

(3) Phi3P – a 3D parallel static solver based on a field formulation that uses hybrid elements for improved accuracy and for more exact description of material boundaries ,

(4) Ptrack – a particle tracking module that works Omega3p and Tau3P to study rf breakdown and dark currents, both important issues for high gradient acceleration.