
-
Previous Article
Segmentation of color images using mean curvature flow and parametric curves
- DCDS-S Home
- This Issue
-
Next Article
Existence of a period two solution of a delay differential equation
Transformation of a Nucleon-Nucleon potential operator into its SU(3) tensor form using GPUs
1. | Faculty of Nuclear Sciences and Physical Engineering, Czech Technical University in Prague, Trojanova 13, Praha 2,120 00, Czech Republic |
2. | Department of Physics and Astronomy, Louisiana State University, Baton Rouge, LA 70803, USA, Nuclear Physics Institute, Czech Academy of Sciences, Řež 25068, Czech Republic |
3. | Department of Physics and Astronomy, Louisiana State University, Baton Rouge, LA 70803, USA |
4. | Faculty of Information Technology, Czech Technical University, Prague 16000, Czech Republic, Aerospace Research and Test Establishment, Prague 19905, Czech Republic |
5. | Department of Physics and Astronomy, Louisiana State University, Baton Rouge, LA 70803, USA |
Starting from the matrix elements of a nucleon-nucleon potential operator provided in a basis of spherical harmonic oscillator functions, we present an algorithm for expressing a given potential operator in terms of irreducible tensors of the SU(3) and SU(2) groups. Further, we introduce a GPU-based implementation of the latter and investigate its performance compared with a CPU-based version of the same. We find that the CUDA implementation delivers speedups of 2.27x – 5.93x.
References:
[1] |
Y. Akiyama and J. P. Draayer, A user's guide to Fortran programs for Wigner and Racah coefficients of SU$_3$, Comp. Phys. Comm, 5 (1973), 405-406. Google Scholar |
[2] |
T. Dytrych, K. D. Launey, J. P. Draayer, P. Maris, J. P. Vary, E. Saule, U. Catalyurek, M. Sosonkina, D. Langr and M. A. Caprio, Collective modes in light nuclei from first principles, Phys. Rev. Lett., 111 (2013), 252501.
doi: 10.1103/PhysRevLett.111.252501. |
[3] |
T. Dytrych, P. Maris, K. D. Launey, J. P. Draayer, J. Vary, D. Langr, E. Saule, M. A. Caprio, U. Catalyurek and M. Sosonkina,
Efficacy of the SU(3) scheme for ab initio large-scale calculations beyond the lightest nuclei, Comp. Phys. Comm., 207 (2016), 202-210.
doi: 10.2172/1326837. |
[4] |
H. T. Johansson and C. Forssén, Fast and accurate evaluation of Wigner $3j$, $6j$, and $9j$ symbols using prime factorization and multiword integer arithmetic, SIAM J. Sci. Comput., 38 (2016), A376–A384.
doi: 10.1137/15M1021908. |
[5] |
K. D. Launey, T. Dytrych and J. P. Draayer,
Symmetry-guided large-scale shell-model theory, Prog. Part. Nucl. Phys., 89 (2016), 101-136.
doi: 10.1016/j.ppnp.2016.02.001. |
[6] |
M. F. O'Reilly,
A closed formula for the product of irreducible representations of SU(3), J. Math. Phys., 23 (1982), 2022-2028.
doi: 10.1063/1.525258. |
show all references
References:
[1] |
Y. Akiyama and J. P. Draayer, A user's guide to Fortran programs for Wigner and Racah coefficients of SU$_3$, Comp. Phys. Comm, 5 (1973), 405-406. Google Scholar |
[2] |
T. Dytrych, K. D. Launey, J. P. Draayer, P. Maris, J. P. Vary, E. Saule, U. Catalyurek, M. Sosonkina, D. Langr and M. A. Caprio, Collective modes in light nuclei from first principles, Phys. Rev. Lett., 111 (2013), 252501.
doi: 10.1103/PhysRevLett.111.252501. |
[3] |
T. Dytrych, P. Maris, K. D. Launey, J. P. Draayer, J. Vary, D. Langr, E. Saule, M. A. Caprio, U. Catalyurek and M. Sosonkina,
Efficacy of the SU(3) scheme for ab initio large-scale calculations beyond the lightest nuclei, Comp. Phys. Comm., 207 (2016), 202-210.
doi: 10.2172/1326837. |
[4] |
H. T. Johansson and C. Forssén, Fast and accurate evaluation of Wigner $3j$, $6j$, and $9j$ symbols using prime factorization and multiword integer arithmetic, SIAM J. Sci. Comput., 38 (2016), A376–A384.
doi: 10.1137/15M1021908. |
[5] |
K. D. Launey, T. Dytrych and J. P. Draayer,
Symmetry-guided large-scale shell-model theory, Prog. Part. Nucl. Phys., 89 (2016), 101-136.
doi: 10.1016/j.ppnp.2016.02.001. |
[6] |
M. F. O'Reilly,
A closed formula for the product of irreducible representations of SU(3), J. Math. Phys., 23 (1982), 2022-2028.
doi: 10.1063/1.525258. |


MPI procs. | CPU only | CPU+GPU | |||
Time [s] | Efficiency | Time [s] | Speed-up | ||
7+1 | 295.3 | – | 75.3 | 3.92 | |
15+1 | 137.6 | 1 | 36.8 | 3.73 | |
31+1 | 66.5 | 1 | 17.5 | 3.79 | |
63+1 | 39.4 | 0.83 | 9.23 | 4.26 | |
127+1 | 32.8 | 0.56 | 6.35 | 5.17 | |
255+1 | 31.0 | 0.52 | 5.22 | 5.94 | |
7+1 | 2219 | – | 648 | 3.42 | |
15+1 | 1034 | 1 | 318 | 3.24 | |
31+1 | 499 | 1 | 151 | 3.28 | |
63+1 | 248 | 0.99 | 74 | 3.32 | |
127+1 | 165 | 0.74 | 43 | 3.75 | |
255+1 | 138 | 0.59 | 32 | 4.25 | |
7+1 | 13083 | – | 4493 | 2.91 | |
15+1 | 6097 | 1 | 2116 | 2.88 | |
31+1 | 2943 | 1 | 1054 | 2.79 | |
63+1 | 1447 | 1 | 515 | 2.80 | |
127+1 | 776 | 0.92 | 269 | 2.88 | |
255+1 | 565 | 0.68 | 169 | 3.33 | |
5+1 | 64865 | – | 26104 | 2.48 | |
15+1 | 30227 | 1 | 12204 | 2.47 | |
31+1 | 14596 | 1 | 5944 | 2.45 | |
63+1 | 7179 | 1 | 2932 | 2.44 | |
127+1 | 3581 | 0.99 | 1461 | 2.45 | |
255+1 | 2142 | 0.83 | 838 | 2.55 |
MPI procs. | CPU only | CPU+GPU | |||
Time [s] | Efficiency | Time [s] | Speed-up | ||
7+1 | 295.3 | – | 75.3 | 3.92 | |
15+1 | 137.6 | 1 | 36.8 | 3.73 | |
31+1 | 66.5 | 1 | 17.5 | 3.79 | |
63+1 | 39.4 | 0.83 | 9.23 | 4.26 | |
127+1 | 32.8 | 0.56 | 6.35 | 5.17 | |
255+1 | 31.0 | 0.52 | 5.22 | 5.94 | |
7+1 | 2219 | – | 648 | 3.42 | |
15+1 | 1034 | 1 | 318 | 3.24 | |
31+1 | 499 | 1 | 151 | 3.28 | |
63+1 | 248 | 0.99 | 74 | 3.32 | |
127+1 | 165 | 0.74 | 43 | 3.75 | |
255+1 | 138 | 0.59 | 32 | 4.25 | |
7+1 | 13083 | – | 4493 | 2.91 | |
15+1 | 6097 | 1 | 2116 | 2.88 | |
31+1 | 2943 | 1 | 1054 | 2.79 | |
63+1 | 1447 | 1 | 515 | 2.80 | |
127+1 | 776 | 0.92 | 269 | 2.88 | |
255+1 | 565 | 0.68 | 169 | 3.33 | |
5+1 | 64865 | – | 26104 | 2.48 | |
15+1 | 30227 | 1 | 12204 | 2.47 | |
31+1 | 14596 | 1 | 5944 | 2.45 | |
63+1 | 7179 | 1 | 2932 | 2.44 | |
127+1 | 3581 | 0.99 | 1461 | 2.45 | |
255+1 | 2142 | 0.83 | 838 | 2.55 |
CPU only | CPU+GPU | ||
Time [s] | Time [s] | Speed-up | |
8 | 41.65 | 15.78 | 2.63 |
10 | 274.14 | 97.99 | 2.79 |
12 | 1649.7 | 611.1 | 2.69 |
14 | 7761.9 | 3407.6 | 2.27 |
CPU only | CPU+GPU | ||
Time [s] | Time [s] | Speed-up | |
8 | 41.65 | 15.78 | 2.63 |
10 | 274.14 | 97.99 | 2.79 |
12 | 1649.7 | 611.1 | 2.69 |
14 | 7761.9 | 3407.6 | 2.27 |
[1] |
Arseny Egorov. Morse coding for a Fuchsian group of finite covolume. Journal of Modern Dynamics, 2009, 3 (4) : 637-646. doi: 10.3934/jmd.2009.3.637 |
[2] |
Zhimin Chen, Kaihui Liu, Xiuxiang Liu. Evaluating vaccination effectiveness of group-specific fractional-dose strategies. Discrete & Continuous Dynamical Systems - B, 2021 doi: 10.3934/dcdsb.2021062 |
[3] |
Vieri Benci, Sunra Mosconi, Marco Squassina. Preface: Applications of mathematical analysis to problems in theoretical physics. Discrete & Continuous Dynamical Systems - S, 2021, 14 (5) : i-i. doi: 10.3934/dcdss.2020446 |
[4] |
Chaoqian Li, Yajun Liu, Yaotang Li. Note on $ Z $-eigenvalue inclusion theorems for tensors. Journal of Industrial & Management Optimization, 2021, 17 (2) : 687-693. doi: 10.3934/jimo.2019129 |
[5] |
Hala Ghazi, François James, Hélène Mathis. A nonisothermal thermodynamical model of liquid-vapor interaction with metastability. Discrete & Continuous Dynamical Systems - B, 2021, 26 (5) : 2371-2409. doi: 10.3934/dcdsb.2020183 |
[6] |
Feng Luo. A combinatorial curvature flow for compact 3-manifolds with boundary. Electronic Research Announcements, 2005, 11: 12-20. |
[7] |
Shihu Li, Wei Liu, Yingchao Xie. Large deviations for stochastic 3D Leray-$ \alpha $ model with fractional dissipation. Communications on Pure & Applied Analysis, 2019, 18 (5) : 2491-2509. doi: 10.3934/cpaa.2019113 |
[8] |
Thomas Y. Hou, Ruo Li. Nonexistence of locally self-similar blow-up for the 3D incompressible Navier-Stokes equations. Discrete & Continuous Dynamical Systems - A, 2007, 18 (4) : 637-642. doi: 10.3934/dcds.2007.18.637 |
[9] |
Carlos Gutierrez, Nguyen Van Chau. A remark on an eigenvalue condition for the global injectivity of differentiable maps of $R^2$. Discrete & Continuous Dynamical Systems - A, 2007, 17 (2) : 397-402. doi: 10.3934/dcds.2007.17.397 |
[10] |
Arunima Bhattacharya, Micah Warren. $ C^{2, \alpha} $ estimates for solutions to almost Linear elliptic equations. Communications on Pure & Applied Analysis, , () : -. doi: 10.3934/cpaa.2021024 |
[11] |
Luigi C. Berselli, Jishan Fan. Logarithmic and improved regularity criteria for the 3D nematic liquid crystals models, Boussinesq system, and MHD equations in a bounded domain. Communications on Pure & Applied Analysis, 2015, 14 (2) : 637-655. doi: 10.3934/cpaa.2015.14.637 |
[12] |
Ka Luen Cheung, Man Chun Leung. Asymptotic behavior of positive solutions of the equation $ \Delta u + K u^{\frac{n+2}{n-2}} = 0$ in $IR^n$ and positive scalar curvature. Conference Publications, 2001, 2001 (Special) : 109-120. doi: 10.3934/proc.2001.2001.109 |
[13] |
Misha Bialy, Andrey E. Mironov. Rich quasi-linear system for integrable geodesic flows on 2-torus. Discrete & Continuous Dynamical Systems - A, 2011, 29 (1) : 81-90. doi: 10.3934/dcds.2011.29.81 |
[14] |
José Raúl Quintero, Juan Carlos Muñoz Grajales. On the existence and computation of periodic travelling waves for a 2D water wave model. Communications on Pure & Applied Analysis, 2018, 17 (2) : 557-578. doi: 10.3934/cpaa.2018030 |
[15] |
A. Kochergin. Well-approximable angles and mixing for flows on T^2 with nonsingular fixed points. Electronic Research Announcements, 2004, 10: 113-121. |
[16] |
Peter Benner, Jens Saak, M. Monir Uddin. Balancing based model reduction for structured index-2 unstable descriptor systems with application to flow control. Numerical Algebra, Control & Optimization, 2016, 6 (1) : 1-20. doi: 10.3934/naco.2016.6.1 |
[17] |
Denis Bonheure, Silvia Cingolani, Simone Secchi. Concentration phenomena for the Schrödinger-Poisson system in $ \mathbb{R}^2 $. Discrete & Continuous Dynamical Systems - S, 2021, 14 (5) : 1631-1648. doi: 10.3934/dcdss.2020447 |
[18] |
Lei Liu, Li Wu. Multiplicity of closed characteristics on $ P $-symmetric compact convex hypersurfaces in $ \mathbb{R}^{2n} $. Discrete & Continuous Dynamical Systems - A, 2020 doi: 10.3934/dcds.2020378 |
2019 Impact Factor: 1.233
Tools
Metrics
Other articles
by authors
[Back to Top]