Publication List

Journal

  1. Masaki Tsukamoto, Yoshiko Hanada, Masahiro Nakao, Keiji Yamamoto. ``Optimization Algorithm with Automatic Adjustment of the Number of Switches in the Order/Radix Problem'', IEICE TRANSACTIONS on Information and Systems, Dec. 2023. doi:10.1587/transinf.2023PAP0004
  2. Masahiro Nakao, Masaki Tsukamoto, Yoshiko Hanada, Keiji Yamamoto. ``Graph optimization algorithm using symmetry and host bias for low-latency indirect network'', Parallel Computing, Oct. 2022. doi:10.1016/j.parco.2022.102983
  3. Masahiro Nakao, Masaki Tsukamoto, Yoshiko Hanada, Keiji Yamamoto. ``Graph optimization algorithm using symmetry and host bias for low-latency indirect network'', Parallel Computing, Oct. 2022.
  4. Masahiro Nakao, Maaki Sakai, Yoshiko Hanada, Hitoshi Murai, Mitsuhisa Sato. ``Graph optimization algorithm for low-latency interconnection networks'', Parallel Computing, Jul. 2021. doi:10.1016/j.parco.2021.102805
  5. Ksander Ejjaaouani, Oivier Aumage, Julien Bigot, Michel Méhrenberger, Hitoshi Murai, Masahiro Nakao, Mitsuhisa Sato. ``InKS: a programming model to decouple algorithm from optimization in HPC codes'', The Journal of Supercomputing, Jul. 2019. doi:10.1007/s11227-019-02950-2.
  6. Masahiro Nakao, Tetsuya Odajima, Hitoshi Murai, Akihiro Tabuchi, Norihisa Fujita, Toshihiro Hanawa, Taisuke Boku, Mitsuhisa Sato. ``Evaluation of XcalableACC with Tightly Coupled Accelerators/InfiniBand Hybrid Communication on Accelerated Cluster'', International Journal of High Performance Computing Applications, Jan. 2019. doi:10.1177/1094342018821163.
  7. Masahiro Nakao, Hitoshi Murai, Hidetoshi Iwashita, Taisuke Boku, Mitsuhisa Sato. ``Implementation and evaluation of the HPC Challenge benchmark in the XcalableMP PGAS language'', International Journal of High Performance Computing Applications, 33(1), 110–123. Mar. 2017. doi:10.1177/1094342017698214.

Conference

Oral

  1. Junya Arai, Masahiro Nakao, Yuto Inoue, Kanto Teranishi, Koji Ueno, Keiichiro Yamamura, Mitsuhisa Sato, Katsuki Fujisawa. ``Doubling Graph Traversal Efficiency to 198 TeraTEPS on the Supercomputer Fugaku'', SC24, Atlanta, USA, Nov. 2024.
  2. Razil Tahir, Jorji Nonaka, Ken Iwata, Taisei Matsushima, Naohisa Sakamoto, Chongke Bi, Masahiro Nakao, Hitoshi Murai. ``Analysis Towards Energy-Aware Image-based In Situ Visualization on the Fugaku'', HPC Asia 2024, Aichi, Japan, Jan. 2024.
  3. Masahiro Nakao, Hidetomo Kaneyama, Masaru Nagaku, Ikki Fujiwara, Atsuko Takefusa, Shin'ichi Miura, Keiji Yamamoto. ``Introducing Open OnDemand to Supercomputer Fugaku'', 10th International Workshop on HPC User Support Tools (HUST2023), Denver, USA, Nov. 2023. [PDF] [Slide]
  4. Taisuke Boku, Ryuta Tsunashima, Ryohei Kobayashi, Nrohisa Fujita, Seyong Lee, Jeffrey S. Vetter, Hitoshi Murai, Masahiro Nakao, Miwako Tsuji, Mitsuhisa Sato. ``OpenACC unified programming environment for multi-hybrid acceleration with GPU and FPGA'', H3 (HPC on Heterogeneous Hardware) ISC2023 Workshop, May, 2023.
  5. Michael Hennecke, Motohiko Matsuda, Masahiro Nakao, Kento Sato. ``Evaluating DAOS Storage on ARM64 Clients.'' IWAHPCE2023, Mar. 2023.
  6. Masahiro Nakao, Koji Ueno, Katsuki Fujisawa, Yuetsu Kodama, Mitsuhisa Sato. ``Performance of the Supercomputer Fugaku for Breadth-First Search in Graph500 Benchmark.'' ISC 2021, Jun. 2021. [Slide]
  7. Ryuta Tsunashima, Ryohei Kobayashi, Norihisa Fujita, Taisuke Boku, Seyong Lee, Jeffrey Vetter, Hitoshi Murai, Masahiro Nakao, Mitsuhisa Sato. ``OpenACC unified programming environment for GPU and FPGA multi-hybrid acceleration,'' 13th International Symposium on High-level Parallel Programming and Applications, Porto, Portugal, July, 2020.
  8. Masahiro Nakao, Hitoshi Murai, Mitsuhisa Sato. ``Parallelization of All-Pairs-Shortest-Path Algorithms in Unweighted Graph,'' HPC Asia 2020, Fukuoka, Japan, Jan. 2020. [Slide]
  9. Masahiro Nakao, Hitoshi Murai, Mitsuhisa Sato. ``A Method for Order/Degree Problem Based on Graph Symmetry and Simulated Annealing with MPI/OpenMP Parallelization'', HPC Asia 2019, Guangzhou, China, Jan. 2019. [Slide]
  10. Masahiro Nakao, Hitoshi Murai, Mitsuhisa Sato. ``Multi-accelerator extension in OpenMP based on PGAS model'', HPC Asia 2019, Guangzhou, China, Jan. 2019. [Slide]
  11. Hitoshi Murai, Mitsuhisa Sato, Masahiro Nakao, Jinpil Lee. ``Metaprogramming Framework for HPC based on the Omni Compiler Infrastructure'', 6th International Workshop on Large-scale HPC Application Modernization, Gifu, Japan, Nov. 2018.
  12. Ksander Ejjaaouani, Olivier Aumage, Julien Bigot, Michel Mehrenberger, Hitoshi Murai, Masahiro Nakao, Mitsuhisa Sato. ``InKS, a Programming Model to Decouple Performance from Semantics in HPC Codes'', The 4th International Workshop on Reengineering for Parallelism in Heterogeneous Parallel Platforms, Turin, Italy, Aug. 2018.
  13. Masahiro Nakao, Hitoshi Murai, Taisuke Boku, Mitsuhisa Sato. ``Linkage of XcalableMP and Python languages for high productivity on HPC cluster system'', Workshop on PGAS programming models: Experiences and Implementations, Tokyo, Japan, Jan. 2018. [Slide]
  14. Akihiro Tabuchi, Masahiro Nakao, Hitoshi Murai, Taisuke Boku, Mitsuhisa Sato. ``Performance Evaluation for a Hydrodynamics Application in XcalableACC PGAS Language for Accelerated Clusters'', Workshop on PGAS programming models: Experiences and Implementations, Tokyo, Japan, Jan. 2018.
  15. Masahiro Nakao, Hitoshi Murai, Taisuke Boku, Mitsuhisa Sato. ``Performance Evaluation for Omni XcalableMP Compiler on Many-core Cluster System based on Knights Landing'', IXPUG Workshop Asia 2018, Tokyo, Japan, Jan. 2018. [Slide]
  16. Hidetoshi Iwashita, Masahiro Nakao, Hitoshi Murai, Mitsuhisa Sato. ``A Source-to-Source Translation of Coarray Fortran with MPI for High Performance'', HPC Asia 2018, Tokyo, Japan, Jan. 2018.
  17. Hitoshi Murai, Masahiro Nakao, Hidetoshi Iwashita, Mitsuhisa Sato. ``Preliminary Performance Evaluation of Coarray-based Implementation of Fiber Miniapp Suite using XcalableMP PGAS Language'', Second Annual PGAS Applications Workshop (PAW), CO, USA, Nov. 2017.
  18. Masahiro Nakao, Hitoshi Murai, Hidetoshi Iwashita, Akihiro Tabuchi, Taisuke Boku, Mitsuhisa Sato. ``Implementing Lattice QCD Application with XcalableACC Language on Accelerated Cluster'', IEEE Cluster 2017, HI, USA, Sep. 2017. (acceptance rate: 21.8%) [Slide]
  19. Akihiro Tabuchi, Masahiro Nakao, Hitoshi Murai, Taisuke Boku and Mitsuhisa Sato. ``Implementation and Evaluation of One-sided PGAS Communication in XcalableACC for Accelerated Clusters'', 17th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid 2017). Madrid, Spain, May 2017.
  20. Mitsuhisa Sato, Hitoshi Murai, Masahiro Nakao, Hidetoshi Iwashita, Jinpil Lee, Akihiro Tabuchi. ``Omni Compiler and XcodeML: An Infrastructure for Source-to-Source Transformation'', Platform for Advanced Scientific Computing Conference (PASC16), SwissTech Convention Center EPFL, Switzerland, Jun. 2016.
  21. Hidetoshi Iwashita, Masahiro Nakao, Mitsuhisa Sato. ``Preliminary Implementation of Coarray Fortran Translator Based on Omni XcalableMP'', The 9th International Conference on Partitioned Global Address Space Programming Models (PGAS2015), Washington, D.C. USA, Sep. 2015.
  22. Tetsuya Odajima, Taisuke Boku, Toshihiro Hanawa, Hitoshi Murai, Masahiro Nakao, Akihiro Tabuchi, Mitsuhisa Sato. ``Hybrid Communication with TCA and InfiniBand on A Parallel Programming Language XcalableACC for GPU Clusters'', Workshop Series on Heterogeneous and Unconventional Cluster Architectures and Applications (HUCAA), IL, USA, Sep. 2015.
  23. Masahiro Nakao, Hitoshi Murai, Takenori Shimosaka, Akihiro Tabuchi, Toshihiro Hanawa, Yuetsu Kodama, Taisuke Boku, Mitsuhisa Sato. ``XcalableACC: Extension of XcalableMP PGAS Language using OpenACC for Accelerator Clusters'', Workshop on accelerator programming using directives (WACCPD), New Orleans, LA, USA, Nov. 2014. [Slide]
  24. Masahiro Nakao, Hitoshi Murai, Takenori Shimosaka, Mitsuhisa Sato. ``Productivity and Performance of the HPC Challenge Benchmarks with the XcalableMP PGAS language'', 7th International Conference on PGAS Programming Models, Edinburgh, Scotland, UK, Oct. 2013. [Paper] [Slide]
  25. Akihiro Tabuchi, Masahiro Nakao, Mitsuhisa Sato. ``A Source-to-Source OpenACC compiler for CUDA'', HeteroPar'2013, Aachen, Germany, Aug. 2013.
  26. Masahiro Nakao, Jinpil Lee, Taisuke Boku, Mitsuhisa Sato. ``Productivity and Performance of Global-View Programming with XcalableMP PGAS Language'', CCGrid 2012 - The 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, Ottawa, Canada, May 2012. (acceptance rate: 27.5%)
  27. Michihiro Koibuchi, Takafumi Watanabe, Atsushi Minamihata, Masahiro Nakao, Tomoyuki Hiroyasu, Hiroki Matsutani, Hideharu Amano. ``Performance Evaluation of Power-aware Multi-tree Ethernet for HPC Interconnects'', ICNC 2011, Japan, Nov. 2011. (Best Paper)
  28. Masahiro Nakao, Tomoyuki Hiroyasu, Mitsunori Miki, Hisatake Yokouchi, Masato Yoshimi. ``Real-coded Estimation of Distribution Algorithm by Using Probabilistic Models with Multiple Learning Rates'', 2nd Workshop on Computational Optimization, Modelling and Simulation (COMS 2011), Nanyang Technological University, Singapore, Jun. 2011.
  29. Masahiro Nakao, Jinpil Lee, Taisuke Boku, Mitsuhisa Sato. ``XcalableMP Implementation and Performance of NAS Parallel Benchmarks'', Fourth Conference on Partitioned Global Address Space Programming Model (PGAS10), New York, USA, Oct. 2010.
  30. Tomoyuki Hiroyasu, Yuta Tomatsu, Masahiro Nakao, Mitsunori Miki, Hisatake Yokouchi, Masato Yoshimi. ``Simulated Annealing with Search Mechanisms of Interpolation and Extrapolation Domain'', International Forum on Multimedia and Image Processing (IFMIP), Kobe, Hyogo, Japan, Sep. 2010.
  31. Takafumi Watanabe, Masahiro Nakao, Tomoyuki Hiroyasu, Michihiro Koibuchi, Tomohiro Otsuka. ``The Impact of Topology and Link Aggregation on PC Cluster with Ethernet'', IEEE Cluster 2008, No.CFP08235-CDR(2008) , pp.280-285, Japan, Oct. 2008.

Poster

  1. Jorji Nonaka, Daichi Obinata, Hiroyuki Ito, Atsushi Toyoda, Naohisa Sakamoto, Masahiro Nakao, Hitoshi Murai, Keiji Yamamoto, Masaaki Terai, Tomohiro Kawanabe, Shunji Uno, Naoyuki Fujita, Toshihiko Kai, Fumiyoshi Shoji, Takanori Haga, Seiji Tsutsumi, Manabu Motokawa, Atsuhi Fujino. ``On the Building of a Common In-Situ Visualization Environment for Arm A64FX Supercomputers'', Cluster 2024, Hyogo, Japan, Sep. 2024.
  2. Masahiro Nakao, Koji Ueno, Katsuki Fujisawa, Yuetsu Kodama, Mitsuhisa Sato. ``Graph500 benchmark with automatic performance tuning'', HPC Asia 2024, Aichi, Japan, Jan. 2024.
  3. Masahiro Nakao, Shin'ichi Miura, Keiji Yamamoto. ``Introduction of Open OnDemand to Supercomputer Fugaku'', HPC Asia 2023, Singapore, Mar. 2023. [poster]
  4. Jorji Nonaka, Masaaki Terai, Masahiro Nakao, Keiji Yamamoto, Hitoshi Murai, Fumiyoshi Shoji. ``Towards an Easy-to-Use Visualization Environment on the Fugaku'', HPC Asia 2023, Singapore, Mar. 2023.
  5. Masahiro Nakao, Masaki Tsukamoto, Yoshiko Hanada, Keiji Yamamoto. ``Graph optimization algorithm for low-latency indirect network'', HPC Asia 2022, Online, Jan. 2022.
  6. Masahiro Nakao, Koji Ueno, Katsuki Fujisawa, Yuetsu Kodama, Mitsuhisa Sato. ``Performance Evaluation of Supercomputer Fugaku using Breadth-First Search Benchmark in Graph500,'' IEEE Cluster 2020, Online, Sep. 2020.
  7. Masahiro Nakao, Hitoshi Murai, Mitsuhisa Sato, Yoshimichi Andoh, Susumu Okazaki. ``Performance improvement of MODYLAS using Remote Direct Memory Access on the K computer,'' International Conference on Parallel Processing, Kyoto, Japan, Aug. 2019. [poster]
  8. Masahiro Nakao, Hitoshi Murai, Jinpil Lee, Tetsuya Odajima, Yutaka Watanabe, Taisuke Boku. ``Evaluation of Lattice QCD code using XcalableACC parallel language'', CCS International Symposium 2018, Tsukuba, Ibaraki, Japan, Oct. 2018.
  9. Masahiro Nakao, Hitoshi Murai, Akihiro Tabuchi, Taisuke Boku, Mitsuhisa Sato. ``Performance Evaluation of NICAM-DC-MINI using XcalableACC on Accelerated Cluster'', HPC Asia 2018 Poster, Tokyo, Japan, Jan. 2018.
  10. Hitoshi Murai, Masahiro Nakao, Takehiro Shimosaka, Akihiro Tabuchi, Taisuke Boku, Mitsuhisa Sato. ``XcalableACC - a Directive-based Language Extension for Accelerated Parallel Computing'', SC14 poster, New Orleans, LA, USA, Nov. 2014.

Other

  1. ``Building the Open OnDemand Community at RIKEN R-CCS'', Open OnDemand Booth of SC23, Nov. 2023 [Slide]
  2. ``Fugaku Open OnDemand'', The 12th meeting for application code tuning on A64FX computer systems, online, Oct. 2023 [Slide]
  3. ``Performance of the supercomputer Fugaku for graph500 benchmark'', The 6th RIKEN-IMI-ISM-NUS-ZIB-MODAL-NHR Workshop on Advances in Classical and Quantum Algorithms for Optimization and Machine Learning, Sep. 2022
  4. ``Graph optimization algorithm with symmetry and biased host density for Order/Radix Problem'', Graph Golf Workshop, Nov. 2021 [Slide]
  5. ``Performance of the supercomputer Fugaku for graph500 benchmark'', The 6th RIKEN-IMI-ISM-NUS-ZIB-MODAL-NHR Workshop on Advances in Classical and Quantum Algorithms for Optimization and Machine Learning, Sep. 2022
  6. ``XcalableMP PGAS Programming Language,'' DOI: 10.1007/978-981-15-7683-6, Springer, Nov. 2020
  7. ``Introduction of fast APSP algorithm and optimization algorithms for grid graphs'', Graph Golf Workshop, Nov. 2019 [Slide]
  8. ``A Method for Order/Degree Problem Based on Graph Symmetry and Simulated Annealing'', Graph Golf Workshop, Nov. 2018 [Slide]
  9. SC17 Research Exhibition in Booth #557 PGAS, Denver, Co, USA, Nov. 2017. [Poster]
  10. CEA-RIKEN HPC School, ``XcalableMP Tutorial'', Maison de la Simulation, Saclay, France,Sep. 2017
  11. SC16 Research Exhibition in Booth #537 PGAS, Salt Lake City, Utah, USA, Nov. 2016. [Poster]
  12. SC15 Research Exhibition in Booth #723 PGAS, Austin, TX, USA, Nov. 2015. [Poster]
  13. SC14 Research Exhibition in Booth #2255 PGAS, New Orleans, LA, USA, Nov. 2014. [Poster]
  14. SC14 Research Exhibition in Booth #2231 RIKEN AICS, New Orleans, LA, USA, Nov. 2014. [Leaflet 1 2]
  15. ``Tuning techniques and performance optimization for CGPOP, NICAM and CICE applications on the K computer'', G8ECS meeting, Kobe, japan, Mar. 2014.
  16. SC13 Research Exhibition in Booth #2519 Center for Computational Sciences, University of Tsukuba, Denver, Colorado, USA, Nov. 2013. [Poster]
  17. SC13 Research Exhibition in Booth #432 PGAS, Denver, Colorado, USA, Nov. 2013. [Poster]
  18. SC12 PGAS: The Partitioned Global Address Space Programming Model BoF, Salt Lake City, Utah, USA, Nov. 2012. [Slide]
  19. SC12 Research Exhibition in Booth #2137 PGAS, Salt Lake City, Utah, USA, Nov. 2012. [Poster]
  20. SC11 Research Exhibition in Booth #5007 T2K Open Supercomputer Alliance, Seattle, Washington, USA, Nov. 2011. [Poster 1, 2, 3] [Leaflet] [Slide]
  21. SC10 Research Exhibition in Booth #1321 T2K Open Supercomputer Alliance, NewOrleans, Louisiana, USA, Nov. 2010. [Poster 1, 2] [Leaflet 1, 2, 3, 4] [Slide]
  22. SC2007 Research Exhibition, Reno, Nevada, USA, Nov. 2007.
  23. SC2004 Research Exhibition, Pittsburgh, Pennsylvania, USA, Nov. 2004.
  24. SC2003 Research Exhibition, Phoenix, Arizona, USA, Nov. 2003.