[4] ACP (Advanced Communication Primitives) [1] ACP ACP [2] ACP Tofu UDP [3] HPC InfiniBand InfiniBand ACP 2 ACP, 3 InfiniBand ACP 4 5 ACP 2. ACP ACP

Similar documents
Fujitsu Standard Tool

IPSJ SIG Technical Report Vol.2013-ARC-207 No.23 Vol.2013-HPC-142 No /12/17 1,a) 1,b) 1,c) 1,d) OpenFOAM OpenFOAM A Bottleneck and Cooperation

1 OpenCL OpenCL 1 OpenCL GPU ( ) 1 OpenCL Compute Units Elements OpenCL OpenCL SPMD (Single-Program, Multiple-Data) SPMD OpenCL work-item work-group N

先進的計算基盤システムシンポジウム DMA Tofu 6 MPI RDMA 6 3 (1 ) RDMA (2 ) 3 MPI MPI 3 MPI 127us, 47GB/s 9,216 MPI Bcast 106GB/s 31 MPI 2 MPI 2 Tofu Eager : 6 7 2

23 Fig. 2: hwmodulev2 3. Reconfigurable HPC 3.1 hw/sw hw/sw hw/sw FPGA PC FPGA PC FPGA HPC FPGA FPGA hw/sw hw/sw hw- Module FPGA hwmodule hw/sw FPGA h

1_26.dvi

IPSJ SIG Technical Report Vol.2013-ARC-203 No /2/1 SMYLE OpenCL (NEDO) IT FPGA SMYLEref SMYLE OpenCL SMYLE OpenCL FPGA 1

IPSJ SIG Technical Report Vol.2015-HPC-150 No /8/6 I/O Jianwei Liao 1 Gerofi Balazs 1 1 Guo-Yuan Lien Prototyping F

VXPRO R1400® ご提案資料

& Vol.2 No (Mar. 2012) 1,a) , Bluetooth A Health Management Service by Cell Phones and Its Us

IPSJ SIG Technical Report Secret Tap Secret Tap Secret Flick 1 An Examination of Icon-based User Authentication Method Using Flick Input for

6 2. AUTOSAR 2.1 AUTOSAR AUTOSAR ECU OSEK/VDX 3) OSEK/VDX OS AUTOSAR AUTOSAR ECU AUTOSAR 1 AUTOSAR BSW (Basic Software) (Runtime Environment) Applicat

IPSJ SIG Technical Report Vol.2017-ARC-225 No.12 Vol.2017-SLDM-179 No.12 Vol.2017-EMB-44 No /3/9 1 1 RTOS DefensiveZone DefensiveZone MPU RTOS

Iteration 0 Iteration 1 1 Iteration 2 Iteration 3 N N N! N 1 MOPT(Merge Optimization) 3) MOPT MOP

hpc141_shirahata.pdf

HPC (pay-as-you-go) HPC Web 2

システムソリューションのご紹介

IPSJ SIG Technical Report Vol.2012-HCI-149 No /7/20 1 1,2 1 (HMD: Head Mounted Display) HMD HMD,,,, An Information Presentation Method for Weara

Vol.55 No (Jan. 2014) saccess 6 saccess 7 saccess 2. [3] p.33 * B (A) (B) (C) (D) (E) (F) *1 [3], [4] Web PDF a m

[2] OCR [3], [4] [5] [6] [4], [7] [8], [9] 1 [10] Fig. 1 Current arrangement and size of ruby. 2 Fig. 2 Typography combined with printing

PeerPool IP NAT IP UPnP 2) Bonjour 3) PeerPool CPU 4) 2 UPnP Bonjour PeerPool CPU PeerPool PeerPool PPv2 PPv2 2. PeerPool 2.1 PeerPool PeerPool PoolGW

2) 2. DLNA DLNA (Version 1.5) 2 (DMC1) (SSDP) (DMS1, DMS2) (DMR1, DMR2, DMR3) (UDP) DMC1 3 DMS2 DMC1 DMS1 (HTTP) DMS1 DMR2 (RTP) DMR2 3. DLNA 4 DMC1 D

TCP/IP IEEE Bluetooth LAN TCP TCP BEC FEC M T M R M T 2. 2 [5] AODV [4]DSR [3] 1 MS 100m 5 /100m 2 MD 2 c 2009 Information Processing Society of

17 Proposal of an Algorithm of Image Extraction and Research on Improvement of a Man-machine Interface of Food Intake Measuring System

fiš„v8.dvi

258 5) GPS 1 GPS 6) GPS DP 7) 8) 10) GPS GPS ) GPS Global Positioning System

& Vol.5 No (Oct. 2015) TV 1,2,a) , Augmented TV TV AR Augmented Reality 3DCG TV Estimation of TV Screen Position and Ro

1 Web [2] Web [3] [4] [5], [6] [7] [8] S.W. [9] 3. MeetingShelf Web MeetingShelf MeetingShelf (1) (2) (3) (4) (5) Web MeetingShelf

1 4 4 [3] SNS 5 SNS , ,000 [2] c 2013 Information Processing Society of Japan

IPSJ SIG Technical Report Vol.2016-CE-137 No /12/ e β /α α β β / α A judgment method of difficulty of task for a learner using simple

Vol.53 No (July 2012) EV ITS 1,a) , EV 1 EV ITS EV ITS EV EV EV Development and Evaluation of ITS Information Commu

Web ( ) [1] Web Shibboleth SSO Web SSO Web Web Shibboleth SAML IdP(Identity Provider) Web Web (SP:ServiceProvider) ( ) IdP Web Web MRA(Mail Retrieval

Table 1 Table 2

FUJII, M. and KOSAKA, M. 2. J J [7] Fig. 1 J Fig. 2: Motivation and Skill improvement Model of J Orchestra Fig. 1: Motivating factors for a

MDD PBL ET 9) 2) ET ET 2.2 2), 1 2 5) MDD PBL PBL MDD MDD MDD 10) MDD Executable UML 11) Executable UML MDD Executable UML

DPA,, ShareLog 3) 4) 2.2 Strino Strino STRain-based user Interface with tacticle of elastic Natural ObjectsStrino 1 Strino ) PC Log-Log (2007 6)

1 Table 1: Identification by color of voxel Voxel Mode of expression Nothing Other 1 Orange 2 Blue 3 Yellow 4 SSL Humanoid SSL-Vision 3 3 [, 21] 8 325

IPSJ SIG Technical Report Vol.2014-ARC-213 No.24 Vol.2014-HPC-147 No /12/10 GPU 1,a) 1,b) 1,c) 1,d) GPU GPU Structure Of Array Array Of

地球観測衛星データの保存・配布システム

IPSJ SIG Technical Report Vol.2011-EC-19 No /3/ ,.,., Peg-Scope Viewer,,.,,,,. Utilization of Watching Logs for Support of Multi-

IPSJ SIG Technical Report NetMAS NetMAS NetMAS One-dimensional Pedestrian Model for Fast Evacuation Simulator Shunsuke Soeda, 1 Tomohisa Yam


Amazon EC2 IaaS (Infrastructure as a Service) HPCI HPCI ( VM) VM VM HPCI VM OS VM HPCI HPC HPCI RENKEI-PoP 2 HPCI HPCI 1 HPCI HPCI HPC CS

IPSJ SIG Technical Report Vol.2012-CG-148 No /8/29 3DCG 1,a) On rigid body animation taking into account the 3D computer graphics came

2006 [3] Scratch Squeak PEN [4] PenFlowchart 2 3 PenFlowchart 4 PenFlowchart PEN xdncl PEN [5] PEN xdncl DNCL 1 1 [6] 1 PEN Fig. 1 The PEN

情報処理学会研究報告 IPSJ SIG Technical Report Vol.2013-HPC-139 No /5/29 Gfarm/Pwrake NICT NICT 10TB 100TB CPU I/O HPC I/O NICT Gf

Input image Initialize variables Loop for period of oscillation Update height map Make shade image Change property of image Output image Change time L

12 PowerEdge PowerEdge Xeon E PowerEdge 11 PowerEdge DIMM Xeon E PowerEdge DIMM DIMM 756GB 12 PowerEdge Xeon E5-

untitled

Introduction Purpose This training course demonstrates the use of the High-performance Embedded Workshop (HEW), a key tool for developing software for

<95DB8C9288E397C389C88A E696E6462>

2. CABAC CABAC CABAC 1 1 CABAC Figure 1 Overview of CABAC 2 DCT 2 0/ /1 CABAC [3] 3. 2 値化部 コンテキスト計算部 2 値算術符号化部 CABAC CABAC

〈論文〉中部圏構想の断章と高速自動車道の建設

IPSJ SIG Technical Report Vol.2011-IOT-12 No /3/ , 6 Construction and Operation of Large Scale Web Contents Distribution Platfo

IPSJ SIG Technical Report Vol.2012-ARC-202 No.13 Vol.2012-HPC-137 No /12/13 Tightly Coupled Accelerators 1,a) 1,b) 1,c) 1,d) GPU HA-PACS


GPGPU

Dual Stack Virtual Network Dual Stack Network RS DC Real Network 一般端末 GN NTM 端末 C NTM 端末 B IPv4 Private Network IPv4 Global Network NTM 端末 A NTM 端末 B

IPSJ SIG Technical Report Vol.2014-CE-123 No /2/8 Bebras 1,a) Bebras,,, Evaluation and Possibility of the Questions for Bebras Contest Abs

06’ÓŠ¹/ŒØŒì

MAC root Linux 1 OS Linux 2.6 Linux Security Modules LSM [1] Security-Enhanced Linux SELinux [2] AppArmor[3] OS OS OS LSM LSM Performance Monitor LSMP


untitled

IPSJ SIG Technical Report Vol.2017-CLE-21 No /3/21 e 1,2 1,2 1 1,2 1 Sakai e e e Sakai e Current Status and Challenges on e-learning T

IPSJ SIG Technical Report An Evaluation Method for the Degree of Strain of an Action Scene Mao Kuroda, 1 Takeshi Takai 1 and Takashi Matsuyama 1

IPSJ SIG Technical Report Vol.2014-IOT-27 No.14 Vol.2014-SPT-11 No /10/10 1,a) 2 zabbix Consideration of a system to support understanding of f

Microsoft Word - Dolphin Expressによる10Gbpソケット通信.docx

HP cafe HP of A A B of C C Map on N th Floor coupon A cafe coupon B Poster A Poster A Poster B Poster B Case 1 Show HP of each company on a user scree

The copyright of this material is retained by the Information Processing Society of Japan (IPSJ). The material has been made available on the website

HAJIMENI_56803.pdf

Vol.54 No (July 2013) [9] [10] [11] [12], [13] 1 Fig. 1 Flowchart of the proposed system. c 2013 Information

29 jjencode JavaScript

GPGPUクラスタの性能評価

4.1 % 7.5 %

IPSJ SIG Technical Report Vol.2014-CE-127 No /12/7 1,a) 2,3 2,3 3 Development of the ethological recording application for the understanding of

fl™‹ä1.eps

PC Development of Distributed PC Grid System,,,, Junji Umemoto, Hiroyuki Ebara, Katsumi Onishi, Hiroaki Morikawa, and Bunryu U PC WAN PC PC WAN PC 1 P

スライド 1

LTE移動通信システムのフィールドトライアル

ActionScript Flash Player 8 ActionScript3.0 ActionScript Flash Video ActionScript.swf swf FlashPlayer AVM(Actionscript Virtual Machine) Windows

untitled

untitled

ID 3) 9 4) 5) ID 2 ID 2 ID 2 Bluetooth ID 2 SRCid1 DSTid2 2 id1 id2 ID SRC DST SRC 2 2 ID 2 2 QR 6) 8) 6) QR QR QR QR

Microsoft Word - HOKUSAI_system_overview_ja.docx

IPSJ SIG Technical Report Vol.2010-GN-74 No /1/ , 3 Disaster Training Supporting System Based on Electronic Triage HIROAKI KOJIMA, 1 KU

untitled

Microsoft Word ●MPI性能検証_志田_ _更新__ doc

IPSJ SIG Technical Report Vol.2019-IOT-44 No /3/8 CRIU HTTP 1,a) 2 PC SNS Web FastContainer HTTP HTTP Web HTTP Web Web Low-cost and High Perfor

IPSJ SIG Technical Report Vol.2015-HPC-148 No /3/3 I/O 1 2 Gerofi Balazs 2 Guo-Yuan Lien netcdf API 2 File I/O Arbitr

DEIM Forum 2009 E

Vol.57 No (Mar. 2016) 1,a) , L3 CG VDI VDI A Migration to a Cloud-based Information Infrastructure to Support

第62巻 第1号 平成24年4月/石こうを用いた木材ペレット

AP_12_15_yonezawa.indd

セレンディピティによる 独創的加工技術開発

(速報) Xeon E 系モデル 新プロセッサ性能について

Vol. 48 No. 3 Mar PM PM PMBOK PM PM PM PM PM A Proposal and Its Demonstration of Developing System for Project Managers through University-Indus

Table 1. Reluctance equalization design. Fig. 2. Voltage vector of LSynRM. Fig. 4. Analytical model. Table 2. Specifications of analytical models. Fig

Vol.53 No (Mar. 2012) 1, 1,a) 1, 2 1 1, , Musical Interaction System Based on Stage Metaphor Seiko Myojin 1, 1,a

1 Fig. 1 Extraction of motion,.,,, 4,,, 3., 1, 2. 2.,. CHLAC,. 2.1,. (256 ).,., CHLAC. CHLAC, HLAC. 2.3 (HLAC ) r,.,. HLAC. N. 2 HLAC Fig. 2

STSNJ NL

Transcription:

InfiniBand ACP 1,5,a) 1,5,b) 2,5 1,5 4,5 3,5 2,5 ACE (Advanced Communication for Exa) ACP (Advanced Communication Primitives) HPC InfiniBand ACP InfiniBand ACP ACP InfiniBand Open MPI 20% InfiniBand Implementation and Evaluation of ACP Basic layer Yoshiyuki Morie 1,5,a) takeshi nanri 1,5,b) yuichiro ajima 2,5 honda hiroaki 1,5 takeshi soga 4,5 taizo kobayashi 3,5 shinji sumimoto 2,5 Abstract: ACE (Advanced Communication for Exa) project is developing ACP (Advanced Communication Primitives) that is a low level communication library. In this paper, ACP basic layer is implemented on InfiniBand that is important on HPC area. This paper reports the method of implementation, its memory consumption and its communication performance. The connection resource is most memory consumption in this implementation. The performance of ACPbl is nearly equal to Open MPI. Maximum performance improvement is about 20 % in midium of message size. Problem of performance in small message size also was reported. Keywords: Exa scale, Low level communication library, InfiniBand 1 Kyushu university, Reserch Institute for information techonology, 6-10-1, higashi-ku, fukuoka, Japan 2 Fujitu Limited., Next Generation Technical Computing Unit 3 Teikyo Uniersity 4 Institute of Systems, Information Technologies and Nanotechnologies (ISIT) 5 Japan Science and Technology Agency (JST), Core Research for Evolutionary Science and Technology (CREST) a) morie.yoshiyuki.404@m.kyushu-u.ac.jp b) nanri@cc.kyushu-u.ac.jp 1. ACE (Advanced Communication for Exa) 1

[4] ACP (Advanced Communication Primitives) [1] ACP ACP [2] ACP Tofu UDP [3] HPC InfiniBand InfiniBand ACP 2 ACP, 3 InfiniBand ACP 4 5 ACP 2. ACP ACP ACP ACP ACP 3. InfiniBand ACP 3.1 InfiniBand ACP RDMA (Remote Direct Memory Access) RC (Reliable Connection) RDMA ACP QP (Queue Pair) ACP ACP Tofu UDP 2 ACP ACP ACP ACP InfiniBand ACP Fig. 1 1 Implementation of remote to remote copy 2

1 4096 QP 2 ACP 1 Table 1 Command format (Bytes) (4) (4) (8) (8) WR ID(8)) (8) (8) (8) (8) (8) (8) (8) (16) COPY (104) (8) CAS4 (104) (4), (4) CAS8 (104) (8), (8) Atomi4 (104) (4) Atomi8 (104)) (8) 64 2 ACP Fig. 2 Basci Archtecture 3 Fig. 3 Global memory address and address key 3.2 1 4096 1 120 B 480 KB 3.3 ACP 3 2 InfiniBand ACP 3

2 Table 2 Global address. 21 1 8 34 ACP ACP ACP InifiniBand 3.4 InfiniBand ACP RDMA InfiniBand 64 rkey 32 rkey 1024 4. InfiniBand ACP 100 3 1 InfiniBand QP 160 MB InfiniBand RC QP QP 1 160 B rkey 12 MB 1MB QP 1 10 GB 1024 10 MB 4

情報処理学会研究報告 表 3 メモリ消費量 Table 3 Memory consumption. 5.3 実験結果 まず 図 4 にそれぞれの実効通信帯域幅を示す これよ り 中メッセージサイズにおいて ACP 基本層の方が高速 用途 容量 コマンドキュー 480KB に動作していることが分かった 特に 8 KB のところで性 リモートコマンド受信バッファ 480KB 能比が最大となり 約 20 %の性能向上を示した しかし 登録メモリ領域情報テーブル 約 10KB (40B * 255 個) 登録メモリ領域情報テーブル 10MB (10KB * 1024 テーブル) 128 MB 以降の大メッセージの領域で Open MPI の方が のキャッシュ 相手登録メモリ領域情報 約 10 %実効通信帯域幅が高いことが分かった テーブルのキャッシュの 情報元ランク配列 テーブルのキャッシュの 所持先ランク配列 テーブルのキャッシュ の破棄要求配列 自登録メモリ領域情報 テーブルのキャッシュ の破棄完了通知配列 QP 160MB (10KB * 1M ランク) CQ 128B IB のメモリリージョン 約 12KB(48B * 255) 相手登録メモリ領域情報 自登録メモリ領域情報 登録テーブル スターターメモリ用 図 4 12MB (12 * 1M) 通信帯域幅 Fig. 4 Communication bandwidth リモートアドレス + rkey テーブル 次に図 5 に通信遅延を示す メッセージサイズが 1 K ま では Open MPI の方が高速であった 特に 64 B 以下の 5. 性能評価実験 今後の改良を計画するため 現在の ACP 基本層の実装 での性能評価を実施した メッセージサイズでは ACP 基本層の方が 50%前後実行 時間が長いことが分かった これは 通信スレッドにおい てコマンドキューや通信デバイスの状態の監視等を通信の 発行と平行して処理を実効することが影響するものと考え ている 5.1 実験概要 本実験では 性能評価の比較対象として Open MPI[6] を用いる 実行プログラムには IMB4.0 (Intel MPI Bench- marks4.0) [5] を用いた この時 pingpong 処理の時間を 一方 ACP 基本層では スターターメモリに対して put get を処理する時間を計測する この時 IMB4.0 の実行回 数に合わせて ACP 基本層のグローバルメモリアクセスを 同数実行する これらの通信実行の平均時間から通信性能 計測する 5.2 実験環境 実験環境としては PRIMERGY RX200 S7 を用いた 計算ノード数は 16 で 各ノードに Intel Xeon プロセッサ E5-2609 (2.40 GHz) が搭載されている メモリは 8 GB を比較する 図 5 通信遅延 Fig. 5 Elapsed time 計算ノード間は InfiniBand QDR スイッチで接続され そ のスループットは片方向 4.0 GB/s となる 2015 Information Processing Society of Japan 5

6. ACP InfiniBand RC Dinamically Connection (DC) (CREST) [1] ACP vol.2014-hpc-143-8 (2014) [2] ACP vol.2014-hpc-143-9 (2014) [3] ACP vol.2014-hpc-143-10 (2014) [4] ACE Project (online), available from http://aceproject.kyushu-u.ac.jp/index.html [5] Intel MPI Benchmarks 4.0 (online), available from https://software.intel.com/en-us/articles/intel-mpibenchmarks [6] Open MPI: Open Source High Performance Computing (online), available from http://www.open-mpi.org/ 6