情報処理学会研究報告 IPSJ SIG Technical Report Vol.2013-HPC-139 No /5/29 Gfarm/Pwrake NICT NICT 10TB 100TB CPU I/O HPC I/O NICT Gf

Similar documents
Shonan Institute of Technology MEMOIRS OF SHONAN INSTITUTE OF TECHNOLOGY Vol. 41, No. 1, 2007 Ships1 * ** ** ** Development of a Small-Mid Range Paral

DEIM Forum 2012 C2-6 Hadoop Web Hadoop Distributed File System Hadoop I/O I/O Hadoo

2). 3) 4) 1.2 NICTNICT DCRA Dihedral Corner Reflector micro-arraysdcra DCRA DCRA DCRA 3D DCRA PC USB PC PC ON / OFF Velleman K8055 K8055 K8055

258 5) GPS 1 GPS 6) GPS DP 7) 8) 10) GPS GPS ) GPS Global Positioning System

The 15th Game Programming Workshop 2010 Magic Bitboard Magic Bitboard Bitboard Magic Bitboard Bitboard Magic Bitboard Magic Bitboard Magic Bitbo

IPSJ SIG Technical Report Vol.2011-IOT-12 No /3/ , 6 Construction and Operation of Large Scale Web Contents Distribution Platfo

4.1 % 7.5 %

1 Fig. 1 Extraction of motion,.,,, 4,,, 3., 1, 2. 2.,. CHLAC,. 2.1,. (256 ).,., CHLAC. CHLAC, HLAC. 2.3 (HLAC ) r,.,. HLAC. N. 2 HLAC Fig. 2

GPGPU

DEIM Forum 2009 B4-6, Str

fiš„v8.dvi

第62巻 第1号 平成24年4月/石こうを用いた木材ペレット

MATLAB® における並列・分散コンピューティング ~ Parallel Computing Toolbox™ & MATLAB Distributed Computing Server™ ~

IPSJ SIG Technical Report Vol.2015-HPC-150 No /8/6 I/O Jianwei Liao 1 Gerofi Balazs 1 1 Guo-Yuan Lien Prototyping F

Vol. 42 No. SIG 8(TOD 10) July HTML 100 Development of Authoring and Delivery System for Synchronized Contents and Experiment on High Spe

A Feasibility Study of Direct-Mapping-Type Parallel Processing Method to Solve Linear Equations in Load Flow Calculations Hiroaki Inayoshi, Non-member

独立行政法人情報通信研究機構 Development of the Information Analysis System WISDOM KIDAWARA Yutaka NICT Knowledge Clustered Group researched and developed the infor

HASC2012corpus HASC Challenge 2010,2011 HASC2011corpus( 116, 4898), HASC2012corpus( 136, 7668) HASC2012corpus HASC2012corpus

Vol. 48 No. 3 Mar PM PM PMBOK PM PM PM PM PM A Proposal and Its Demonstration of Developing System for Project Managers through University-Indus

<95DB8C9288E397C389C88A E696E6462>

IPSJ SIG Technical Report Vol.2009-DPS-141 No.20 Vol.2009-GN-73 No.20 Vol.2009-EIP-46 No /11/27 1. MIERUKEN 1 2 MIERUKEN MIERUKEN MIERUKEN: Spe

2. CABAC CABAC CABAC 1 1 CABAC Figure 1 Overview of CABAC 2 DCT 2 0/ /1 CABAC [3] 3. 2 値化部 コンテキスト計算部 2 値算術符号化部 CABAC CABAC

1 Table 1: Identification by color of voxel Voxel Mode of expression Nothing Other 1 Orange 2 Blue 3 Yellow 4 SSL Humanoid SSL-Vision 3 3 [, 21] 8 325

3_23.dvi

IPSJ SIG Technical Report Vol.2012-HCI-149 No /7/20 1 1,2 1 (HMD: Head Mounted Display) HMD HMD,,,, An Information Presentation Method for Weara

1_26.dvi

,4) 1 P% P%P=2.5 5%!%! (1) = (2) l l Figure 1 A compilation flow of the proposing sampling based architecture simulation

IPSJ SIG Technical Report Vol.2010-GN-74 No /1/ , 3 Disaster Training Supporting System Based on Electronic Triage HIROAKI KOJIMA, 1 KU

1., 1 COOKPAD 2, Web.,,,,,,.,, [1]., 5.,, [2].,,.,.,, 5, [3].,,,.,, [4], 33,.,,.,,.. 2.,, 3.., 4., 5., ,. 1.,,., 2.,. 1,,

Vol. 48 No. 4 Apr LAN TCP/IP LAN TCP/IP 1 PC TCP/IP 1 PC User-mode Linux 12 Development of a System to Visualize Computer Network Behavior for L

Microsoft Word - SWO予稿 d1.docx

Fig. 2 Signal plane divided into cell of DWT Fig. 1 Schematic diagram for the monitoring system

2. Twitter Twitter 2.1 Twitter Twitter( ) Twitter Twitter ( 1 ) RT ReTweet RT ReTweet RT ( 2 ) URL Twitter Twitter 140 URL URL URL 140 URL URL

IPSJ SIG Technical Report Vol.2014-HCI-158 No /5/22 1,a) 2 2 3,b) Development of visualization technique expressing rainfall changing conditions

ActionScript Flash Player 8 ActionScript3.0 ActionScript Flash Video ActionScript.swf swf FlashPlayer AVM(Actionscript Virtual Machine) Windows

パナソニック技報

Amazon EC2 IaaS (Infrastructure as a Service) HPCI HPCI ( VM) VM VM HPCI VM OS VM HPCI HPC HPCI RENKEI-PoP 2 HPCI HPCI 1 HPCI HPCI HPC CS

Fig. 3 Flow diagram of image processing. Black rectangle in the photo indicates the processing area (128 x 32 pixels).

1 UD Fig. 1 Concept of UD tourist information system. 1 ()KDDI UD 7) ) UD c 2010 Information Processing S

_先端融合開発専攻_観音0314PDF用

Vol.53 No (Mar. 2012) 1, 1,a) 1, 2 1 1, , Musical Interaction System Based on Stage Metaphor Seiko Myojin 1, 1,a

e-learning e e e e e-learning 2 Web e-leaning e 4 GP 4 e-learning e-learning e-learning e LMS LMS Internet Navigware

IPSJ SIG Technical Report Vol.2017-ARC-225 No.12 Vol.2017-SLDM-179 No.12 Vol.2017-EMB-44 No /3/9 1 1 RTOS DefensiveZone DefensiveZone MPU RTOS

*1 *2 *1 JIS A X TEM 950 TEM JIS Development and Research of the Equipment for Conversion to Harmless Substances and Recycle of Asbe

3D UbiCode (Ubiquitous+Code) RFID ResBe (Remote entertainment space Behavior evaluation) 2 UbiCode Fig. 2 UbiCode 2. UbiCode 2. 1 UbiCode UbiCode 2. 2

17 Proposal of an Algorithm of Image Extraction and Research on Improvement of a Man-machine Interface of Food Intake Measuring System

2 251 Barrera, 1986; Barrera, e.g., Gottlieb, 1985 Wethington & Kessler 1986 r Cohen & Wills,

IPSJ SIG Technical Report Vol.2013-ARC-203 No /2/1 SMYLE OpenCL (NEDO) IT FPGA SMYLEref SMYLE OpenCL SMYLE OpenCL FPGA 1


mobicom.dvi

P2P P2P peer peer P2P peer P2P peer P2P i

IPSJ SIG Technical Report Vol.2011-EC-19 No /3/ ,.,., Peg-Scope Viewer,,.,,,,. Utilization of Watching Logs for Support of Multi-

i

sigmusdemo.dvi

6 2. AUTOSAR 2.1 AUTOSAR AUTOSAR ECU OSEK/VDX 3) OSEK/VDX OS AUTOSAR AUTOSAR ECU AUTOSAR 1 AUTOSAR BSW (Basic Software) (Runtime Environment) Applicat

1重谷.PDF

IPSJ SIG Technical Report Vol.2009-DPS-141 No.23 Vol.2009-GN-73 No.23 Vol.2009-EIP-46 No /11/27 t-room t-room 2 Development of

Consideration of Cycle in Efficiency of Minority Game T. Harada and T. Murata (Kansai University) Abstract In this study, we observe cycle in efficien

log F0 意識 しゃべり 葉の log F0 Fig. 1 1 An example of classification of substyles of rap. ' & 2. 4) m.o.v.e 5) motsu motsu (1) (2) (3) (4) (1) (2) mot

[2] , [3] 2. 2 [4] 2. 3 BABOK BABOK(Business Analysis Body of Knowledge) BABOK IIBA(International Institute of Business Analysis) BABOK 7

28 Docker Design and Implementation of Program Evaluation System Using Docker Virtualized Environment

1 3DCG [2] 3DCG CG 3DCG [3] 3DCG 3 3 API 2 3DCG 3 (1) Saito [4] (a) 1920x1080 (b) 1280x720 (c) 640x360 (d) 320x G-Buffer Decaudin[5] G-Buffer D

1 7.35% 74.0% linefeed point c 200 Information Processing Society of Japan

36 581/2 2012

情報処理学会研究報告 IPSJ SIG Technical Report Vol.2011-MBL-57 No.27 Vol.2011-UBI-29 No /3/ A Consideration of Features for Fatigue Es

A Study on Throw Simulation for Baseball Pitching Machine with Rollers and Its Optimization Shinobu SAKAI*5, Yuichiro KITAGAWA, Ryo KANAI and Juhachi

07九州工業大学.indd

Journal of Geography 116 (6) Configuration of Rapid Digital Mapping System Using Tablet PC and its Application to Obtaining Ground Truth

1 4 4 [3] SNS 5 SNS , ,000 [2] c 2013 Information Processing Society of Japan

OJT Planned Happenstance


52-2.indb

06_学術.indd

渡辺(2309)_渡辺(2309)

Bulletin of JSSAC(2014) Vol. 20, No. 2, pp (Received 2013/11/27 Revised 2014/3/27 Accepted 2014/5/26) It is known that some of number puzzles ca

TCP/IP IEEE Bluetooth LAN TCP TCP BEC FEC M T M R M T 2. 2 [5] AODV [4]DSR [3] 1 MS 100m 5 /100m 2 MD 2 c 2009 Information Processing Society of

AJACS18_ ppt

IPSJ SIG Technical Report Vol.2014-CE-127 No /12/7 1,a) 2,3 2,3 3 Development of the ethological recording application for the understanding of

PeerPool IP NAT IP UPnP 2) Bonjour 3) PeerPool CPU 4) 2 UPnP Bonjour PeerPool CPU PeerPool PeerPool PPv2 PPv2 2. PeerPool 2.1 PeerPool PeerPool PoolGW

untitled

( ) [1] [4] ( ) 2. [5] [6] Piano Tutor[7] [1], [2], [8], [9] Radiobaton[10] Two Finger Piano[11] Coloring-in Piano[12] ism[13] MIDI MIDI 1 Fig. 1 Syst

i Ceph

HAJIMENI_56803.pdf

Visual Evaluation of Polka-dot Patterns Yoojin LEE and Nobuko NARUSE * Granduate School of Bunka Women's University, and * Faculty of Fashion Science,

平常時火災における消火栓の放水能力に関する研究

untitled

& Vol.5 No (Oct. 2015) TV 1,2,a) , Augmented TV TV AR Augmented Reality 3DCG TV Estimation of TV Screen Position and Ro

知能と情報, Vol.30, No.5, pp

Web Web Web Web Web, i

23 Fig. 2: hwmodulev2 3. Reconfigurable HPC 3.1 hw/sw hw/sw hw/sw FPGA PC FPGA PC FPGA HPC FPGA FPGA hw/sw hw/sw hw- Module FPGA hwmodule hw/sw FPGA h

3. XML, DB, DB (AP). DB, DB, AP. RDB., XMLDB, XML,.,,.,, (XML / ), XML,,., AP. AP AP AP 検索キー //A=1 //A=2 //A=3 返却 XML 全体 XML 全体 XML 全体 XMLDB <root> <A

untitled

IPSJ SIG Technical Report Vol.2014-CE-126 No /10/11 1,a) Kinect Support System for Romaji Learning through Exercise Abstract: Educatio

DPA,, ShareLog 3) 4) 2.2 Strino Strino STRain-based user Interface with tacticle of elastic Natural ObjectsStrino 1 Strino ) PC Log-Log (2007 6)

new_emc_panf_Hyoushi_0818

08医療情報学22_1_水流final.PDF

[2] OCR [3], [4] [5] [6] [4], [7] [8], [9] 1 [10] Fig. 1 Current arrangement and size of ruby. 2 Fig. 2 Typography combined with printing

A pp CALL College Life CD-ROM Development of CD-ROM English Teaching Materials, College Life Series, for Improving English Communica

(2003)


Transcription:

Gfarm/Pwrake NICT 1 1 1 1 2 2 3 4 5 5 5 6 NICT 10TB 100TB CPU I/O HPC I/O NICT Gfarm Gfarm Pwrake A Parallel Processing Technique on the NICT Science Cloud via Gfarm/Pwrake KEN T. MURATA 1 HIDENOBU WATANABE 1 KAZUNORI YAMAMOTO 1 YASUBUMI KUBOTA 1 OSAMU TATEBE 2 MASAHIRO TANAKA 2 KEIICHIRO FUKAZAWA 3 EIZEN KIMURA 4 KENTARO UKAWA 5 KAZUYA MURANAGA 5 YUTAKA SUZUKI 5 FUSAKO ISODA 6 For data intensive science on cloud systems, we need development of techniques for DIC (Data-Intensive Computing) as well as HTC (High-Through-put Computing), MTC (Many-Task Computing), and HPC (High-Performance Computing). The DIC is a new concept of large-scale data processing paying attentions to data distribution, data-parallel execution, and harnessing data locality by scheduling of computations close to the data. As the data file size is getting larger, I/O time to read and/or write data is not negligible compared with data processing time. We herein develop a DIC technique on a science cloud using Gfarm/Pwrak. The Gfarm/Pwrake has been developed as an integrated system of both distributed file system and parallel data processing system. With identifying file system nodes (FSN) and processing client node (CN) and giving higher priority to process files on the local disk than on remote disks, we succeeded in progress of total performance in processing large-scale data files. 1. 3 1 2 [3] 19 20 3 21 3 4 The Fourth Paradigm: Data-Intensive Science 1 National Institute of Information and Communications Technology 2 Center for Computational Sciences, University of Tsukuba 3 Research Institute for Information Technology, Kyushu University 4 Department of Medical Informatics Ehime University Graduate School of Medicine 5 Systems Engineering Consultants Co., LTD 6 Science Service Inc. [1] BigData BigData 1

[4] 2008 10TB 100TB CPU I/O HPC I/O NICT Gfarm Gfarm Pwrake 2. NICT NICT NICT [4] JGN-X 1 I/O HPC NICT I/O 1TB 6 Gfarm 2.5.8 Gfarm Pwrake[2] 1 NICT Figure 1 Construction of the NICT Science Cloud. 2

3. 3.1 2 1 NICT 6 Gfarm FSN CN Gfarm DELL PowerConnect 6224 10GbE 1 2 CN FSN CN FSN I/O 782 FSN Gfarm Pwrake [2] Pwrake Gfarm I/O CN 2 Figure 2 The computer system for the experiment. 2 Table 2 Data files for the present experiments. Spec. Number of data files 782 File size 2.2GB/file Total file size 1.72TB 1 Table 1 Spec. of computers for the present experiments. Spec. CPU number/node 8 CPU Intel Xeon X5550@2.67GHz Main Memory 144GB OS opensuse 11.1 (x86_64) HDD SATA 3 x4 (RAID5) HDD (read) 371 MB/sec HDD (write) 137MB/sec NIC 10GbE 2 3 MHD 782 6 FSN Gfarm 6 CN NICT 3 Gfarm 3 Figure 3 Data files for the present experiments. 3.2 4 782 4 6 6 1 4 I/O I/O 4 40 782 3

I/O CN 1 4 CN I/O I/O 4 CN 6 I/O CN 1 I/O Figure 4 Upper: Processing time and I/O time at each step with 6 cores on each node. Lower: Same result in case with 1 core (process) on each node. Red part: data processing time. White part: data I/O time. 3 5 Table 3 Data processing results on each node. Node Core (process) Step (file) number Average time (sec.) Total processing time (sec.) H1 6 140 84.58 1973.52 H2 6 151 79.29 1995.40 H3 6 142 84.26 1994.08 H4 6 155 76.64 1979.86 H5 6 100 118.50 1974.95 H6 6 95 125.35 1984.74 5 CN 6 I/O I/O Figure 5 Load balance between nodes: Total time (data processing time and I/O time) for each data file. 3.3 CN FIFO 100 5 3 4

I/O I/O CN FSN I/O [5] FSN CN FSN I/O FSN I/O CN FSN 6 6 1 / I/O Gfarm/Pwrake I/O 6 Gfarm Pwrake 6 5 4 2 6 14 FSN CN 7 14 8 6 1 6 I/O Figure 6 Reference Experiment: Load balance between nodes. 6 CN Gfarm/Pwrake I/O 4 I/O CPU I/O 7 Figure 7 The computer system for the reference experiment. 4. 4 [1] IT [4] 10 CPU TB PB 5

10TB 100TB I/O HPC I/O NICT Gfarm [5] Gfarm Pwrake [2] TB (1) (2)I/O NICT 1) EditEd by Tony Hey, STewarT TanSley, and KriSTin Tolle, The Fourth Paradigm: Data-Intensive Scientific Discovery, ISBN 978-0-9825442-0-4, 2009. 2) Pwrake JAXA Research and development report JAXA-RR-11-007, pp.67-76, 2012-03-30. http://office.microsoft.com/ja-jp/word-help/ch010097020.aspx 3) 1996. 4) Murata, K., T, Watari, S., Nagatsuma, T., Kunitake, M., Watanabe, H., Yamamoto, K., Kubota, Y., Kato, H., Tsugawa, T., Ukawa, K., Muranaga, K., Kimura, E., Tatebe, O., Fukazawa, K. and Murayama, Y., A Science Cloud for Data Intensive Sciences, Data Science Journal, Vol. 12, pp. WDS139-WDS146 (2013). 5) Gfarm File System, ISBN-10: 6133490381, ISBN-13: 978-6133490383. 6