橡3_2石川.PDF

Similar documents
develop

卒業論文

1重谷.PDF

Myrinet2000 ご紹介

untitled

Second-semi.PDF

HPC

install

09中西

GPGPU

untitled

23 Fig. 2: hwmodulev2 3. Reconfigurable HPC 3.1 hw/sw hw/sw hw/sw FPGA PC FPGA PC FPGA HPC FPGA FPGA hw/sw hw/sw hw- Module FPGA hwmodule hw/sw FPGA h

mobicom.dvi

Microsoft PowerPoint - intro.ppt

BRANCH SRX <2010Q3 > 2 Copyright 2010 Juniper Networks, Inc.

Shonan Institute of Technology MEMOIRS OF SHONAN INSTITUTE OF TECHNOLOGY Vol. 41, No. 1, 2007 Ships1 * ** ** ** Development of a Small-Mid Range Paral

Microsoft Word - D JP.docx

PC Development of Distributed PC Grid System,,,, Junji Umemoto, Hiroyuki Ebara, Katsumi Onishi, Hiroaki Morikawa, and Bunryu U PC WAN PC PC WAN PC 1 P

CPU Levels in the memory hierarchy Level 1 Level 2... Increasing distance from the CPU in access time Level n Size of the memory at each level 1: 2.2


28 Docker Design and Implementation of Program Evaluation System Using Docker Virtualized Environment

Cisco 1711/1712セキュリティ アクセス ルータの概要

00.目次_ope

TOOLS for UR44 Release Notes for Windows


スライド 1

5シンポジウム2001予稿小野寺011121

4.1 % 7.5 %

untitled

第3回戦略シンポジウム緑川公開用

ITAOI2003第三屆離島資訊與應用研討會論文範例

N Express5800/R320a-E4 N Express5800/R320a-M4 ユーザーズガイド

Express5800/R320a-E4, Express5800/R320b-M4ユーザーズガイド

A Feasibility Study of Direct-Mapping-Type Parallel Processing Method to Solve Linear Equations in Load Flow Calculations Hiroaki Inayoshi, Non-member

Express5800/320Fc-MR

Express5800/R320a-E4/Express5800/R320b-M4ユーザーズガイド

[1] [2] [3] (RTT) 2. Android OS Android OS Google OS 69.7% [4] 1 Android Linux [5] Linux OS Android Runtime Dalvik Dalvik UI Application(Home,T

Vol. 48 No. 4 Apr LAN TCP/IP LAN TCP/IP 1 PC TCP/IP 1 PC User-mode Linux 12 Development of a System to Visualize Computer Network Behavior for L

Express5800/R110a-1Hユーザーズガイド

3_23.dvi

Express5800/320Fa-L/320Fa-LR

fx-9860G Manager PLUS_J

untitled

1 2

soturon.dvi

1 M32R Single-Chip Multiprocessor [2] [3] [4] [5] Linux/M32R UP(Uni-processor) SMP(Symmetric Multi-processor) MMU CPU nommu Linux/M32R Linux/M32R 2. M

HP ProLiant 500シリーズ

The Effect of the Circumferential Temperature Change on the Change in the Strain Energy of Carbon Steel during the Rotatory Bending Fatigue Test by Ch

NKK NEWS 2012

A Responsive Processor for Parallel/Distributed Real-time Processing

XcalableMP入門

Microsoft Word - 01マニュアル・入稿原稿p1-112.doc



DL1010.PDF


220 28;29) 30 35) 26;27) % 8.0% 9 36) 8) 14) 37) O O 13 2 E S % % 2 6 1fl 2fl 3fl 3 4

Microsoft PowerPoint - CCS学際共同boku-08b.ppt

はじめに

Development of Induction and Exhaust Systems for Third-Era Honda Formula One Engines Induction and exhaust systems determine the amount of air intake

Microsoft PowerPoint - GPU_computing_2013_01.pptx

Itanium2ベンチマーク

CTA 82: CTA A A B B A B A, C A A A D A B Max-Planck-Inst. fuer Phys. C D

P2P P2P Winny 3 P2P P2P 1 P2P, i

,,,,., C Java,,.,,.,., ,,.,, i

untitled

00-COVER.P65

1 2


2017 (413812)

NEC Storage series NAS Device

スパコンに通じる並列プログラミングの基礎

スパコンに通じる並列プログラミングの基礎

Express5800/110Ee Pentium 1. Express5800/110Ee N N Express5800/110Ee Express5800/110Ee ( /800EB(256)) ( /800EB(256) 20W) CPU L1 L2 CD-

スパコンに通じる並列プログラミングの基礎

きずなプロジェクト-表紙.indd

6 2. AUTOSAR 2.1 AUTOSAR AUTOSAR ECU OSEK/VDX 3) OSEK/VDX OS AUTOSAR AUTOSAR ECU AUTOSAR 1 AUTOSAR BSW (Basic Software) (Runtime Environment) Applicat

17 TCP (ACK:ACKnowledge) (RTT:Round Trip Time) TCP (Transmission Control Protocol) PSPacer (Precise Software Pacer) JGN2 TCP FAST TCP UDP PSPacer

I TCP 1/2 1

,4) 1 P% P%P=2.5 5%!%! (1) = (2) l l Figure 1 A compilation flow of the proposing sampling based architecture simulation

スライド 1

fiš„v8.dvi


ISSN NII Technical Report Patent application and industry-university cooperation: Analysis of joint applications for patent in the Universit

【生】④安藤 幸先生【本文】4c/【生】④安藤 幸先生【本文】

LAN LAN LAN LAN LAN LAN,, i



main.dvi


Core1 FabScalar VerilogHDL Cache Cache FabScalar 1 CoreConnect[2] Wishbone[3] AMBA[4] AMBA 1 AMBA ARM L2 AMBA2.0 AMBA2.0 FabScalar AHB APB AHB AMBA2.0

1 Table 1: Identification by color of voxel Voxel Mode of expression Nothing Other 1 Orange 2 Blue 3 Yellow 4 SSL Humanoid SSL-Vision 3 3 [, 21] 8 325

IPSJ SIG Technical Report * Wi-Fi Survey of the Internet connectivity using geolocation of smartphones Yoshiaki Kitaguchi * Kenichi Nagami and Yutaka

<30375F97E996D88E812E696E6464>

Express5800/53Xg, Y53Xg インストレーションガイド(Windows編)

Express5800/320Fa-L/320Fa-LR/320Fa-M/320Fa-MR

VMware VirtualCenter: Virtual Infrastructure Management Software

卒業論文2.dvi

Cisco ESRのメンテナンス

Transcription:

PC RWC 01/10/31 2 1

SCore 1,024 PC SCore III PC 01/10/31 3 SCore SCore Aug. 1995 Feb. 1996 Oct. 1996 1997-1998 Oct. 1999 Oct. 2000 April. 2001 01/10/31 4 2

SCore University of Bonn, Germany University of Heidelberg, Germany University of Tuebingen, Germany Oxford University, England Warwick University, England 01/10/31 5 Host PC RWC SCore III NEC Express Servers Dual Pentium III 933 MHz 512 Mbytes of Main Memory # of Hosts 512 Hosts (1,024 Processors) Networks Myrinet-2000 2 Ethernet Links Linpack Result 618.3 Gflops This is the world fastest PC cluster at August of 2001 01/10/31 6 3

Myrinet-2000 2000 2 Gbps full duplex NIC Lanai DMA Engines HOST NIC Outgoing/Incoming Message 16 port switch warm hall routing 01/10/31 7 SCore Version 4 System Software High Performance Communication Libs PMv2 15.0 usec Round Trip time 233 MB/s Bandwidth MPICH-SCore MPI Library 24.4 usec Round Trip time 228 MB/s Bandwidth PM/Ethernet Network Trunking Utilizing more than one NIC Global Operating System SCore-D Single/Multi User Environment Gang scheduling Checkpoint and restart Parallel Programming Language MPC++ Multi-Thread Template Library Shared Memory Programming Support Omni OpenMP on SCASH 10 times faster than Fast Ethernet + TCP/IP Three times as fast as Gigabit Ethernet + TCP/IP OMNI/SCASH SCASH PM/Shmem PM/Shmem driver Applications MPC++ MPICH-SCore SCore-D Global Operating System PM/Myrinet PM/Myrinet driver PMv2 Myrinet NIC PM firmware PM/Ethernet PM/Ethernet driver PVM-SCore Ethernet driver PM/UDP Socket UDP/IP Ethernet NIC PBS Linux User Level Kernel Level NIC Level 01/10/31 8 4

PM PM vs. GM DMA GM DMA 01/10/31 9 MPI Point to Point MPI Communication Bandwidth 2.50E+08 PM/Myrinet GM 1.0E+08 PM/Ethernet PM/Ethernet (2Way) 2.00E+08 LAM/MPI Bandwidth (Byte/sec) 1.50E+08 1.00E+08 Bandwidth (Byte/sec) 1.0E+07 1.0E+06 5.00E+07 0.00E+00 1.E+00 1.E+01 1.E+02 1.E+03 1.E+04 1.E+05 1.E+06 1.E+07 Message Size (Byte) 1.0E+05 1.0E+00 1.0E+01 1.0E+02 1.0E+03 1.0E+04 1.0E+05 1.0E+06 1.0E+07 Message Size (Byte) 01/10/31 10 5

Application Benchmark 512x256x256 01/10/31 11 Application Benchmark IS (Class C) 700 600 500 PM/Myrinet GM PM/Ethernet PM/Ethernet (2Way) TCP/IP(LAM) Total Mops 400 300 200 100 0 0 50 100 150 200 250 300 Number of Procs 01/10/31 12 6

Application Benchmark FT (Class C) FFT 12000 10000 8000 PM/Myrinet GM PM/Ethernet PM/Ethernet (2Way) TCP/IP(LAM) Total Mops 6000 4000 2000 0 0 50 100 150 200 250 300 Number of Procs 01/10/31 13 Application Benchmark Total Mops 40000 35000 30000 25000 20000 15000 10000 5000 LU (Class C) PM/Myrinet GM PM/Ethernet PM/Ethernet (2Way) TCP/IP(LAM) SSOR(Symmetric Successive Over-Relaxation) CFD 0 0 50 100 150 200 250 300 Number of Procs 01/10/31 14 7

Application Benchmark MG (Class C) 16000 14000 12000 PM/Myrinet GM PM/Ethernet PM/Ethernet (2Way) TCP/IP(LAM) Total Mops 10000 8000 6000 4000 2000 0 0 50 100 150 200 250 300 Number of Procs 01/10/31 15 PC 01/10/31 16 8

SCore III : 2 2 PC 2 Ethernet Myrinet Myrinet Clos128 Gigabit Ethernet 01/10/31 17 Myrinet-2000 2000 E128 Switches are connected by eight port switches. #6 #4 #2 #0 #1 E128 E128 nodes nodes nodes nodes nodes nodes #3 #5 #7 01/10/31 18 9

Ethernet eth1 Summit7i eth0 Summit7i #6 #4 #6 #2 #4 #0 #2 #0 #3 #1 #3 #7 #1 #5 #5 #7 01/10/31 19 01/10/31 20 10

01/10/31 21 01/10/31 22 11

01/10/31 23 16 Ethernet 4 2 Ethernet PC 5 01/10/31 24 12

(1/2) (800 Mbytes) RPM anaconda installation tool Score MBR The First Partition contains the installation image Other Partitions are empty SCore II NEC Express Servers (2U type) with Myrinet 5 SCSI disks 2 6 1 minutes and half for one disk copy 01/10/31 25 (2/2) 1st stage 2nd stage IP Kickstart anaconda 01/10/31 26 13

Each rack, Each modules, and All racks Score rcstest all-to-all Stressing Myrinet network in terms of network packets memory and Lanai Processor of Myrinet NIC Stressing processors and memory in Some initial hardware failures appear at at the the stress test!!!! 01/10/31 27 Connection between Myrinet Card and PCI bus slot Performance degradation We have not found the reason Connection between Myrinet Line Card and back-plane No communication CRC errors Connection between Myrinet Card and Cable No communication 01/10/31 28 14

PC PC Ethernet Ethernet Linux Ethernet NIC Ethernet Switch 01/10/31 29 8 01/10/31 30 15

PC PC 01/10/31 31 NIC 01/10/31 32 16

PC vs vs 01/10/31 33 PC PC PC Linux SCore 01/10/31 34 17

PC 19, 2, 7 SCore Omni OpenMP www.pccluster.org PC 01/10/31 35 Real World Computing Partnership is over, but SCore Development is continued 01/10/31 36 18