untitled

Similar documents
1重谷.PDF

卒業論文

09中西

橡3_2石川.PDF

HP High Performance Computing(HPC)

Itanium2ベンチマーク

PROLIANT ML

develop

HPEハイパフォーマンスコンピューティング ソリューション

スライド 1

HP High Performance Computing(HPC)

ProLiant BL20p Generation 4 システム構成図

OVERVIEW ProLiant ML330 ProLiantML330(P1266)ATA PAQ V 0 A IDE CD-ROM PCI 64 PCI ( ) A B B B C D E 0 1 (1 ) (IDE ) 3.5 (1 ) (SCSI ) ProLi

ProLiant BL460c システム構成図

Myrinet2000 ご紹介


Microsoft PowerPoint - ★13_日立_清水.ppt

ProLiant BL25p Generation 2システム構成図

untitled

PROLIANT ML

Second-semi.PDF

ProLiant ML110 Generation 4 システム構成図

PC Development of Distributed PC Grid System,,,, Junji Umemoto, Hiroyuki Ebara, Katsumi Onishi, Hiroaki Morikawa, and Bunryu U PC WAN PC PC WAN PC 1 P


untitled

HP ProLiant 500シリーズ

OVERVIEW ProLiant ML370(X2400, X2800, X3060, X3200) ProLiant ML370 A B D B C D B C 2 () (2 ) (1.6 ) 3.5 LED 48 IDE CD-ROM 5.25 Wide Ultra3/U

HPC (pay-as-you-go) HPC Web 2

HPC


Microsoft PowerPoint - CCS学際共同boku-08b.ppt

Standard Features 550MHz ProLiant 6400R 6/550-2M /550-1M / MHz ProLiant 6400R 6/500-

HP StoreVirtual(LeftHand)

システムユニット構成ツリーの見方

ProLiant ML115 Generation 1 システム構成図

ProLiant ML110 システム構成図

ProLiant DL380 SAN Storageモデル システム構成図

ProLiant SL6000 Sclable System システム構成図

HP Blade Workstation HP RCS Remote Client Solution HP Blade Workstation CO2 2

HP ProLiant ML110 Generation 5 システム構成図

ProLiant BL35p システム構成図

OVERVIEW hp StorageWorks NAS 500s hp StorageWorks NAS 500s A C D () 4 SATA RAID Ultra320 SCSI Serial ATA 3.5 DVD-ROM hp StorageWorks NAS 500s

PowerPoint プレゼンテーション

CPU Levels in the memory hierarchy Level 1 Level 2... Increasing distance from the CPU in access time Level n Size of the memory at each level 1: 2.2


Microsoft PowerPoint _AMD.ppt

main.dvi

スライド 1

untitled

SharePoint 2003 Performance White Paper


ProLiant ML115 Generation 1 システム構成図

GPU n Graphics Processing Unit CG CAD

untitled

<834E C F D E657073>

Express5800/120Ed

クララパンフレット2011冬1P-P40

Express5800/140Ma

Ver. 3.8 Ver NOTE E v3 2.4GHz, 20M cache, 8.00GT/s QPI,, HT, 8C/16T 85W E v3 1.6GHz, 15M cache, 6.40GT/s QPI,,

ProLiant DL165 G6 システム構成図

supercomputer2010.ppt

OVERVIEW hp StorageWorks NAS 2000s hp StorageWorks NAS 2000s A 3.5 B 3.5 IDE DVD-ROM C LED LED Ultra320 SCSI ( ) NAS 2000s NAS 2000s NAS

ProLiant DL380 Generation 4 システム構成図

スパコンに通じる並列プログラミングの基礎

Microsoft PowerPoint - intro.ppt

HP ProLiant Gen8とRed Hatで始めるHadoop™ ~Hadoop™スタートアップ支援サービス~

untitled

PowerEdge R730xd Contents RAID /RAID & P3-6 PCIe P P P P OS P P P P7 P8 P9 P10-11 P12-17 P P112

HP StorageWorks P4000 G2 SAN Solutions (LeftHand ) システム構成図

スパコンに通じる並列プログラミングの基礎

Ver. 3.8 Ver NOTE E v3 2.4GHz, 20M cache, 8.00GT/s QPI,, HT, 8C/16T 85W E v3 1.6GHz, 15M cache, 6.40GT/s QPI,,

1 All Rights Reserved, Copyright 2004, NEC Corporation 2 All Rights Reserved, Copyright 2004, NEC Corporation

BL tc2120ml110dl140dl145 ProLiant Essentials Foundation ProLiant Essentials Foundation OS DOS Insight ProLiant Essentials Foundation ( MS-DOS ) ProLia

Express5800/120Mc

Express5800/120Lc

ProLiant ML110 Generation 4 システム構成図

Ver. 3.8 Ver NOTE E v3 2.4GHz, 20M cache, 8.00GT/s QPI,, HT, 8C/16T 85W E v3 1.6GHz, 15M cache, 6.40GT/s QPI,

OVERVIEW ProLiant ML350(X2800, X3060) ProLiant ML350 A B C D IDE CD-ROM Wide Ultra3/Ultra320 PCI (64 /100Mz PCI-X 4 32 /33Mz PCI 1) ( ) (CP) P

第3回戦略シンポジウム緑川公開用

HP ProLiant ML370 Generation

HP ProLiant ML310 Generation 3 システム構成図

スライド 1

ProLiant DL380 Generation 4 システム構成図

BL tc2120ml110dl140dl145 ProLiant Essentials Foundation ProLiant Essentials Foundation OS DOS Insight ProLiant Essentials Foundation ( MS-DOS ) ProLia

OVERVIEW ProLiant ML570 Generation 3 ProLiant ML570 (SCSI ) SCSI ID SCSI LED IDE DVD-ROM ( ) Ultra320 SCSI A B C 3.5 ( ) ( ) ProL

システムソリューションのご紹介

次世代スーパーコンピュータのシステム構成案について

Microsoft Word - PowerEdge_M-Series_Competitive_Power_Study_-_August_2010[1]_j.docx

Express5800/140Ma

アライドテレシス x900 Day ~最新製品とサービスが織りなすネットワーク・ソリューション~ 基調講演「医用ネットワーク設計の勘どころ」

GRAPE GRAPE-DR V-GRAPE

Express5800/110Ee Pentium 1. Express5800/110Ee N N Express5800/110Ee Express5800/110Ee ( /800EB(256)) ( /800EB(256) 20W) CPU L1 L2 CD-

OVERVIEW ProLiant ML350 G5 Storage Server ProLiant ML350 G5 Storage Server LFF(3.5") A B C D SFF(2.5") 4 16 DVD 2 6 LFF(3.5") /SATA A B C D E 6(PCI Ex

Pentium III Standard Features 600MHz ProLiant 3000R 6/ Smart3200 Wide Ultra2 550MHz ProLiant 3000R 6/

HPE Moonshot System ~ビッグデータ分析&モバイルワークプレイスを新たなステージへ~

Ver Ver NOTE E v3 2.4GHz, 20M cache, 8.00GT/s QPI,, HT, 8C/16T 85W E v3 1.6GHz, 15M cache, 6.40GT/s QPI

OVERVIEW ProLiant ML370(X2800, X3060, X3200) ProLiant ML370 A B D B C D B C 2 () (2 ) (1.6 ) 3.5 LED 48 IDE CD-ROM 5.25 Wide Ultra3/

Po w eredge M000e Index? & 00% 5 32CPU 256 0U PowerEdge M000e PowerEdge M000eI/O 6

LinuxDeviceDriver2003-PDF.PDF

OVERVIEW ProLiant ML110 G2 Storage Server ProLiant ML110 G2 Storage Server A C D SATA NH 320GB 01 (1TB) (1TB) Ultra320 SCSI 6 SATA RAID Serial

Transcription:

taisuke@cs.tsukuba.ac.jp http://www.hpcs.is.tsukuba.ac.jp/~taisuke/

CP-PACS HPC PC post CP-PACS CP-PACS II

1990 HPC RWCP,

HPC

かつての世界最高速計算機も 1996年11月のTOP500 第一位 ピーク性能 614 GFLOPS Linpack性能 368 GFLOPS (地球シミュレータの前 に日本が一位を取った 最後の計算機 2003年11月のTOP500 ついに drop off!! CCSシンポジウム (2004/06/10)

6Gflops 1000 6 Tflops Infiniband (x4): 1 Gbyte/s MyrinetXP (dual): 500 Mbyte/s CP-PACS 16 bank

Flare cluster DELL PowerEdge 1750 Xeon 3.06GHz dual 12 nodes, 72 GFLOPS Gigabit Ethernet Linux CPU Orion cluster Compaq AlphaServer DS20L Alpha EV68 833MHz dual 30 nodes, 100 GFLOPS Fast Ethernet Linux + SCore CPU Perseus cluster HP ProLiant DL360G3 Xeon 2.8GHz dual 37 nodes, 414 GFLOPS Myrinet2000 Linux + SCore (HMCS) Corona cluster HP ProLiant DL380G3 Xeon 3.06GHz dual 8 nodes, 48 GFLOPS Gigabit Ethernet x 6 Linux+SCore trunk

Perseus custer Xeon dual, Myrinet2000, 37 nodes) SCore+PBS+CMU MPI on PM/Myrinet, no SCore-D GRAPE-6 HMCS: Heterogeneous Multi-Computer System) 6 13 200GFLOPS Myrinet2000 full connection

PC-Cluster (Xeon dual) Parallel I/O System PAVEMENT/PIO MPP for Particle Simulation (GRAPE-6) Paralel File Server (SGI Origin2000) 100base-TX Switches 32bit PCI N Hybrid System Communication Cluster (Compaq Alpha) Parallel Visualization Server (SGI Onyx2) Parallel Visualization System PAVEMENT/VIZ

CPU (Alpha EV68 dual, Xeon dual) QCDpost processing WS

CP-PACS CPU 99% 10 20TFLOPS

full QCD FFT + CG HMCS MPP

IntelXeon, Opteron, Itanium2 Dual CPU Network bound SAN (System Area Network) MyrinetXP: dual connection Infiniband: x4 GbEthernet

ex. InfiniBand, Quadrix vs ex. GbE n

PC QCD Lattice QCD x, y, z, t PC

[flop] Load [byte] Store [byte] [byte/flop] t 288 672 192 3.00 x 336 864 192 3.14 y 336 864 192 3.14 z 336 864 192 3.14 clover 600 864 192 1.76 5088[byte]/1896[flop] = 2.68 [byte/flop]

4 Nx*Ny*Nz*Nt3 3nx*ny*nz x y z [byte] 12*2*(Nt/2+1)*(Ny/ny)*(Nz/nz)*16 12*2*(Nt/2+1)*(Nx/nx)*(Ny/ny)*16 12*2*(Nt/2+1)*(Nx/nx)*(Ny/ny)*16 1 Nx=Ny=Nz=Ns, nx=ny=nz=ns 0.608 * ((Nt+2)/Nt) / (Ns/ns) [byte/flop]

P4 (Pentium4, 2.4GHz): HyperThreading PC3200 memory single CPU Xeon (Pentium Xeon, 2.8 GHz): PC2100 memory, 2 CPU SMP EV7 (Alpha EV7, 1.15 GHz): PC3200 memory, 16 CPU HyperTransport connected Alpha EV7 HP

CPU [Mflops] No copy P4 Copy No copy P4, Xeon SSE2 Xeon Copy No copy EV7 Copy 2*2*2*64 1251 957 811 598 1190 949 4*4*4*64 1020 878 633 536 1144 1034 8*8*8*64 1045 958 686 625 1140 1082 16*16*16*64 N/A N/A 604 573 1122 1101 No Copy: CPU Copy:

EV7 16 CPU ( 16*16*16*64) CPU 1 2 4 8 16 1 0.94 0.86 0.72 0.32 8 CPU HyperTransport

32*32*32*64 1 trajectory=20284 [Tflop] [Tflop] [Tbyte] [Tflop/Tbyte] 1 20284.0 ----- ----- 8 2535.50 77.1 32.9 64 316.938 19.3 16.4 512 39.6 4.82 8.22 4096 4.95 1.20 4.11 32768 0.619 0.301 2.06

1Gflops0.6Gbyte/s 32*32*32*645000 trajectory CPU 512 (ns=8) 4096 (ns=16) [ ] 2768 405 [%] 17 29

Xeon 3.06GHz, 1CPU)

Xeon 3.06GHz)

PC3200 (3.2 Gbyte/s) x 2 3MB L2 short vector SSE2 SSE3

CPU CPU

CPU NIC+Switch

24.6 Tflops (4GHz CPU) 3072 CPU (1536 boards) 2 CPU CPU 6.1 TB 1.05 PB (RAID0 mirror) GbEthernet trunk (dual link 3 ) PCI-X dual Gigabit Ethernet 3-D Hyper Crossbar 88 (Node=48, Switch=40)

IDE CPU CPU IDE HDD HDD chip-set memory chip-set memory x0, x1: X dual link y0, y1: Y dual link management net (100Mbps) data net (GbE x 6) data net (GbE x 6) management net (100Mbps) z0, z1: Z dual link x0 x1 y0 y1 z0 z1 x0 x1 y0 y1 z0 z1 management network switch

X Z=12 Y Z CPU Y=16 1 CPU CPU CPU X=16 dual link

CPU([0-F],[0-F],[0-1])1/6=512 CPU [Z] Y(x=0 15, z=z) [Y] Z(x=0 15, y=y) Y1-Y2,Z CPU (x=0 F,y=Y1 Y2,z=Z) Y(1128 ) Z(132 ) [Z] X(y=0 15, z=z) X(1128 ) 0-3,0 8-B,0 4-7,0 C-F,0 0-3,1 8-B,1 4-7,1 C-F,1 [0] [0] [2] [2] [1] [1] [3] [3] [2] [0] [1] [3] [6] [4] [5] [7] [4] [4] [6] [6] [5] [5] [7] [7] [A] [8] [9] [B] [E] [C] [D] [F] [8] [8] [A] [A] [9] [9] [B] [B] 0-3,2 8-B,2 4-7,2 C-F,2 0-3,3 8-B,3 4-7,3 C-F,3 0-3,4 8-B,4 4-7,4 C-F,4 0-3,5 8-B,5 4-7,5 C-F,5 0-3,6 8-B,6 4-7,6 C-F,6 0-3,7 8-B,7 4-7,7 C-F,7 0-3,8 8-B,8 4-7,8 C-F,8 0-3,9 8-B,9 4-7,9 C-F,9 0-3,A 8-B,A 4-7,A C-F,A 0-3,B 8-B,B 4-7,B C-F,B

CPUSSE dual CPU SMP 3

PC