次世代スーパーコンピュータのシステム構成案について

Size: px
Start display at page:

Download "次世代スーパーコンピュータのシステム構成案について"

Transcription

1

2 A 3.3 B /4/27 4 1

3 /4/27 4 2

4 NEC NHF PFLOPS2.5PB 30MW 3, SimFold, GAMESS, Modylas, RSDFT, NICAM, LatticeQCD, LANS HPL, NPB-FT /4/27 4 3

5 NH 1,280 N 40,960 SMP CPU 40, , PFLOPS : 2.5PB N 2TB Fat-tree Fat-tree 16GB/s Gbps 17.5MW (Linpack) SW2 #00 SW2 #15 SW2 #16 SW2 #31 SW2 #32 SW2 #47 SW2 #48 SW2 #63 Fat-tree 4 SW1 SW1 SW1 #00 #15 # SW0 SW0 SW0 #00 #15 #16 16 SW1 SW1 #31 #32 SW0 SW0 #31 #32 SW1 SW1 #47 #48 SW0 SW0 #47 #48 SW1 SW1 #63 #64 SW0 SW0 #63 #64 SW1 #79 SW0 #79 16GB/s x 16links x 2 16GB/s x 16links x 2 N : 32CPU, 128Core, 8.19TFLOPS, 2TB N : 32CPU, 128Core, 8.19TFLOPS, 2TB N NUMA 16GB/s x 16links x 2 N NUMA 16GB/s x 16links x 2 CPU: 256GFLOPS CPU: 256GFLOPS CPU: 256GFLOPS CPU: 256GFLOPS Core: 2GHz Core: (2FMA 2GHz Core: x 8VPP) (2FMA 2GHz Core: 64GFLOPS x 8VPP) (2FMA 2GHz 64GFLOPS x 8VPP) (2FMA 64GFLOPS x 8VPP) 64GFLOPS (32) Core: 2GHz Core: (2FMA 2GHz Core: x 8VPP) (2FMA 2GHz Core: 64GFLOPS x 8VPP) (2FMA 2GHz 64GFLOPS x 8VPP) (2FMA 64GFLOPS x 8VPP) 64GFLOPS (1280 N ) Core: 2GHz Core: (2FMA 2GHz Core: x 8VPP) (2FMA 2GHz Core: 64GFLOPS x 8VPP) (2FMA 2GHz 64GFLOPS x 8VPP) (2FMA 64GFLOPS x 8VPP) 64GFLOPS (32) Core: 2GHz Core: (2FMA 2GHz Core: x 8VPP) (2FMA 2GHz Core: 64GFLOPS x 8VPP) (2FMA 2GHz 64GFLOPS x 8VPP) (2FMA 64GFLOPS x 8VPP) 64GFLOPS L2$: 8MB L2$: 8MB L2$: 8MB L2$: 8MB 128GB/s 128GB/s 128GB/s 128GB/s MEM: 64GB MEM: 64GB MEM: 64GB MEM: 64GB 2007/4/27 4 4

6 NH 45nmCPU 256GFLOPS CPU42GHz 2FMAx8128KB 8MB 4RDB Reusable Data Buffering L2 1CPU4 SMP 40,960CPU10.48PFLOPS2.5PB N 32CPU OSMPI CPU CPU 140W Linpack 328TB/s 3Fat tree 1280 N 2007/4/27 4 5

7 NH OS: LinuxIO OS : : OpenMP MPI : Fortran HPF CAF C/C++ MPI 2007/4/27 4 6

8 F 82,944 CPU 82, , PFLOPS 2.53PB32GB ToFu: +3D 18CPU D 5.0GB/s GB/s MW (Linpack) 3D 30GB/s x 6 2 /9 / 30GB/s x 6 2 /9 / CPU: 2GHz, 128GFLOPS (8Cores) Core: Core: SIMD(4FMA) Core: SIMD(4FMA) Core: SIMD(4FMA) Core: SIMD(4FMA) Core: SIMD(4FMA) Core: SIMD(4FMA) Core SIMD(4FMA) SIMD(4FMA) 16GFLOPS L2$: 6MB MEM: 32GB 64GB/s 82,944 CPU: 2GHz, 128GFLOPS (8Cores) Core: Core: SIMD(4FMA) Core: SIMD(4FMA) Core: SIMD(4FMA) Core: SIMD(4FMA) Core: SIMD(4FMA) Core SIMD(4FMA) Core: SIMD(4FMA) SIMD(4FMA) 16GFLOPS L2$: 6MB MEM: 32GB 64GB/s 2.5GB/s x 8 links x 2 180GB/s 2.5GB/s x 8 links x 2 180GB/s 2007/4/27 4 7

9 F 45nm 1CPU LSI 128GFLOPS 1CPU82GHz FP128SPARC-V9 4 SIMD4FMA 4 HPC 6MB L28 / 82,944CPU 10.6PFLOPS2.53PB Linpack 58W/CPU 20 ToFu Torus-connected Full connection 18CPU 1 3D /4/27 4 8

10 F OS POSIX UNIX OS OpenMP MPI : 8SMP8SMP D ToFu Fortran XP Fortran HPF CAF C/C++ MPI 2007/4/27 4 9

11 NH F PFLOPS PB PB / m 2 1,446 / 2,976 1,475 / 3,198 / MW 17.5 / 23 Linpack 15.5 / 22.8 Linpack CPU 40,960 82, , ,552 Fat Tree D 2007/4/

12 NH F GHz 2 GFLOPS : 2FMA x 8VPP) SIMD 4FMA GFLOPS CPU Byte/Flop L2 0.5 MB 8 6 Byte/Flop /4/

13 2GHz Thin Fat NH 40,960 F 82,944 () HPC 2007/4/

14 2 NH 4 16 F 8 66 NH F SIMD NH Fat Tree F D 2007/4/

15 SimFold GAMESS Modylas RSDFT NICAM LatticeQCD LANS HPL High Performance LinpackNPB-FT 2007/4/

16 NH F PFLOPS SimFold GAMESS Modylas RSDFT NICAM LatticeQCD LANS HPL NPB-FT 7 HPL NPB- FT 2007/4/

17 NH LatticeQCD LANS NH F NH F NH F NH F NH F NH F NH F NH F SimFold GAMESS Modylas RSDFT NICAM LatticeQCD LANS NPB-FT RSDFTNPB-FT 2007/4/

18 12 NH F PFLOPS BMT 2007/4/

19 10PFLOPS2.5PB 30MW 3,200 BMT CPU F NH F NH 2007/4/

20 /4/

21 /4/

22 1. LINPACK 10PFLOPS 2. 10PFLOPS 10PFLOPS 3-5PFLOPS PC 3. 3PFLOPS 3PFLOPS 1PFLOPS 2007/4/

23 F 10PFLOPSNH 3PFLOPS A B Fat Tree Fat Tree ToFu Fat Tree NIC F NH F NH F NH ToFu /4/

24 FNH 10PFLOPS 3PFLOPS Linpack 10PFLOPS A B ToFu Fat Tree F NH 2007/4/

25 1/ /4/

26 2/3 SIMD 2007/4/

27 3/3 CPU 2007/4/

28 /4/

29 A B ToFu Fat Tree A B LINPACK 10PFLOPS A 10PFLOPS B 3PFLOPS A: 11.2PFLOPS x 85% LINPACK =9.52PFLOPS B: 3.1PFLOPS x 90% LINPACK =2.79PFLOPS 1.2TB/ 15PB F 80PB NH 5PB A+B LINPACK 90% 11.08PFLOPS 85% 10.46PFLOPS 80% 9.85PFLOPS A 1/8 B/FLOPS B 1/4 1/8 B/FLOPS 100PB A B10 A B B 2007/4/

30 On-the-fly 2007/4/

31 On-the-fly 10PFLOPS t 1 t 2 t 3 2, 2, 2, t 1 t 2 t 3 A B 2007/4/

32 On-the-fly 10A A 2 10TB B 10PFLOPS 3PFLOPS 10TB 10TB 1PFLOPS 2 on N TB 10TB 2 on N TB 10TB 2 on N n GB/CPU 1.0 1TB/ A B 2007/4/

33 - e - e I - I - 3 3PF 1PF 30GB 40TB 45GB 0.3GB SCF-CI 4GB 3GB 2007/4/

34 A 10PFLOPS 15PB A ToFu F 80PB B Fat Tree NH 5PB B 1PFLOPS 1TB NUMA 2007/4/

35 A 13PF A 10PF+B 3PF (1PF )10 A 13PF A 10PF B 3PF 151 5, ,500 B : NICAM 1 : 1.9 LANS 1 : /4/

36 /4/

37 CPU 99, , PFLOPS PB 100PB A B 24MW 3, MW/PFLOPS 266 /PFLOPS 15PB CPU 87, , PFLOPS 1.34PB 15.2MW 1,900 CPU 12,288 49, PFLOPS PB 6.8MW 900 5PB 80PB 1.2TB/s 2.0MW /4/

38 MPI A MPI B 2007/4/

39 A B ACL MPI API 2007/4/

40 A B 57m 52m A B 1, m 3,800 36m 70m 12.5m 17m 2007/4/

41 2007/4/ A LSI OS LSI OS B

42 3.2 A 2007/4/

43 A 87,552 CPU 87, , PFLOPS 1.34PB16GB ToFu +3D 18CPU 1 20x16x16 =5,120 3D 15.2 MW (Linpack) 3D 30GB/s x 6 2 /9 / 30GB/s x 6 2 /9 / CPU: 2GHz, 128GFLOPS (8Cores) Core: Core: SIMD(4FMA) Core: SIMD(4FMA) Core: SIMD(4FMA) Core: SIMD(4FMA) Core: SIMD(4FMA) Core: SIMD(4FMA) Core SIMD(4FMA) SIMD(4FMA) 16GFLOPS L2$: 6MB MEM: 16GB 64GB/s 87,552 CPU: 2GHz, 128GFLOPS (8Cores) Core: Core: SIMD(4FMA) Core: SIMD(4FMA) Core: SIMD(4FMA) Core: SIMD(4FMA) Core: SIMD(4FMA) Core SIMD(4FMA) Core: SIMD(4FMA) SIMD(4FMA) 16GFLOPS L2$: 6MB MEM: 16GB 64GB/s 2.5GB/s x 8 links x 2 180GB/s 2.5GB/s x 8 links x 2 180GB/s 2007/4/

44 A 45nm 1CPU(LSI)128GFLOPS 1CPU82GHz FP128SPARC-V9 4 ) SIMD (4FMA 4 ) HPC 6MB L28 / 42W/CPU Linpack 58W/CPU20 ToFu (Torus-Full connection) 18CPU /4/

45 8128FP 2GHz SIMD GFLOPS CPU 128GFLOPS 6MB 64GB/s 32GB/s 32GB/s L2 L1 2B/FLOP L2 0.5B/FLOP CPU 128GF 16GFx8 2GH 8 2 2SIMD 2 2SIMD KB(2way) 116KB(2way) 2 6MB(12way) 64GB/s 2007/4/

46 SIMD 4,8 (1) SIMD 2 (2) SIMD 4 Basic FPR(%b0-%b63) FPR(%e0-%e63) FPR(%b0-%b63) FPR(%e0-%e63) FMA FMA FMA FMA A-pipe B-pipe C-pipe D-pipe FMA FMA FMA FMA A-pipe B-pipe C-pipe D-pipe Extend (3) SIMD1 SIMD2 FPR(%b0-%b63) FPR(%e0-%e63) FPR(%b0-%b63) FPR(%e0-%e63) FMA FMA FMA FMA A-pipe B-pipe C-pipe D-pipe FMA FMA FMA FMA A-pipe B-pipe C-pipe D-pipe 2007/4/

47 CPU 16GB SBCPU 2 32GB ICC Interconnect Controller CPU-ICC 32GB/s ICCPCI Express gen2 DIMM DIMM 32GB/s 32GB/s DIMM CPU CPU 32GB/s 32GB/s 82GB/s ICC DIMM 32GB/s 32GB/s PCIe Gen2 4GB/s x3 ToFu 6.4Gbps / differential pair PCIe Gen2... 5Gbps / differential pair Full / ToFu 5GB/s x8 Torus / ToFu 10GB/s x 2(+1) 2007/4/

48 ToFu ToFu Torus-connected Full-connection 2 9SB 2.5GB/s 2) ToFu 2 20x16x16 3 5GB/s x 3 x 2 = 30GB/s MPI D 2007/4/

49 mm 3 52m 36m 2007/4/

50 25 SW (8) (50) 10GbE SW 10GbE 50TB RAID SCFB 50TB RAID SW SW IO SB SW SW (320) (320) (320) (320) 10GbE SW 10GbE SW 10GbE SW 10GbE SW 1GbE SW 1GbE SW 1(12) 8GFC PB 2007/4/

51 OS POSIXUNIX OS SW OpenMP MPI : Fortran HPF CAF XP Fortran C/C++ A 8SMP 87,552 B ToFu 2007/4/

52 RAS CPU ECC RAM /4/

53 3.3 B 2007/4/

54 B 12, N CPU 12,288 49, PFLOPS PB32-64GB N 32CPUs NUMA1TB-2TB 2 Fat-tree ( ) x 16 7MW 900 Fat-tree SW2 24 #00 SW0 #00 16 SW0 #02 SW2 #02 SW0 #03 SW2 #15 SW0 #23 16GB/s x 16links x 2 16GB/s x 16links x 2 N : 32CPU, 128Core, 8.19TFLOPS, 1-2TB N : 32CPU, 128Core, 8.19TFLOPS, 1-2TB N NUMA 16GB/s x 16links x 2 N NUMA 16GB/s x 16links x 2 CPU: 256GFLOPS CPU: 256GFLOPS CPU: 256GFLOPS CPU: 256GFLOPS Core: 2GHz Core: (2FMA 2GHz Core: x 8VPP) (2FMA 2GHz Core: 64GFLOPS x 8VPP) (2FMA 2GHz 64GFLOPS x 8VPP) (2FMA 64GFLOPS x 8VPP) 64GFLOPS (32) Core: 2GHz Core: (2FMA 2GHz Core: x 8VPP) (2FMA 2GHz Core: 64GFLOPS x 8VPP) (2FMA 2GHz 64GFLOPS x 8VPP) (2FMA 64GFLOPS x 8VPP) 64GFLOPS (384 N ) Core: 2GHz Core: (2FMA 2GHz Core: x 8VPP) (2FMA 2GHz Core: 64GFLOPS x 8VPP) (2FMA 2GHz 64GFLOPS x 8VPP) (2FMA 64GFLOPS x 8VPP) 64GFLOPS (32) Core: 2GHz Core: (2FMA 2GHz Core: x 8VPP) (2FMA 2GHz Core: 64GFLOPS x 8VPP) (2FMA 2GHz 64GFLOPS x 8VPP) (2FMA 64GFLOPS x 8VPP) 64GFLOPS L2$: 8MB L2$: 8MB L2$: 8MB L2$: 8MB 256GB/s 256GB/s 256GB/s 256GB/s MEM: 32-64GB MEM: 32-64GB MEM: 32-64GB MEM: 32-64GB 2007/4/

55 B 45nmCPU 256GFLOPS CPU42GHz 8FMAx2128KB 8MB L24 RDB (Reusable Data Buffering) 12,288CPU3.14PFLOPS PB N 32CPU OS : 140W/CPU Linpack 98TB/s 2Fat tree 384 N 2007/4/

56 4 1 8MB L2 2GHz 64GFLOPS CPU 256GFLOPS 1B/FLOP 8MB L2 256GB/s 128GB/s 1B/FLOP L2 4B/FLOP RDB (Reusable Data Buffering) 256GF 64GFx4 8MB 8way- 64B/4 Unified 1B/FLOP 16GB/s 2 256GB/s 2007/4/

57 128 4way 8 2 / /4/

58 N 4CPU 1U 8U 32CPU I/O NUMA 2CPUN33x33 16GB/s x 2 I/Ox86 NN N N 16GB/s x x 16 MM MM MM MM C C C C C C C C C C C C C C C C MM MM MM MM MM MM MM MM C C C C C C C C C C C C C C C C MM MM MM MM MM MM MM MM C C C C C C C C C C C C C C C C MM MM MM MM I/O I/O U #0 U #1 U #7 CPU CPU 2007/4/

59 N 2Fat-tree 16GB/s Gbps N N 98TB/s SW2 24 #00 SW0 SW0 #00 #02 16 N 16 CPU #0~3 CPU #4~7 CPU #28~31 SW2 #02 SW2 #15 SW0 #03 SW0 # /4/

60 54.5m 2 N ) 1I/O mm 2000mm 1000mm 2N mm I/O SW 8SW 1000mm 600mm 800mm I/O m 800mm 1000mm 2007/4/

61 OS: LinuxIO OS : SW : OpenMP MPI : Fortran HPF CAF C/C++ UPC 2007/4/

62 RAS CPU ECCRAM(L2 ) I/F RAM MOD-N Out-of-N BIST (Built-In Test) / LSI ECC 1 N / OS CPU N I/O NN RAID6 I/O 2007/4/

63 /4/

64 A SIMD ToFu SIMD RAS B Fat-tree VCSEL 20Gbps SerDes RAS 2007/4/

65 A LSI (1/2) LSI 45nm LSI 8 HPC SIMD 6MB 128GFLOPS /101 / ) - RAM - Vth - - Vdd, Vbs 2007/4/

66 A LSI (2/2) ( 10 ) LSI R A M L1$ L1 $ SEC DED ECC L2$ SEC DED ECC SEC DED ECC mtlb GPR FPR GUB FUB PC PSTATE ALU SHIFT FMA 2007/4/

67 2007/4/ A (1/2) I/O 6.25Gbps 6.25Gbps PT 15 IDC 3.125Gbps SystemBoard ICC SystemBoard ICC SystemBoard ICC SystemBoard ICC SystemBoard ICC SystemBoard ICC SystemBoard ICC SystemBoard ICC SystemBoard ICC SystmBoard ICC CN SystmBoard ICC CN SystmBoard ICC CN SystmBoard ICC CN SystmBoard ICC CN SystmBoard ICC CN SystmBoard ICC CN SystmBoard ICC CN SystmBoard ICC CN

68 A (2/2) ToFu MPI 100PetaFlops HPC /4/

69 A (1/2) (SB) SB (SB) 2007/4/

70 A (2/2) CPU0.006( ) / / / 2007/4/

71 A SIMD (1/2) Basic, Extend 2 2/ Basic, Extend SIMD 2 SIMD DO I=1,N IF ((I)) then A(I)=B(I)+C(I) ELSE X(I)=Y(I)*Z(I) ENDIF ENDDO L2, L1 DO I=1,N,2 IF ((I)) then IF ((I+1)) then A(I)=B(I)+C(I) A(I+1)=B(I+1)+C(I+1) ELSE A(I)=B(I)+C(I) X(I+1)=Y(I+1)*Z(I+1) ENDIF ELSE IF ((I+1)) then X(I)=Y(I)*Z(I) A(I+1)=B(I+1)+C(I+1) ELSE X(I)=Y(I)*Z(I) X(I+1)=Y(I +1)*Z(I+1) ENDIF ENDIF ENDDO 2007/4/

72 A SIMD (2/2) Venus 8 ( ) SIMD / / 2007/4/

73 B LSI(1/2) (1) NMOS PMOS N+ N+ P_well P+ P+ N_well P_sub / 90nm 65nm 45nm (2) 45nmCMOS Low etc /SRAM etc Vth etc etc 2007/4/

74 B LSI(2/2) LSI TEG LSI LSI 2007/1 2009/1 2009/ fix RTL LSI TEG TO LSI LSI LSI 2007/4/

75 B(1/2) (1) (2) 20Gbps SerDes ITRS Gbps 1000/LSI 1/200 1/ /4/ G bps 10G 1G ITRS 2 20Gbps

76 B(2/2) FIX 2007/4 2007/4 2007/2 2008/2 2009/2 2009/ fix RTL LSI 2007/4/

77 B(1/2) Program DO DO i = 1, 1, n +B(i-1)+ +B(i)+ = +B(i+1) END END DO DO i-1 i i+1 VL i-1 i i+1 VL 2007/4/

78 B(2/2) 2007/4Q 2008/4Q 2009/4Q 2010/4Q fix RTL LSI 2007/4/

79 /4/

80 SimFold GAMESS Modylas RSDFT NICAM LatticeQCD LANS HPL High Performance LinpackNPB-FT 2007/4/

81 2007/4/

82 2007/4/

83 2007/4/

84 2007/4/

85 F 1/2 (SPARC64VI 1Core) SIMD SIMD or 2007/4/

86 F 2/2 2007/4/

統合汎用スーパーコンピュータシステムの設計状況と施設整備状況

統合汎用スーパーコンピュータシステムの設計状況と施設整備状況 81 200942 2142 1 A B / HPC Challenge Award 2009/4/2 1 1 2009/4/2 1 2 2009/4/2 1 3 11PB CPU 88,128 705,024 11.28PFLOPS 1.34PB 16MW 1,470 CPU 12,288 49,152 3.1PFLOPS 0.375PB 7MW 1,070 7.6PB 30PB 2MW 1000

More information

1重谷.PDF

1重谷.PDF RSCC RSCC RSCC BMT 1 6 3 3000 3000 200310 1994 19942 VPP500/32PE 19992 VPP700E/128PE 160PE 20043 2 2 PC Linux 2048 CPU Intel Xeon 3.06GHzDual) 12.5 TFLOPS SX-7 32CPU/256GB 282.5 GFLOPS Linux 3 PC 1999

More information

卒業論文

卒業論文 PC OpenMP SCore PC OpenMP PC PC PC Myrinet PC PC 1 OpenMP 2 1 3 3 PC 8 OpenMP 11 15 15 16 16 18 19 19 19 20 20 21 21 23 26 29 30 31 32 33 4 5 6 7 SCore 9 PC 10 OpenMP 14 16 17 10 17 11 19 12 19 13 20 1421

More information

01_OpenMP_osx.indd

01_OpenMP_osx.indd OpenMP* / 1 1... 2 2... 3 3... 5 4... 7 5... 9 5.1... 9 5.2 OpenMP* API... 13 6... 17 7... 19 / 4 1 2 C/C++ OpenMP* 3 Fortran OpenMP* 4 PC 1 1 9.0 Linux* Windows* Xeon Itanium OS 1 2 2 WEB OS OS OS 1 OS

More information

supercomputer2010.ppt

supercomputer2010.ppt nanri@cc.kyushu-u.ac.jp 1 !! : 11 12! : nanri@cc.kyushu-u.ac.jp! : Word 2 ! PC GPU) 1997 7 http://wiredvision.jp/news/200806/2008062322.html 3 !! (Cell, GPU )! 4 ! etc...! 5 !! etc. 6 !! 20km 40 km ) 340km

More information

untitled

untitled Power Wall HPL1 10 B/F EXTREMETECH Supercomputing director bets $2,000 that we won t have exascale computing by 2020 One of the biggest problems standing in our way is power. [] http://www.extremetech.com/computing/155941

More information

スーパーコンピュータ「京」の概要

スーパーコンピュータ「京」の概要 Overview of the K computer System 宮崎博行 草野義博 新庄直樹 庄司文由 横川三津夫 渡邊貞 あらまし HPCI CPUOS LINPACK 10 PFLOPSCPU 8 Abstract RIKEN and Fujitsu have been working together to develop the K computer, with the aim of beginning

More information

スパコンに通じる並列プログラミングの基礎

スパコンに通じる並列プログラミングの基礎 2018.06.04 2018.06.04 1 / 62 2018.06.04 2 / 62 Windows, Mac Unix 0444-J 2018.06.04 3 / 62 Part I Unix GUI CUI: Unix, Windows, Mac OS Part II 2018.06.04 4 / 62 0444-J ( : ) 6 4 ( ) 6 5 * 6 19 SX-ACE * 6

More information

スパコンに通じる並列プログラミングの基礎

スパコンに通じる並列プログラミングの基礎 2016.06.06 2016.06.06 1 / 60 2016.06.06 2 / 60 Windows, Mac Unix 0444-J 2016.06.06 3 / 60 Part I Unix GUI CUI: Unix, Windows, Mac OS Part II 0444-J 2016.06.06 4 / 60 ( : ) 6 6 ( ) 6 10 6 16 SX-ACE 6 17

More information

スパコンに通じる並列プログラミングの基礎

スパコンに通じる並列プログラミングの基礎 2018.09.10 furihata@cmc.osaka-u.ac.jp ( ) 2018.09.10 1 / 59 furihata@cmc.osaka-u.ac.jp ( ) 2018.09.10 2 / 59 Windows, Mac Unix 0444-J furihata@cmc.osaka-u.ac.jp ( ) 2018.09.10 3 / 59 Part I Unix GUI CUI:

More information

Microsoft PowerPoint 知る集い(京都)最終.ppt

Microsoft PowerPoint 知る集い(京都)最終.ppt 次世代スパコンについて知る集い 配布資料 世界最高性能を目指すシステム開発について ー次世代スパコンのシステム構成と施設の概要 - 平成 22 年 1 月 28 日 理化学研究所次世代スーパーコンピュータ開発実施本部横川三津夫 高性能かつ大規模システムの課題と対応 演算性能の向上 CPU のマルチコア化,SIMD( ベクトル化 ) 機構 主記憶へのアクセス頻度の削減 - CPU 性能とメモリアクセス性能のギャップ

More information

untitled

untitled taisuke@cs.tsukuba.ac.jp http://www.hpcs.is.tsukuba.ac.jp/~taisuke/ CP-PACS HPC PC post CP-PACS CP-PACS II 1990 HPC RWCP, HPC かつての世界最高速計算機も 1996年11月のTOP500 第一位 ピーク性能 614 GFLOPS Linpack性能 368 GFLOPS (地球シミュレータの前

More information

NEC All rights reserved 1

NEC All rights reserved 1 NEC All rights reserved 1 NEC All rights reserved 2 NEC All rights reserved 3 (Founder) (Langchao Langchao) NEC All rights reserved 4 2.1 GB/s 64 bits wide 266 MHz 4 MB L3 on board, 96k L2, 32k L1 on -die

More information

GPU GPU CPU CPU CPU GPU GPU N N CPU ( ) 1 GPU CPU GPU 2D 3D CPU GPU GPU GPGPU GPGPU 2 nvidia GPU CUDA 3 GPU 3.1 GPU Core 1

GPU GPU CPU CPU CPU GPU GPU N N CPU ( ) 1 GPU CPU GPU 2D 3D CPU GPU GPU GPGPU GPGPU 2 nvidia GPU CUDA 3 GPU 3.1 GPU Core 1 GPU 4 2010 8 28 1 GPU CPU CPU CPU GPU GPU N N CPU ( ) 1 GPU CPU GPU 2D 3D CPU GPU GPU GPGPU GPGPU 2 nvidia GPU CUDA 3 GPU 3.1 GPU Core 1 Register & Shared Memory ( ) CPU CPU(Intel Core i7 965) GPU(Tesla

More information

23_33.indd

23_33.indd 23 16 26 25 24 2 30 2 19 20 1 21 1 22 9 11 15 14 23 2 3 5 1 6 12 14 29 P.26 P.26 P.26 P.26 P.2 P.26 P.2 P.2 P.2 P.2 P.2 P.2 P.24 P.24 P.24 P.24 P.24 MAC 10. 10.6 10.5 1TB 2TB XP XP MAC 10. 10. 10.6 10.5

More information

040312研究会HPC2500.ppt

040312研究会HPC2500.ppt 2004312 e-mail : m-aoki@jp.fujitsu.com 1 2 PRIMEPOWER VX/VPP300 VPP700 GP7000 AP3000 VPP5000 PRIMEPOWER 2000 PRIMEPOWER HPC2500 1998 1999 2000 2001 2002 2003 3 VPP5000 PRIMEPOWER ( 1 VU 9.6 GF 16GB 1 VU

More information

Ver. 3.8 Ver NOTE E v3 2.4GHz, 20M cache, 8.00GT/s QPI,, HT, 8C/16T 85W E v3 1.6GHz, 15M cache, 6.40GT/s QPI,,

Ver. 3.8 Ver NOTE E v3 2.4GHz, 20M cache, 8.00GT/s QPI,, HT, 8C/16T 85W E v3 1.6GHz, 15M cache, 6.40GT/s QPI,, PowerEdge R930 Contents RAID /RAID & P3-5 P6 P7 P7 P8-P9 P10-13 P14-57 P58 PCIe P59-71 P72-73 P74-77 P78-81 OS P82-88 P88-89 P90-91 V3.8 Apr. 2017 2017 4 28 2016 4 22 Ver. 3.8 Ver. 1.0 +- NOTE E5-2630

More information

Ver. 3.8 Ver NOTE E v3 2.4GHz, 20M cache, 8.00GT/s QPI,, HT, 8C/16T 85W E v3 1.6GHz, 15M cache, 6.40GT/s QPI,,

Ver. 3.8 Ver NOTE E v3 2.4GHz, 20M cache, 8.00GT/s QPI,, HT, 8C/16T 85W E v3 1.6GHz, 15M cache, 6.40GT/s QPI,, PowerEdge R730 Contents RAID /RAID & PCIe GPU OS P3-5 P6 P7 P8 P9-10 P11-16 P17-55 P56 P57-66 P67-69 P70-72 P72 P73 P74-77 P78-81 P82-88 P88-89 P90-91 V3.8 Apr. 2017 2017 4 28 2016 4 22 Ver. 3.8 Ver. 1.0

More information

26102 (1/2) LSISoC: (1) (*) (*) GPU SIMD MIMD FPGA DES, AES (2/2) (2) FPGA(8bit) (ISS: Instruction Set Simulator) (3) (4) LSI ECU110100ECU1 ECU ECU ECU ECU FPGA ECU main() { int i, j, k for { } 1 GP-GPU

More information

Ver. 3.8 Ver NOTE E v3 2.4GHz, 20M cache, 8.00GT/s QPI,, HT, 8C/16T 85W E v3 1.6GHz, 15M cache, 6.40GT/s QPI,

Ver. 3.8 Ver NOTE E v3 2.4GHz, 20M cache, 8.00GT/s QPI,, HT, 8C/16T 85W E v3 1.6GHz, 15M cache, 6.40GT/s QPI, PowerEdge T630 Contents RAID /RAID & PCIe GPU OS v3.8 Apr. 2017 P3-5 P6 P7 P8-9 P10-11 P12-16 P17-79 P80-85 P86-87 P88-90 P90 P91-92 P93-96 P97-100 P101-107 P107-108 P109-110 2017 4 28 2016 4 22 Ver. 3.8

More information

ProLiant BL35p システム構成図

ProLiant BL35p システム構成図 HP ProLiant BL p-class Server BL35p 2007 8 9 1 OVERVIEW HP BladeSystem p-class Hardware Component 2 BladeSystem p-class BladeSystem p-class BladeSystem p-class () 3U () 1U HP BladeSystem p-class Common

More information

Microsoft Word - .....J.^...O.|Word.i10...j.doc

Microsoft Word - .....J.^...O.|Word.i10...j.doc P 1. 2. R H C H, etc. R' n R' R C R'' R R H R R' R C C R R C R' R C R' R C C R 1-1 1-2 3. 1-3 1-4 4. 5. 1-5 5. 1-6 6. 10 1-7 7. 1-8 8. 2-1 2-2 2-3 9. 2-4 2-5 2-6 2-7 10. 2-8 10. 2-9 10. 2-10 10. 11. C

More information

HP High Performance Computing(HPC)

HP High Performance Computing(HPC) ACCELERATE HP High Performance Computing HPC HPC HPC HPC HPC 1000 HPHPC HPC HP HPC HPC HPC HP HPCHP HP HPC 1 HPC HP 2 HPC HPC HP ITIDC HP HPC 1HPC HPC No.1 HPC TOP500 2010 11 HP 159 32% HP HPCHP 2010 Q1-Q4

More information

untitled

untitled A = QΛQ T A n n Λ Q A = XΛX 1 A n n Λ X GPGPU A 3 T Q T AQ = T (Q: ) T u i = λ i u i T {λ i } {u i } QR MR 3 v i = Q u i A {v i } A n = 9000 Quad Core Xeon 2 LAPACK (4/3) n 3 O(n 2 ) O(n 3 ) A {v i }

More information

Ver. 3.8 Ver NOTE E v3 2.4GHz, 20M cache, 8.00GT/s QPI,, HT, 8C/16T 85W E v3 1.6GHz, 15M cache, 6.40GT/s QPI,,

Ver. 3.8 Ver NOTE E v3 2.4GHz, 20M cache, 8.00GT/s QPI,, HT, 8C/16T 85W E v3 1.6GHz, 15M cache, 6.40GT/s QPI,, PowerEdge R730xd Contents RAID /RAID & P3-6 PCIe P112-122 P123-125 P126-129 P130-133 OS P134-140 P140-141 P142-143 P7 P8 P9 P10-11 P12-17 P18-110 P111 v3.8 Apr. 2017 2017 4 28 2016 4 22 Ver. 3.8 Ver. 1.0

More information

Itanium2ベンチマーク

Itanium2ベンチマーク HPC CPU mhori@ile.osaka-u.ac.jp Special thanks Timur Esirkepov HPC 2004 2 25 1 1. CPU 2. 3. Itanium 2 HPC 2 1 Itanium2 CPU CPU 3 ( ) Intel Itanium2 NEC SX-6 HP Alpha Server ES40 PRIMEPOWER SR8000 Intel

More information

untitled

untitled 1 NAREGI 2 (NSF) CyberInfrastructure Teragrid (EU) E-Infrastructure EGEE Enabling Grids for E-science E ) DEISA (Distributed European Infrastructure for Supercomputing applications) EPSRC) UK e-science

More information

untitled

untitled 2005 2 1 105-0004 5-34-3 Tel: 03-3431-4002 Fax: 03-3431-4044 1 SRL/ISTEC 1 1 SFQ SFQ SFQ 2004 9 4 SFQ SFQ / LSI 269 230 230 230 269 230 SFQ SFQ 2005 2 ISTEC 2005 All rights reserved. - 1 - 2005 2 1 105-0004

More information

1 / 1 idrac8 CPU 1 Intel Xeon E v5 Intel Pentium Intel Core i3 Intel Celeron Intel C236 Microsoft Windows Server 2008 R2 SP1 Microsoft Windows S

1 / 1 idrac8 CPU 1 Intel Xeon E v5 Intel Pentium Intel Core i3 Intel Celeron Intel C236 Microsoft Windows Server 2008 R2 SP1 Microsoft Windows S PowerEdge T130 Contents RAID /RAID & PCIe OS P2-3 P4 P5 P5 P6 P7-8 P9-15 P16-18 P19 P19 P20-23 P24 P25-27 P27-28 P29-30 V1.1 Mar. 2016 1 / 1 idrac8 CPU 1 Intel Xeon E3-1200 v5 Intel Pentium Intel Core

More information

OVERVIEW hp StorageWorks NAS 2000s hp StorageWorks NAS 2000s A 3.5 B 3.5 IDE DVD-ROM C LED LED Ultra320 SCSI ( ) NAS 2000s NAS 2000s NAS

OVERVIEW hp StorageWorks NAS 2000s hp StorageWorks NAS 2000s A 3.5 B 3.5 IDE DVD-ROM C LED LED Ultra320 SCSI ( ) NAS 2000s NAS 2000s NAS システム構成図 2004 年 11 月 18 日 1 OVERVIEW hp StorageWorks NAS 2000s hp StorageWorks NAS 2000s A 3.5 B 3.5 IDE DVD-ROM C LED LED Ultra320 SCSI 0 5 15 1.6 1 ( ) NAS 2000s NAS 2000s NAS 2000s 364971-B21( ) 345645-001(

More information

GPU n Graphics Processing Unit CG CAD

GPU n Graphics Processing Unit CG CAD GPU 2016/06/27 第 20 回 GPU コンピューティング講習会 ( 東京工業大学 ) 1 GPU n Graphics Processing Unit CG CAD www.nvidia.co.jp www.autodesk.co.jp www.pixar.com GPU n GPU ü n NVIDIA CUDA ü NVIDIA GPU ü OS Linux, Windows, Mac

More information

コスト効率の高い業界標準サーバーへのERPの導入

コスト効率の高い業界標準サーバーへのERPの導入 IT ERP ERP IT 1 ERP 4-way 4-way ERP I/O 4-way Sudip Chahal / Karl Mailman 2009 3 IT@Intel ERP 4-way ERP I/O RISC ERP IT ERP 1 IT 4-way ERP I/O ERP 2-way 4-way 2-way ERP 1.5 2 2 4 ERP 2 3 I/O Xeon 5500

More information

11U Dell CPU RAID 1U 1 Intel Xeon E v5 Intel Pentium Intel Core i3 Intel Celeron Intel C236 Microsoft Windows Server 2008 R2/2008 R2 SP1 Standar

11U Dell CPU RAID 1U 1 Intel Xeon E v5 Intel Pentium Intel Core i3 Intel Celeron Intel C236 Microsoft Windows Server 2008 R2/2008 R2 SP1 Standar PowerEdge R230 Contents RAID /RAID & PCIe OS P2-4 P5 P6 P7 P8 P9-10 P11-26 P27-29 P30 P30 P31-34 P35 P36-38 P38-39 P40-41 V1.1 Mar. 2016 11U Dell CPU RAID 1U 1 Intel Xeon E3-1200 v5 Intel Pentium Intel

More information

untitled

untitled PC murakami@cc.kyushu-u.ac.jp muscle server blade server PC PC + EHPC/Eric (Embedded HPC with Eric) 1216 Compact PCI Compact PCIPC Compact PCISH-4 Compact PCISH-4 Eric Eric EHPC/Eric EHPC/Eric Gigabit

More information

HP Workstation 総合カタログ

HP Workstation 総合カタログ HP Workstation E5 v2 Z Z SFF E5 v2 2 HP Windows Z 3 Performance Innovation Reliability 3 HPZ HP HP Z820 Workstation P.11 HP Z620 Workstation & CPU P.12 HP Z420 Workstation P.13 17.3in WIDE HP ZBook 17

More information

PowerEdge R730xd Contents RAID /RAID & P3-6 PCIe P P P P OS P P P P7 P8 P9 P10-11 P12-17 P P112

PowerEdge R730xd Contents RAID /RAID & P3-6 PCIe P P P P OS P P P P7 P8 P9 P10-11 P12-17 P P112 PowerEdge R730xd Contents RAID /RAID & P3-6 PCIe P113-123 P124-126 P127-130 P131-134 OS P135-139 P139-140 P141-142 P7 P8 P9 P10-11 P12-17 P18-111 P112 v4.11 Apr. 2018 2018 4 30 2016 4 22 Ver. 4.11 Ver.

More information

iphone GPGPU GPU OpenCL Mac OS X Snow LeopardOpenCL iphone OpenCL OpenCL NVIDIA GPU CUDA GPU GPU GPU 15 GPU GPU CPU GPU iii OpenMP MPI CPU OpenCL CUDA OpenCL CPU OpenCL GPU NVIDIA Fermi GPU Fermi GPU GPU

More information

Ver Ver NOTE E v3 2.4GHz, 20M cache, 8.00GT/s QPI,, HT, 8C/16T 85W E v3 1.6GHz, 15M cache, 6.40GT/s QPI

Ver Ver NOTE E v3 2.4GHz, 20M cache, 8.00GT/s QPI,, HT, 8C/16T 85W E v3 1.6GHz, 15M cache, 6.40GT/s QPI PowerEdge T630 Contents RAID /RAID & PCIe GPU OS V4.10 Mar.2018 P3-5 P6 P7 P8-9 P10-11 P12-16 P17-84 P85-90 P91-92 P93-95 P95 P96-97 P98-101 P102-105 P106-110 P110-111 P112-113 2018 3 30 2016 4 22 Ver.

More information

ÊÂÎó·×»»¤È¤Ï/OpenMP¤Î½éÊâ¡Ê£±¡Ë

ÊÂÎó·×»»¤È¤Ï/OpenMP¤Î½éÊâ¡Ê£±¡Ë 2015 5 21 OpenMP Hello World Do (omp do) Fortran (omp workshare) CPU Richardson s Forecast Factory 64,000 L.F. Richardson, Weather Prediction by Numerical Process, Cambridge, University Press (1922) Drawing

More information

Microsoft PowerPoint - ★13_日立_清水.ppt

Microsoft PowerPoint - ★13_日立_清水.ppt PC クラスタワークショップ in 京都 日立テクニカルコンピューティングクラスタ 2008/7/25 清水正明 日立製作所中央研究所 1 目次 1 2 3 4 日立テクニカルサーバラインナップ SR16000 シリーズ HA8000-tc/RS425 日立自動並列化コンパイラ 2 1 1-1 日立テクニカルサーバの歴史 最大性能 100TF 10TF 30 年間で百万倍以上の向上 (5 年で 10

More information

Ver. 3.9 Ver E v3 2.4GHz, 20M cache, 8.00GT/s QPI,, HT, 8C/16T 85W E v3 1.6GHz, 15M cache, 6.40GT/s QPI,, HT,

Ver. 3.9 Ver E v3 2.4GHz, 20M cache, 8.00GT/s QPI,, HT, 8C/16T 85W E v3 1.6GHz, 15M cache, 6.40GT/s QPI,, HT, PowerEdge R630 Contents RAID /RAID & PCIe OS P3-6 P7 P8 P9 P10-11 P12-16 P17-61 P62 P63-72 P73-75 P75 P76-79 P80-83 P84-90 P90-91 P92-93 V3.9 Apr. 2017 2017 4 28 2016 4 22 Ver. 3.9 Ver. 1.0 +- E5-2630

More information

CPU Levels in the memory hierarchy Level 1 Level 2... Increasing distance from the CPU in access time Level n Size of the memory at each level 1: 2.2

CPU Levels in the memory hierarchy Level 1 Level 2... Increasing distance from the CPU in access time Level n Size of the memory at each level 1: 2.2 FFT 1 Fourier fast Fourier transform FFT FFT FFT 1 FFT FFT 2 Fourier 2.1 Fourier FFT Fourier discrete Fourier transform DFT DFT n 1 y k = j=0 x j ω jk n, 0 k n 1 (1) x j y k ω n = e 2πi/n i = 1 (1) n DFT

More information

09中西

09中西 PC NEC Linux (1) (2) (1) (2) 1 Linux Linux 2002.11.22) LLNL Linux Intel Xeon 2300 ASCIWhite1/7 / HPC (IDC) 2002 800 2005 2004 HPC 80%Linux) Linux ASCI Purple (ASCI 100TFlops Blue Gene/L 1PFlops (2005)

More information

PowerPoint Presentation

PowerPoint Presentation Its Concept and Architecture Hiroshi Nakashima (Kyoto U.) with cooperation of Mitsuhisa Sato (U. Tsukuba) Taisuke Boku (U. Tsukuba) Yutaka Ishikawa (U. Tokyo) Contents Alliance Who & Why Allied? Specification

More information

Microsoft Word - HOKUSAI_system_overview_ja.docx

Microsoft Word - HOKUSAI_system_overview_ja.docx HOKUSAI システムの概要 1.1 システム構成 HOKUSAI システムは 超並列演算システム (GWMPC BWMPC) アプリケーション演算サーバ群 ( 大容量メモリ演算サーバ GPU 演算サーバ ) と システムの利用入口となるフロントエンドサーバ 用途の異なる 2 つのストレージ ( オンライン ストレージ 階層型ストレージ ) から構成されるシステムです 図 0-1 システム構成図

More information

( 4 ) GeoFEM ( 5 ) MDTEST ( 6 ) IOR 2 Oakleaf-FX 3 Oakleaf-FX 4 Oakleaf-FX Oakleaf-FX Oakleaf-FX 1 Oakleaf-FX 1 Oakleaf- FX SR11000/J2 HA8000 T

( 4 ) GeoFEM ( 5 ) MDTEST ( 6 ) IOR 2 Oakleaf-FX 3 Oakleaf-FX 4 Oakleaf-FX Oakleaf-FX Oakleaf-FX 1 Oakleaf-FX 1 Oakleaf- FX SR11000/J2 HA8000 T Oakleaf-FX(Fujitsu PRIMEHPC FX10) 1,a) 1 1 1 1,2 1 2012 4 Oakleaf-FX (Fujitsu PRIMEHPC FX10) Oakleaf-FX SPARC64IXfx FEFS 1.13PFLOPS Performance Evaluation of Oakleaf-FX (Fujitsu PRIMEHPC FX10) Supercomputer

More information

HP xw9400 Workstation

HP xw9400 Workstation HP xw9400 Workstation HP xw9400 Workstation AMD Opteron TM PCI Express x16 64 PCI Express x16 2 USB2.0 8 IEEE1394 2 8DIMM HP HP xw9400 Workstation HP CPU HP CPU 240W CPU HP xw9400 HP CPU CPU CPU CPU Sound

More information

ProLiant DL380 Generation 4 システム構成図

ProLiant DL380 Generation 4 システム構成図 P ProLiant DL380 Generation 5 Data Protection Storage Server 2007 9 20 12 28 P ProLiant Web http://www.hp.com/jp/rack_op 1 OVERVIEW ProLiant DL380 G5 Data Protection Storage Server ProLiant DL380G5 Data

More information

HP Workstation 総合カタログ

HP Workstation 総合カタログ HP Workstation Z HP 6 Z HP HP Z840 Workstation P.9 HP Z640 Workstation & CPU P.10 HP Z440 Workstation P.11 17.3in WIDE HP ZBook 17 G2 Mobile Workstation P.15 15.6in WIDE HP ZBook 15 G2 Mobile Workstation

More information

スライド 1

スライド 1 GPU クラスタによる格子 QCD 計算 広大理尾崎裕介 石川健一 1.1 Introduction Graphic Processing Units 1 チップに数百個の演算器 多数の演算器による並列計算 ~TFLOPS ( 単精度 ) CPU 数十 GFLOPS バンド幅 ~100GB/s コストパフォーマンス ~$400 GPU の開発環境 NVIDIA CUDA http://www.nvidia.co.jp/object/cuda_home_new_jp.html

More information

ProLiant BL460c システム構成図

ProLiant BL460c システム構成図 HP BladeSystem c-class Server HP 2008 5 26 BLADE3.0 Web http://www.hp.com/jp/blade_fill/ 1 OVERVIEW HP 1 2 2.5 SAS H Xeon ( 2 ) (SFF)( 2 ) I/O PC2-5300 FB-DIMM DDR2-667 8 Smart E200i (Type Type 1 ) USB

More information

Ver. 1.1 Ver NOTE 1TB 7.2K RPM SAS 3.5, 40,100 2TB 7.2K RPM SAS 3.5, 46,600 4TB 7.2K RPM SAS 6Gbps 3.5, 63,600 PowerEdge D

Ver. 1.1 Ver NOTE 1TB 7.2K RPM SAS 3.5, 40,100 2TB 7.2K RPM SAS 3.5, 46,600 4TB 7.2K RPM SAS 6Gbps 3.5, 63,600 PowerEdge D Contents... P3... P5... P6... P8... P14... P16 RAID /RAID... P22 PCIe... P35 GPU... P41... P46... P48 OS... P52... P54... P56 Ver.1.1 Apr. 2017 2017 4 28 2017 4 14 Ver. 1.1 Ver. 1.0 +- NOTE 1TB 7.2K RPM

More information

HPEハイパフォーマンスコンピューティング ソリューション

HPEハイパフォーマンスコンピューティング ソリューション HPE HPC / AI Page 2 No.1 * 24.8% No.1 * HPE HPC / AI HPC AI SGIHPE HPC / AI GPU TOP500 50th edition Nov. 2017 HPE No.1 124 www.top500.org HPE HPC / AI TSUBAME 3.0 2017 7 AI TSUBAME 3.0 HPE SGI 8600 System

More information

VXPRO R1400® ご提案資料

VXPRO R1400® ご提案資料 Intel Core i7 プロセッサ 920 Preliminary Performance Report ノード性能評価 ノード性能の評価 NAS Parallel Benchmark Class B OpenMP 版での性能評価 実行スレッド数を 4 で固定 ( デュアルソケットでは各プロセッサに 2 スレッド ) 全て 2.66GHz のコアとなるため コアあたりのピーク性能は同じ 評価システム

More information

Ver. 3.7 Ver E v3 2.4GHz, 20M cache, 8.00GT/s QPI,, HT, 8C/16T 85W E v3 1.6GHz, 15M cache, 6.40GT/s QPI,, HT,

Ver. 3.7 Ver E v3 2.4GHz, 20M cache, 8.00GT/s QPI,, HT, 8C/16T 85W E v3 1.6GHz, 15M cache, 6.40GT/s QPI,, HT, PowerEdge T130 Contents RAID /RAID & PCIe OS P3-4 P5 P6 P6 P7 P8-9 P10-16 P17-19 P20 P20 P21-24 P25 P26-30 P30-31 P32-33 v3.7 Apr. 2017 2016 4 28 2016 4 22 Ver. 3.7 Ver. 1.1 +- E5-2630 v3 2.4GHz, 20M cache,

More information

Express5800/120Rb-1 (2002/01/22)

Express5800/120Rb-1 (2002/01/22) (2002/01/22) 1. N8100-764 N8100-765 N8100-783 ( /1BG(256)) ( /1.26G(512)) ( /1.40G(512)) CPU Pentium Pentium -S Pentium -S (1BGHz) 1( 2 ) (1.26GHz) 1( 2 ) (1.40GHz) 1( 2 ) L1 32KB L2 256KB 512KB 256MB(

More information

main.dvi

main.dvi PC 1 1 [1][2] [3][4] ( ) GPU(Graphics Processing Unit) GPU PC GPU PC ( 2 GPU ) GPU Harris Corner Detector[5] CPU ( ) ( ) CPU GPU 2 3 GPU 4 5 6 7 1 toyohiro@isc.kyutech.ac.jp 45 2 ( ) CPU ( ) ( ) () 2.1

More information

P33W・P28X カタログ

P33W・P28X カタログ P33WP28X Windows 10 24 FC-PM IoT 24 Windows 10Windows 7 2 FC98-NXP33WP28X PC FC-PM P33WP28X PC ACC 1 1HDD1 1 2HDD2 1 AC 1 2 USB 3 USB3.0 USB 4 USB3.0 USB 5 USB3.0 USB 6 USB3.0 USB 7 USB3.0 USB 8 USB3.0

More information

T330_ indd

T330_ indd PowerEdge T330 Contents RAID /RAID & PCIe OS P3-5 P6 P7 P7 P8 P9-10 P11-28 P29-31 P32 P32 P33-36 P37 P38-40 P40-41 P42-43 V1.0 Apr. 2016 2016 4 22 Ver1.0 +- E5-2630 v3 2.4GHz, 20M cache, 8.00GT/s QPI,,

More information

FY14Q4 SMB Magalog December - APJ Version

FY14Q4 SMB Magalog December - APJ Version Business 13.co.jp 13.com/learn/jp/ja/jpbsd1/campaigns/poweredge-13g-server.com/learn/jp/ja/jpbsd1/campaigns/dell-server-transition-campaign Windows Server 2012 https://marketing.dell.com/jp/13g 212-8589

More information

Ver Ver NOTE E v3 2.4GHz, 20M cache, 8.00GT/s QPI,, HT, 8C/16T 85W E v3 1.6GHz, 15M cache, 6.40GT/s QPI

Ver Ver NOTE E v3 2.4GHz, 20M cache, 8.00GT/s QPI,, HT, 8C/16T 85W E v3 1.6GHz, 15M cache, 6.40GT/s QPI PowerEdge R630 Contents RAID /RAID & PCIe OS P3-6 P7 P8 P9 P10-11 P12-16 P17-54 P55 P56-65 P66-68 P68 P69-72 P73-76 P77-81 P81-82 P84-85 V4.10 Mar. 2018 2018 3 30 2016 4 22 Ver. 4.10 Ver. 1.0 + - NOTE

More information

GRAPE GRAPE-DR V-GRAPE

GRAPE GRAPE-DR V-GRAPE V-GRAPE / CCSR 2007/1/24 GRAPE GRAPE-DR V-GRAPE http://antwrp.gsfc.nasa.gov/apod/ap950917.html ( ) SDSS GRAPE : (Barnes-Hut tree, FMM, Particle- Mesh Ewald(PPPM)...): ( ) 1988 GRAPE-1(1989) 16 8 32

More information

Myrinet2000 ご紹介

Myrinet2000 ご紹介 34 HPC -Myrinet- ES HPC http://www.sse.co.jp/myrinet/ Out Line Myrinet HPC 50 2 4 O.S. Computer Computer Computer Computer Computer Low-level Interconnection Network (transport & switching) 2-4 / / OS

More information

マルチコアPCクラスタ環境におけるBDD法のハイブリッド並列実装

マルチコアPCクラスタ環境におけるBDD法のハイブリッド並列実装 2010 GPGPU 2010 9 29 MPI/Pthread (DDM) DDM CPU CPU CPU CPU FEM GPU FEM CPU Mult - NUMA Multprocessng Cell GPU Accelerator, GPU CPU Heterogeneous computng L3 cache L3 cache CPU CPU + GPU GPU L3 cache 4

More information

Express5800/140Hb (2002/01/22)

Express5800/140Hb (2002/01/22) (2002/01/22) 1. N8100-592B N8100-594B N8100-681 ( -X/700(1)) ( -X/700(2)) ( -X/900(2)) CPU L1 Pentium Xeon (700MHz) 1 4 Pentium Xeon (700MHz) 1 4 32KB Pentium Xeon (900MHz) 1 4 L2 1MB 2MB 2MB CD-ROM LAN

More information

12 PowerEdge PowerEdge Xeon E PowerEdge 11 PowerEdge DIMM Xeon E PowerEdge DIMM DIMM 756GB 12 PowerEdge Xeon E5-

12 PowerEdge PowerEdge Xeon E PowerEdge 11 PowerEdge DIMM Xeon E PowerEdge DIMM DIMM 756GB 12 PowerEdge Xeon E5- 12ways-12th Generation PowerEdge Servers improve your IT experience 12 PowerEdge 12 1 6 2 GPU 8 4 PERC RAID I/O Cachecade I/O 5 Dell Express Flash PCIe SSD 6 7 OS 8 85.5% 9 Dell OpenManage PowerCenter

More information

Express5800/120Ed

Express5800/120Ed Pentium 60% 1. N8500-570A N8500-662 N8500-663 N8500-664 ( /800EB(256)) ( /800EB(256)-9W) ( /800EB(256)-9W2) ( /1BG(256)) Windows NT Server 4.0 Windows 2000 HDD HDD CPU Pentium 800EBMHz1 Pentium 1BGHz1

More information

1 GPU GPGPU GPU CPU 2 GPU 2007 NVIDIA GPGPU CUDA[3] GPGPU CUDA GPGPU CUDA GPGPU GPU GPU GPU Graphics Processing Unit LSI LSI CPU ( ) DRAM GPU LSI GPU

1 GPU GPGPU GPU CPU 2 GPU 2007 NVIDIA GPGPU CUDA[3] GPGPU CUDA GPGPU CUDA GPGPU GPU GPU GPU Graphics Processing Unit LSI LSI CPU ( ) DRAM GPU LSI GPU GPGPU (I) GPU GPGPU 1 GPU(Graphics Processing Unit) GPU GPGPU(General-Purpose computing on GPUs) GPU GPGPU GPU ( PC ) PC PC GPU PC PC GPU GPU 2008 TSUBAME NVIDIA GPU(Tesla S1070) TOP500 29 [1] 2009 AMD

More information

Second-semi.PDF

Second-semi.PDF PC 2000 2 18 2 HPC Agenda PC Linux OS UNIX OS Linux Linux OS HPC 1 1CPU CPU Beowulf PC (PC) PC CPU(Pentium ) Beowulf: NASA Tomas Sterling Donald Becker 2 (PC ) Beowulf PC!! Linux Cluster (1) Level 1:

More information

(^^

(^^ 57 GRACE 2012 2 21 munetomo@iic.hokudai.ac.jp 1996 1999 1998 1999 1999 (^^ 1962 2003 1979 11 43TFlops 2,000 40, Mem:128GB, 10GbE x 2 500TBytes Web Web IT SR16000 Model M1 22 Total: 172 TFlops Power 7

More information

インテル(R) Visual Fortran Composer XE

インテル(R) Visual Fortran Composer XE Visual Fortran Composer XE 1. 2. 3. 4. 5. Visual Studio 6. Visual Studio 7. 8. Compaq Visual Fortran 9. Visual Studio 10. 2 https://registrationcenter.intel.com/regcenter/ w_fcompxe_all_jp_2013_sp1.1.139.exe

More information

OpenMP (1) 1, 12 1 UNIX (FUJITSU GP7000F model 900), 13 1 (COMPAQ GS320) FUJITSU VPP5000/64 1 (a) (b) 1: ( 1(a))

OpenMP (1) 1, 12 1 UNIX (FUJITSU GP7000F model 900), 13 1 (COMPAQ GS320) FUJITSU VPP5000/64 1 (a) (b) 1: ( 1(a)) OpenMP (1) 1, 12 1 UNIX (FUJITSU GP7000F model 900), 13 1 (COMPAQ GS320) FUJITSU VPP5000/64 1 (a) (b) 1: ( 1(a)) E-mail: {nanri,amano}@cc.kyushu-u.ac.jp 1 ( ) 1. VPP Fortran[6] HPF[3] VPP Fortran 2. MPI[5]

More information

untitled

untitled A = QΛQ T A n n Λ Q A = XΛX 1 A n n Λ X GPGPU A 3 T Q T AQ = T (Q: ) T u i = λ i u i T {λ i } {u i } QR MR 3 v i = Q u i A {v i } A n = 9000 Quad Core Xeon 2 LAPACK (4/3) n 3 O(n 2 ) O(n 3 ) A {v i }

More information

テストコスト抑制のための技術課題-DFTとATEの観点から

テストコスト抑制のための技術課題-DFTとATEの観点から 2 -at -talk -talk -drop 3 4 5 6 7 Year of Production 2003 2004 2005 2006 2007 2008 Embedded Cores Standardization of core Standard format Standard format Standard format Extension to Extension to test

More information

OVERVIEW ProLiant ML110 G2 Storage Server ProLiant ML110 G2 Storage Server A C D SATA NH 320GB 01 (1TB) (1TB) Ultra320 SCSI 6 SATA RAID Serial

OVERVIEW ProLiant ML110 G2 Storage Server ProLiant ML110 G2 Storage Server A C D SATA NH 320GB 01 (1TB) (1TB) Ultra320 SCSI 6 SATA RAID Serial HP ProLiant ML110 Generation 2 Storage Server 2006 4 6 1 OVERVIEW ProLiant ML110 G2 Storage Server ProLiant ML110 G2 Storage Server A C D 01 3.5 SATA NH 320GB 01 (1TB) (1TB) Ultra320 SCSI 6 SATA RAID Serial

More information

T430_ indd

T430_ indd PowerEdge T430 Contents RAID /RAID & PCIe OS P3-6 P7 P8 P9 P10 P11-14 P15-48 P49-53 P54-55 P55 P56-59 P60-63 P64-69 P69-70 P71-72 V1.0 Apr. 2016 2016 4 22 Ver1.0 NOTE + - E5-2630 v3 2.4GHz, 20M cache,

More information

ProLiant BL20p Generation 4 システム構成図

ProLiant BL20p Generation 4 システム構成図 HP ProLiant BL p-class Server BL20p Generation 4 2007 11 15 1 OVERVIEW ProLiantBL20p Generation 4 HP BladeSystem p-class Hardware Component BladeSystem p-class BladeSystem p-class BladeSystem p-class ()

More information

フカシギおねえさん問題の高速計算アルゴリズム

フカシギおねえさん問題の高速計算アルゴリズム JST ERATO 2013/7/26 Joint work with 1 / 37 1 2 3 4 5 6 2 / 37 1 2 3 4 5 6 3 / 37 : 4 / 37 9 9 6 10 10 25 5 / 37 9 9 6 10 10 25 Bousquet-Mélou (2005) 19 19 3 1GHz Alpha 8 Iwashita (Sep 2012) 21 21 3 2.67GHz

More information

openmp1_Yaguchi_version_170530

openmp1_Yaguchi_version_170530 並列計算とは /OpenMP の初歩 (1) 今 の内容 なぜ並列計算が必要か? スーパーコンピュータの性能動向 1ExaFLOPS 次世代スハ コン 京 1PFLOPS 性能 1TFLOPS 1GFLOPS スカラー機ベクトル機ベクトル並列機並列機 X-MP ncube2 CRAY-1 S-810 SR8000 VPP500 CM-5 ASCI-5 ASCI-4 S3800 T3E-900 SR2201

More information

AMD/ATI Radeon HD 5870 GPU DEGIMA LINPACK HD 5870 GPU DEGIMA LINPACK GFlops/Watt GFlops/Watt Abstract GPU Computing has lately attracted

AMD/ATI Radeon HD 5870 GPU DEGIMA LINPACK HD 5870 GPU DEGIMA LINPACK GFlops/Watt GFlops/Watt Abstract GPU Computing has lately attracted DEGIMA LINPACK Energy Performance for LINPACK Benchmark on DEGIMA 1 AMD/ATI Radeon HD 5870 GPU DEGIMA LINPACK HD 5870 GPU DEGIMA LINPACK 1.4698 GFlops/Watt 1.9658 GFlops/Watt Abstract GPU Computing has

More information

大規模共有メモリーシステムでのGAMESSの利点

大規模共有メモリーシステムでのGAMESSの利点 Technical white paper GAMESS GAMESS Gordon Group *1 Gaussian Gaussian1 Xeon E7 8 80 2013 4 GAMESS 1 RHF ROHF UHF GVB MCSCF SCF Energy CDFpEP CDFpEP CDFpEP CD-pEP CDFpEP SCF Gradient CDFpEP CDFpEP CDFpEP

More information

資料3 今後のHPC技術に関する研究開発の方向性について(日立製作所提供資料)

資料3 今後のHPC技術に関する研究開発の方向性について(日立製作所提供資料) 今後の HPC 技術に関する 研究開発の方向性について 2012 年 5 月 30 日 ( 株 ) 日立製作所情報 通信システム社 IT プラットフォーム事業本部 Hitachi, Hitachi, Ltd. Ltd. Hitachi 2012. 2012. Ltd. 2012. All rights All rights All rights reserved. reserved. reserved.

More information

HP Z800 Workstation 製品構成ガイド

HP Z800 Workstation 製品構成ガイド HP Z800 Workstation システム構成図 0 年 3 月 3日版 HP Z800 Workstation 0 3 3 HP Z800 Workstation (/) HP Z800 / CT Workstation (0 3 3) HP Z800 / CT Workstation E5507 E560 E5640 X5650 X5660 X5667 X5670 X5677 X5680

More information

Microsoft PowerPoint - CCS学際共同boku-08b.ppt

Microsoft PowerPoint - CCS学際共同boku-08b.ppt マルチコア / マルチソケットノードに おけるメモリ性能のインパクト 研究代表者朴泰祐筑波大学システム情報工学研究科 taisuke@cs.tsukuba.ac.jp アウトライン 近年の高性能 PC クラスタの傾向と問題 multi-core/multi-socket ノードとメモリ性能 メモリバンド幅に着目した性能測定 multi-link network 性能評価 まとめ 近年の高性能 PC

More information

はじめに

はじめに hp rp2400 white paper hp-ux ... 2 hp server rp2400... 2 hp 2400... 3 rp2470... 3 rp2430... 3 hp... 4... 5... 6... 7... 7 I/O... 7 I/O... 9... 9... 9... 10 hp... 10... 10... 10... 11... 11 ECC... 11...

More information

HP ProLiant ML110 Generation 5 システム構成図

HP ProLiant ML110 Generation 5 システム構成図 HP ProLiant ML110 Generation 5 Storage Server 2009 12 10 OVERVIEW (SATA ) 1 () 2 USB 3 6 3.5 SATA NH () DVD-ROM LED LED Smart E200/128 BBWC Lights-Out 100c ( ) 1TB-SATA x64 WSS2003R2 2TB-SATA x64 WSS2003R2

More information

new_emc_panf_Hyoushi_0818

new_emc_panf_Hyoushi_0818 EMC NAS 2015 2015 8 EMC 151-0053 2-1-1 http://japan.emc.com http://japan.emc.com/contact/ Copyright 2015 EMC Corporation. All rights reserved. EMC EMC 2 EMCInsightIQ OneFS SmartConnect SmartLock SmartPools

More information

PROLIANT ML

PROLIANT ML PROLIANT ML750 2001 7 16 1 OVERVIEW ProLiant ML750 SCSI 64 PCI PCI 1 3 Pentium Xeon ( 1 1 ) 5.25 IDE CD-ROM 3.5 (/ ) (IMD) I/O A B C, D (2 ) 3.5 (2 ) Smart 4250ES ( Smart 4250ES ) ProLiant ML750( ) R01

More information

Express5800/140Ma

Express5800/140Ma Pentium Xeon Express 1. N8500-479 N8500-480 N8500-489,-490 N8500-491,-492 (-X/550(512)-25AWS) (-X/550(1)-25AWS) (-X/550(512)) (-X/550(1)) (-X/550(512)-25AWE) (-X/550(1)-25AWE) CPU L1 Pentium Xeon 550MHz1

More information

Microsoft Word - PowerEdge_M-Series_Competitive_Power_Study_-_August_2010[1]_j.docx

Microsoft Word - PowerEdge_M-Series_Competitive_Power_Study_-_August_2010[1]_j.docx : John Beckett Robert Bradfield 2010 Dell Inc. 2010 All rights reserved. Dell DELL DELL PowerEdge Dell Inc. Microsoft Windows Windows Server Microsoft Corporation SPEC SPECpower_ssj Standard Performance

More information

ProLiant BL25p Generation 2システム構成図

ProLiant BL25p Generation 2システム構成図 HP ProLiant BL p-class Server BL25p Generation 2 2007 11 15 1 OVERVIEW ProLiant BL25p Generation 2 HP BladeSystem p-class Hardware Component BladeSystem p-class BladeSystem p-class BladeSystem p-class

More information

2011年2月 Express5800シリーズ Gモデル

2011年2月 Express5800シリーズ Gモデル Express5800 20112 SOHO/ G http://www.nec.co.jp/exp/ PC G No.1.1 14No.1 93mm 3 3 2 OS2 5 ExpressSupportPack G2 5224 365 &OS PlatformSupportPack ExpressSupportPack G2OS OS HW OS PlatformSupportPack OS 24365

More information

PowerEdge R230 Contents RAID /RAID & PCIe OS P3-5 P6 P7 P8 P9 P10-11 P12-28 P29-31 P32 P32 P33-36 P37 P38-42 P42-43 P44-45 V4.11 Apr. 2018

PowerEdge R230 Contents RAID /RAID & PCIe OS P3-5 P6 P7 P8 P9 P10-11 P12-28 P29-31 P32 P32 P33-36 P37 P38-42 P42-43 P44-45 V4.11 Apr. 2018 PowerEdge R230 Contents RAID /RAID & PCIe OS P3-5 P6 P7 P8 P9 P10-11 P12-28 P29-31 P32 P32 P33-36 P37 P38-42 P42-43 P44-45 V4.11 Apr. 2018 2018 4 30 2016 4 22 Ver. 4.11 Ver. 1.1 +- E5-2630 v3 2.4GHz, 20M

More information

R630_160428_2.indd

R630_160428_2.indd PowerEdge R630 Contents RAID /RAID & PCIe OS P3-6 P7 P8 P9 P10-11 P12-16 P17-56 P57 P58-65 P66-68 P68 P69-72 P73-76 P77-82 P82-83 P84-85 V1.0 Apr. 2016 2016 4 22 Ver1.0 + - E5-2630 v3 2.4GHz, 20M cache,

More information

B 2 Thin Q=3 0 0 P= N ( )P Q = 2 3 ( )6 N N TSUB- Hub PCI-Express (PCIe) Gen 2 x8 AME1 5) 3 GPU Socket 0 High-performance Linpack 1

B 2 Thin Q=3 0 0 P= N ( )P Q = 2 3 ( )6 N N TSUB- Hub PCI-Express (PCIe) Gen 2 x8 AME1 5) 3 GPU Socket 0 High-performance Linpack 1 TSUBAME 2.0 Linpack 1,,,, Intel NVIDIA GPU 2010 11 TSUBAME 2.0 Linpack 2CPU 3GPU 1400 Dual-Rail QDR InfiniBand TSUBAME 1.0 30 2.4PFlops TSUBAME 1.0 Linpack GPU 1.192PFlops PFlops Top500 4 Achievement of

More information

Ver. 3.8 Ver E v3 2.4GHz, 20M cache, 8.00GT/s QPI,, HT, 8C/16T 85W E v3 1.6GHz, 15M cache, 6.40GT/s QPI,, HT,

Ver. 3.8 Ver E v3 2.4GHz, 20M cache, 8.00GT/s QPI,, HT, 8C/16T 85W E v3 1.6GHz, 15M cache, 6.40GT/s QPI,, HT, PowerEdge R230 Contents RAID /RAID & PCIe OS P3-5 P6 P7 P8 P9 P10-11 P12-28 P29-31 P32 P32 P33-36 P37 P38-42 P42-43 P44-45 V3.8 Apr. 2017 2017 4 28 2016 4 22 Ver. 3.8 Ver. 1.1 +- E5-2630 v3 2.4GHz, 20M

More information

Po w eredge M000e Index? & 00% 5 32CPU 256 0U PowerEdge M000e PowerEdge M000eI/O 6

Po w eredge M000e Index? & 00% 5 32CPU 256 0U PowerEdge M000e PowerEdge M000eI/O 6 PowerEdge M Designed for Efficiency Built to reduce total economic impact. PowerEdge M000e PowerEdge M90 / M70 / M70HD / M60 / M60x Xeon 2 9 0 3 4 2 5 6 3 4 7 8 5 6 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 Po

More information

ProLiant DL180 システム構成図

ProLiant DL180 システム構成図 P ProLiant DL180 Generation 5 2009 11 12 1 OVERVIEW ProLiant DL180 Generation 5 ProLiant DL180 G5 8DD A ( 8DD ) LED / DVD-RW DVD-ROM USB 2.0 2 4 PCI ProLiant DL180 G5 QC XE5405 2/2x6 1P 1G 8D R 456831-291(

More information

Microsoft SQL Server 2012 における EMC パフォーマンスの高速化EMC VFCache、EMC Symmetrix VMAX 10K、および EMC FAST VP

Microsoft SQL Server 2012 における EMC パフォーマンスの高速化EMC VFCache、EMC Symmetrix VMAX 10K、および EMC FAST VP Microsoft SQL Server 2012 EMC EMC VFCache SQL Server EMC FAST VP VMware vsphere EMC EMC Symmetrix VMAX 10K SQL Server 2012 EMC VFCache VFCache PCIe VFCache VMware vsphere 5 2012 7 Copyright 2012 EMC Corporation.

More information

Express5800/120Ra-1

Express5800/120Ra-1 1. CPU L1 L2 CD-ROM LAN OS OS N8100-661A ( /1BG(256)) Pentium 1.0BGHz 1 2 32KB 256KB 128MB 4GB (73.2GB 2) 10 24 100BASE-TX 10BASE-T 2 640 480 1280 1024* 2. DISK LINK/ACT(LAN1) STATUS LINK/ACT(LAN2) POWER/SLEEP

More information