ストリーミング SIMD 拡張命令2 (SSE2) を使用した SAXPY/DAXPY
|
|
- しなつ たなせ
- 5 years ago
- Views:
Transcription
1 SIMD 2(SSE2) SAXPY/DAXPY : J /12/06 1
2 Fax: * Copyright Intel Corporation 1999, /12/06 2
3 SAXPY DAXPY SAXPY DAXPY SIMD C C/C C/C SSE2 C SSE A -... A-1... A-1... A-2 01/12/06 3
4 2.0 Pentium Lawson Hanson Kincaid Krogh Basic linear algebra subprograms for Fortran usage ACM Transactions on Mathematical Software Vol. 5 No Dongarra Moler Bunch Stewart LINPACK User's Guide SIAM 1979 C++ SIMD J 1999 C/C J /12/06 4
5 1 SIMD 2(SSE2 Streaming SIMD Extensions 2) SIMD(Single Instruction Multiple Data) SIMD IA-32 SIMD SIMD (SSE) SIMD 128 SIMD 64 SIMD 3D (3D) / SAXPY/DAXPY / SSE2 SSE Pentium 4 Pentium III SAXPY DAXPY SAXPY(DAXPY) SAXPY(DAXPY) ( ) (IA) SIMD SAXPY(DAXPY) SIMD C++ C/C SAXPY(DAXPY) C/C++ SIMD 2 SAXPY DAXPY SAXPY(DAXPY) Lawson Hanson Kincaid Krogh BLAS(Basic Linear Algebra Subprograms)[Lawson, 1979] SAXPY (SA) X Y DAXPY Y = a * X + Y X Y ( 1 n) a FORTRAN 01/12/06 5
6 CALL SAXPY (N, A, X, INCX, Y, INCY) N INCX INCY SAXPY X Y 2 INCX=INCY=1 LINPACK BLAS INCX=INCY=1 [Dongarra, 1979] LINPACK BLAS SAXPY DAXPY SIMD INCX=INCY=1 SAXPY DAXPY C SAXPY DAXPY SSE2 C/C++ C/C SIMD C++ SSE2 1 C/C SIMD C++ FVEC DVEC 16 float(double) F32vec4(F64vec2) x y F32vec4(F64vec2) y[i] = scalar * x[i] + y[i] 128 4(2) n 4(2) sa(da) 1 sa(da) 4(2) 1 16 float(double) F32vec4(F64vec2) x y 2 01/12/06 6
7 x y 1 C/C++ declspec ( C/C++ ) x y 2 if SAXPY(DAXPY) C/C++ 30 SAXPY(DAXPY) C/C++ /QxK /QxW SSE SSE2 /Qvec_verbose3 SAXPY(DAXPY) C/C++ SAXPY( DAXPY) C/C++ FORTRAN /Qvec_verbose3 2 ivdep pragma for pragma 4(2) vector aligned pragma C/C++ ivdep vector aligned 2 pragma SAXPY DAXPY pragma 1 C/C++ novector pragma 1 /QxW /QaxW /QxW SSE2( Pentium III ) Pentium 4 IA SSE2 /QaxW SSE2 2 1 /QxW 01/12/06 7
8 SSE2 1 /QxW SSE SSE2 FVEC(DVEC) SIMD C++ C/C++ FVEC(DVEC) ( ) C/C++ SSE SSE2 (SSE 4 SSE2 2 ) SAXPY SSE 4 Pentium 4 Pentium III 2 SSE DAXPY SSE2 4 SSE SSE2 SIMD 2 (1)SIMD C++ ( C/C++ ) (2) C/C++ pragma Itanium TM (2) 01/12/06 8
9 5 C/C++ * *Saxpy from BLAS, Lawson, Manson, Kincaid, and Krogh (1979) * *this compilation from _Linpack Users' Guide_, Dongarra, Moler, * Bunch, & Stewart, Siam 1979, appendix A. * *These versions are "unit" saxpy (daxpy), meaning they assume stride = 1. *(note that this is what BLAS requires, and is the most common form) * * Assume all vectors are aligned. * */ void usaxpy (int n, float sa, float *sx, float *sy) { if (sa == 0.0) return; } //The latest intel compilers can now vectorize this loop. //Use the novector pragma to prevent vectorization. #pragma novector for (int i = 0; i < n; i++) sy[i] = sa * sx[i] + sy[i]; void udaxpy (int n, double da, double *dx, double *dy) { if (da == 0.0) return; //The latest intel compilers can now vectorize this loop. //Use the novector pragma to prevent vectorization. #pragma novector for (int i = 0; i < n; i++) dy[i] = da * dx[i] + dy[i]; } 01/12/06 9
10 6 SSE2 C++ /* Assumes vectors are aligned, and that the vector length n is divisible * by 4 for saxpy and 2 for daxpy. */ #include <fvec.h> #include <dvec.h> void usaxpy_fvec (int n, float sa, float *sx, float *sy) { F32vec4 *x = (F32vec4 *)sx, *y = (F32vec4 *)sy; F32vec4 a(sa); if (sa == 0.0) return; n >>= 2; for (int i = 0; i < n; i++) y[i] = a * x[i] + y[i]; } void udaxpy_dvec (int n, double da, double *dx, double *dy) { F64vec2 *x = (F64vec2 *) dx, *y = (F64vec2 *) dy; F64vec2 a(da); if (da == 0.0) return; n >>= 1; for (int i = 0; i < n; i++) y[i] = a * x[i] + y[i]; } 01/12/06 10
11 7 SSE2 /* Assumes vectors are aligned */ void usaxpy_vec (int n, float sa, float *sx, float *sy) { if (sa == 0.0) return; #pragma ivdep #pragma vector aligned for (int i = 0; i < n; i++) sy[i] = sa * sx[i] + sy[i]; } void udaxpy_vec (int n, double da, double *dx, double *dy) { if (da == 0.0) return; #pragma ivdep #pragma vector aligned for (int i = 0; i < n; i++) dy[i] = da * dx[i] + dy[i]; } 01/12/06 11
12 A GHz Pentium :SAXPY/DAXPY ( MFLOPS) Pentium III (733 MHz) SAXPY: C SAXPY: FVEC SAXPY: DAXPY: C DAXPY: DVEC N/A 951 DAXPY: N/A 672 Pentium 4 (1.20 GHz) MHz Pentium III 1.20 GHz Pentium 4 A-2 2 ( ) FLOPS SAXPY SIMD (SSE) 4 Pentium III 2 Pentium SSE 1.9 GFLOPS DAXPY SSE MFLOPS DAXPY 1 SIMD 2 SAXPY 4 01/12/06 A-1
13 2: Pentium III Pentium III (733 MHz) Desktop Board VC820 BIOS VC82010A.86A.0028.P KB 128 MB RDRAM PC Ultra ATA IBM DJNA ATA-66 Creative Labs 3D Blaster Annihilator Pro AGP nvidia GeForce256 DDR 32MB Nvidia Reference Driver 5.22 Windows : Pentium 4 Pentium 4 (1.20 GHz) Desktop Board D850GB BIOS GB85010A.86A.0014.D KB 128 MB RDRAM PC Ultra ATA IBM DJNA ATA-66 / Creative Labs 3D Blaster Annihilator Pro AGP nvidia GeForce256 DDR 32MB NVidia Reference Driver 5.22 Windows /12/06 A-2
ストリーミング SIMD 拡張命令2 (SSE2) を使用した、倍精度浮動小数点ベクトルの最大/最小要素とそのインデックスの検出
SIMD 2(SSE2) / 2.0 2000 7 : 248602J-001 01/10/30 1 305-8603 115 Fax: 0120-47-8832 * Copyright Intel Corporation 1999-2001 01/10/30 2 1...5 2...5 2.1...5 2.1.1...5 2.1.2...8 3...9 3.1...9 3.2...9 4...9
More informationCPU Levels in the memory hierarchy Level 1 Level 2... Increasing distance from the CPU in access time Level n Size of the memory at each level 1: 2.2
FFT 1 Fourier fast Fourier transform FFT FFT FFT 1 FFT FFT 2 Fourier 2.1 Fourier FFT Fourier discrete Fourier transform DFT DFT n 1 y k = j=0 x j ω jk n, 0 k n 1 (1) x j y k ω n = e 2πi/n i = 1 (1) n DFT
More informationSource: Intel.Config: Pentium III Processor-Intel Seattle SE440BX-2, 128MB PC100 CL2 SDRAM Intel 440BX-2 Chipset Platform- Diamond Viper 550 /
2002.1 4 1 2 3 Source: Intel.Config: Pentium III Processor-Intel Seattle SE440BX-2, 128MB PC100 CL2 SDRAM Intel 440BX-2 Chipset Platform- Diamond Viper 550 / nvidia TNT 2x AGP with 16MB memory, nvidia
More information(Basic Theory of Information Processing) 1
(Basic Theory of Information Processing) 1 10 (p.178) Java a[0] = 1; 1 a[4] = 7; i = 2; j = 8; a[i] = j; b[0][0] = 1; 2 b[2][3] = 10; b[i][j] = a[2] * 3; x = a[2]; a[2] = b[i][3] * x; 2 public class Array0
More informationインテル(R) Visual Fortran Composer XE
Visual Fortran Composer XE 1. 2. 3. 4. 5. Visual Studio 6. Visual Studio 7. 8. Compaq Visual Fortran 9. Visual Studio 10. 2 https://registrationcenter.intel.com/regcenter/ w_fcompxe_all_jp_2013_sp1.1.139.exe
More informationmate10„”„õŒì4
2002.10 1 2 3 4 2 LINE UP 31w 79w 3 4 LINE UP Windows XP Windows 98 Pentium 1.70GHz Pentium 1.80GHz Pentium 2A GHz Pentium 2.40GHz Pentium 2.53GHz 0 50 100 150 200 250 Processor:Pentium 4 processor 1.50
More informationスパコンに通じる並列プログラミングの基礎
2018.09.10 furihata@cmc.osaka-u.ac.jp ( ) 2018.09.10 1 / 59 furihata@cmc.osaka-u.ac.jp ( ) 2018.09.10 2 / 59 Windows, Mac Unix 0444-J furihata@cmc.osaka-u.ac.jp ( ) 2018.09.10 3 / 59 Part I Unix GUI CUI:
More informationスパコンに通じる並列プログラミングの基礎
2018.06.04 2018.06.04 1 / 62 2018.06.04 2 / 62 Windows, Mac Unix 0444-J 2018.06.04 3 / 62 Part I Unix GUI CUI: Unix, Windows, Mac OS Part II 2018.06.04 4 / 62 0444-J ( : ) 6 4 ( ) 6 5 * 6 19 SX-ACE * 6
More informationPentium 4
Pentium 4 Pentium 4... 2... 2... 2... 3... 3... 3... 3... 4 TMPGEnc Plus2.5 Ver.2.59... 5... 8... 9... 9 VTune TM... 9 C++/Fortran... 9 1 Pentium 4 HT Xeon TM Pentium 4 3.06GHz HT Pentium 4 NetBurst TM
More informationItanium2ベンチマーク
HPC CPU mhori@ile.osaka-u.ac.jp Special thanks Timur Esirkepov HPC 2004 2 25 1 1. CPU 2. 3. Itanium 2 HPC 2 1 Itanium2 CPU CPU 3 ( ) Intel Itanium2 NEC SX-6 HP Alpha Server ES40 PRIMEPOWER SR8000 Intel
More informationGRAPE GRAPE-DR V-GRAPE
V-GRAPE / CCSR 2007/1/24 GRAPE GRAPE-DR V-GRAPE http://antwrp.gsfc.nasa.gov/apod/ap950917.html ( ) SDSS GRAPE : (Barnes-Hut tree, FMM, Particle- Mesh Ewald(PPPM)...): ( ) 1988 GRAPE-1(1989) 16 8 32
More informationGRAPE GRAPE-DR V-GRAPE
GRAPE-DR / 2006/11/20-22 GRAPE GRAPE-DR V-GRAPE http://antwrp.gsfc.nasa.gov/apod/ap950917.html ( ) SDSS Genzel et al 2003 Adaptive Optics SgrA ( ) 12 1 : GRAPE : (Barnes-Hut tree, FMM, Particle- Mesh
More informationスパコンに通じる並列プログラミングの基礎
2016.06.06 2016.06.06 1 / 60 2016.06.06 2 / 60 Windows, Mac Unix 0444-J 2016.06.06 3 / 60 Part I Unix GUI CUI: Unix, Windows, Mac OS Part II 0444-J 2016.06.06 4 / 60 ( : ) 6 6 ( ) 6 10 6 16 SX-ACE 6 17
More information64bit SSE2 SSE2 FPU Visual C++ 64bit Inline Assembler 4 FPU SSE2 4.1 FPU Control Word FPU 16bit R R R IC RC(2) PC(2) R R PM UM OM ZM DM IM R: reserved
(Version: 2013/5/16) Intel CPU (kashi@waseda.jp) 1 Intel CPU( AMD CPU) 64bit SIMD Inline Assemler Windows Visual C++ Linux gcc 2 FPU SSE2 Intel CPU double 8087 FPU (floating point number processing unit)
More informationMBLAS¤ÈMLAPACK; ¿ÇÜĹÀºÅÙÈǤÎBLAS/LAPACK¤ÎºîÀ®
MBLAS MLAPACK; BLAS/LAPACK maho@riken.jp February 23, 2009 MPACK(MBLAS/MLAPACK) ( ) (2007 ) ( ) http://accc.riken.jp/maho/ BLAS/LAPACK http://mplapack.sourceforge.net/ BLAS (Basic Linear Algebra Subprograms)
More informationExpress5800/110Ee Pentium 1. Express5800/110Ee N N Express5800/110Ee Express5800/110Ee ( /800EB(256)) ( /800EB(256) 20W) CPU L1 L2 CD-
Express5800/110Ee Pentium 1. Express5800/110Ee N8500-654 N8500-655 Express5800/110Ee Express5800/110Ee ( /800EB(256)) ( /800EB(256) 20W) CPU L1 L2 CD-ROM LAN Windows NT Server 4.0 Pentium 800EBMHz 1 (
More informationExpress5800/110Ee (2002/01/22)
(2002/01/22) 1. N8100-691 ( /1BG(256)) CPU L1 L2 CD-ROM LAN OS Pentium 1.0BGHz 1 32KB 256KB 128MB 1.5GB ( IDE 60GB 3( IDE 2)) ( SCSI 18.1GB 3) 14 40 100BASE-TX 10BASE-T 640 480 1280 1024(VRAM 8MB) 2. CD-ROM
More information64bit SSE2 SSE2 FPU Visual C++ 64bit Inline Assembler 4 FPU SSE2 4.1 FPU Control Word FPU 16bit R R R IC RC(2) PC(2) R R PM UM OM ZM DM IM R: reserved
(Version: 2013/7/10) Intel CPU (kashi@waseda.jp) 1 Intel CPU( AMD CPU) 64bit SIMD Inline Assemler Windows Visual C++ Linux gcc 2 FPU SSE2 Intel CPU double 8087 FPU (floating point number processing unit)
More information(Version: 2017/4/18) Intel CPU 1 Intel CPU( AMD CPU) 64bit SIMD Inline Assemler Windows Visual C++ Linux gcc 2 FPU SSE2 Intel CPU do
(Version: 2017/4/18) Intel CPU (kashi@waseda.jp) 1 Intel CPU( AMD CPU) 64bit SIMD Inline Assemler Windows Visual C++ Linux gcc 2 FPU SSE2 Intel CPU double 8087 FPU (floating point number processing unit)
More informationuntitled
A = QΛQ T A n n Λ Q A = XΛX 1 A n n Λ X GPGPU A 3 T Q T AQ = T (Q: ) T u i = λ i u i T {λ i } {u i } QR MR 3 v i = Q u i A {v i } A n = 9000 Quad Core Xeon 2 LAPACK (4/3) n 3 O(n 2 ) O(n 3 ) A {v i }
More informationv10 IA-32 64¹ IA-64²
v10 IA-32 64¹ IA-64² 1. 2. 3. 4. 5. 6. /Od (-O0) Windows* /O1 /O2 /O3 Linux* Mac OS* -O1 -O2 -O3 /O2 ( O2) /O3 (-O3) IA-64 Core 2 /QxT ( xt) IA-32 64 IA-32 64 Itanium 2 9000 /G2-p9000 ( mtune=itanium2-p9000)
More information倍々精度RgemmのnVidia C2050上への実装と応用
.. maho@riken.jp http://accc.riken.jp/maho/,,, 2011/2/16 1 - : GPU : SDPA-DD 10 1 - Rgemm : 4 (32 ) nvidia C2050, GPU CPU 150, 24GFlops 25 20 GFLOPS 15 10 QuadAdd Cray, QuadMul Sloppy Kernel QuadAdd Cray,
More informationインテル(R) C++ Composer XE 2011 Windows版 入門ガイド
C++ Composer XE 2011 Windows* エクセルソフト株式会社 www.xlsoft.com Rev. 1.2 (2011/05/03) Copyright 1998-2011 XLsoft Corporation. All Rights Reserved. 1 / 70 ... 4... 5... 6... 8 /... 8... 10 /... 11... 11 /... 13
More informationuntitled
A = QΛQ T A n n Λ Q A = XΛX 1 A n n Λ X GPGPU A 3 T Q T AQ = T (Q: ) T u i = λ i u i T {λ i } {u i } QR MR 3 v i = Q u i A {v i } A n = 9000 Quad Core Xeon 2 LAPACK (4/3) n 3 O(n 2 ) O(n 3 ) A {v i }
More information23 Fig. 2: hwmodulev2 3. Reconfigurable HPC 3.1 hw/sw hw/sw hw/sw FPGA PC FPGA PC FPGA HPC FPGA FPGA hw/sw hw/sw hw- Module FPGA hwmodule hw/sw FPGA h
23 FPGA CUDA Performance Comparison of FPGA Array with CUDA on Poisson Equation (lijiang@sekine-lab.ei.tuat.ac.jp), (kazuki@sekine-lab.ei.tuat.ac.jp), (takahashi@sekine-lab.ei.tuat.ac.jp), (tamukoh@cc.tuat.ac.jp),
More informationRaVioli SIMD
RaVioli SIMD 17 17115074 i RaVioli SIMD PC PC PC PC CPU RaVioli RaVioli CPU RaVioli CPU SIMD RaVioli RaVioli SIMD RaVioli SIMD RaVioli SIMD 1 1 2 RaVioli 2 2.1 RaVioli.......................................
More informationWeb Web Web Web i
28 Research of password manager using pattern lock and user certificate 1170369 2017 2 28 Web Web Web Web i Abstract Research of password manager using pattern lock and user certificate Takuya Mimoto In
More informationExpress5800/110Rc-1 1. Express5800/110Rc-1 N N Express5800/110Rc-1 Express5800/110Rc-1 ( /1BG(256)) (C/850(128)) CPU Pentium (1BGHz) 1
(2002/01/22) Express5800/110Rc-1 1. Express5800/110Rc-1 N8100-665 N8100-793 Express5800/110Rc-1 Express5800/110Rc-1 ( /1BG(256)) (C/850(128)) CPU Pentium (1BGHz) 1 Celeron (850MHz) 1 L1 32KB L2 256KB 128KB
More informationG007 Panasonic CF-R7 U7700 1.33GHz 2GB 250GB 12inch 0.9Kg G008 Panasonic CF-R3 1.10GHz 768MB 40GB 10inch 0.9Kg WinXP Pro G009 Panasonic CF-R4 1.1GHz 7
リユースノートパソコン 利 用 申 請 書 に 希 望 するパソコンの 管 理 コードを 記 載 してください 全 てのPCに 以 下 のソフトがインストールされています Kingsoft Office Plus (オフィスソフト) ESET Smart Security (セキュリティソフト) Kingsoft Office は Microsoft Officeとの 互 換 性 は 大 変 高 く
More information07-二村幸孝・出口大輔.indd
GPU Graphics Processing Units HPC High Performance Computing GPU GPGPU General-Purpose computation on GPU CPU GPU GPU *1 Intel Quad-Core Xeon E5472 3.0 GHz 2 6 MB L2 cache 1600 MHz FSB 80 GFlops 1 nvidia
More informationSonicStage Ver. 2.0
3-263-346-01(1) SonicStage Ver. 2.0 SonicStage SonicStage 2004 Sony Corporation Windows SonicStage OpenMG Net MD ATRAC ATRAC3 ATRAC3plus Microsoft Windows Windows NT Windows Media Microsoft Corporation
More information01_OpenMP_osx.indd
OpenMP* / 1 1... 2 2... 3 3... 5 4... 7 5... 9 5.1... 9 5.2 OpenMP* API... 13 6... 17 7... 19 / 4 1 2 C/C++ OpenMP* 3 Fortran OpenMP* 4 PC 1 1 9.0 Linux* Windows* Xeon Itanium OS 1 2 2 WEB OS OS OS 1 OS
More informationHPC / (CfCA) HPC 2007/11/23-25
HPC / (CfCA) HPC 2007/11/23-25 CfCA GRAPE GRAPE GRAPE-DR HPC : : 1 1 (II ) Ia 100 1 ( ) 0.1 pc 1 AU 3 : 1 100 Top-down Katz and Gunn 1992 Dark Matter + + DM, : :SPH 10 4 Cray YMP 500-1000 : 10 7 Saitoh
More information橡Webcamユーザーガイド03.PDF
Desktop On-Call Version 4 Webcam extension Pak for Windows Webcam extension Pak Desktop On-Call Version 4 Web PC i Desktop On-Call Version 4 PC PC Desktop On-Call Version 4 PC Windows 98 Windows 98SE Windows
More informationSecond-semi.PDF
PC 2000 2 18 2 HPC Agenda PC Linux OS UNIX OS Linux Linux OS HPC 1 1CPU CPU Beowulf PC (PC) PC CPU(Pentium ) Beowulf: NASA Tomas Sterling Donald Becker 2 (PC ) Beowulf PC!! Linux Cluster (1) Level 1:
More information2
DX Simulator Copyright 2001-2002 Yamaha Corporation. All rights reserved. Version 1.2, 2002 YAMAHA CORPORATION 2 z x z x c 3 z Windows Macintosh Windows Macintosh x 4 z Windows Macintosh Windows Macintosh
More informationXcalableMP入門
XcalableMP 1 HPC-Phys@, 2018 8 22 XcalableMP XMP XMP Lattice QCD!2 XMP MPI MPI!3 XMP 1/2 PCXMP MPI Fortran CCoarray C++ MPIMPI XMP OpenMP http://xcalablemp.org!4 XMP 2/2 SPMD (Single Program Multiple Data)
More informationGPU GPU CPU CPU CPU GPU GPU N N CPU ( ) 1 GPU CPU GPU 2D 3D CPU GPU GPU GPGPU GPGPU 2 nvidia GPU CUDA 3 GPU 3.1 GPU Core 1
GPU 4 2010 8 28 1 GPU CPU CPU CPU GPU GPU N N CPU ( ) 1 GPU CPU GPU 2D 3D CPU GPU GPU GPGPU GPGPU 2 nvidia GPU CUDA 3 GPU 3.1 GPU Core 1 Register & Shared Memory ( ) CPU CPU(Intel Core i7 965) GPU(Tesla
More informationFileMaker Mobile 8 User’s Guide
For Windows, Mac, Palm OS, and Pocket PC FileMaker Mobile 8 Companion for Palm OS and Pocket PC 2000-2006 FileMaker, Inc. All Rights Reserved. FileMaker, Inc. 5201 Patrick Henry Drive Santa Clara, California
More information1重谷.PDF
RSCC RSCC RSCC BMT 1 6 3 3000 3000 200310 1994 19942 VPP500/32PE 19992 VPP700E/128PE 160PE 20043 2 2 PC Linux 2048 CPU Intel Xeon 3.06GHzDual) 12.5 TFLOPS SX-7 32CPU/256GB 282.5 GFLOPS Linux 3 PC 1999
More information1st-session key
1 2013/11/29 Project based Learning: Soccer Agent Program 1 2012/12/9 Project based Learning: Soccer Agent Program PBL Learning by doing Schedule 1,2 2013 11/29 Make 2013 12/6 2013 12/13 2013 12/20 2014
More information1 (bit ) ( ) PC WS CPU IEEE754 standard ( 24bit) ( 53bit)
GNU MP BNCpack tkouya@cs.sist.ac.jp 2002 9 20 ( ) Linux Conference 2002 1 1 (bit ) ( ) PC WS CPU IEEE754 standard ( 24bit) ( 53bit) 10 2 2 3 4 5768:9:; = %? @BADCEGFH-I:JLKNMNOQP R )TSVU!" # %$ & " #
More informationiphone GPGPU GPU OpenCL Mac OS X Snow LeopardOpenCL iphone OpenCL OpenCL NVIDIA GPU CUDA GPU GPU GPU 15 GPU GPU CPU GPU iii OpenMP MPI CPU OpenCL CUDA OpenCL CPU OpenCL GPU NVIDIA Fermi GPU Fermi GPU GPU
More informationdevelop
SCore SCore 02/03/20 2 1 HA (High Availability) HPC (High Performance Computing) 02/03/20 3 HA (High Availability) Mail/Web/News/File Server HPC (High Performance Computing) Job Dispatching( ) Parallel
More informationmain.dvi
PC 1 1 [1][2] [3][4] ( ) GPU(Graphics Processing Unit) GPU PC GPU PC ( 2 GPU ) GPU Harris Corner Detector[5] CPU ( ) ( ) CPU GPU 2 3 GPU 4 5 6 7 1 toyohiro@isc.kyutech.ac.jp 45 2 ( ) CPU ( ) ( ) () 2.1
More informationDPCK-US10
3-064-500-02(1) DPCK-US10 2000 Sony Corporation DPCK-US10 USB MGI PhotoSuite MGI PhotoSuite PhotoSuite V8.1 Windows ) VideoWave SE+ Windows PhotoSuite SE V1.1(Macintosh ) DPCK-US10 USB USB MGI PhotoSuite
More informationExpress5800/120Rb-1 (2002/01/22)
(2002/01/22) 1. N8100-764 N8100-765 N8100-783 ( /1BG(256)) ( /1.26G(512)) ( /1.40G(512)) CPU Pentium Pentium -S Pentium -S (1BGHz) 1( 2 ) (1.26GHz) 1( 2 ) (1.40GHz) 1( 2 ) L1 32KB L2 256KB 512KB 256MB(
More informationHP xw9400 Workstation
HP xw9400 Workstation HP xw9400 Workstation AMD Opteron TM PCI Express x16 64 PCI Express x16 2 USB2.0 8 IEEE1394 2 8DIMM HP HP xw9400 Workstation HP CPU HP CPU 240W CPU HP xw9400 HP CPU CPU CPU CPU Sound
More informationNEC All rights reserved 1
NEC All rights reserved 1 NEC All rights reserved 2 NEC All rights reserved 3 (Founder) (Langchao Langchao) NEC All rights reserved 4 2.1 GB/s 64 bits wide 266 MHz 4 MB L3 on board, 96k L2, 32k L1 on -die
More informationExpress5800/120Ra-1
1. CPU L1 L2 CD-ROM LAN OS OS N8100-661A ( /1BG(256)) Pentium 1.0BGHz 1 2 32KB 256KB 128MB 4GB (73.2GB 2) 10 24 100BASE-TX 10BASE-T 2 640 480 1280 1024* 2. DISK LINK/ACT(LAN1) STATUS LINK/ACT(LAN2) POWER/SLEEP
More informationHP Compaq Business Desktop dx7300シリーズ
本カタログは 旧製品もしくはすでに販売終了した製品のカタログです 最新版のカタログ 現在販売している製品のカタログは下記サイトにございます www.hp.com/jp/catalog その他ご不明な点は下記お問い合わせ窓口までご連絡ください HP Directplus 9 00 19 00 5/1 10 00 17 00 03-6416-6222 HP 9 00 19 00 10 00 17 00
More informationMicrosoft PowerPoint - sales2.ppt
最適化とは何? CPU アーキテクチャに沿った形で最適な性能を抽出できるようにする技法 ( 性能向上技法 ) コンパイラによるプログラム最適化 コンパイラメーカの技量 経験量に依存 最適化ツールによるプログラム最適化 KAP (Kuck & Associates, Inc. ) 人によるプログラム最適化 アーキテクチャのボトルネックを知ること 3 使用コンパイラによる性能の違い MFLOPS 90
More information1 OpenCL OpenCL 1 OpenCL GPU ( ) 1 OpenCL Compute Units Elements OpenCL OpenCL SPMD (Single-Program, Multiple-Data) SPMD OpenCL work-item work-group N
GPU 1 1 2 1, 3 2, 3 (Graphics Unit: GPU) GPU GPU GPU Evaluation of GPU Computing Based on An Automatic Program Generation Technology Makoto Sugawara, 1 Katsuto Sato, 1 Kazuhiko Komatsu, 2 Hiroyuki Takizawa
More informationGetting Started Creative Sound Blaster Live! 5.1 Creative Sound Blaster Live! 5.1 Digital Audio Creative Technology Ltd. Creative Technology Ltd. 1 Co
TM Getting Started Creative Sound Blaster Live! 5.1 Creative Sound Blaster Live! 5.1 Digital Audio Creative Technology Ltd. Creative Technology Ltd. 1 Copyright 1998-2002 by Creative Technology Ltd. All
More informationContents Windows* /Linux* C++/Fortran... 3 Microsoft* embedded Visual C++* C Microsoft* Windows* CE.NET Platform Builder C IP
Windows*/Linux* VTune TM Contents... 1... 2 Windows* /Linux* C++/Fortran... 3 Microsoft* embedded Visual C++* C++... 9 Microsoft* Windows* CE.NET Platform Builder C++... 11 IPP... 13 PCA IPP... 15 GPP...
More information卒業論文2.dvi
15 GUI A study on the system to transfer a GUI sub-picture to the enlarging viewer for operational support 1040270 2004 2 27 GUI PC PC GUI Graphical User Interface PC GUI GUI PC GUI PC PC GUI i Abstract
More informationCompiler Differences on OpenVMS I64
Itanium OpenVMS 2006 4 4 2006 4 6 OpenVMS Technical Director gaitan.dantoni@hp.com Intel Intel Inside Itanium Intel Corporation Portions 1998-2005 Intel Corporation Portions 1998-2005 Hewlett-Packard Corporation
More informationFFTSS Library Version 3.0 User's Guide
: 19 10 31 FFTSS 3.0 Copyright (C) 2002-2007 The Scalable Software Infrastructure Project, (CREST),,. http://www.ssisc.org/ Contents 1 4 2 (DFT) 4 3 4 3.1 UNIX............................................
More informationXMPによる並列化実装2
2 3 C Fortran Exercise 1 Exercise 2 Serial init.c init.f90 XMP xmp_init.c xmp_init.f90 Serial laplace.c laplace.f90 XMP xmp_laplace.c xmp_laplace.f90 #include int a[10]; program init integer
More informationUntitled
VASP 2703 2006 3 VASP 100 PC 3,4 VASP VASP VASP FFT. (LAPACK,BLAS,FFT), CPU VASP. 1 C LAPACK,BLAS VASP VASP VASP VASP bench.hg VASP CPU CPU CPU northwood LAPACK lmkl lapack64, BLAS lmkl p4 LA- PACK liblapack,
More information展開とプロビジョニングの概念
ADOBE CREATIVE SUITE 5 2010 Adobe Systems Incorporated and its licensors. All rights reserved. Adobe Creative Suite Deployment and Provisioning Concepts This guide is licensed for use under the terms of
More informationExpress5800/120Lf 1. Express5800/120Lf N N N Express5800/120Lf Express5800/120Lf Express5800/120Lf ( /1BG(256)) ( /1BG(256)) (
(2001/11/13) Express5800/120Lf 1. Express5800/120Lf N8100-748 N8100-751 N8100-754 Express5800/120Lf Express5800/120Lf Express5800/120Lf ( /1BG(256)) ( /1BG(256)) ( /1.26G(512)) CPU Hot-Plug Pentium (1.0BGHz)
More informationNW-E062 / E063 / E062K/ E063K
/ FM / 112 NW-E062 / E063 / E062K/ E063K 59 70 1 USB USB / MD CD MD CD 111 x- x- 112 NW-E062 / E063 / E062K/ E063K / FM / 112 NW-E062 / E063 / E062K/ E063K / FM / 112 MD CD MD CD x- CD 29 31 47 52 111
More informationインテル(R) Visual Fortran Composer XE 2013 Windows版 入門ガイド
Visual Fortran Composer XE 2013 Windows* エクセルソフト株式会社 www.xlsoft.com Rev. 1.1 (2012/12/10) Copyright 1998-2013 XLsoft Corporation. All Rights Reserved. 1 / 53 ... 3... 4... 4... 5 Visual Studio... 9...
More informationExpress5800/120Rb-2
Workgroup/Department 1. N8500-478 N8500-486 (/533EB(256)) (/667EB(256)) CPU Pentium Pentium 533EBMHz1 2 667EBMHz1 2 L1 32KB L2 256KB 128MB 4GB ( 72.6GB) CD-ROM 24 LAN 100BASE-TX 6404801024768 OS OS 2.
More informationDPD Software Development Products Overview
2 2007 Intel Corporation. Core 2 Core 2 Duo 2006/07/27 Core 2 precise VTune Core 2 Quad 2006/11/14 VTune Core 2 ( ) 1 David Levinthal 3 2007 Intel Corporation. PC Core 2 Extreme QX6800 2.93GHz, 1066MHz
More informationExpress5800/120Rc-2 Workgroup/Department 1. Express5800/120Rc-2 N N N Express5800/120Rc-2 Express5800/120Rc-2 Express5800/120R
Express5800/120Rc-2 Workgroup/Department 1. Express5800/120Rc-2 N8500-572 N8500-573 N8500-574 Express5800/120Rc-2 Express5800/120Rc-2 Express5800/120Rc-2 (/667(256)) (/800EB(256)) (/933(256)) CPU L1 L2
More informationExpress5800/120Le
Workgroup/Department 1. N8500-579A N8500-671 N8500-672 (/800EB (256)-27AWS) (/800EB (256)-27AW2S) (/800EB(256)) Windows NT Server 4.0 Windows 2000 Server CPU Pentium800EBMHz1 2 L1 32KB L2 256KB 128MB 4GB
More informationInfoprint 250 GA
Infoprint 250 GA88-0164-00 Infoprint 250 GA88-0164-00 ! viii 2000 5 Copyright International Business Machines Corporation 2000. All rights reserved. ii Infoprint 250 ...III... VI...VII... VIII... VIII...IX
More information,,,,., C Java,,.,,.,., ,,.,, i
24 Development of the programming s learning tool for children be derived from maze 1130353 2013 3 1 ,,,,., C Java,,.,,.,., 1 6 1 2.,,.,, i Abstract Development of the programming s learning tool for children
More informationimai@eng.kagawa-u.ac.jp No1 No2 OS Wintel Intel x86 CPU No3 No4 8bit=2 8 =256(Byte) 16bit=2 16 =65,536(Byte)=64KB= 6 5 32bit=2 32 =4,294,967,296(Byte)=4GB= 43 64bit=2 64 =18,446,744,073,709,551,615(Byte)=16EB
More informationExpress5800/120Ed
Pentium 60% 1. N8500-570A N8500-662 N8500-663 N8500-664 ( /800EB(256)) ( /800EB(256)-9W) ( /800EB(256)-9W2) ( /1BG(256)) Windows NT Server 4.0 Windows 2000 HDD HDD CPU Pentium 800EBMHz1 Pentium 1BGHz1
More informationindd
Windows Vista 2 Service pack 1 SP1 Windows Vista Windows Xp Windows Vista Windows Vista CPU Windows OS Windows Xp Windows Vista Windows 7 15 20 Windows Vista Windows Vista Windows Xp Windows Vista Windows
More informationIntel® Compilers Professional Editions
2007 6 10.0 * 10.0 6 5 Software &Solutions group 10.0 (SV) C++ Fortran OpenMP* OpenMP API / : 200 C/C++ Fortran : OpenMP : : : $ cat -n main.cpp 1 #include 2 int foo(const char *); 3 int main()
More informationPrinter Driverセットアップ編
Microsoft MS-DOS Windows Windows Server Windows Vista Internet Explorer PowerPoint Outlook Microsoft Corporation Microsoft Corporation Intel Intel Inside Itanium Pentium Intel Corporation USB-IF Universal
More informationExpress5800/120Lc
Workgroup/Department 1. N8500-371 CPU L1 L2 CD-ROM LAN OS OS (/450(512)) N8500-372 N8500-373 N8500-400 (/450(512)-25AWS) (/500(512)) (/450(512)-25AWE) StarOffice Exchange Pentium450MHz1 2 ( 72GB) 32KB
More informationExcel97関数編
Excel97 SUM Microsoft Excel 97... 1... 1... 1... 2... 3... 3... 4... 5... 6... 6... 7 SUM... 8... 11 Microsoft Excel 97 AVERAGE MIN MAX SUM IF 2 RANK TODAY ROUND COUNT INT VLOOKUP 1/15 Excel A B C A B
More informationMultiPASS B-20 MultiPASS Suite 3.10使用説明書
TM MultiPASS Suite Ver.3.10 for Windows ii iii Copyright 1999 Canon Inc. ALL RIGHTS RESERVED iv v vi vii viii ix x 1 2 3 4 5 6 7 8 9 xi xii 1 1 1-1 1 2 3 1-2 4 5 1 1-3 6 1-4 1 7 8 1-5 9 10 11 1-6 1 1-7
More informationHP Compaq Business Desktop dc7700シリーズ
本カタログは 旧製品もしくはすでに販売終了した製品のカタログです 最新版のカタログ 現在販売している製品のカタログは下記サイトにございます www.hp.com/jp/catalog その他ご不明な点は下記お問い合わせ窓口までご連絡ください HP Directplus 9 00 19 00 5/1 10 00 17 00 03-6416-6222 HP 9 00 19 00 10 00 17 00
More information1 4 1.1........................................... 4 1.2.................................. 4 1.3................................... 4 2 5 2.1 GPU.....
CPU GPU N Q07-065 2011 2 17 1 1 4 1.1........................................... 4 1.2.................................. 4 1.3................................... 4 2 5 2.1 GPU...........................................
More informationuntitled
SUBJECT: Applied Biosystems Data Collection Software v2.0 v3.0 Windows 2000 OS : 30 45 Cancel Data Collection - Applied Biosystems Sequencing Analysis Software v5.2 - Applied Biosystems SeqScape Software
More informationHPC
HPC NECHPC Express5800 1000 TX7/AzusA Express5800/Parallel PC-Cluster - AzusA AzusA SX IA-64 AzusA)Express5800 AzusA PC PC < > ( ) SX-6 HPC SX-6i CPU GFLOPS 1996 1997 1998 1999 2000 2001 2002 Car Parrinello
More informationpptx
iphone 2010 8 18 C xkozima@myu.ac.jp C Hello, World! Hello World hello.c! printf( Hello, World!\n );! os> ls! hello.c! os> cc hello.c o hello! os> ls! hello!!hello.c! os>./hello! Hello, World!! os>! os>
More information16 2020 H.264/AVC 2 H.265/HEVC 1 H.265 JCT-VC HM(HEVC Test Model) HM 5 5 SIMD HM 33%
H.265/HEVC 2014 (410808) 16 2020 H.264/AVC 2 H.265/HEVC 1 H.265 JCT-VC HM(HEVC Test Model) HM 5 5 SIMD HM 33% Abstract In recent years, high resolution video technology has been developed in order to start
More information02_C-C++_osx.indd
C/C++ OpenMP* / 2 C/C++ OpenMP* OpenMP* 9.0 1... 2 2... 3 3OpenMP*... 5 3.1... 5 3.2 OpenMP*... 6 3.3 OpenMP*... 8 4OpenMP*... 9 4.1... 9 4.2 OpenMP*... 9 4.3 OpenMP*... 10 4.4... 10 5OpenMP*... 11 5.1
More information(SAD) x86 MPSADBW H.264/AVC H.264/AVC SAD SAD x86 SAD MPSADBW SAD 3x3 3 9 SAD SAD SAD x86 MPSADBW SAD 9 SAD SAD 4.6
SAD 23 (410M520) (SAD) x86 MPSADBW H.264/AVC H.264/AVC SAD SAD x86 SAD MPSADBW SAD 3x3 3 9 SAD SAD SAD x86 MPSADBW SAD 9 SAD SAD 4.6 Abstract In recent years, the high definition of video image has made
More information単位、情報量、デジタルデータ、CPUと高速化 ~ICT用語集~
CPU ICT mizutani@ic.daito.ac.jp 2014 SI: Systèm International d Unités SI SI 10 1 da 10 1 d 10 2 h 10 2 c 10 3 k 10 3 m 10 6 M 10 6 µ 10 9 G 10 9 n 10 12 T 10 12 p 10 15 P 10 15 f 10 18 E 10 18 a 10 21
More information1 3DCG [2] 3DCG CG 3DCG [3] 3DCG 3 3 API 2 3DCG 3 (1) Saito [4] (a) 1920x1080 (b) 1280x720 (c) 640x360 (d) 320x G-Buffer Decaudin[5] G-Buffer D
3DCG 1) ( ) 2) 2) 1) 2) Real-Time Line Drawing Using Image Processing and Deforming Process Together in 3DCG Takeshi Okuya 1) Katsuaki Tanaka 2) Shigekazu Sakai 2) 1) Department of Intermedia Art and Science,
More informationM-crew for HAR-LH500 (Version 2.6J)
4-249-905-01 (1) M-crew for HAR-LH500 Version 2.6J M-crew Readme Readme M-crew for HAR-LH500 Readme 2003 Sony Corporation Program 1997-2003 Sony Corporation Documentation 2003 Sony Corporation 2003 Sony
More informationOpenMP (1) 1, 12 1 UNIX (FUJITSU GP7000F model 900), 13 1 (COMPAQ GS320) FUJITSU VPP5000/64 1 (a) (b) 1: ( 1(a))
OpenMP (1) 1, 12 1 UNIX (FUJITSU GP7000F model 900), 13 1 (COMPAQ GS320) FUJITSU VPP5000/64 1 (a) (b) 1: ( 1(a)) E-mail: {nanri,amano}@cc.kyushu-u.ac.jp 1 ( ) 1. VPP Fortran[6] HPF[3] VPP Fortran 2. MPI[5]
More information211 年ハイパフォーマンスコンピューティングと計算科学シンポジウム Computing Symposium 211 HPCS /1/18 a a 1 a 2 a 3 a a GPU Graphics Processing Unit GPU CPU GPU GPGPU G
211 年ハイパフォーマンスコンピューティングと計算科学シンポジウム Computing Symposium 211 HPCS211 211/1/18 GPU 4 8 BLAS 4 8 BLAS Basic Linear Algebra Subprograms GPU Graphics Processing Unit 4 8 double 2 4 double-double DD 4 4 8 quad-double
More informationEPSON ES-D200 パソコンでのスキャンガイド
NPD4271-00 ...4...7 EPSON Scan... 7...11 PDF...12 / EPSON Scan...13 EPSON Scan...13 EPSON Scan...14 EPSON Scan...14 EPSON Scan...15 Epson Event Manager...16 Epson Event Manager...16 Epson Event Manager...16
More informationインテル(R) Visual Fortran Composer XE 2011 Windows版 入門ガイド
Visual Fortran Composer XE 2011 Windows* エクセルソフト株式会社 www.xlsoft.com Rev. 1.0 (2010/12/20) Copyright 1998-2011 XLsoft Corporation. All Rights Reserved. 1 / 36 ... 3... 4... 5... 7 /... 7... 9 /... 9...
More informationEPSON Easy Interactive Tools Ver.4.2 Operation Guide
Easy Interactive Tools Ver.4.2 Easy Interactive Tools Ver.4.2 Easy Interactive Tools Easy Interactive Tools Easy Interactive Pen s p.12 s p.12 s p.11 s p.18 s p.20 s p.29 Easy Interactive Tools PowerPoint
More informationHP COMPAQ BUSINESS DESKTOP DC7800シリーズ
本カタログは 旧製品もしくはすでに販売終了した製品のカタログです 最新版のカタログ 現在販売している製品のカタログは下記サイトにございます www.hp.com/jp/catalog その他ご不明な点は下記お問い合わせ窓口までご連絡ください HP Directplus 9 00 19 00 5/1 10 00 17 00 03-6416-6222 HP 9 00 19 00 10 00 17 00
More informationsupercomputer2010.ppt
nanri@cc.kyushu-u.ac.jp 1 !! : 11 12! : nanri@cc.kyushu-u.ac.jp! : Word 2 ! PC GPU) 1997 7 http://wiredvision.jp/news/200806/2008062322.html 3 !! (Cell, GPU )! 4 ! etc...! 5 !! etc. 6 !! 20km 40 km ) 340km
More informationNW- E052 / E053
3 2010 Sony Corporation Printed in China 4-192-084-01 (1) NW- E052 / E053 x- 13 URL http://www.sony.jp/support/pa_common/x-appli/download/ 1 2 2 URL URL http://m.sony.jp/m/walkman/i/nwe050/mg02/ 2 URL
More informationたのしいプログラミング Pythonではじめよう!
Title of English-language original: Python for Kids A Playful Introduction to Programming ISBN 978-1-59327-407-8, published by No Starch Press, Inc. Copyright 2013 by Jason R. Briggs. Japanese-language
More information