Similar documents
FIT2013( 第 12 回情報科学技術フォーラム ) I-032 Acceleration of Adaptive Bilateral Filter base on Spatial Decomposition and Symmetry of Weights 1. Taiki Makishi Ch

Slides: TimeGraph: GPU Scheduling for Real-Time Multi-Tasking Environments

GPGPU

main.dvi

23 Fig. 2: hwmodulev2 3. Reconfigurable HPC 3.1 hw/sw hw/sw hw/sw FPGA PC FPGA PC FPGA HPC FPGA FPGA hw/sw hw/sw hw- Module FPGA hwmodule hw/sw FPGA h

07-二村幸孝・出口大輔.indd

2005 1

Agenda GRAPE-MPの紹介と性能評価 GRAPE-MPの概要 OpenCLによる四倍精度演算 (preliminary) 4倍精度演算用SIM 加速ボード 6 processor elem with 128 bit logic Peak: 1.2Gflops

スライド 1

「FPGAを用いたプロセッサ検証システムの製作」

GPU GPU CPU CPU CPU GPU GPU N N CPU ( ) 1 GPU CPU GPU 2D 3D CPU GPU GPU GPGPU GPGPU 2 nvidia GPU CUDA 3 GPU 3.1 GPU Core 1

プロセッサ・アーキテクチャ

,., ping - RTT,., [2],RTT TCP [3] [4] Android.Android,.,,. LAN ACK. [5].. 3., 1.,. 3 AI.,,Amazon, (NN),, 1..NN,, (RNN) RNN

IPSJ SIG Technical Report Vol.2014-CG-155 No /6/28 1,a) 1,2,3 1 3,4 CG An Interpolation Method of Different Flow Fields using Polar Inter

VLSI工学

Introduction Purpose This training course demonstrates the use of the High-performance Embedded Workshop (HEW), a key tool for developing software for

1 3DCG [2] 3DCG CG 3DCG [3] 3DCG 3 3 API 2 3DCG 3 (1) Saito [4] (a) 1920x1080 (b) 1280x720 (c) 640x360 (d) 320x G-Buffer Decaudin[5] G-Buffer D

次世代スーパーコンピュータのシステム構成案について

Express5800/120Ed

Microsoft PowerPoint - GPU_computing_2013_01.pptx

untitled

Express5800/110Ee (2002/01/22)

untitled

26 FPGA FPGA (Field Programmable Gate Array) ASIC (Application Specific Integrated Circuit) FPGA FPGA FPGA FPGA Linux FreeDOS skewed way L1


Express5800/120Lc

3 SIMPLE ver 3.2: SIMPLE (SIxteen-bit MicroProcessor for Laboratory Experiment) 1 16 SIMPLE SIMPLE 2 SIMPLE 2.1 SIMPLE (main memo

Express5800/110Ee Pentium 1. Express5800/110Ee N N Express5800/110Ee Express5800/110Ee ( /800EB(256)) ( /800EB(256) 20W) CPU L1 L2 CD-

untitled

FINAL PROGRAM 22th Annual Workshop SWoPP / / 2009 Sendai Summer United Workshops on Parallel, Distributed, and Cooperative Processing

Express5800/120Lf 1. Express5800/120Lf N N N Express5800/120Lf Express5800/120Lf Express5800/120Lf ( /1BG(256)) ( /1BG(256)) (

Express5800/140Ma


2nd-1.dvi

untitled

スパコンに通じる並列プログラミングの基礎

1 GPU GPGPU GPU CPU 2 GPU 2007 NVIDIA GPGPU CUDA[3] GPGPU CUDA GPGPU CUDA GPGPU GPU GPU GPU Graphics Processing Unit LSI LSI CPU ( ) DRAM GPU LSI GPU

組込みシステムシンポジウム2011 Embedded Systems Symposium 2011 ESS /10/20 FPGA Android Android Java FPGA Java FPGA Dalvik VM Intel Atom FPGA PCI Express DM

マルチコアPCクラスタ環境におけるBDD法のハイブリッド並列実装

スライド 1

セゾン保険_PDF用.indd

10D16.dvi

単位、情報量、デジタルデータ、CPUと高速化 ~ICT用語集~

Express5800/110Rc-1 1. Express5800/110Rc-1 N N Express5800/110Rc-1 Express5800/110Rc-1 ( /1BG(256)) (C/850(128)) CPU Pentium (1BGHz) 1



スパコンに通じる並列プログラミングの基礎

1 Table 1: Identification by color of voxel Voxel Mode of expression Nothing Other 1 Orange 2 Blue 3 Yellow 4 SSL Humanoid SSL-Vision 3 3 [, 21] 8 325

Express5800/140Hb (2002/01/22)

indd

Express5800/120Rb-2

HP ProLiant 500シリーズ

CPU Levels in the memory hierarchy Level 1 Level 2... Increasing distance from the CPU in access time Level n Size of the memory at each level 1: 2.2

IPSJ SIG Technical Report Vol.2013-ARC-203 No /2/1 SMYLE OpenCL (NEDO) IT FPGA SMYLEref SMYLE OpenCL SMYLE OpenCL FPGA 1

Express5800/120Rc-2 Workgroup/Department 1. Express5800/120Rc-2 N N N Express5800/120Rc-2 Express5800/120Rc-2 Express5800/120R

untitled

Express5800/120Rb-1 (2002/01/22)

AMD/ATI Radeon HD 5870 GPU DEGIMA LINPACK HD 5870 GPU DEGIMA LINPACK GFlops/Watt GFlops/Watt Abstract GPU Computing has lately attracted

GM-01A_usermanual

RaVioli SIMD

Express5800/120Le

HDL Designer Series SupportNet GUI HDL Designer Series HDL Desi

258 5) GPS 1 GPS 6) GPS DP 7) 8) 10) GPS GPS ) GPS Global Positioning System

Design at a higher level

Slides: TimeGraph: GPU Scheduling for Real-Time Multi-Tasking Environments

,4) 1 P% P%P=2.5 5%!%! (1) = (2) l l Figure 1 A compilation flow of the proposing sampling based architecture simulation

インテル(R) Visual Fortran Composer XE

Express5800/120Ra-1

17 Proposal of an Algorithm of Image Extraction and Research on Improvement of a Man-machine Interface of Food Intake Measuring System

2006

1

全集’.PDF

完成卒論.PDF

(Making the electronic circuit with use of micro-processor)

HP Workstation 総合カタログ

Source: Intel.Config: Pentium III Processor-Intel Seattle SE440BX-2, 128MB PC100 CL2 SDRAM Intel 440BX-2 Chipset Platform- Diamond Viper 550 /

スパコンに通じる並列プログラミングの基礎


P2P P2P peer peer P2P peer P2P peer P2P i

price, style. Office. VAJ/DG5TFTSXGA+ Pentium III VA0J/DX.TFTXGA Pentium III VAJ/DF5TFTXGA Pentium III VA0H/DF5TFTXGA Celeron VA0J/DF5TFTXGA Pentium I

WebGL OpenGL GLSL Kageyama (Kobe Univ.) Visualization / 57

Nios II ハードウェア・チュートリアル


untitled

アセンブラ入門(CASL II) 第3版

卒業論文2.dvi

HP Workstation Xeon 5600

DRAM SRAM SDRAM (Synchronous DRAM) DDR SDRAM (Double Data Rate SDRAM) DRAM 4 C Wikipedia 1.8 SRAM DRAM DRAM SRAM DRAM SRAM (256M 1G bit) (32 64M bit)

スライド タイトルなし

情報処理学会研究報告 IPSJ SIG Technical Report Vol.2013-HPC-139 No /5/29 Gfarm/Pwrake NICT NICT 10TB 100TB CPU I/O HPC I/O NICT Gf

12 DCT A Data-Driven Implementation of Shape Adaptive DCT

(SAD) x86 MPSADBW H.264/AVC H.264/AVC SAD SAD x86 SAD MPSADBW SAD 3x3 3 9 SAD SAD SAD x86 MPSADBW SAD 9 SAD SAD 4.6

64bit SSE2 SSE2 FPU Visual C++ 64bit Inline Assembler 4 FPU SSE2 4.1 FPU Control Word FPU 16bit R R R IC RC(2) PC(2) R R PM UM OM ZM DM IM R: reserved

IPSJ SIG Technical Report iphone iphone,,., OpenGl ES 2.0 GLSL(OpenGL Shading Language), iphone GPGPU(General-Purpose Computing on Graphics Proc

IEEE802.11n LAN WiMAX(Mobile Worldwide Interoperability for Microwave Access) LTE(Long Term Evolution) IEEE LAN Bluetooth IEEE LAN


Vol. 48 No. 4 Apr LAN TCP/IP LAN TCP/IP 1 PC TCP/IP 1 PC User-mode Linux 12 Development of a System to Visualize Computer Network Behavior for L

Express5800/140Ma

GPU.....

富士通セミコンダクター株式会社発表資料

Transcription:

26102

(1/2) LSISoC: (1) (*) (*) GPU SIMD MIMD FPGA DES, AES

(2/2) (2) FPGA(8bit) (ISS: Instruction Set Simulator) (3) (4) LSI

ECU110100ECU1 ECU ECU ECU ECU FPGA ECU main() { int i, j, k for { } 1

GP-GPU A Parallel Logic Simulation Method using GP-GPU ~ ~ GPU GPU main(){ MemoryCopy(HostToDevice) kernel<<<blocks,threads>>> MemoryCopy(DeviceToHost) } GPU CPUHost 1 2 3 2 3 4 5 GPUSM SM STEP1 6 7 L M N STEP2 1 2 3 CG1 CG2 CG3 3 GPUDevice SM SM 2 nsm 1 1 2 1 2 3 1. GPU 2 STEP3 1 3 2. L M N SM1 SM2 SM3 GPU ModelSim GPU Geforce GTX480 PC Intel Core i7-950 3.07GHz 100,000 - SEQSim - GPUSim - ModelSim SE 6.2e 3. SEQSim GPUSim5.1 GPUSim-M 4.3 32bit processor GPU* *Geforce TITAN2688 1. cpu x 1 2148 56 cpu x 20 42599 56 cpu x 40 85179 56 5.1 4.3 4. sim

Research of Acceleration Method for Logic Simulation based on Parallel Algorithm () C SpecC Basic block JAXA - Elegant / Visual Spec 3 5 4 2 6 1 SpecC 4 ( ) () (1) ()(2) 1. 2. ModelSim - :Elegant/Visual Spec(ver4.1.6) - :ARM946E-S(200MHz) * - ModelSim SE 6.2e() - PCIntel Core i7-950 3.07GHz 10,000 1. 10.4 4.6 3. sim vs sim(8) sim 8sim cpu x810.4 sim(8) vs sim sim cpu x84.6 (3210) (2) ANDOR

FPGA A Logic Simulation Method using FPGA ~ ~ FPGA FPGA ( ) 1. SIM Onchip SRAM Offchip RAM FPGA ModelSim FPGA50MHz 10,000 ModelSim SE 6.2e( PC Intel Core i7-950 3.07GHz 2. 1. 3. 256 (FPGA_SIM256) (FPGA_SIM1) cpu x 16114 2563 (FPGA_SIM256) cpu x 164.9 BRAMFPGA offchipram 2~3 23 1 LSI10

FPGA A Proposal of FPGA Microcontroller 88 8 8 FPGA FPGA FPGA PC GUI FPGA PCFPGA FPGA110 FPGA2 2. PCFPGA PCFPGA2 FPGAPC FPGA 1. FPGA 8 : RS232C : LSI FPGA control-line : RAM(ISS) ISSFPGA Debug : PC 8 RAM/ MEM : debug_out : PC LED_out: 7SEGLED 3. GUI FPGA40MHz 5760!! FPGA 57601!! 1.ISS 2.FPGA 3.ISSFPGA

Compact Logic Optimization Method for Partial Logic Circuits GPU 1. ()

FPGA High Speed Encryption Unit based on FPGA for Mobile Terminal ~ ~ AES FPGA AES 128bit 41 PC RS232C 通信モジュール 1 PC BRAM FPGA AES 暗号化ユニット RS232C FPGA AES BRAM CLK 128bit AES Key data In data mc0 mc1 mc2 mc3 Add RoundKey Key Expansion mc00 mc01 mc02 mc03 mc10 mc11 mc12 mc13 mc20 mc21 mc22 mc23 mc30 mc31 mc32 mc33 Sub Bytes 2.AES GF2 GF3 GF1 GF1 GF1 GF2 GF3 GF1 GF1 GF1 GF2 GF3 GF3 GF1 GF1 GF2 3.MixColumns 2.4MB SpecC ARM9200MH 11.2sec ModelSim 100MHAES 0.19sec FPGA XOR XOR XOR XOR 4MixColumns GF16 9 10 11 Shift Rows Mix Columns 128bit 59 out

Proposal of Alert System using Medical Data Analysis DWH DWH ( ) : 1. DWH Y 1x1 2x2 nx n, n Y: () xn: 01 1 Y 1 e 1 ( 1 x 1 2 x 2 n x n ), n Y: xn: IF-THEN YES A 1 NO B NO 2 2. YES C 1 X1 X2 B2 X1 2 X2 B2A B1 S1 A B1 S2 1,2 : A, B, C : XML A X1 B1 = B1X1 X2 B2 = B2X2 A B1X1 B2X2 Y 1x1 2x2 S1 3. DWH Y n x n

GP-GPU Evaluation of parallel logic simulation performance using GP-GPU GP-GPU GPU 8 Graphics Processing Unit GPU GTX 480 GP-GPUGeneral Purpose GPU GPU GPU SM /SM SM GTX780, GPU 100,000 ModelSim SE 6.2e( PC Intel Core i7-950 3.07GHz 2. Adder4 x 640GPU 10 Ldpc_enGT540MQuadro600 7GTX480,780 25 80,000 GPU GP-GPU ModelSim GTX780 GTX480

262014 252013 242012 232011 222010 212009 202008 NTT NTT NEC