先進的計算基盤システムシンポジウム SACSIS2012 Symposium on Advanced Computing Systems and Infrastructures SACSIS /5/18 CPU, CPU., Memory-bound CPU,., Memory-bo

Similar documents
,4) 1 P% P%P=2.5 5%!%! (1) = (2) l l Figure 1 A compilation flow of the proposing sampling based architecture simulation

IPSJ SIG Technical Report Vol.2013-ARC-206 No /8/1 Android Dominic Hillenbrand ODROID-X2 GPIO Android OSCAR WFI 500[us] GPIO GP

IPSJ SIG Technical Report Vol.2017-ARC-225 No.12 Vol.2017-SLDM-179 No.12 Vol.2017-EMB-44 No /3/9 1 1 RTOS DefensiveZone DefensiveZone MPU RTOS

PC Development of Distributed PC Grid System,,,, Junji Umemoto, Hiroyuki Ebara, Katsumi Onishi, Hiroaki Morikawa, and Bunryu U PC WAN PC PC WAN PC 1 P

258 5) GPS 1 GPS 6) GPS DP 7) 8) 10) GPS GPS ) GPS Global Positioning System

16.16%

2). 3) 4) 1.2 NICTNICT DCRA Dihedral Corner Reflector micro-arraysdcra DCRA DCRA DCRA 3D DCRA PC USB PC PC ON / OFF Velleman K8055 K8055 K8055

P2P P2P peer peer P2P peer P2P peer P2P i

23 Fig. 2: hwmodulev2 3. Reconfigurable HPC 3.1 hw/sw hw/sw hw/sw FPGA PC FPGA PC FPGA HPC FPGA FPGA hw/sw hw/sw hw- Module FPGA hwmodule hw/sw FPGA h

2. CABAC CABAC CABAC 1 1 CABAC Figure 1 Overview of CABAC 2 DCT 2 0/ /1 CABAC [3] 3. 2 値化部 コンテキスト計算部 2 値算術符号化部 CABAC CABAC

mobicom.dvi

B

FabHetero FabHetero FabHetero FabCache FabCache SPEC2000INT IPC FabCache 0.076%

第 55 回自動制御連合講演会 2012 年 11 月 17 日,18 日京都大学 1K403 ( ) Interpolation for the Gas Source Detection using the Parameter Estimation in a Sensor Network S. T

1

情報処理学会研究報告 IPSJ SIG Technical Report Vol.2013-HPC-139 No /5/29 Gfarm/Pwrake NICT NICT 10TB 100TB CPU I/O HPC I/O NICT Gf

CPU Levels in the memory hierarchy Level 1 Level 2... Increasing distance from the CPU in access time Level n Size of the memory at each level 1: 2.2

GPGPU

4) 5) ) ( 1 ) ( 2 ) ( 3 ) ( 4 ) ( 5 ) ( 6 ) )8) ( 1 ) ( 2 ) ( 3 ) ( 200 9) ( 10) 1 2 (

Core1 FabScalar VerilogHDL Cache Cache FabScalar 1 CoreConnect[2] Wishbone[3] AMBA[4] AMBA 1 AMBA ARM L2 AMBA2.0 AMBA2.0 FabScalar AHB APB AHB AMBA2.0

& Vol.5 No (Oct. 2015) TV 1,2,a) , Augmented TV TV AR Augmented Reality 3DCG TV Estimation of TV Screen Position and Ro

1 2

26 FPGA FPGA (Field Programmable Gate Array) ASIC (Application Specific Integrated Circuit) FPGA FPGA FPGA FPGA Linux FreeDOS skewed way L1

(a) 1 (b) 3. Gilbert Pernicka[2] Treibitz Schechner[3] Narasimhan [4] Kim [5] Nayar [6] [7][8][9] 2. X X X [10] [11] L L t L s L = L t + L s

EQUIVALENT TRANSFORMATION TECHNIQUE FOR ISLANDING DETECTION METHODS OF SYNCHRONOUS GENERATOR -REACTIVE POWER PERTURBATION METHODS USING AVR OR SVC- Ju

1, 4,a) 1, 4 1, 4 1, , 4 3, 4 HPC HPC HPC Slurm 1. HPC Tianhe MW MW [1] MW CREST a)

VXPRO R1400® ご提案資料

1 OpenCL OpenCL 1 OpenCL GPU ( ) 1 OpenCL Compute Units Elements OpenCL OpenCL SPMD (Single-Program, Multiple-Data) SPMD OpenCL work-item work-group N

卒業論文

1重谷.PDF

1 Fig. 1 Extraction of motion,.,,, 4,,, 3., 1, 2. 2.,. CHLAC,. 2.1,. (256 ).,., CHLAC. CHLAC, HLAC. 2.3 (HLAC ) r,.,. HLAC. N. 2 HLAC Fig. 2

IPSJ SIG Technical Report Vol.2009-DPS-141 No.20 Vol.2009-GN-73 No.20 Vol.2009-EIP-46 No /11/27 1. MIERUKEN 1 2 MIERUKEN MIERUKEN MIERUKEN: Spe


IPSJ SIG Technical Report Vol.2013-ARC-203 No /2/1 SMYLE OpenCL (NEDO) IT FPGA SMYLEref SMYLE OpenCL SMYLE OpenCL FPGA 1

1 3DCG [2] 3DCG CG 3DCG [3] 3DCG 3 3 API 2 3DCG 3 (1) Saito [4] (a) 1920x1080 (b) 1280x720 (c) 640x360 (d) 320x G-Buffer Decaudin[5] G-Buffer D

[2] 2. [3 5] 3D [6 8] Morishima [9] N n 24 24FPS k k = 1, 2,..., N i i = 1, 2,..., n Algorithm 1 N io user-specified number of inbetween omis

6 2. AUTOSAR 2.1 AUTOSAR AUTOSAR ECU OSEK/VDX 3) OSEK/VDX OS AUTOSAR AUTOSAR ECU AUTOSAR 1 AUTOSAR BSW (Basic Software) (Runtime Environment) Applicat

AMD/ATI Radeon HD 5870 GPU DEGIMA LINPACK HD 5870 GPU DEGIMA LINPACK GFlops/Watt GFlops/Watt Abstract GPU Computing has lately attracted

MAC root Linux 1 OS Linux 2.6 Linux Security Modules LSM [1] Security-Enhanced Linux SELinux [2] AppArmor[3] OS OS OS LSM LSM Performance Monitor LSMP

1_26.dvi

Bulletin of JSSAC(2014) Vol. 20, No. 2, pp (Received 2013/11/27 Revised 2014/3/27 Accepted 2014/5/26) It is known that some of number puzzles ca

揃 Lag [hour] Lag [day] 35

1 2

17 Proposal of an Algorithm of Image Extraction and Research on Improvement of a Man-machine Interface of Food Intake Measuring System


第62巻 第1号 平成24年4月/石こうを用いた木材ペレット

3_39.dvi

Vol. 23 No. 4 Oct Kitchen of the Future 1 Kitchen of the Future 1 1 Kitchen of the Future LCD [7], [8] (Kitchen of the Future ) WWW [7], [3

ディスプレイと携帯端末間の通信を実現する映像媒介通信技術

日立評論2007年3月号 : ソフトウェア開発への

DPA,, ShareLog 3) 4) 2.2 Strino Strino STRain-based user Interface with tacticle of elastic Natural ObjectsStrino 1 Strino ) PC Log-Log (2007 6)

( 1) 3. Hilliges 1 Fig. 1 Overview image of the system 3) PhotoTOC 5) 1993 DigitalDesk 7) DigitalDesk Koike 2) Microsoft J.Kim 4). 2 c 2010

IPSJ SIG Technical Report Vol.2015-ARC-215 No.7 Vol.2015-OS-133 No /5/26 Just-In-Time PG 1,a) 1, Just-In-Time VM Geyser Dalvik VM Caffei

2. Eades 1) Kamada-Kawai 7) Fruchterman 2) 6) ACE 8) HDE 9) Kruskal MDS 13) 11) Kruskal AGI Active Graph Interface 3) Kruskal 5) Kruskal 4) 3. Kruskal

IPSJ SIG Technical Report Vol.2012-HCI-149 No /7/20 1 1,2 1 (HMD: Head Mounted Display) HMD HMD,,,, An Information Presentation Method for Weara

Input image Initialize variables Loop for period of oscillation Update height map Make shade image Change property of image Output image Change time L

組込みシステムシンポジウム2011 Embedded Systems Symposium 2011 ESS /10/20 FPGA Android Android Java FPGA Java FPGA Dalvik VM Intel Atom FPGA PCI Express DM

A Study on Throw Simulation for Baseball Pitching Machine with Rollers and Its Optimization Shinobu SAKAI*5, Yuichiro KITAGAWA, Ryo KANAI and Juhachi

1 Web [2] Web [3] [4] [5], [6] [7] [8] S.W. [9] 3. MeetingShelf Web MeetingShelf MeetingShelf (1) (2) (3) (4) (5) Web MeetingShelf

HP cafe HP of A A B of C C Map on N th Floor coupon A cafe coupon B Poster A Poster A Poster B Poster B Case 1 Show HP of each company on a user scree

1 1 CodeDrummer CodeMusician CodeDrummer Fig. 1 Overview of proposal system c

36 581/2 2012

IPSJ SIG Technical Report Vol.2010-SLDM-144 No.50 Vol.2010-EMB-16 No.50 Vol.2010-MBL-53 No.50 Vol.2010-UBI-25 No /3/27 Twitter IME Twitte

1 Table 1: Identification by color of voxel Voxel Mode of expression Nothing Other 1 Orange 2 Blue 3 Yellow 4 SSL Humanoid SSL-Vision 3 3 [, 21] 8 325

& Vol.2 No (Mar. 2012) 1,a) , Bluetooth A Health Management Service by Cell Phones and Its Us

Ver. 3.8 Ver NOTE E v3 2.4GHz, 20M cache, 8.00GT/s QPI,, HT, 8C/16T 85W E v3 1.6GHz, 15M cache, 6.40GT/s QPI,

B HNS 7)8) HNS ( ( ) 7)8) (SOA) HNS HNS 4) HNS ( ) ( ) 1 TV power, channel, volume power true( ON) false( OFF) boolean channel volume int

IPSJ SIG Technical Report Vol.2014-HCI-158 No /5/22 1,a) 2 2 3,b) Development of visualization technique expressing rainfall changing conditions

2) TA Hercules CAA 5 [6], [7] CAA BOSS [8] 2. C II C. ( 1 ) C. ( 2 ). ( 3 ) 100. ( 4 ) () HTML NFS Hercules ( )

fiš„v8.dvi

09中西

IPSJ SIG Technical Report iphone iphone,,., OpenGl ES 2.0 GLSL(OpenGL Shading Language), iphone GPGPU(General-Purpose Computing on Graphics Proc

MPC MPC R p N p Z p p N (m, σ 2 ) m σ 2 floor( ), rem(v 1 v 2 ) v 1 v 2 r p e u[k] x[k] Σ x[k] Σ 2 L 0 Σ x[k + 1] = x[k] + u[k floor(l/h)] d[k]. Σ k x

untitled

TA3-4 31st Fuzzy System Symposium (Chofu, September 2-4, 2015) Interactive Recommendation System LeonardoKen Orihara, 1 Tomonori Hashiyama, 1

<95DB8C9288E397C389C88A E696E6462>

untitled


IPSJ SIG Technical Report Vol.2014-EIP-63 No /2/21 1,a) Wi-Fi Probe Request MAC MAC Probe Request MAC A dynamic ads control based on tra

The 18th Game Programming Workshop ,a) 1,b) 1,c) 2,d) 1,e) 1,f) Adapting One-Player Mahjong Players to Four-Player Mahjong

Second-semi.PDF

Chip Size and Performance Evaluations of Shared Cache for On-chip Multiprocessor Takahiro SASAKI, Tomohiro INOUE, Nobuhiko OMORI, Tetsuo HIRONAKA, Han

Iteration 0 Iteration 1 1 Iteration 2 Iteration 3 N N N! N 1 MOPT(Merge Optimization) 3) MOPT MOP

Vol. 42 No. SIG 8(TOD 10) July HTML 100 Development of Authoring and Delivery System for Synchronized Contents and Experiment on High Spe

Microsoft Word - deim2011_new-ichinose doc

Ver Ver NOTE E v3 2.4GHz, 20M cache, 8.00GT/s QPI,, HT, 8C/16T 85W E v3 1.6GHz, 15M cache, 6.40GT/s QPI

LAN LAN LAN LAN LAN LAN,, i

TF-IDF TDF-IDF TDF-IDF Extracting Impression of Sightseeing Spots from Blogs for Supporting Selection of Spots to Visit in Travel Sat

PowerEdge R730xd Contents RAID /RAID & P3-6 PCIe P P P P OS P P P P7 P8 P9 P10-11 P12-17 P P112

大学における原価計算教育の現状と課題

Shonan Institute of Technology MEMOIRS OF SHONAN INSTITUTE OF TECHNOLOGY Vol. 41, No. 1, 2007 Ships1 * ** ** ** Development of a Small-Mid Range Paral

IPSJ SIG Technical Report Vol.2011-MUS-91 No /7/ , 3 1 Design and Implementation on a System for Learning Songs by Presenting Musical St

3_23.dvi

The copyright of this material is retained by the Information Processing Society of Japan (IPSJ). The material has been made available on the website

Express5800/320Fa-L/320Fa-LR

04.™ƒ”R/’Ô”�/’Xfl©

IPSJ SIG Technical Report Vol.2014-DBS-159 No.6 Vol.2014-IFAT-115 No /8/1 1,a) 1 1 1,, 1. ([1]) ([2], [3]) A B 1 ([4]) 1 Graduate School of Info

1. HNS [1] HNS HNS HNS [2] HNS [3] [4] [5] HNS 16ch SNR [6] 1 16ch 1 3 SNR [4] [5] 2. 2 HNS API HNS CS27-HNS [1] (SOA) [7] API Web 2

6_27.dvi


Transcription:

CPU, CPU, Memory-bound CPU,, Memory-bound ( ) Performance Monitoring Counter(PMC), PMC (nmi watchdog), PMC CPU., PMC, CPU, Memory-bound, CPU-bound,, CPU,, PMC,,,, CPU, NPB 8, 5% CPU, CPU, 3%, 5% CPU, IS 5%, 15% CPU, Last Level Cache Miss PMC, Evaluation of Dynamic Voltage and Frequency Scaling Adaptation based on Memory Power Masahiro Miwa, Kohta Nakashima, Akira Hirai, Satoshi Kazama, Yasushi Hara and Akira Naruse This paper presents a novel approach of DVFS adaptation based on memory power. We find that memory power strongly correlates with CPU Frequency Dependency (the degree of application performance affected by CPU frequency). CPU-bound application consumes low memory power and Memory-bound application consumes high memory power. We make a CPU frequency scaling by memory power. So, our proposal method does not need to use the Performance Monitoring Counter that most of prior works use in order to decide CPU frequency scaling. Our method uses data acquired by sensor and could be implemented on BMC(Baseboard Management Controller), and in that case, no overhead is needed in target host for CPU frequency control. Experiment results using NPB show that under 3% performance loss(average), we could save 5% CPU energy(average) and especially for IS benchmark, 15% energy saving was achieved under 5% performance loss. 1., PC, PC ( ) Fujitsu Laboratories Ltd.,, DVFS(Dynamic Voltage and Frequency Scaling) CPU CPU,, CPU, CPU [10]. DVFS, Linux Ondemand Governor [8]. Ondemand Governor, CPU 357

CPU, CPU CPU, CPU CPU CPU CPU,,, CPU, CPU,, CPU-bound Memory-bound ( ), Memory-bound CPU Memory-bound, Memory-bound CPU [1 6],, PMC(Performance MonitoringCounter) [1 5]., PMC (nmi watchdog) Vtune, CPU PMC, PMC, CPU, CPU, CPU Memory-bound,, CPU-bound,, Memory-bound, CPU,,, PMC,, CPU, CPU CPU, 1 SPEC CPU2006, CPU -0.93,, NPB 8, 5% CPU, CPU, 3%, 5% CPU, IS 5%, 15% CPU, Last Level Cache Miss PMC, 2, 3, 4, CPU 5, 6, 2. CPU,, 2.1 CPU PC, CPU idle, CPU,. 1 CPU, 1 idle, CPU 100% idle, idle CPU, CPU 358

2!"# $!"# CPU 2.93GHz 0.3417 33.36 85.67 1.60GHz 0.6265 61.19 94.98 (%) 83.35 83.42 10.87 1 (ns) CPUbound(sec) Memorybound(sec) 2 CPU CPUbound Memorybound CPU (%) 100 13 CPU, (CPU 100%), CPU CPU, CPU,., CPU PC, CPU, CPU 2.2 CPU 2.2.1 CPU 3 CPU Intel Xeon X5570 @2.93 GHz Memory DDR3 1333MHz 12GB(4GBx3) OS CentOS 5.7 CPU 2 CPU-bound, Memory-bound CPU-bound,,, Memory-bound,, Memory-bound, CPU,, Memory-bound,, CPU 2, CPUbound Memory-bound CPU 3, CPU-bound, CPU, Memory-bound, CPU CPU-bound, Memorybound, Memory-Bound,, CPU. CPU, CPU, (1), CPU, CPU, (1), 2, CPU-bound CPU 100%, Memory-bound CPU 13% ( 1, 2). (CPU ) = ( ) ( ) (1) 2.2.2 CPU CPU CPU, Memory-bound, CPU, CPU, CPU-bound CPU, CPU, CPU,,, CPU, CPU,, CPU [1] f max, CPU (T fmax cpu ) CPU (T fmaxmem ) 359

((2) ). T fmax = T fmaxcpu + T fmaxmem (2) CPU f max f (3) T f = f max T fmax cpu f + T f max (3) mem, CPU f max f, (5) T f T fmax = f max T fmax cpu f T f max (4) cpu = ( fmax 1)T fmax (5) cpu f CPU T fmax, CPU cpu f max T fmax CPU (d), (6) T f T fmax = ( f max 1)T fmax d (6) f, (P F Loss), (7) P F Loss = T f T fmax T fmax (7) (6), (7), CPU (f) (8) f max f = (8) P F Loss /d + 1, P F Loss, CPU f, CPU f max CPU 2.3 2 CPU, CPU,, PMC [1 5], PMC Last Level Cache(LLC) Miss [2, 3, 5]. LLC Miss,, LLC Miss Memory-bound, CPU-bound, PMC PMC PMC (nmi watchdog), PMC CPU CPU, PMC, PMC,, CPU CPU 3. CPU CPU Memorybound,,, CPU-bound,,, Memory-bound CPU-bound, CPU 3, SPEC CPU2006 [11] SPEC CPU2006, (INT), (FP), Intel Compiler 12, SPEC CPU2006, train, 1 4 CPU 3, CPU CPU, -0.93,, CPU,, CPU, CPU CPU, ( 3 )., CPU SPEC CPU2006, CPU, PMC PMC (nmi watchdog) 360

0., + *& -) /,'(% & $!"## 632, +- 1*+ 40 1. 52,, - 1*+) /!"#$%$&'(% 3 (p) CPU (d) 4 LLC Miss (c) CPU (d),, BMC(Baseboard Management Controller) CPU,, CPU 4., CPU CPU,, NPB 4.1 4.1.1 CPU 3, CPU, CPU (8), CPU 4.1.2 CPU CPU, 1 100, CPU, CPU, 133MHz 1.60 2.93GHz 11, CPU, (8) CPU f, CPU 4.2 4.2.1 5(%), CPU, NAS Parallel Benchmark (NPB)(version 3.2, OpenMP ) [12]. NPB 8, C, 4, Intel Compiler 12 4.2.2 PMC PMC, LLC Miss [2, 3, 5]., Nehalem Uncore PMC, UNC L3 MISS.ANY(EventNum 09H, Umask Value: 03H) [9]., PMC CPU,, SPEC CPU2006 train 4 LLC Miss, CPU LLC Miss, CPU LLC Miss CPU -0.95 LLC Miss, ( 4 ), CPU,, CPU 4.3 3 3,,, 3 DIMM VTT. 9 361

!" #$ %&'(" %& $" &)!" #$ %&'(" %& $" &) 5 CPU (1sec) 6 CPU (100msec)! #$! %&! '(! " " )*+,&! )*! " (&! *-! "!" #$ %& '( )$ '( &$ (* 7 PMC CPU (1sec) 8 PMC CPU (100msec), NI USB-6210(National Instrument ), nidaqmxbase-3.4.5 10msec,,, CPU 9 4.4 (2.93GHz) CPU 5, 6, 7, 8, CPU CPU, ( ),, CPU, PMC CPU 5% PMC, CPU, CPU 1, 100 CPU, 5%CPU, IS 5%, 15% CPU LLC Miss PMC CPU, CPU, PMC CPU, IS, CPU, 362

4 ( CPU ) memory 1sec memory 100msec (%) (%) ( - ) (%) (%) ( - ) bt.c 3.8 3.5 0.3 4.4 5.1-0.6 cg.c 3.3 1.6 1.7 3.3 1.4 1.9 ep.c 4.8 4.9-0.1 4.7 4.9-0.2 is.c 2.0 4.9-2.9 1.9 4.8-2.9 lu-hp.c 4.3 1.6 2.7 3.8 3.0 0.7 lu.c 3.9 1.4 2.5 4.0 2.4 1.6 sp.c 4.1 3.1 0.9 3.9 5.4-1.5 ua.c 2.7 2.5 0.2 3.7 3.1 0.6 5 (PMC CPU ) pmc 1sec pmc 100msec (%) (%) ( - ) (%) (%) ( - ) bt.c 3.8 3.5 0.3 4.3 4.2 0.1 cg.c 4.8 1.4 3.4 4.9 1.2 3.7 ep.c 4.7 4.9-0.2 4.5 6.7-2.2 is.c 4.7 1.6 3.2 4.6 1.6 3.0 lu-hp.c 2.5 1.8 0.8 4.1 2.8 1.3 lu.c 4.1 1.7 2.4 4.3 1.5 2.8 sp.c 3.7 2.7 1.0 4.0 3.3 0.6 ua.c 2.8 2.7 0.0 4.1 4.1 0.0 4.5 5%,, 2 ( 1 ) CPU CPU, (133MHz), (8) CPU, CPU, (8) CPU CPU, CPU, CPU, ( 2 ) CPU 3 4,. CPU (CPU-bound), CPU, CPU (Memory-bound),,, CPU,, CPU ( ) ( ), 4, 5,, CPU, PMC CPU,,,, 5. DVFS Ondemand Governor [8], CPU CPU CPU,,, CPU, CPU CPU, [2], PMC LLC Miss, CPU, PMC, [1, 4]., PMC (nmi watchdog), Vtune, PMC CPU 363

,,, CPU, [6].,,,,, [3] PMC, MSHR (Miss Status Holding Register), MSHR,,,,, CPU [7],, PMC CPU, CPU, CPU,,, 6., CPU, CPU,, CPU, CPU CPU, CPU PMC PMC,, NPB 8, 5% CPU, CPU, 3%, 5% CPU, IS 5%, 15% CPU, Last Level Cache Miss PMC,, 1) S. Huang and W. Feng, Energy-Efficient Cluster Computing via Accurate Workload Characterization, in Proceedings of the 2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid, ser. CCGRID 09. Washington, DC, USA: IEEE Computer Society, 2009, pp. 68-75. 2) R. Schone, D. Hackenberg: On-line analysis of hardware performance events for workload characterization and processor frequency scaling decisions. In Proceeding of the second joint WOSP/SIPEW international conference on Performance engineering, ICPE 11, pages 481-486. ACM, 2011. 3),,,, Vol. 45, No. SIG 6(ACS 6), pp.1-11, 2004 5. 4) R. Kotla, S. Ghiasi, T. Keller and F. Rawson, Scheduling Processor Voltage and Frequency in Server and Cluster Systems, Proceedings of the 19th IEEE International Parallel and Distributed Symposium, April 2005, pp 234-241. 5) Hrishikesh Amur, Karsten Schwan, Milos Prvulovic, Towards Optimal Power Management: Estimation of Performance Degradation due to DVFS on Modern Processors, Tech. Report GIT-CERCS-10-02. 6) P. Stanley-Marbell, M. Hsiao and U.Kremer, A Hardware Architecture for Dynamic Performance and Energy Adaptation, Power-Aware Computer Systems, Lecture Notes in Computer Science 2325, Springer Verlag, 2002. 7) Vol47 No.Sig18 (ACS16) pp.80 91 2006 11. 8) V. Pallipadi, A. Starikovskiy, The Ondemand Governor: Past, Present, and Future, The Linux Symposium, 2006. 9) Intel, Intel 64 and IA-32 Architectures Software Developer s Manual Volume 3B: System Programming Guide, Part2, 2010. 10) Intel White Paper, Enhanced Intel Speed- 364

Step Technology for the Intel Pentium M Processor, 2004. 11) SPEC CPU2006, http://www.spec.org/cpu2006/. 12) Nas Parallel Benchmark, http://www.nas.nasa.gov/. 365