インテル(R) Xeon(R) プロセッサーおよびインテル(R) Pentium(R) 4 プロセッサーのパフォーマンス・カウンター

Similar documents
The 3 key challenges in programming for MC

DPD Software Development Products Overview

インテル® VTune™ パフォーマンス・アナライザー 9.1 Windows* 版

untitled

Pentium 4

v10 IA-32 64¹ IA-64²



Boot Camp インストールと設定ガイド

InterSafe Personal_v2.3 ユーザーズガイド_初版

EPSON EasyMP Multi PC Projection Ver.1.00 Operation Guide

untitled

OptiPlex OptiPlex 4 OptiPlex vpro Energy STAR5.0 EPEAT GOLD 90 Energy Smart Energy Smart

HP Compaq Business Desktop dx7300シリーズ

HP Compaq Business Desktop dc7700シリーズ

SmartLMSユーザーズガイド<講師編>

P3PC

EPSON EasyMP Multi PC Projection Ver.1.10 Operation Guide


ViewSonic Corporation, Macintosh Power Macintosh Microsoft Windows Windows ViewSonic 3 OnView ViewMatch ViewMeter ViewSonic ViewSonic, ViewSonic

Acronis Backup & Recover 10 Deduplication

Running FileMaker Pro 8 on Windows Server 2003 Terminal Services


Running FileMaker Pro 8 on Citrix Presentation Server

EPSON EasyMP Multi PC Projection Ver.1.11 Operation Guide

LAPLINK ヘルプデスク 導入ガイド

目    次

FileMaker, Inc. All Rights Reserved. FileMaker, Inc Patrick Henry Drive Santa Clara, California FileMaker FileMaker, Inc. FileMa

Printer Driverセットアップ編

コミュニケーションユーティリティー編

インテル(R) Visual Fortran Composer XE

HP xw9400 Workstation

MSAC-EX1

HP COMPAQ BUSINESS DESKTOP DC7800シリーズ

HP High Performance Computing(HPC)

untitled

Canon Industrial Imaging Platform Monitoring Edition 使用説明書

EPSON Easy Interactive Tools Ver.2 Operation Guide

LP-S820

写真集計くん+ for Mac ユーザーズガイド

HDR-TG1

橡Webcamユーザーガイド03.PDF

DPCK-US10

CPU Levels in the memory hierarchy Level 1 Level 2... Increasing distance from the CPU in access time Level n Size of the memory at each level 1: 2.2

ExpressUpdate Agent インストレーションマニュアル

untitled

untitled

untitled

FileMaker Pro 11 Citrix XenApp FileMaker Pro 11

Adobe Acrobat DC 製品比較表

HP Workstation 総合カタログ

パーソナルコンピュータに関するヘドニック回帰式(再推計結果)

DSC-W350/W350D

HPE Moonshot System ~ビッグデータ分析&モバイルワークプレイスを新たなステージへ~

1 Web 1W e b Q Pay-easy 2 31 Web :00 315:00 15:00 315:00 Q 515:00 Q 9 30 Q :00 6:00 21:00 6:

EPSON EasyMP Multi PC Projection Ver.1.20 Operation Guide


名称未設定

ACDSee-Press-Release_0524

konicaminolta.co.jp PageScope Net Care

Microsoft Word - p2viewer_plus_jpn20.doc

PX-K100

Studio One 2クイック・スタート・ガイド

インテル(R) C++ Composer XE 2011 Windows版 入門ガイド


untitled

HDR-SR7/SR8

P3PC

EPSON EB-W9/X9/S9 User's Guide

LP-M720F

F-08E

AC 2

000COV-J.book

ネットワーク装置設定/ アドレス帳編集編

DPF-D70

SHOBI_Portal_Manual

Contents Windows* /Linux* C++/Fortran... 3 Microsoft* embedded Visual C++* C Microsoft* Windows* CE.NET Platform Builder C IP

MSAC-US40

untitled

VNSTProductDes3.0-1_jp.pdf

SonicStage Ver. 2.0

LFA-PC20

ProLiant BL20p Generation 4 システム構成図

untitled

Systemwalker Desktop Patrol V15 資産管理集計機能 説明書

01_OpenMP_osx.indd

Installation and New Features Guide for FileMaker Pro 10 and FileMaker Pro 10 Advanced

FileMaker Server Getting Started Guide

Zinstall WinWin 日本語ユーザーズガイド

CD Microsoft, MS-DOS, Windows 95, Windows 98,Windows Me, Windows NT, Windows 2000, Visual Basic, Visual Basic.NET, Visual C#, Excel, ActiveX Microsoft

FileMaker Server 9 Getting Started Guide

install_guide.PDF

HP Workstation Xeon 5600

Corel GuideMenu DVD MovieWriter SE DVD MovieWriter SE DVD MovieWriter SE WinDVD SE WinDVD SE Corel Application Disc Corel Application Disc 2

Express5800/53Xg, Y53Xg インストレーションガイド(Windows編)

デスクトップ仮想化の理解

indd



2 3

Transcription:

Core Intel Intel Intel Itanium Pentium Xeon Intel Corporation 2007Intel Corporation.

VTune 9.0 VTune 1 Randy Camp R&D MUSICMATCH Inc. Core 2 Duo Core 2 Quad Windows: Microsoft Vista* Linux: hotspot Windows* Linux* Mac* IA32 Intel64 IA64 2 2007 Intel Corporation. Intel Intel Intel Corporation

Core 3 2007 Intel Corporation. Intel Intel Intel Corporation

Core 4 2007 Intel Corporation. Intel Intel Intel Corporation

VTune : = * (SAV) 5 2007 Intel Corporation. Intel Intel Intel Corporation

Core FSB L2 2/4MB CPU-0 L1D=32KB L0/L1 DTLB PMH CPU-0 L1I=32KB CPU-1 L1D=32KB L0/L1 DTLB PMH CPU-1 L1I=32KB CPU-0 CPU-1 6 2007 Intel Corporation. Intel Intel Intel Corporation

7 2007 Intel Corporation. Intel Intel Intel Corporation

4 L1 Core Duo DCU DCU Data Cache Unit IP IP L2 Pentium 4 8 2007 Intel Corporation. Intel Intel Intel Corporation

P P CPU_CLK_UNHALTED BUS_DRDY_CLOCKS.ALL_AGENTS BUSY INST_RETIRED_ANY_P P BUS_DRDY_CLOCKS.THIS_AGENT BUSY INST_RETIRED_LOADS MEM_LOAD_RETIRED.L2_LINE_MISS P L2 INST_RETIRED_STORES MMX2_PRE_MISS.T1 LSW BUS_TRANS_ANY MMX2_PRE_MISS.T2 L2SW BUS_TRANS_MEM MMX2_PRE_MISS.STORES BUS_TRANS_BURST L2_LINES_IN.SELF.DEMAND SW L2 BUS_TRANS_BRD L2_LINES_IN.SELF.PREFETCH HW L2 BUS_TRANS_WB NT L2_LINES_OUT.SELF.DEMAND L2 BUS_TRANS_RFO RFOHW L2_LINES_OUT.SELF.PREFETCH HW L2 = 64 * Bus_Trans_Mem * / Cpu_Clk_Unhalted 9 2007 Intel Corporation. Intel Intel Intel Corporation

X86 OOO 10 2007 Intel Corporation. Intel Intel Intel Corporation

2 11 2007 Intel Corporation. Intel Intel Intel Corporation

IP OOO 4 L2 BW DTLB FE CPU 12 2007 Intel Corporation. Intel Intel Intel Corporation

X86 CPU_CLK_UNHALTED = + NON_RET_DISPATCH + RET_DISPATCH PGO ILP RESOURCE_STALLS.BR_MISS_CLEAR 13 2007 Intel Corporation. Intel Intel Intel Corporation

UOP // L2 32KB IP 32KB 4 (BTB) (RAT) RESOURCE_STALLS RS 32 MOB) SIMD SIMD SIMD FP Add / FP DIV/MUL ROB) 96 IA 14 2007 Intel Corporation. Intel Intel Intel Corporation

UOP // L2 32KB IP 32KB 4 (BTB) (RAT) RS 32 MOB) SIMD SIMD SIMD FP Add / FP DIV/MUL ROB) 96 RS_UOPS_DISPATCHED IA 15 2007 Intel Corporation. Intel Intel Intel Corporation

UOP // L2 32KB IP 32KB 4 (BTB) (RAT) UOPS_RETIRED RS 32 MOB) SIMD SIMD SIMD FP Add / FP DIV/MUL ROB) 96 IA 16 2007 Intel Corporation. Intel Intel Intel Corporation

OOO RS RS_UOPS_DISPATCHED CPU RS PMU* * Performance Monitoring Unit (PMU) 17 2007 Intel Corporation. Intel Intel Intel Corporation

// L2 32KB IP 32KB 4 (BTB) (RAT) RS_UPOS_DISPACHED RS 32 MOB) SIMD SIMD SIMD FP Add / FP DIV/MUL ROB) 96 IA 18 2007 Intel Corporation. Intel Intel Intel Corporation

VTune 19 2007 Intel Corporation. Intel Intel Intel Corporation

PMU GE LT CMASK I NV E N I N T P C E OS US R umask Event # 3 APIC 0 RS_UOPS_DISPATCHED CMASK = 1 INV = 1 UOPS == RS_UOPS_DISPATCHED.CYCLES_NONE 20 2007 Intel Corporation. Intel Intel Intel Corporation

RS_UOPS_DISPATCH: cmask = 1 ~ CPU_CLK_UNHALTED RS_UOPS_DISPATCH: cmask = 1 : inv = 1 CPU_CLK_UNHALTED OOO >99% CPU PL HALT 21 2007 Intel Corporation. Intel Intel Intel Corporation

UOP / RS_UOPS_DISPATCHED:cmask=1:inv=1 RS_UOPS_DISPATCHED:cmask=2:inv=1 RS_UOPS_DISPATCHED:cmask=3:inv=1 RS_UOPS_DISPATCHED:cmask=4:inv=1 RS_UOPS_DISPATCHED:cmask=5:inv=1 RS_UOPS_DISPATCHED:cmask=6:inv=1 RS_UOPS_DISPATCHED:cmask=7:inv=1 N-1 uops dispatched per cycle a[i] = exp(x[i]); 1 18000000000 16000000000 14000000000 12000000000 10000000000 8000000000 6000000000 4000000000 2000000000 0 0 2 4 6 8 Series1 22 2007 Intel Corporation. Intel Intel Intel Corporation

// L2 32KB IP 32KB 4 (BTB) (RAT) RESOURCE_STALL RS 32 MOB) SIMD SIMD SIMD FP Add / FP DIV/MUL ROB) 96 IA 23 2007 Intel Corporation. Intel Intel Intel Corporation

RESOURCE_STALL OOO UOP RESOURCE_STALLS.BR_MISS_CLEAR FP RESOURCE_STALLS.CLEAR RESOURCE_STALLS.ROB_FULL ROB 96 RESOURCE_STALLS.LD_ST RESOURCE_STALLS.RS_FULL RS 32 24 2007 Intel Corporation. Intel Intel Intel Corporation

// L2 32KB IP 32KB 4 (BTB) (RAT) UOPS_RETIRED RS 32 MOB) SIMD SIMD SIMD FP Add / FP DIV/MUL ROB) 96 IA 25 2007 Intel Corporation. Intel Intel Intel Corporation

"Stalls" at Retirement vs Dispatch Cycles(Ret_uops=0) 1 0.8 0.6 0.4 0.2 0 0 0.2 0.4 0.6 0.8 1 Cycles(Disp_uops=0) OOO 26 2007 Intel Corporation. Intel Intel Intel Corporation

X86 = rs_uops_dispatched.cycles_none + rs_uops_dispatched:cmask=1 + ~ CPU_CLK_UNHALTED.CORE CPU 27 2007 Intel Corporation. Intel Intel Intel Corporation

X86 ~ cycles_dispatch_retiring_uops + cycles_dispatch_non_retiring_uops / UOP UOP = rs_uops_dispatched (uops_retired.any + Uops_retired.fused) UOP ~ uops / avg_uops_per_cycle = rs_uops_dispatched / (uops_retired.any + uops_retired.fused) - 1 28 2007 Intel Corporation. Intel Intel Intel Corporation

Cycle Accounting 1.2 1 0.8 0.6 0.4 Executing Stalls 0.2 0 = UOP = RS_UOPS_DISPATCH.CYCLES_NONE 29 2007 Intel Corporation. Intel Intel Intel Corporation

Stall Decomposition 1.2 1 0.8 0.6 0.4 0.2 Executing FE + Scoreboard Pipeline Flush DTLB L2 Hits L2 Misses Stall Total 0 1 2 = RESOURCE_STALLS.BR_MISS_CLEAR / L2 = ( MEM_LOAD_RETIRED.L1D_LINE_MISS - MEM_LOAD_RETIRED.L2_LINE_MISS )*12/ DTLB/L2 = * / FE + = 30 2007 Intel Corporation. Intel Intel Intel Corporation

Decomposing Unstalled Cycles 1.05 1 1 0.95 0.9 0.85 Uops Retiring OOO Bursts Non_retired Stalls 0.8 0.75 = (( 1 (Uops_retired.any+Uops_retired.fused)/RS_Uops_Dispatched) * RS_Uops_Dispatched:cmask=1 / CPU_CLK_UNHALTED.CORE OOO = Uops_Retired.Any.cycles_none 31 2007 Intel Corporation. Intel Intel Intel Corporation

1.2 1 0.8 0.6 0.4 0.2 0 Cycle Decomposition 1 Uops Retiring OOO Bursts Non_retired FE + Scoreboard Pipeline Flush DTLB L2 Hits L2 Misses / FE + 32 2007 Intel Corporation. Intel Intel Intel Corporation

Load_Blocks.ADR ~5 8 4 Load_Blocks.Overlap_Store ~6 N*4096 Load_Blocks. Overlap_Store ~6 L2 2 Load_Blocks.UNTIL_RETIRE ~22 L2 2 Load_Blocks.UNTIL_RETIRE ~20 16 ILD_STALLS ~6 FE + 33 2007 Intel Corporation. Intel Intel Intel Corporation

CPU_CYCLES CPU_CLK_UNHALTED.CORE BACK_END_BUBBLE.ALL RS_UOPS_DISPATCHED.CYCLES_NONE BUS_MEMORY.ALL.SELF BUS_TRANS_ANY.SELF DEAR_LATENCY_GT_64 MEM_LOAD_RETIRED.L2_LINE_MISS 34 2007 Intel Corporation. Intel Intel Intel Corporation

1 SAV (Sample After Value) CPU_CLK_UNHALTED.CORE 2,000,000 RS_UOPS_DISPATCHED.CYCLES_NONE 2,000,000 UOPS_RETIRED.ANY + UOPS_RETIRED.FUSED 2,000,000 RS_UOPS_DISPATCHED 2,000,000 MEM_LOAD_RETIRED.L2_LINE_MISS 10,000 INST_RETIRED.ANY_P 2,000,000 BUS_TRANS_ANY.SELF 100,000 BUS_TRANS_ANY.ALL_AGENTS 100,000 RESOURCE_STALLS.BR_MISS_CLEAR 2,000,000 SAV 35 2007 Intel Corporation. Intel Intel Intel Corporation

2 SAV (Sample After Value) MEM_LOAD_RETIRED.DTLB_MISS 20,000 MEM_LOAD_RETIRED.L2_MISS 10,000 MEM_LOAD_RETIRED.L1_LINE_MISS 200,000 BR_CND_MISSP_EXEC 2,000,000 BR_CND_EXEC 2,000,000 BR_CALL_EXEC 200,000 BR_CALL_MISSP_EXEC 200,000 ILD_STALLS 200,000 LOAD_BLOCK.STORE_OVERLAP 200,000 SAV : L1 / L2_hit 10 36 2007 Intel Corporation. Intel Intel Intel Corporation

FP L2 OOO 37 2007 Intel Corporation. Intel Intel Intel Corporation

Core 2 2 SIMD 1 Core 2 PMU LLC 38 2007 Intel Corporation. Intel Intel Intel Corporation

39 2007 Intel Corporation. Intel Intel Intel Corporation

MEM_LOAD_RETIRED.L2_MISS L2 40 2007 Intel Corporation. Intel Intel Intel Corporation

Core Core BR_INST_RETIRED.MISPRED INST_RETIRED.ANY_P MEM_LOAD_RETIRED.DTLB_MISS MEM_LOAD_RETIRED.L1D_LINE_MISS EIP MEM_LOAD_RETIRED.L1D_MISS MEM_LOAD_RETIRED.L2_LINE_MISS MEM_LOAD_RETIRED.L2_MISS SIMD_INST_RETIRED.ANY X87_OPS_RETIRED.ANY 41 2007 Intel Corporation. Intel Intel Intel Corporation

Core CPU_CLK_UNHALTED.CORE INST_RETIRED.ANY CPU_CLK_UNHALTED.CORE CPU HALT CPU CPU_CLK_UNHALTED.CORE / = 1 INST_RETIRED.ANY = CPI = CPU_CLK_UNHALTED.CORE / INST_RETIRED.ANY CPI 42 2007 Intel Corporation. Intel Intel Intel Corporation

Core CPU_CLK_UNHALTED.CORE CPU_CLK_UNHALTED.NO_OTHER CPU_CLK_UNHALTED.NO_OTHER HALT 1 CPU_CLK_UNHALTED.CORE HALT CPU_CLK_UNHALTED.NO_OTHER 43 2007 Intel Corporation. Intel Intel Intel Corporation

VTune Xeon Pentium 4 Web http://www.intel.co.jp/jp/developer/download/ http://www.intel.co.jp/jp/developer/software/products/ http://developer.intel.com http://support.intel.com http://softwareforums.intel.com/ids 44 2007 Intel Corporation. Intel Intel Intel Corporation

Intel s Terms and conditions of Sales ( ) * 2007 Intel Corporation. 45 2007 Intel Corporation. Intel Intel Intel Corporation