Core Intel Intel Intel Itanium Pentium Xeon Intel Corporation 2007Intel Corporation.
VTune 9.0 VTune 1 Randy Camp R&D MUSICMATCH Inc. Core 2 Duo Core 2 Quad Windows: Microsoft Vista* Linux: hotspot Windows* Linux* Mac* IA32 Intel64 IA64 2 2007 Intel Corporation. Intel Intel Intel Corporation
Core 3 2007 Intel Corporation. Intel Intel Intel Corporation
Core 4 2007 Intel Corporation. Intel Intel Intel Corporation
VTune : = * (SAV) 5 2007 Intel Corporation. Intel Intel Intel Corporation
Core FSB L2 2/4MB CPU-0 L1D=32KB L0/L1 DTLB PMH CPU-0 L1I=32KB CPU-1 L1D=32KB L0/L1 DTLB PMH CPU-1 L1I=32KB CPU-0 CPU-1 6 2007 Intel Corporation. Intel Intel Intel Corporation
7 2007 Intel Corporation. Intel Intel Intel Corporation
4 L1 Core Duo DCU DCU Data Cache Unit IP IP L2 Pentium 4 8 2007 Intel Corporation. Intel Intel Intel Corporation
P P CPU_CLK_UNHALTED BUS_DRDY_CLOCKS.ALL_AGENTS BUSY INST_RETIRED_ANY_P P BUS_DRDY_CLOCKS.THIS_AGENT BUSY INST_RETIRED_LOADS MEM_LOAD_RETIRED.L2_LINE_MISS P L2 INST_RETIRED_STORES MMX2_PRE_MISS.T1 LSW BUS_TRANS_ANY MMX2_PRE_MISS.T2 L2SW BUS_TRANS_MEM MMX2_PRE_MISS.STORES BUS_TRANS_BURST L2_LINES_IN.SELF.DEMAND SW L2 BUS_TRANS_BRD L2_LINES_IN.SELF.PREFETCH HW L2 BUS_TRANS_WB NT L2_LINES_OUT.SELF.DEMAND L2 BUS_TRANS_RFO RFOHW L2_LINES_OUT.SELF.PREFETCH HW L2 = 64 * Bus_Trans_Mem * / Cpu_Clk_Unhalted 9 2007 Intel Corporation. Intel Intel Intel Corporation
X86 OOO 10 2007 Intel Corporation. Intel Intel Intel Corporation
2 11 2007 Intel Corporation. Intel Intel Intel Corporation
IP OOO 4 L2 BW DTLB FE CPU 12 2007 Intel Corporation. Intel Intel Intel Corporation
X86 CPU_CLK_UNHALTED = + NON_RET_DISPATCH + RET_DISPATCH PGO ILP RESOURCE_STALLS.BR_MISS_CLEAR 13 2007 Intel Corporation. Intel Intel Intel Corporation
UOP // L2 32KB IP 32KB 4 (BTB) (RAT) RESOURCE_STALLS RS 32 MOB) SIMD SIMD SIMD FP Add / FP DIV/MUL ROB) 96 IA 14 2007 Intel Corporation. Intel Intel Intel Corporation
UOP // L2 32KB IP 32KB 4 (BTB) (RAT) RS 32 MOB) SIMD SIMD SIMD FP Add / FP DIV/MUL ROB) 96 RS_UOPS_DISPATCHED IA 15 2007 Intel Corporation. Intel Intel Intel Corporation
UOP // L2 32KB IP 32KB 4 (BTB) (RAT) UOPS_RETIRED RS 32 MOB) SIMD SIMD SIMD FP Add / FP DIV/MUL ROB) 96 IA 16 2007 Intel Corporation. Intel Intel Intel Corporation
OOO RS RS_UOPS_DISPATCHED CPU RS PMU* * Performance Monitoring Unit (PMU) 17 2007 Intel Corporation. Intel Intel Intel Corporation
// L2 32KB IP 32KB 4 (BTB) (RAT) RS_UPOS_DISPACHED RS 32 MOB) SIMD SIMD SIMD FP Add / FP DIV/MUL ROB) 96 IA 18 2007 Intel Corporation. Intel Intel Intel Corporation
VTune 19 2007 Intel Corporation. Intel Intel Intel Corporation
PMU GE LT CMASK I NV E N I N T P C E OS US R umask Event # 3 APIC 0 RS_UOPS_DISPATCHED CMASK = 1 INV = 1 UOPS == RS_UOPS_DISPATCHED.CYCLES_NONE 20 2007 Intel Corporation. Intel Intel Intel Corporation
RS_UOPS_DISPATCH: cmask = 1 ~ CPU_CLK_UNHALTED RS_UOPS_DISPATCH: cmask = 1 : inv = 1 CPU_CLK_UNHALTED OOO >99% CPU PL HALT 21 2007 Intel Corporation. Intel Intel Intel Corporation
UOP / RS_UOPS_DISPATCHED:cmask=1:inv=1 RS_UOPS_DISPATCHED:cmask=2:inv=1 RS_UOPS_DISPATCHED:cmask=3:inv=1 RS_UOPS_DISPATCHED:cmask=4:inv=1 RS_UOPS_DISPATCHED:cmask=5:inv=1 RS_UOPS_DISPATCHED:cmask=6:inv=1 RS_UOPS_DISPATCHED:cmask=7:inv=1 N-1 uops dispatched per cycle a[i] = exp(x[i]); 1 18000000000 16000000000 14000000000 12000000000 10000000000 8000000000 6000000000 4000000000 2000000000 0 0 2 4 6 8 Series1 22 2007 Intel Corporation. Intel Intel Intel Corporation
// L2 32KB IP 32KB 4 (BTB) (RAT) RESOURCE_STALL RS 32 MOB) SIMD SIMD SIMD FP Add / FP DIV/MUL ROB) 96 IA 23 2007 Intel Corporation. Intel Intel Intel Corporation
RESOURCE_STALL OOO UOP RESOURCE_STALLS.BR_MISS_CLEAR FP RESOURCE_STALLS.CLEAR RESOURCE_STALLS.ROB_FULL ROB 96 RESOURCE_STALLS.LD_ST RESOURCE_STALLS.RS_FULL RS 32 24 2007 Intel Corporation. Intel Intel Intel Corporation
// L2 32KB IP 32KB 4 (BTB) (RAT) UOPS_RETIRED RS 32 MOB) SIMD SIMD SIMD FP Add / FP DIV/MUL ROB) 96 IA 25 2007 Intel Corporation. Intel Intel Intel Corporation
"Stalls" at Retirement vs Dispatch Cycles(Ret_uops=0) 1 0.8 0.6 0.4 0.2 0 0 0.2 0.4 0.6 0.8 1 Cycles(Disp_uops=0) OOO 26 2007 Intel Corporation. Intel Intel Intel Corporation
X86 = rs_uops_dispatched.cycles_none + rs_uops_dispatched:cmask=1 + ~ CPU_CLK_UNHALTED.CORE CPU 27 2007 Intel Corporation. Intel Intel Intel Corporation
X86 ~ cycles_dispatch_retiring_uops + cycles_dispatch_non_retiring_uops / UOP UOP = rs_uops_dispatched (uops_retired.any + Uops_retired.fused) UOP ~ uops / avg_uops_per_cycle = rs_uops_dispatched / (uops_retired.any + uops_retired.fused) - 1 28 2007 Intel Corporation. Intel Intel Intel Corporation
Cycle Accounting 1.2 1 0.8 0.6 0.4 Executing Stalls 0.2 0 = UOP = RS_UOPS_DISPATCH.CYCLES_NONE 29 2007 Intel Corporation. Intel Intel Intel Corporation
Stall Decomposition 1.2 1 0.8 0.6 0.4 0.2 Executing FE + Scoreboard Pipeline Flush DTLB L2 Hits L2 Misses Stall Total 0 1 2 = RESOURCE_STALLS.BR_MISS_CLEAR / L2 = ( MEM_LOAD_RETIRED.L1D_LINE_MISS - MEM_LOAD_RETIRED.L2_LINE_MISS )*12/ DTLB/L2 = * / FE + = 30 2007 Intel Corporation. Intel Intel Intel Corporation
Decomposing Unstalled Cycles 1.05 1 1 0.95 0.9 0.85 Uops Retiring OOO Bursts Non_retired Stalls 0.8 0.75 = (( 1 (Uops_retired.any+Uops_retired.fused)/RS_Uops_Dispatched) * RS_Uops_Dispatched:cmask=1 / CPU_CLK_UNHALTED.CORE OOO = Uops_Retired.Any.cycles_none 31 2007 Intel Corporation. Intel Intel Intel Corporation
1.2 1 0.8 0.6 0.4 0.2 0 Cycle Decomposition 1 Uops Retiring OOO Bursts Non_retired FE + Scoreboard Pipeline Flush DTLB L2 Hits L2 Misses / FE + 32 2007 Intel Corporation. Intel Intel Intel Corporation
Load_Blocks.ADR ~5 8 4 Load_Blocks.Overlap_Store ~6 N*4096 Load_Blocks. Overlap_Store ~6 L2 2 Load_Blocks.UNTIL_RETIRE ~22 L2 2 Load_Blocks.UNTIL_RETIRE ~20 16 ILD_STALLS ~6 FE + 33 2007 Intel Corporation. Intel Intel Intel Corporation
CPU_CYCLES CPU_CLK_UNHALTED.CORE BACK_END_BUBBLE.ALL RS_UOPS_DISPATCHED.CYCLES_NONE BUS_MEMORY.ALL.SELF BUS_TRANS_ANY.SELF DEAR_LATENCY_GT_64 MEM_LOAD_RETIRED.L2_LINE_MISS 34 2007 Intel Corporation. Intel Intel Intel Corporation
1 SAV (Sample After Value) CPU_CLK_UNHALTED.CORE 2,000,000 RS_UOPS_DISPATCHED.CYCLES_NONE 2,000,000 UOPS_RETIRED.ANY + UOPS_RETIRED.FUSED 2,000,000 RS_UOPS_DISPATCHED 2,000,000 MEM_LOAD_RETIRED.L2_LINE_MISS 10,000 INST_RETIRED.ANY_P 2,000,000 BUS_TRANS_ANY.SELF 100,000 BUS_TRANS_ANY.ALL_AGENTS 100,000 RESOURCE_STALLS.BR_MISS_CLEAR 2,000,000 SAV 35 2007 Intel Corporation. Intel Intel Intel Corporation
2 SAV (Sample After Value) MEM_LOAD_RETIRED.DTLB_MISS 20,000 MEM_LOAD_RETIRED.L2_MISS 10,000 MEM_LOAD_RETIRED.L1_LINE_MISS 200,000 BR_CND_MISSP_EXEC 2,000,000 BR_CND_EXEC 2,000,000 BR_CALL_EXEC 200,000 BR_CALL_MISSP_EXEC 200,000 ILD_STALLS 200,000 LOAD_BLOCK.STORE_OVERLAP 200,000 SAV : L1 / L2_hit 10 36 2007 Intel Corporation. Intel Intel Intel Corporation
FP L2 OOO 37 2007 Intel Corporation. Intel Intel Intel Corporation
Core 2 2 SIMD 1 Core 2 PMU LLC 38 2007 Intel Corporation. Intel Intel Intel Corporation
39 2007 Intel Corporation. Intel Intel Intel Corporation
MEM_LOAD_RETIRED.L2_MISS L2 40 2007 Intel Corporation. Intel Intel Intel Corporation
Core Core BR_INST_RETIRED.MISPRED INST_RETIRED.ANY_P MEM_LOAD_RETIRED.DTLB_MISS MEM_LOAD_RETIRED.L1D_LINE_MISS EIP MEM_LOAD_RETIRED.L1D_MISS MEM_LOAD_RETIRED.L2_LINE_MISS MEM_LOAD_RETIRED.L2_MISS SIMD_INST_RETIRED.ANY X87_OPS_RETIRED.ANY 41 2007 Intel Corporation. Intel Intel Intel Corporation
Core CPU_CLK_UNHALTED.CORE INST_RETIRED.ANY CPU_CLK_UNHALTED.CORE CPU HALT CPU CPU_CLK_UNHALTED.CORE / = 1 INST_RETIRED.ANY = CPI = CPU_CLK_UNHALTED.CORE / INST_RETIRED.ANY CPI 42 2007 Intel Corporation. Intel Intel Intel Corporation
Core CPU_CLK_UNHALTED.CORE CPU_CLK_UNHALTED.NO_OTHER CPU_CLK_UNHALTED.NO_OTHER HALT 1 CPU_CLK_UNHALTED.CORE HALT CPU_CLK_UNHALTED.NO_OTHER 43 2007 Intel Corporation. Intel Intel Intel Corporation
VTune Xeon Pentium 4 Web http://www.intel.co.jp/jp/developer/download/ http://www.intel.co.jp/jp/developer/software/products/ http://developer.intel.com http://support.intel.com http://softwareforums.intel.com/ids 44 2007 Intel Corporation. Intel Intel Intel Corporation
Intel s Terms and conditions of Sales ( ) * 2007 Intel Corporation. 45 2007 Intel Corporation. Intel Intel Intel Corporation