IPSJ SIG Technical Report Vol.2010-AVM-68 No /3/ High-Efficient 2-pass Video Coding Algorithm based on Macroblock Rate-Distortion Kazu

Similar documents
2. CABAC CABAC CABAC 1 1 CABAC Figure 1 Overview of CABAC 2 DCT 2 0/ /1 CABAC [3] 3. 2 値化部 コンテキスト計算部 2 値算術符号化部 CABAC CABAC

図 2: 高周波成分を用いた超解像 解像度度画像とそれらを低解像度化して得られる 低解像度画像との差により低解像度の高周波成分 を得る 高解像度と低解像度の高周波成分から位 置関係を保ったままパッチ領域をそれぞれ切り出 し 高解像度パッチ画像と低解像度パッチ画像の ペアとしてデータベースに登録する

1 Fig. 1 Extraction of motion,.,,, 4,,, 3., 1, 2. 2.,. CHLAC,. 2.1,. (256 ).,., CHLAC. CHLAC, HLAC. 2.3 (HLAC ) r,.,. HLAC. N. 2 HLAC Fig. 2

kiyo5_1-masuzawa.indd

& Vol.5 No (Oct. 2015) TV 1,2,a) , Augmented TV TV AR Augmented Reality 3DCG TV Estimation of TV Screen Position and Ro

Input image Initialize variables Loop for period of oscillation Update height map Make shade image Change property of image Output image Change time L

4. C i k = 2 k-means C 1 i, C 2 i 5. C i x i p [ f(θ i ; x) = (2π) p 2 Vi 1 2 exp (x µ ] i) t V 1 i (x µ i ) 2 BIC BIC = 2 log L( ˆθ i ; x i C i ) + q

2.2 (a) = 1, M = 9, p i 1 = p i = p i+1 = 0 (b) = 1, M = 9, p i 1 = 0, p i = 1, p i+1 = 1 1: M 2 M 2 w i [j] w i [j] = 1 j= w i w i = (w i [ ],, w i [

Ł\”ƒ-2005

第90回日本感染症学会学術講演会抄録(I)

ID 3) 9 4) 5) ID 2 ID 2 ID 2 Bluetooth ID 2 SRCid1 DSTid2 2 id1 id2 ID SRC DST SRC 2 2 ID 2 2 QR 6) 8) 6) QR QR QR QR

IPSJ SIG Technical Report Vol.2009-CVIM-167 No /6/10 Real AdaBoost HOG 1 1 1, 2 1 Real AdaBoost HOG HOG Real AdaBoost HOG A Method for Reducing

(3.6 ) (4.6 ) 2. [3], [6], [12] [7] [2], [5], [11] [14] [9] [8] [10] (1) Voodoo 3 : 3 Voodoo[1] 3 ( 3D ) (2) : Voodoo 3D (3) : 3D (Welc

IPSJ SIG Technical Report Pitman-Yor 1 1 Pitman-Yor n-gram A proposal of the melody generation method using hierarchical pitman-yor language model Aki

Vol.54 No (July 2013) [9] [10] [11] [12], [13] 1 Fig. 1 Flowchart of the proposed system. c 2013 Information

IPSJ SIG Technical Report Vol.2016-CE-137 No /12/ e β /α α β β / α A judgment method of difficulty of task for a learner using simple

抄録/抄録1    (1)V

日本内科学会雑誌第98巻第4号

日本内科学会雑誌第97巻第7号

IPSJ SIG Technical Report Vol.2012-CG-148 No /8/29 3DCG 1,a) On rigid body animation taking into account the 3D computer graphics came

1 Table 1: Identification by color of voxel Voxel Mode of expression Nothing Other 1 Orange 2 Blue 3 Yellow 4 SSL Humanoid SSL-Vision 3 3 [, 21] 8 325

パーキンソン病治療ガイドライン2002

研修コーナー

[2] OCR [3], [4] [5] [6] [4], [7] [8], [9] 1 [10] Fig. 1 Current arrangement and size of ruby. 2 Fig. 2 Typography combined with printing

IPSJ SIG Technical Report An Evaluation Method for the Degree of Strain of an Action Scene Mao Kuroda, 1 Takeshi Takai 1 and Takashi Matsuyama 1

28 TCG SURF Card recognition using SURF in TCG play video

P2P P2P peer peer P2P peer P2P peer P2P i

IPSJ SIG Technical Report Vol.2014-CG-155 No /6/28 1,a) 1,2,3 1 3,4 CG An Interpolation Method of Different Flow Fields using Polar Inter

情報処理学会研究報告 IPSJ SIG Technical Report Vol.2017-CG-166 No /3/ HUNTEXHUNTER1 NARUTO44 Dr.SLUMP1,,, Jito Hiroki Satoru MORITA The

IPSJ SIG Technical Report Vol.2014-IOT-27 No.14 Vol.2014-SPT-11 No /10/10 1,a) 2 zabbix Consideration of a system to support understanding of f

人文学部研究年報12号.indb

7,, i

MDD PBL ET 9) 2) ET ET 2.2 2), 1 2 5) MDD PBL PBL MDD MDD MDD 10) MDD Executable UML 11) Executable UML MDD Executable UML

,,.,.,,.,.,.,.,,.,..,,,, i

Fig. 3 Flow diagram of image processing. Black rectangle in the photo indicates the processing area (128 x 32 pixels).

THE INSTITUTE OF ELECTRONICS, INFORMATION AND COMMUNICATION ENGINEERS TECHNICAL REPORT OF IEICE. TV A310

DEIM Forum 2009 C8-4 QA NTT QA QA QA 2 QA Abstract Questions Recomme

258 5) GPS 1 GPS 6) GPS DP 7) 8) 10) GPS GPS ) GPS Global Positioning System

IPSJ SIG Technical Report Vol.2009-BIO-17 No /5/26 DNA 1 1 DNA DNA DNA DNA Correcting read errors on DNA sequences determined by Pyrosequencing

1: A/B/C/D Fig. 1 Modeling Based on Difference in Agitation Method artisoc[7] A D 2017 Information Processing

149 (Newell [5]) Newell [5], [1], [1], [11] Li,Ryu, and Song [2], [11] Li,Ryu, and Song [2], [1] 1) 2) ( ) ( ) 3) T : 2 a : 3 a 1 :

THE INSTITUTE OF ELECTRONICS, INFORMATION AND COMMUNICATION ENGINEERS TECHNICAL REPORT OF IEICE.

2). 3) 4) 1.2 NICTNICT DCRA Dihedral Corner Reflector micro-arraysdcra DCRA DCRA DCRA 3D DCRA PC USB PC PC ON / OFF Velleman K8055 K8055 K8055

1 Web [2] Web [3] [4] [5], [6] [7] [8] S.W. [9] 3. MeetingShelf Web MeetingShelf MeetingShelf (1) (2) (3) (4) (5) Web MeetingShelf

VRSJ-SIG-MR_okada_79dce8c8.pdf

(MIRU2008) HOG Histograms of Oriented Gradients (HOG)

(a) Picking up of six components (b) Picking up of three simultaneously. components simultaneously. Fig. 2 An example of the simultaneous pickup. 6 /

20 Method for Recognizing Expression Considering Fuzzy Based on Optical Flow

16_.....E...._.I.v2006

IPSJ SIG Technical Report Vol.2011-EC-19 No /3/ ,.,., Peg-Scope Viewer,,.,,,,. Utilization of Watching Logs for Support of Multi-

& Vol.2 No (Mar. 2012) 1,a) , Bluetooth A Health Management Service by Cell Phones and Its Us

表1-表4宅建99.indd

表1-表4宅建98.indd

表1-表4宅建101.indd

表1-表4宅建いわて-表紙.indd

17 Proposal of an Algorithm of Image Extraction and Research on Improvement of a Man-machine Interface of Food Intake Measuring System

& 3 3 ' ' (., (Pixel), (Light Intensity) (Random Variable). (Joint Probability). V., V = {,,, V }. i x i x = (x, x,, x V ) T. x i i (State Variable),

揃 Lag [hour] Lag [day] 35

TCP/IP IEEE Bluetooth LAN TCP TCP BEC FEC M T M R M T 2. 2 [5] AODV [4]DSR [3] 1 MS 100m 5 /100m 2 MD 2 c 2009 Information Processing Society of

IPSJ SIG Technical Report Vol.2010-CVIM-170 No /1/ Visual Recognition of Wire Harnesses for Automated Wiring Masaki Yoneda, 1 Ta

4 i

IPSJ SIG Technical Report GPS LAN GPS LAN GPS LAN Location Identification by sphere image and hybrid sensing Takayuki Katahira, 1 Yoshio Iwai 1

23_02.dvi

MmUm+FopX m Mm+Mop F-Mm(Fop-Mopum)M m+mop MSuS+FX S M S+MOb Fs-Ms(Mobus-Fex)M s+mob Fig. 1 Particle model of single degree of freedom master/ slave sy

A Feasibility Study of Direct-Mapping-Type Parallel Processing Method to Solve Linear Equations in Load Flow Calculations Hiroaki Inayoshi, Non-member

IPSJ SIG Technical Report Secret Tap Secret Tap Secret Flick 1 An Examination of Icon-based User Authentication Method Using Flick Input for

THE INSTITUTE OF ELECTRONICS, INFORMATION AND COMMUNICATION ENGINEERS TECHNICAL REPORT OF IEICE. UWB UWB

1 Kinect for Windows M = [X Y Z] T M = [X Y Z ] T f (u,v) w 3.2 [11] [7] u = f X +u Z 0 δ u (X,Y,Z ) (5) v = f Y Z +v 0 δ v (X,Y,Z ) (6) w = Z +

23 Fig. 2: hwmodulev2 3. Reconfigurable HPC 3.1 hw/sw hw/sw hw/sw FPGA PC FPGA PC FPGA HPC FPGA FPGA hw/sw hw/sw hw- Module FPGA hwmodule hw/sw FPGA h

2007/8 Vol. J90 D No. 8 Stauffer [7] 2 2 I 1 I 2 2 (I 1(x),I 2(x)) 2 [13] I 2 = CI 1 (C >0) (I 1,I 2) (I 1,I 2) Field Monitoring Server

Q [4] 2. [3] [5] ϵ- Q Q CO CO [4] Q Q [1] i = X ln n i + C (1) n i i n n i i i n i = n X i i C exploration exploitation [4] Q Q Q ϵ 1 ϵ 3. [3] [5] [4]

浜松医科大学紀要

Vol. 42 No. SIG 8(TOD 10) July HTML 100 Development of Authoring and Delivery System for Synchronized Contents and Experiment on High Spe

B HNS 7)8) HNS ( ( ) 7)8) (SOA) HNS HNS 4) HNS ( ) ( ) 1 TV power, channel, volume power true( ON) false( OFF) boolean channel volume int

130 Oct Radial Basis Function RBF Efficient Market Hypothesis Fama ) 4) 1 Fig. 1 Utility function. 2 Fig. 2 Value function. (1) (2)

1 DHT Fig. 1 Example of DHT 2 Successor Fig. 2 Example of Successor 2.1 Distributed Hash Table key key value O(1) DHT DHT 1 DHT 1 ID key ID IP value D

IPSJ SIG Technical Report 1, Instrument Separation in Reverberant Environments Using Crystal Microphone Arrays Nobutaka ITO, 1, 2 Yu KITANO, 1

IPSJ SIG Technical Report Vol.2012-MUS-96 No /8/10 MIDI Modeling Performance Indeterminacies for Polyphonic Midi Score Following and

Microsoft Word - toyoshima-deim2011.doc

IPSJ SIG Technical Report Vol.2015-CVIM-196 No /3/6 1,a) 1,b) 1,c) U,,,, The Camera Position Alignment on a Gimbal Head for Fixed Viewpoint Swi

21 Key Exchange method for portable terminal with direct input by user

1 UD Fig. 1 Concept of UD tourist information system. 1 ()KDDI UD 7) ) UD c 2010 Information Processing S

2016 [1][2] H.264/AVC HEVC HEVC

,,,,., C Java,,.,,.,., ,,.,, i

1 [1, 2, 3, 4, 5, 8, 9, 10, 12, 15] The Boston Public Schools system, BPS (Deferred Acceptance system, DA) (Top Trading Cycles system, TTC) cf. [13] [

,4) 1 P% P%P=2.5 5%!%! (1) = (2) l l Figure 1 A compilation flow of the proposing sampling based architecture simulation

2.2 6).,.,.,. Yang, 7).,,.,,. 2.3 SIFT SIFT (Scale-Invariant Feature Transform) 8).,. SIFT,,. SIFT, Mean-Shift 9)., SIFT,., SIFT,. 3.,.,,,,,.,,,., 1,

IPSJ SIG Technical Report Vol.2012-IS-119 No /3/ Web A Multi-story e-picture Book with the Degree-of-interest Extraction Function

Table 1. Assumed performance of a water electrol ysis plant. Fig. 1. Structure of a proposed power generation system utilizing waste heat from factori

DPA,, ShareLog 3) 4) 2.2 Strino Strino STRain-based user Interface with tacticle of elastic Natural ObjectsStrino 1 Strino ) PC Log-Log (2007 6)

IHI Robust Path Planning against Position Error for UGVs in Rough Terrain Yuki DOI, Yonghoon JI, Yusuke TAMURA(University of Tokyo), Yuki IKEDA, Atsus

Wavelet HSI / [1] JPEG2000 9/7Wavelet [2][6] 2:1 9/7Wavelet Wavelet 80 Wavelet i

28 Horizontal angle correction using straight line detection in an equirectangular image

鉄鋼協会プレゼン

日本内科学会雑誌第102巻第4号

ばらつき抑制のための確率最適制御

EQUIVALENT TRANSFORMATION TECHNIQUE FOR ISLANDING DETECTION METHODS OF SYNCHRONOUS GENERATOR -REACTIVE POWER PERTURBATION METHODS USING AVR OR SVC- Ju

ax 2 + bx + c = n 8 (n ) a n x n + a n 1 x n a 1 x + a 0 = 0 ( a n, a n 1,, a 1, a 0 a n 0) n n ( ) ( ) ax 3 + bx 2 + cx + d = 0 4


H.264/AVC 2 H.265/HEVC 1 H.265 JCT-VC HM(HEVC Test Model) HM 5 5 SIMD HM 33%

Transcription:

. High-Efficient -pass Video Coding Algorithm based on Macroblock Rate-Distortion Kazuya Yokohari, Atsushi Sagata and Kazuto Kamikura In order to achieve high-efficient video coding, -pass video coding algorithms are proposed. In the existing -pass video coding algorithms, To focus on the feature quantity of the coding result in the frame, the video coding efficiency is grown in the frame. In this paper, we focus on the macroblock coding efficiency (rate-distortion). In the proposed algorithm to achieve high-efficient video coding, the quantization parameters which optimize video coding efficiency of each macroblocks are decided based on the -pass encoding result. Using these quantization parameters at the -pass encoding, the video coding efficiency is grown in the macroblock. The proposed algorithm is able to improve the video coding efficiency by about percents. QVGA (3 4) WVGA (8 48) BS 7 48, 44 8, 9 8 ) ) 3 4 5. NTT NTT Cyber Space Laboratories, NTT Corporation c Information Processing Society of Japan

Lagrange D mb R mb = D picture R picture = constant () 3) () () RD RD RD ( D/ R) QP QP-GenBit. 4) PSNR PSNR 5) AVC/H.64 JM 6) GNU GPL AVC/H.64 x64 ) x, y x, y (x, y) = [l(x, y)] α [c(x, y)] β [s(x, y)] γ () l(x, y) = µxµy + C (3) µ x + µ y + C c(x, y) = σ xσ y + C σ x + σ y + C (4) s(x, y) = σxy + C3 (5) σ x σ y + C 3 l(x, y) c(x, y) s(x, y) µ x, µ y, σ x, σ y, σ xy µ x = N x i (6) µ y = y i (7) N { } σ x = (x i µ x) (8) N { } σ y = (y i µ x) (9) N σ xy = N (x i µ x )(y i µ y ) () µ x, µ y σ x, σ y σ xy C, C, C 3, α, β, γ C = 6.55, C = 58.55, C 3 = C /, α = β = γ = 4). RD RD RD QP 5 RD c Information Processing Society of Japan

情報処理学会研究報告 した 結果を図 に示す プロットの形状から y = xn のグラフとの相関が高いことがわか 3.9 る そこで 発生符号量と の関係を表す RD 特性近似曲線を以下の () 式のように 近似する () 発生符号量.7 f (x) = ( x)a ここで x はマクロブロック最大発生符号量 (3bit) で正規化した発生符号量 f (x) は.5 が入り a はマクロブロックごとに異なる値をとる変数である 任意の QP でマクロ.3 ブロックを符号化した結果得られる発生符号量と を () 式へ与えることで a を決定. し当該マクロブロックの RD 特性近似曲線を求めることが出来る. 上で求めた RD 特性近似曲線 f (x) の精度を検証した 検証方法は以下の通りである あるマクロブロックを つの代表 QP 値で符号化した時の発生符号量 を抽出し 3 3 発生符号量 4 5 6 QP (xk, f (xk )) = (M bbit, ) を代入して ak を求める 次に 求められた当該マクロブロッ 図 QP と発生符号量 Fig. QP-Generated Bits 図 RD 特性曲線 Fig. RD Curve クの RD 特性近似曲線 f (x) = ( x)ak の x に QP が から 5 までの時の実発生符 号量を代入し f (x) 近似 を得る 得られた近似 と 実際に QP を か ら 5 までの値の時に得られる について相関係数を求めた あるマクロブロックについて実際の RD 特性曲線と RD 特性近似曲線をプロットしたも QP-MbBits Curve Approx QP-MbBits Curve のを図 3 に示す RD 特性曲線と RD 特性近似曲線を比べると 同様の傾向を持つ曲線と 代表点として3つのマクロブロックを選定し 相関係数を求めたところ最大で.994 最 GenBit [kbit/s] なっていることが確認出来る 小で.975 という結果が得られた いずれも に非常に近い値が相関係数として得られて いるため f (x) は十分 RD 特性曲線の近似式として適用出来ると考えられる.3 QP-発生符号量曲線 QP-Bit 曲線. RD Curve Approx RD Curve 先に求めた RD 特性近似曲線を用いて決定された第二符号化時の各マクロブロックのター ゲット符号量となるような QP を求めるために QP と発生符号量の関係を表す QP-発生. GenBit 3 4 5 QP 符号量曲線 QP-Bit 曲線 の近似式について検討した 図 3 RD 特性近似曲線 Fig. 3 Approx RD Curve RD 特性近似曲線を定義したときと同様に 様々なマクロブロックについて QP を か 図 4 QP-発生符号量曲線 Fig. 4 QP-Generated Bit Curve ら 5 まで設定した時の QP と発生符号量の関係をグラフにプロットし 実際の QP-発生 符号量の関係を確認する 結果を図 に示す プロットの形状からそれぞれの曲線がほと な QP-GenBit 曲線の平均値を求め その平均値と () 式に対して最小自乗法を用いて んど交差することがないことがわかる そこで 代表的な点を用いて QP と発生符号量の α =.9846975, β = 95.79875 と求めた また γ は y 軸切片を表しており曲線への依 関係を表す QP-Bit 曲線を以下の2次方程式で近似する 存性が強い値であると考えられるため γ を変数として任意の QP とマクロブロック発生符 g(x) = αx + βx + γ 号量を () 式へ与えることで 本の QP-GenBit 曲線を決定する すなわち 第一符号化 () 結果を与えることで各マクロブロックの QP-GenBit 曲線を決定する ここで x は QP g(x) はマクロブロック発生符号量が入る 今回は代表的な点として様々 3 c Information Processing Society of Japan

RD QP-Bit g(x) QP MbBit (x l, g(x l )) = (QP, MbBit) γ l QP-MbBits g(x) = αx + βx + γ l x QP 5 QP QP-Bit QP-Bit 4 QP-Bit QP-Bit 3.999.993 QP-Bit RD g(x) QP-Bit 3. RD D/ R D/ R QP-Bit QP QP 5 5.. 3. 4. E pic = D pic/ R pic E pic 5 Fig. 5 Flow chart (AvePicBit) (Ave) AvePicBit Ave () (3) E pic E(x) = f (x) = a( x) a (3) E pic = a pic ( AveP icbit) apic, here (4) ln( f(x)) ln( Ave) a pic = = ln( x) ln( AveP icbit) 5. RD (x i ) () a i RD f(x i ) = ( x i ) a i, here a i = ln( f(x i)) ln( x i ) i i 6. 5. a i RD 4. E pic (T MbBit i ) (3) E(x i) = E pic a = a i T MbBit i (5) 4 c Information Processing Society of Japan

E(x i ) = a i ( x i ) a i E pic = a i( T MbBit i) a i ( Epic ) a i T MbBit i = (6) a i 7. QP () QP-GenBit QP (QP i ) (MbBit i ) γ i QP-GenBit γ i = MbBit i α QPi β QP i (7) QP-GenBit γ i g(x i ) = T MbBit i QP (T QP i ) T QP i QP 5 8. 7. QP (T QP i ) 4. 3 ITE SDTV (7 48 45 I- JM ver. 4. 3 E pic = D pic / R pic (AvePicBit) (Ave) RD (Soccer Action) 6 Soccer Action QP ±8 3% 5. %.9.7 stenc ndenc.5 4 6 8 GenBit [kbit/s] 6 (Soccer Action) Fig. 6 Experimental Result (Soccer Action) ) Yutaka Yokoyama, Satoshi Nogaki, A Rate Control Method with Pre-Analysis for Real-Time MPEG- Video Coding, IEEE CNF, vol. 3, pp. 54-57, 7- Oct.. ) H.64/AVC, x64, http://www.videolan.org/developers/x64.html (), 3) Seishi Takamura, Naoki Kobayashi, MPEG- one-pass variable bit rate control algorithm and its LSI Implementation, Image Processing,. Proceedings. Internatioanl Conf., vol., pp. 94-945, 7- Oct.. 4) Zhou Wang, Alan Conrad Bovik, Hamid Rahim Sheikh, Eero P. Simoncelli, Image Quality Assessment: From Error Visibility to Structural Similarity, IEEE Transactions on Image Processing, vol. 3, no. 4, pp. 6-6, Apr. 4. 5) Huynh-Thu Q, Ghanbari M, Scope of Validity of PSNR in Image/Video Quality Assessment, IEEE Electronics Letters, Vol. 44, Issue. 3, pp. 8-8, June 9 8. 6) Joint Model Reference Software, JM, http://iphome.hhi.de/suehring/tml/ (), 5 c Information Processing Society of Japan