12 DCT A Data-Driven Implementation of Shape Adaptive DCT

Similar documents
28 Horizontal angle correction using straight line detection in an equirectangular image

soturon.dvi

2016 [1][2] H.264/AVC HEVC HEVC

2017 (413812)

2. CABAC CABAC CABAC 1 1 CABAC Figure 1 Overview of CABAC 2 DCT 2 0/ /1 CABAC [3] 3. 2 値化部 コンテキスト計算部 2 値算術符号化部 CABAC CABAC

7,, i

kut-paper-template.dvi

23 Fig. 2: hwmodulev2 3. Reconfigurable HPC 3.1 hw/sw hw/sw hw/sw FPGA PC FPGA PC FPGA HPC FPGA FPGA hw/sw hw/sw hw- Module FPGA hwmodule hw/sw FPGA h

,,,,., C Java,,.,,.,., ,,.,, i

20 Method for Recognizing Expression Considering Fuzzy Based on Optical Flow

Fig. 3 Flow diagram of image processing. Black rectangle in the photo indicates the processing area (128 x 32 pixels).

4.1 % 7.5 %

Fig. 1 Schematic construction of a PWS vehicle Fig. 2 Main power circuit of an inverter system for two motors drive

i

21 Quantum calculator simulator based on reversible operation

& Vol.5 No (Oct. 2015) TV 1,2,a) , Augmented TV TV AR Augmented Reality 3DCG TV Estimation of TV Screen Position and Ro

258 5) GPS 1 GPS 6) GPS DP 7) 8) 10) GPS GPS ) GPS Global Positioning System

1 1 tf-idf tf-idf i

25 Removal of the fricative sounds that occur in the electronic stethoscope

08-特集04.indd

Web Web Web Web Web, i

Firewall IDS IP IP 1 HTTP 74% Quick Search 32 bit DDMP Mbps URL Filtering 59.3 Mbps i

untitled

25 D Effects of viewpoints of head mounted wearable 3D display on human task performance

06’ÓŠ¹/ŒØŒì

幅広い業種や業務に適応可能なRFIDソリューション

A Feasibility Study of Direct-Mapping-Type Parallel Processing Method to Solve Linear Equations in Load Flow Calculations Hiroaki Inayoshi, Non-member

1., 1 COOKPAD 2, Web.,,,,,,.,, [1]., 5.,, [2].,,.,.,, 5, [3].,,,.,, [4], 33,.,,.,,.. 2.,, 3.., 4., 5., ,. 1.,,., 2.,. 1,,


(Visual Secret Sharing Scheme) VSSS VSSS 3 i

58 10

n 2 n (Dynamic Programming : DP) (Genetic Algorithm : GA) 2 i

21 Key Exchange method for portable terminal with direct input by user

卒業論文2.dvi

29 jjencode JavaScript

28 TCG SURF Card recognition using SURF in TCG play video


THE INSTITUTE OF ELECTRONICS, INFORMATION AND COMMUNICATION ENGINEERS TECHNICAL REPORT OF IEICE.

DTN DTN DTN DTN i

Virtual Window System Virtual Window System Virtual Window System Virtual Window System Virtual Window System Virtual Window System Social Networking

ï\éÜA4*

スライド 1

i


Wide Scanner TWAIN Source ユーザーズガイド

,,,,,,,,,,,,,,,,,,, 976%, i


LAN LAN LAN LAN LAN LAN,, i

社会学部紀要 114号☆/22.松村

Web Web ID Web 16 Web Web i

17 Proposal of an Algorithm of Image Extraction and Research on Improvement of a Man-machine Interface of Food Intake Measuring System

23 The Study of support narrowing down goods on electronic commerce sites

24 FFT Self-Timeed Pipeline Implementation of Adaptive FFT for Different Rate Signals

Web-ATMによる店舗向けトータルATMサービス

13 RoboCup The Interface System for Learning By Observation Applied to RoboCup Agents Ruck Thawonmas

2 ( ) i


23 Study on Generation of Sudoku Problems with Fewer Clues

携帯電話向け画像処理LSI:Milbeaut Mobile

1 [1, 2, 3, 4, 5, 8, 9, 10, 12, 15] The Boston Public Schools system, BPS (Deferred Acceptance system, DA) (Top Trading Cycles system, TTC) cf. [13] [

( )

29 Short-time prediction of time series data for binary option trade

SOM SOM(Self-Organizing Maps) SOM SOM SOM SOM SOM SOM i

Sport and the Media: The Close Relationship between Sport and Broadcasting SUDO, Haruo1) Abstract This report tries to demonstrate the relationship be

Wi-Fi Wi-Fi Wi-Fi Wi-Fi SAS SAS-2 Wi-Fi i

Web Basic Web SAS-2 Web SAS-2 i

デジタルカメラ用ISP:Milbeaut

21 e-learning Development of Real-time Learner Detection System for e-learning

fiš„v8.dvi

P2P P2P Winny 3 P2P P2P 1 P2P, i

Web Web Web Web i

, IT.,.,..,.. i

paper.dvi

(SAD) x86 MPSADBW H.264/AVC H.264/AVC SAD SAD x86 SAD MPSADBW SAD 3x3 3 9 SAD SAD SAD x86 MPSADBW SAD 9 SAD SAD 4.6

,.,.,,.,. X Y..,,., [1].,,,.,,.. HCI,,,,,,, i

2 1 ( ) 2 ( ) i

udc-2.dvi

86 7 I ( 13 ) II ( )

149 (Newell [5]) Newell [5], [1], [1], [11] Li,Ryu, and Song [2], [11] Li,Ryu, and Song [2], [1] 1) 2) ( ) ( ) 3) T : 2 a : 3 a 1 :

KII, Masanobu Vol.7 No Spring

大学における原価計算教育の現状と課題

IPSJ SIG Technical Report Vol.2017-ARC-225 No.12 Vol.2017-SLDM-179 No.12 Vol.2017-EMB-44 No /3/9 1 1 RTOS DefensiveZone DefensiveZone MPU RTOS

25 II :30 16:00 (1),. Do not open this problem booklet until the start of the examination is announced. (2) 3.. Answer the following 3 proble

Wavelet HSI / [1] JPEG2000 9/7Wavelet [2][6] 2:1 9/7Wavelet Wavelet 80 Wavelet i

22 Google Trends Estimation of Stock Dealing Timing using Google Trends

WebRTC P2P Web Proxy P2P Web Proxy WebRTC WebRTC Web, HTTP, WebRTC, P2P i

Webサービス本格活用のための設計ポイント

1 3DCG [2] 3DCG CG 3DCG [3] 3DCG 3 3 API 2 3DCG 3 (1) Saito [4] (a) 1920x1080 (b) 1280x720 (c) 640x360 (d) 320x G-Buffer Decaudin[5] G-Buffer D

JOURNAL OF THE JAPANESE ASSOCIATION FOR PETROLEUM TECHNOLOGY VOL. 66, NO. 6 (Nov., 2001) (Received August 10, 2001; accepted November 9, 2001) Alterna

IT i


( ) [1] [4] ( ) 2. [5] [6] Piano Tutor[7] [1], [2], [8], [9] Radiobaton[10] Two Finger Piano[11] Coloring-in Piano[12] ism[13] MIDI MIDI 1 Fig. 1 Syst

橡表紙参照.PDF

(MIRU2008) HOG Histograms of Oriented Gradients (HOG)

, (GPS: Global Positioning Systemg),.,, (LBS: Local Based Services).. GPS,.,. RFID LAN,.,.,.,,,.,..,.,.,,, i

1 DHT Fig. 1 Example of DHT 2 Successor Fig. 2 Example of Successor 2.1 Distributed Hash Table key key value O(1) DHT DHT 1 DHT 1 ID key ID IP value D

IPSJ SIG Technical Report Vol.2012-MUS-96 No /8/10 MIDI Modeling Performance Indeterminacies for Polyphonic Midi Score Following and

2 Poisson Image Editing DC DC 2 Poisson Image Editing Agarwala 3 4 Agarwala Poisson Image Editing Poisson Image Editing f(u) u 2 u = (x

NINJAL Research Papers No.3

GPGPU

Transcription:

12 DCT A Data-Driven Implementation of Shape Adaptive DCT 1010431 2001 2 5

DCT MPEG H261,H263 LSI DDMP [1]DDMP MPEG4 DDMP MPEG4 SA-DCT SA-DCT DCT SA-DCT DDMP SA-DCT MPEG4, DDMP,, SA-DCT,, ο i

Abstract A Data-Driven Implementation of Shape Adaptive DCT MASAKAZU HASHIMOTO In recent years, picture compression and extension systems, such as MPEG, H261 and H263, etc., have become standards. Also, We are improving the performance of DDMP (Data Driven Multimedia Processor) as an one chip processor for image processing[1]. DDMP is a programmable device and it has enough flexibility for the various application tools of MPEG4. This research proposes a method of data driven type parallel processing for SA-DCT which is heavy tasks in image one of compression and extension the good of this paper is optimizing DDMP to MPEG4 functions. Shift operations of pixels are required by SA-DCT additional ordinal DCT. The method solves some problems in the current DDMP instruction set to utilize the parallelism of SA-DCT. Therefore, the search and examination on the bottleneck of SA-DCT and its solution is urgently required. The improvement in the speed by addition of the new instruction is proposed, and the possibility of realizations shown by a description of simple hardware mechanisms. The new added instructions should give more flexibility and can be applied in various fields of application. In this paper, a rough estimation of hardware cost is describes. key words MPEG4, DDMP, data-driven, SA-DCT, parallel processing, instruction ii

1 1 2 SA-DCT 5 2.1... 5 2.2 SA-DCT... 6 2.3 SA-DCT... 7 2.4 DDMP... 10 2.5 SA-DCT... 11 3 SA-DCT 13 3.1... 13 3.2... 14 3.2.1... 15 3.2.2... 15 3.3... 16 3.3.1 (h-read, v-read)... 16 3.3.2 2 (brpx, brln)... 19 3.3.3... 22 3.4 SA-DCT... 22 4 24 4.1... 24 4.2... 24 iii

4.3... 26 4.4... 27 4.4.1 SA-DCT... 27 4.4.2 DCT... 28 5 29 31 32 iv

1.1... 3 1.2... 4 2.1 SA-DCT... 6 2.2 SA-DCT... 7 2.3 SA-DCT... 9 2.4 DDMP... 10 2.5 DDMP-4G (OCP)... 11 3.1... 16 3.2 h-rad... 17 3.3 v-read... 18 3.4 2... 19 3.5 brpx... 20 3.6 brln... 21 3.7 SA-DCT ()... 23 v

3.1 ( DCT SA-DCT )... 13 4.1 ( SA-DCT )... 27 4.2 ( DCT )... 28 vi

1 MPEG H261,H263 TV DVD MPEG ISO/IEO (Moving Picture coding Expert Group) 1988 MPEG1MPEG2MPEG4 [2][3] MPEG1 CD-ROM MPEG2 MPEG4 TV MPEG4 MPEG4 MPEG4version2 MPEG4version2 3 MPEG4 1

MPEG4 MPEG1,2 DCT(Discrete Cosine Transform) 1 DCT () () DCT Shape Adaptive DCT (SA-DCT) SA-DCT MPEG4 SA-DCT DDMP (Data-Driven Multimedia Processors) 1 8600MOPS(Mega Operations Per Second) DDMP-4G MPEG4 VOP(Video Object Plane: ) 8 8 DCT DCT 1.1 DC AC 2

DCT 1.1 VOP VOP DCT 2 8 DC 8 1.1 1.2 VOP VOP VOP VOP VOP MPEG4-version2 DCT SA-DCT [4]SA-DCT DCT 3

DC DCT DCT DCT 0 VOP VOP VOP 1.2 SA-DCT 2 DCT 2 SA-DCT DDMP SA-DCT DDMP 3 DDMP DDMP-4G SA-DCT 4 3 4

2 SA-DCT 2.1 SA-DCT SA-DCT SA-DCT 8 8 2.1 3 1 () SA-DCT 3 1. VOP 2. DCT 3. 5

2.2 SA-DCT 2.1 SA-DCT 2.2 SA-DCT SA-DCT DCT SA-DCT DCT 2.2 VOP SA-DCT 1 DCT 1 DCT [5] DCT MPEG4 6

2.3 SA-DCT 188 DCT VOP + DCT + DCT 2.2 SA-DCT 2.3 SA-DCT 2.2 VOP SA-DCT 8 1 7

2.3 SA-DCT DCT 1 8 DCT DCT 2.3 1 1 1 DCT18 8 DCT DCT 8

2.3 SA-DCT 188 DCT VOP SA-DCT DCT DCT DCT + DCT SA-DCT DCT + DCT DCT DCT 2.3 SA-DCT 9

2.4 DDMP 2.4 DDMP DDMP DDMP 1 8600MOPS DDMP 2 () 2 DDMP 1 2.4 DDMP x y z DDMP t 2.4 DDMP 10

2.5 SA-DCT 2.5 SA-DCT DDMP DCT DDMP 2.3 DDMP-4G SA-DCT DDMP-4G 1 64 2.5 DDMP-4G OCP(Operation and Control Processor) DDMP-4G DCT PE MUL,INT(, ) IO() PE MUL(SYC) PE TBL ETM ( ) () INT ( ) GNT 2.5 DDMP-4G (OCP) DDMP 2 () 11

2.5 SA-DCT 1 1 /SA-DCT DCT DCT SA-DCT DCT DCT (N a N 2 ) 2 N 2 1 N 2 + N1N 2 2 [6] DCT SA-DCT SA-DCT DCT DCT [4] SA-DCT DCT DCT DCT 12

3 SA-DCT 3.1 DDMP-4G SA-DCT SA-DCT DDMP-4G DDMP-4G 3.1 ( DCT SA-DCT ) DCT 1 2 3 4 5 6 7 8 DCT 1 1 1 1 1 1 1 1 SA-DCT 6.3 6.3 6.6 6.9 7.2 8.1 9.3 8.4 DCT SA-DCT 3.1 SA-DCT () DCT DCT SA-DCT DCT SA-DCT 6 10 2.5 DDMP-4G 13

3.2 3.2 SA-DCT (a) 1 SA-DCT SA-DCT (b) 2 DDMP DDMP 3 2 (c) 3 DDMP 1 1 1 and or 1. 2. 14

3.2 3.2.1 1 2.3 (a) DDMP DDMP-4G () DDMP-4G 1 DDMP 3.2.2 SA-DCT DCT DCT 8 1 1 (b),(c) 15

3.3 3.3 DDMP SA-DCT 3.3.1 (h-read, v-read) (c) h-read, v-read v (0,0) (0,n-1) h token:= [line,pixel,data] 0,0,0 [i,j,n] [i,j+n-1,an] : [i,j+2,a3] [i,j+1,a2] [i,j,a1] h-read (i,j) n (px) h-read:horizontally read 0,0,0 [i,j,n] [i+n-1,j,an] : [i+2,j,a3] [i+1,j,a2] [i,j,a1] v-read (i,j) n (ln) v-read:vertically read 3.1 16

3.3 ffl h-read 3.1 (i=, j=) ( ) n= ( + j) = ) n () ffl v-read v-read n= ( + i) = ) n () (line,pixel,data) =(1,0,0) h-read (1,0,8) Data+pixel= 0+0= 0 (line,pixel,data) =(1,0,0) h-read 0 8 0 7 0x0000000f 0x000000ff0xffffffff (1,1,0x0000000f) (1,0,0x000000ff)(1,7,0xffffffff) pixel0 pixel 3.2 h-rad 17

3.3 3.2 (1,0,0)=(,, ) (a,0,8) h-read 07 8 (line,pixel,data) =(1,0,0) v-read (1,0,8) Data+line= 0+1= 1 (line,pixel,data) =(1,0,0) (2,0,0x00000fff) (1,0,0x000000ff)(8,0,0x00000001) v-read 1 8 line1 line 0 1 7 8 0x0000000f 0x000000ff0xffffffff 0x00000001 3.3 v-read 3.3 v-read 07 8 1 () 18

3.3 3.3.2 2 (brpx, brln) (b) brpx, brln 2 [i,j,n] [i,j,a] brpx aj a a+1 a+m m=max(j) brpx:branch by pixel num. [i,j,n] [i,j,a] brln ai a a+1 a+m m=max(i) brln:branch by line num. 3.4 2 ffl brpx 3.4 a ( j) n (a + j) = ) n 19

3.3 ffl brln brln a ( i) n (a + i) = ) n (line,pixel,data) =(1,0,1) brpx (1,0,2) pixel+data= 0+2 = 2 1 2 3 DCT1 DCT2 DCT8 (line,pixel,data) =(1,0,1) brpx (1,0,1) 1 2 3 DCT1 DCT2 DCT8 3.5 brpx 3.5 (1,0,1)=(,, ) (1,0,2) brpx 2 0 2 20

3.3 (line,pixel,data) =(1,0,1) brln (1,0,2) line+data= 1+2 = 3 1 2 3 DCT1 DCT2 DCT8 (line,pixel,data) =(1,0,1) brln (1,0,1) 1 2 3 DCT1 DCT2 DCT8 3.6 brln 3.6 brln 2 1 3 DCT 21

3.4 SA-DCT 3.3.3 4 SA-DCT 1 2 0 (brpx brln ) 3.4 SA-DCT 3.7 SA-DCT 2 DCT 2 DCT DCT 2 (brpx) DCT 1 DCT DCT 1 1 (v-read) 1 DCT 1 DCT 22

3.4 SA-DCT SA-DCT DCT (brpx) DCT DCT DCT DCT (h-read) SA-DCT DCT (brln) DCT DCT DCT DCT 3.7 SA-DCT () 23

4 4.1 DDMP-4G DDMP 1 1 DDMP-4G 12bit 4.2 ffl 1 DDMP () ffl 1 24

4.2 ( DCT) ffl DDMP 1 () () 64 (8 8) 16 (4 4) 1024 64(1 ) 16() = 1024() 30 30() 1024() = 30720 0,4,8,16,32,64 7 25

4.3 4.3 (1) : () (2) : (3) : 1 (DCT =s) 1 (fps) 1 (s) (1) (2) SA-DCT (3) 1 SA-DCT 1 26

4.4 4.4 SA-DCT DDMP-4G 4.4.1 SA-DCT SA-DCT SA-DCT DDMP-4G SA-DCT 4.1 SA-DCT 0 1 SA-DCT 1 DCT 4.1 ( SA-DCT ) SA-DCT SA-DCT 0 1.27 1 4 1.32 1.04 8 1.36 1.08 16 1.49 1.21 32 1.55 1.27 64 1.66 1.38 27

4.4 4.4.2 DCT DCT DCT DCT DCT SA-DCT 4.2 DCT 1 DCT DCT 1 SA-DCT 4.2 ( DCT ) DCT SA-DCT 0 1 2.05 4 1 2.14 8 1 2.23 16 1 2.49 32 1 2.62 64 1 2.83 28

5 MPEG4 SA-DCT 2 SA-DCT DDMP SA-DCT 3 SA-DCT DDMP SA-DCT 4 SA-DCT SA-DCT DCT SA-DCT DCT DCT Java SA-DCT 29

DCT SA-DCT DDMP SA- DCT 30

31

[1] H. Terada, S. Miyata, and M. Iwata, DDMP's: self-timed super-pipelined datadriven multimedia processors," Proc. of IEEE, 87(2), 282 296 (1999). [2], MPEG-4," (1999). [3] K. R. Rao, and J. J. Hwang,,,," (1999). [4],,,, MPEG-4,", 53(4), 485 491 (1999). [5],,,, MPEG-4 SA-DCT VLSI," TECHNICAL REPORT OF IEICE., 2000(35), (2000). [6] W. H. Chen, C. H. Smith, and S. C. Fralick, A Fast Computational Algorithm for the Discrete Cosine Transform," IEEE Trans. Commun., 25(9), 1004 1009 (1997). 32