IPSJ SIG Technical Report Vol.2011-CVIM-177 No /5/ TRECVID2010 SURF Bag-of-Features 1 TRECVID SVM 700% MKL-SVM 883% TRECVID2010 MKL-SVM A

Similar documents
IPSJ SIG Technical Report Vol.2010-CVIM-171 No /3/19 1. Web 1 1 Web Web Web Multiple Kernel Learning(MKL) Web ( ) % MKL 68.8% Extractin

(MIRU2009) cuboid cuboid SURF 6 85% Web. Web Abstract Extracting Spatio-te

bag-of-words bag-of-keypoints Web bagof-keypoints Nearest Neighbor SVM Nearest Neighbor SIFT Nearest Neighbor bag-of-keypoints Nearest Neighbor SVM 84

THE INSTITUTE OF ELECTRONICS, INFORMATION AND COMMUNICATION ENGINEERS TECHNICAL REPORT OF IEICE. TRECVID2012 Instance Search {sak

28 TCG SURF Card recognition using SURF in TCG play video

IPSJ SIG Technical Report Vol.2010-CVIM-170 No /1/ Visual Recognition of Wire Harnesses for Automated Wiring Masaki Yoneda, 1 Ta

(MIRU2008) HOG Histograms of Oriented Gradients (HOG)

Google Goggles [1] Google Goggles Android iphone web Google Goggles Lee [2] Lee iphone () [3] [4] [5] [6] [7] [8] [9] [10] :

情報処理学会研究報告 IPSJ SIG Technical Report Vol.2013-CVIM-186 No /3/15 EMD 1,a) SIFT. SIFT Bag-of-keypoints. SIFT SIFT.. Earth Mover s Distance

1 Fig. 1 Extraction of motion,.,,, 4,,, 3., 1, 2. 2.,. CHLAC,. 2.1,. (256 ).,., CHLAC. CHLAC, HLAC. 2.3 (HLAC ) r,.,. HLAC. N. 2 HLAC Fig. 2

2.2 6).,.,.,. Yang, 7).,,.,,. 2.3 SIFT SIFT (Scale-Invariant Feature Transform) 8).,. SIFT,,. SIFT, Mean-Shift 9)., SIFT,., SIFT,. 3.,.,,,,,.,,,., 1,

Duplicate Near Duplicate Intact Partial Copy Original Image Near Partial Copy Near Partial Copy with a background (a) (b) 2 1 [6] SIFT SIFT SIF

4. C i k = 2 k-means C 1 i, C 2 i 5. C i x i p [ f(θ i ; x) = (2π) p 2 Vi 1 2 exp (x µ ] i) t V 1 i (x µ i ) 2 BIC BIC = 2 log L( ˆθ i ; x i C i ) + q

1., 1 COOKPAD 2, Web.,,,,,,.,, [1]., 5.,, [2].,,.,.,, 5, [3].,,,.,, [4], 33,.,,.,,.. 2.,, 3.., 4., 5., ,. 1.,,., 2.,. 1,,

(3.6 ) (4.6 ) 2. [3], [6], [12] [7] [2], [5], [11] [14] [9] [8] [10] (1) Voodoo 3 : 3 Voodoo[1] 3 ( 3D ) (2) : Voodoo 3D (3) : 3D (Welc

IPSJ SIG Technical Report Vol.2012-CG-149 No.13 Vol.2012-CVIM-184 No /12/4 3 1,a) ( ) DB 3D DB 2D,,,, PnP(Perspective n-point), Ransa

スライド 1

(MIRU2010) Geometric Context Randomized Trees Geometric Context Rand

Microsoft PowerPoint - SSII_harada pptx

kut-paper-template.dvi

3 2 2 (1) (2) (3) (4) 4 4 AdaBoost 2. [11] Onishi&Yoda [8] Iwashita&Stoica [5] 4 [3] 3. 3 (1) (2) (3)

DEIM Forum 2010 A Web Abstract Classification Method for Revie

IPSJ SIG Technical Report iphone iphone,,., OpenGl ES 2.0 GLSL(OpenGL Shading Language), iphone GPGPU(General-Purpose Computing on Graphics Proc

情報処理学会研究報告 IPSJ SIG Technical Report Vol.2011-MBL-57 No.27 Vol.2011-UBI-29 No /3/ A Consideration of Features for Fatigue Es

Optical Flow t t + δt 1 Motion Field 3 3 1) 2) 3) Lucas-Kanade 4) 1 t (x, y) I(x, y, t)

LBP 2 LBP 2. 2 Local Binary Pattern Local Binary pattern(lbp) [6] R

1 (PCA) 3 2 P.Viola 2) Viola AdaBoost 1 Viola OpenCV 3) Web OpenCV T.L.Berg PCA kpca LDA k-means 4) Berg 95% Berg Web k-means k-means

29 jjencode JavaScript

GPGPU

IPSJ SIG Technical Report Vol.2011-MUS-91 No /7/ , 3 1 Design and Implementation on a System for Learning Songs by Presenting Musical St

光学

[2] OCR [3], [4] [5] [6] [4], [7] [8], [9] 1 [10] Fig. 1 Current arrangement and size of ruby. 2 Fig. 2 Typography combined with printing

Sobel Canny i

% 2 3 [1] Semantic Texton Forests STFs [1] ( ) STFs STFs ColorSelf-Simlarity CSS [2] ii

スライド 1

Input image Initialize variables Loop for period of oscillation Update height map Make shade image Change property of image Output image Change time L

..,,,, , ( ) 3.,., 3.,., 500, 233.,, 3,,.,, i

1 Kinect for Windows M = [X Y Z] T M = [X Y Z ] T f (u,v) w 3.2 [11] [7] u = f X +u Z 0 δ u (X,Y,Z ) (5) v = f Y Z +v 0 δ v (X,Y,Z ) (6) w = Z +

IS1-09 第 回画像センシングシンポジウム, 横浜,14 年 6 月 2 Hough Forest Hough Forest[6] Random Forest( [5]) Random Forest Hough Forest Hough Forest 2.1 Hough Forest 1 2.2

,,.,.,,.,.,.,.,,.,..,,,, i

2. 30 Visual Words TF-IDF Lowe [4] Scale-Invarient Feature Transform (SIFT) Bay [1] Speeded Up Robust Features (SURF) SIFT 128 SURF 64 Visual Words Ni

IPSJ SIG Technical Report Vol.2010-SLDM-144 No.50 Vol.2010-EMB-16 No.50 Vol.2010-MBL-53 No.50 Vol.2010-UBI-25 No /3/27 Twitter IME Twitte

IPSJ SIG Technical Report Vol.2011-EC-19 No /3/ ,.,., Peg-Scope Viewer,,.,,,,. Utilization of Watching Logs for Support of Multi-

(a) 1 (b) 3. Gilbert Pernicka[2] Treibitz Schechner[3] Narasimhan [4] Kim [5] Nayar [6] [7][8][9] 2. X X X [10] [11] L L t L s L = L t + L s

IT,, i

28 Horizontal angle correction using straight line detection in an equirectangular image

& Vol.5 No (Oct. 2015) TV 1,2,a) , Augmented TV TV AR Augmented Reality 3DCG TV Estimation of TV Screen Position and Ro

1 Table 1: Identification by color of voxel Voxel Mode of expression Nothing Other 1 Orange 2 Blue 3 Yellow 4 SSL Humanoid SSL-Vision 3 3 [, 21] 8 325

2) 3) LAN 4) 2 5) 6) 7) K MIC NJR4261JB0916 8) 24.11GHz V 5V 3kHz 4 (1) (8) (1)(5) (2)(3)(4)(6)(7) (1) (2) (3) (4)

The 18th Game Programming Workshop ,a) 1,b) 1,c) 2,d) 1,e) 1,f) Adapting One-Player Mahjong Players to Four-Player Mahjong

yoo_graduation_thesis.dvi

IPSJ SIG Technical Report 1,a) 1,b) 1,c) 1,d) 2,e) 2,f) 2,g) 1. [1] [2] 2 [3] Osaka Prefecture University 1 1, Gakuencho, Naka, Sakai,

本文6(599) (Page 601)

1 Web [2] Web [3] [4] [5], [6] [7] [8] S.W. [9] 3. MeetingShelf Web MeetingShelf MeetingShelf (1) (2) (3) (4) (5) Web MeetingShelf

IPSJ SIG Technical Report Vol.2009-DPS-141 No.20 Vol.2009-GN-73 No.20 Vol.2009-EIP-46 No /11/27 1. MIERUKEN 1 2 MIERUKEN MIERUKEN MIERUKEN: Spe

IPSJ SIG Technical Report Vol.2013-CVIM-187 No /5/30 1,a) 1,b), 1,,,,,,, (DNN),,,, 2 (CNN),, 1.,,,,,,,,,,,,,,,,,, [1], [6], [7], [12], [13]., [

TF-IDF TDF-IDF TDF-IDF Extracting Impression of Sightseeing Spots from Blogs for Supporting Selection of Spots to Visit in Travel Sat

Abstract This paper concerns with a method of dynamic image cognition. Our image cognition method has two distinguished features. One is that the imag

Automatic Collection of Web Video Shots Corresponding to Specific Actions using Web Images

TRECVID Semantic Indexing TaskとMultimedia Event Detection Taskへの取り組み

DPA,, ShareLog 3) 4) 2.2 Strino Strino STRain-based user Interface with tacticle of elastic Natural ObjectsStrino 1 Strino ) PC Log-Log (2007 6)

paper.dvi

Q [4] 2. [3] [5] ϵ- Q Q CO CO [4] Q Q [1] i = X ln n i + C (1) n i i n n i i i n i = n X i i C exploration exploitation [4] Q Q Q ϵ 1 ϵ 3. [3] [5] [4]

Microsoft Word - toyoshima-deim2011.doc

4.1 % 7.5 %

,,,,., C Java,,.,,.,., ,,.,, i


IPSJ SIG Technical Report Vol.2009-CVIM-167 No /6/10 Real AdaBoost HOG 1 1 1, 2 1 Real AdaBoost HOG HOG Real AdaBoost HOG A Method for Reducing

Honda 3) Fujii 4) 5) Agrawala 6) Osaragi 7) Grabler 8) Web Web c 2010 Information Processing Society of Japan

Web Stamps 96 KJ Stamps Web Vol 8, No 1, 2004

Computer Security Symposium October ,a) 1,b) Microsoft Kinect Kinect, Takafumi Mori 1,a) Hiroaki Kikuchi 1,b) [1] 1 Meiji U

SURF,,., 55%,.,., SURF(Speeded Up Robust Features), 4 (,,, ), SURF.,, 84%, 96%, 28%, 32%.,,,. SURF, i

IPSJ SIG Technical Report GPS LAN GPS LAN GPS LAN Location Identification by sphere image and hybrid sensing Takayuki Katahira, 1 Yoshio Iwai 1

Vol.55 No (Jan. 2014) saccess 6 saccess 7 saccess 2. [3] p.33 * B (A) (B) (C) (D) (E) (F) *1 [3], [4] Web PDF a m

Convolutional Neural Network A Graduation Thesis of College of Engineering, Chubu University Investigation of feature extraction by Convolution

,,, Twitter,,, ( ), 2. [1],,, ( ),,.,, Sungho Jeon [2], Twitter 4 URL, SVM,, , , URL F., SVM,, 4 SVM, F,.,,,,, [3], 1 [2] Step Entered

24 Region-Based Image Retrieval using Fuzzy Clustering

3 1 Table 1 1 Feature classification of frames included in a comic magazine Type A Type B Type C Others 81.5% 10.3% 5.0% 3.2% Fig. 1 A co

[1] SBS [2] SBS Random Forests[3] Random Forests ii

SICE東北支部研究集会資料(2013年)

IPSJ SIG Technical Report Vol.2012-CG-148 No /8/29 3DCG 1,a) On rigid body animation taking into account the 3D computer graphics came

xx/xx Vol. Jxx A No. xx 1 Fig. 1 PAL(Panoramic Annular Lens) PAL(Panoramic Annular Lens) PAL (2) PAL PAL 2 PAL 3 2 PAL 1 PAL 3 PAL PAL 2. 1 PAL

日本感性工学会論文誌

258 5) GPS 1 GPS 6) GPS DP 7) 8) 10) GPS GPS ) GPS Global Positioning System

IPSJ SIG Technical Report Vol.2014-GN-90 No.16 Vol.2014-CDS-9 No.16 Vol.2014-DCC-6 No /1/24 1,a) 2,b) 2,c) 1,d) QUMARION QUMARION Kinect Kinect

B HNS 7)8) HNS ( ( ) 7)8) (SOA) HNS HNS 4) HNS ( ) ( ) 1 TV power, channel, volume power true( ON) false( OFF) boolean channel volume int

(VKIR) VKIR VKIR DCT (R) (G) (B) Ward DCT i

IPSJ SIG Technical Report Vol.2009-HCI-134 No /7/17 1. RDB Wiki Wiki RDB SQL Wiki Wiki RDB Wiki RDB Wiki A Wiki System Enhanced by Visibl

第62巻 第1号 平成24年4月/石こうを用いた木材ペレット

130 Oct Radial Basis Function RBF Efficient Market Hypothesis Fama ) 4) 1 Fig. 1 Utility function. 2 Fig. 2 Value function. (1) (2)

IPSJ SIG Technical Report Vol.2009-BIO-17 No /5/26 DNA 1 1 DNA DNA DNA DNA Correcting read errors on DNA sequences determined by Pyrosequencing

2. Twitter Twitter 2.1 Twitter Twitter( ) Twitter Twitter ( 1 ) RT ReTweet RT ReTweet RT ( 2 ) URL Twitter Twitter 140 URL URL URL 140 URL URL

人工知能学会研究会資料 SIG-KBS-B Analysis of Voting Behavior in One Night Werewolf 1 2 Ema Nishizaki 1 Tomonobu Ozaki Graduate School of Integrated B

1 1 CodeDrummer CodeMusician CodeDrummer Fig. 1 Overview of proposal system c


& 3 3 ' ' (., (Pixel), (Light Intensity) (Random Variable). (Joint Probability). V., V = {,,, V }. i x i x = (x, x,, x V ) T. x i i (State Variable),

( ) [1] [4] ( ) 2. [5] [6] Piano Tutor[7] [1], [2], [8], [9] Radiobaton[10] Two Finger Piano[11] Coloring-in Piano[12] ism[13] MIDI MIDI 1 Fig. 1 Syst

ipod touch 1 2 Apple ipod touch ipod touch 3 ( ) ipod touch ( 1 ) Apple ( 2 ) Web 1),2) 3. ipod touch 1 2 ipod touch x y z i

知能と情報, Vol.30, No.5, pp

1: A/B/C/D Fig. 1 Modeling Based on Difference in Agitation Method artisoc[7] A D 2017 Information Processing

Transcription:

1 1 TRECVID2010 SURF Bag-of-Features 1 TRECVID SVM 700% MKL-SVM 883% TRECVID2010 MKL-SVM Analysis of video data recognition using multi-frame Kazuya Hidume 1 and Keiji Yanai 1 In this study, we aim to verify the effectiveness of a multi-frame method for shot recognition proposed in recent years. In the experiments, we extract SURF, color and spatio- temporal features from the TRECVID 2010 video data, and convert them the Bag-of-Features(BoF) representation. In the multiframe method unlike the conventional method to extract features from only one keyframe, features are extracted from multiple frames which are selected from the video, and one BoF feature vector is generated by integrating these features. In the experiment, we use five kinds of concepts out of 130 TRECVID2010 target concepts and analyze recognition performance in various settings in terms of the number of frames selected from one shot. As a result, compared to the conventional method, the recognition accu- racy in classifying by SVM raised 700% at most and by MKL-SVM advance 883%. The result of MKL-SVM in all class outperformed the average of all the teams in TRECVID2010. 1. WEB TRECVID 1) TRECVID Web TRECVID 2. TRECVID TRECVID TREC Video Retrieval EvaluationNIST Disruptive Technology Office(DTO) TRECVID 2010 TRECVID2010 6 Semantic indexing SIN SIN 1 Department of Informatics, Graduate School of Informatics and Engineering, The University of Electro-Communications 1 c 2011 Information Processing Society of Japan

1 TRECVID2010 Semantic indexing TRECVID2010 130 1 MPEG-4/H.264 200 3. TRECVID 2008 2) 4 SIFT 7 2009 10 Multiple Kernel Learning SR-KDA(Spectral Regression combined with Kernel Discriminant Analysis) 3) MK-FDA(Multiple Kernel Fisher Discriminant Analysis) 4) TRECVID 5) 2009 SIFT (MFCC) 4 4. 4.1 2 2 1 2 3 1 1 N 1 M 2 c 2011 Information Processing Society of Japan

Airplane Flying Bus Hand 3 N + 1 M M N ( 1)/M + 1 N M N = 3, 5, 10, M = 30, 15, 10 M 30 / 4.2 4 SURF Bag-of-Features SURF 2 2 Bag-of-Features SVM MKL-SVM 4 SVM RBF-χ 2 2000 5. 5.1 Bag-of-Features Bag-of-Features Bag-of-Words Bag-of-Words Bag-of-Features Bag-of-Features ( 1 ) ( 2 ) k k visual words visual words codebook ( 3 ) visual words 3 c 2011 Information Processing Society of Japan

( 4 ) bin ( SURF ) Bag-of-Features Bag-of-Features codebook 5.2 RGB RGB 3 Bag-of-Features visual words 5.3 SURF SURF(Speeded-Up Robust Feature) 6) 64 128 SIFT 7) SIFT 128 SURF SIFT SIFT SURF 5000 TRECVID2010 Web 5.4 8) Web step1 step2 step3 step4 step5 step6 SURF SURF Delaunay Web Lucas-Kanade 9) SURF SURF Dlauney Dlauney 3 SURF 64 3 = 192 SURF N SURF N/2 N M Lucas-Kanade x,y 5 N = 5 M = 5 5 20 3 + 5 = 65 SURF 192 + 65 = 257 5.5 Bag-of-Features Lazebnik 10) Bag-of-Features 4 c 2011 Information Processing Society of Japan

SURF 6. SVM(Support Vector Machine) MKL-SVM(Multiple Kernel Learning SVM) 6.1 Support Vector Machine SVM(Support Vector Machine) 2 SVM x φ(x) RBF-χ 2 K(x, y) = exp ( 1 2σ 2 6.2 Multiple Kernel Learning i ) x i y i 2 x i + y i MKL(Multiple Kernel Learning) SVM () K K K combined (x, x ) = β jk j(x, x ) β j 0, β j = 1 (2) j=1 β j MKL MKL β j β j MKL Sonnenburg 11) SVM β j j=1 (1) 7. 7.1 1 Airplane Flying 66 1000 Boat Ship 172 2500 Bus 31 1000 Cityscape 558 5000 Classroom 139 2500 TRECVID TRECVID2010 5 7.1 144,988 7.2 TRECVID2006 (Inferred Average Precision : infap) ( )/( ) N k P recision(k) infap = 1 N N P recision(k) (3) k=1 ( )/( ) TRECVID2010 TRECVID2010 Semantic Indexing 2000 7.3 SURF 3 Bag-of-Features codebook Bag-of- Features 500 Bag-of-Features 5 c 2011 Information Processing Society of Japan

2 M / M M=30 M=15 M=10 Airplane Flying 4.06/5.45 7.62/10.44 11.15/15.45 Boat Ship 5.40/6.05 10.60/11.72 15.82/17.46 Bus 19.06/5.23 37.65/10.08 56.55/14.94 Cityscape 4.80/7.77 9.20/15.23 13.61/22.67 Classroom 14.11/7.29 27.95/14.23 41.94/21.21 4.99 9.48 14.01 3 N M N=1 N=3 N=5 N=10 M=30 M=15 M=10 Airplane Flying 0.0076 0.0263 0.0282 0.0301 0.0331 0.0608 0.0400 0.0414 Boat Ship 0.0294 0.0334 0.0282 0.0278 0.0220 0.0270 0.0276 0.0274 Bus 0.0021 0.0028 0.0030 0.0032 0.0034 0.0034 0.0043 0.0043 Cityscape 0.0586 0.0818 0.0746 0.0466 0.0785 0.0796 0.0722 0.0889 Classroom 0.0002 0.0006 0.0003 0.0003 0.0012 0.0014 0.0004 0.0006 2 2 500 500 4 = 2000 SURF 2000 2 + 500 = 4500 SVM MKL-SVM RBF-χ 2 7.3.1 3 ( 1 ) ( 2 ) ( 3 ) MKL-SVM SVM M 2 MKL-SVM MKL TRECVID2010 (median) (max) 7.4 SVM 3 5 3 N = 1 4 MKL-SVM N M TRECVID2010 N=1 N=3 N=5 N=10 M=30 M=15 M=10 median max Airplane Flying 0.0114 0.0373 0.420 0.0429 0.0654 0.0742 0.0678 0.0675 0.017 0.141 Boat Ship 0.0528 0.0324 0.0322 0.0294 0.0231 0.0291 0.0286 0.0276 0.018 0.165 Bus 0.0035 0.0023 0.0035 0.0031 0.0024 0.0029 0.0036 0.0030 0.002 0.032 Cityscape 0.0996 0.1236 0.1250 0.1251 0.1255 0.1216 0.1235 0.1264 0.045 0.21 Classroom 0.0006 0.0017 0.0027 0.0031 0.0048 0.0059 0.0029 0.0032 0.002 0.116 Airplane Flying 700% M Boat Ship M 1 M=15 MKL-SVM 4 6 4 N MKL 7 Classroom MKL-SVM 883% Bus 4 SVM MKL-SVM Boat Ship MKL-SVM 6 c 2011 Information Processing Society of Japan

5 SVM 6 MKL-SVM N=1 SURF 5000 TRECVID2010 320 240 1 76800 Bag-of-Features 1% TRECVID2010 MKL-SVM SVM Classroom 4 3 1 codebook 4096 8. TRECVID2010 Se- 7 N mantic indexing 2 SURF Semantic indexing N M TRECVID2010 SVM 700% MKL-SVM 883% 7 c 2011 Information Processing Society of Japan

情報処理学会研究報告 た また TRECVID2010 の全チームの平均値と比較した結果 MKL-SVM を使用した設 10) S.Lazebnik, C.Schmid, and J.Ponce. Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. In Proc. of IEEE Computer Vision and Pattern Recognition, pp. 2169 2178, 2006. 11) S.Sonnenburg, G.Ra tsch, C.Scha fer, and B.Scho lkopf. Large Scale Multiple Kernel Learning. The Journal of Machine Learning Research, Vol.7, pp. 1531 1565, 2006. 定のいくつかで 5 クラスすべての値を上回る結果を得た 以上の点から ショット認識にお けるマルチフレーム手法の有効性を確認することができた 9. 今後の課題 実験ではより多くのフレームから特徴量を抽出した場合 逆に精度が悪くなるということ 付 が起こった 単純なショット中の位置でフレームを抽出するだけでなく より有用なフレー 録 MKL-SVM N=10 で実行した上位 15 ショットを示す 赤枠は正解のショットである ムを選択できるようにすることも重要である 例えば 時空間特徴の抽出で行ったような 選択した他のフレーム (この場合は既に抽出した前のフレーム) との差異を計算した場合に 一定以上特徴量が異なれば新たなフレームとして抽出する といったことが考えられる ま た そのフレームの差異が大きければ基本的な取得フレーム間隔を小さくし 逆に差異が小 さければ間隔を大きくする つまりフレームの変化によって取得間隔をショット毎に変更す るということもできる 参 考 文 献 1) TRECVID Home Page. http://www-nlpir.nist.gov/projects/trecvid/. 2) C.G.M. Snoek, KEA vande Sande, O.deRooij, etal. The mediamill trecvid 2008 semantic video search engine. In Proc. of TRECVID Workshop, 2008. 3) D.Cai, X.He, and J.Han. Efficient kernel discriminant analysis via spectral regression. In Data Mining, 2007. ICDM 2007. Seventh IEEE International Conference on, pp. 427 432. IEEE, 2008. 4) J.Ye, S.Ji, and J.Chen. Multi-class discriminant kernel learning via convex programming. The Journal of Machine Learning Research, Vol.9, pp. 719 758, 2008. 5) N.Inoue, S.Hao, T.Saito, K.Shinoda, I.Kim, and C.H. Lee. Titgt at trecvid 2009 workshop. In Proc. of TRECVID Workshop, Vol.2, 2009. 6) H.Bay, T.Tuytelaars, and L.VanGool. SURF: Speeded up robust features. In Proc. of European Conference on Computer Vision, pp. 404 415, 2006. 7) D.G. Lowe. Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision, Vol.60, No.2, pp. 91 110, 2004. 8) 野口顕嗣, 柳井啓司. 動きの連続性を考慮した動画からの局所的な時空間特徴の抽出. In MIRU, 2009. 9) B.D. Lucas and T.Kanade. An iterative image registration technique with an application to stereo vision. In International joint conference on artificial intelligence, Vol.3, pp. 674 679. Citeseer, 1981. 図8 Airplane Flying 図 9 Boat Ship 図 10 図 11 Cityscape Bus 図 12 Classroom 8 c 2011 Information Processing Society of Japan