情報処理学会研究報告 IPSJ SIG Technical Report Vol.2013-CVIM-186 No /3/15 EMD 1,a) SIFT. SIFT Bag-of-keypoints. SIFT SIFT.. Earth Mover s Distance

Similar documents
(MIRU2008) HOG Histograms of Oriented Gradients (HOG)

28 TCG SURF Card recognition using SURF in TCG play video

paper.dvi

IPSJ SIG Technical Report Vol.2010-CVIM-170 No /1/ Visual Recognition of Wire Harnesses for Automated Wiring Masaki Yoneda, 1 Ta

Sobel Canny i

RoboCup 1 2D 3D Figre 1 2 2D 3D 2D 2D 3D 2D 2D Earth Mover s Distance Earth Mover s Distance 3.1 (x y ) p i w pi Figure 3 opuscom Uv

24 Region-Based Image Retrieval using Fuzzy Clustering

1 Web [2] Web [3] [4] [5], [6] [7] [8] S.W. [9] 3. MeetingShelf Web MeetingShelf MeetingShelf (1) (2) (3) (4) (5) Web MeetingShelf

SURF,,., 55%,.,., SURF(Speeded Up Robust Features), 4 (,,, ), SURF.,, 84%, 96%, 28%, 32%.,,,. SURF, i

光学

2 122

28 Horizontal angle correction using straight line detection in an equirectangular image

Duplicate Near Duplicate Intact Partial Copy Original Image Near Partial Copy Near Partial Copy with a background (a) (b) 2 1 [6] SIFT SIFT SIF

(VKIR) VKIR VKIR DCT (R) (G) (B) Ward DCT i

,,,,,,,,,,,,,,,,,,, 976%, i

GPGPU

4.1 % 7.5 %

20 Method for Recognizing Expression Considering Fuzzy Based on Optical Flow

Web Web Web Web Web, i

,,.,.,,.,.,.,.,,.,..,,,, i

bag-of-words bag-of-keypoints Web bagof-keypoints Nearest Neighbor SVM Nearest Neighbor SIFT Nearest Neighbor bag-of-keypoints Nearest Neighbor SVM 84

soturon.dvi

1 Fig. 1 Extraction of motion,.,,, 4,,, 3., 1, 2. 2.,. CHLAC,. 2.1,. (256 ).,., CHLAC. CHLAC, HLAC. 2.3 (HLAC ) r,.,. HLAC. N. 2 HLAC Fig. 2

ï\éÜA4*

25 II :30 16:00 (1),. Do not open this problem booklet until the start of the examination is announced. (2) 3.. Answer the following 3 proble


IPSJ SIG Technical Report Vol.2014-HCI-158 No /5/22 1,a) 2 2 3,b) Development of visualization technique expressing rainfall changing conditions

Virtual Window System Virtual Window System Virtual Window System Virtual Window System Virtual Window System Virtual Window System Social Networking

WebRTC P2P Web Proxy P2P Web Proxy WebRTC WebRTC Web, HTTP, WebRTC, P2P i

Bull. of Nippon Sport Sci. Univ. 47 (1) Devising musical expression in teaching methods for elementary music An attempt at shared teaching

58 10

2.2 6).,.,.,. Yang, 7).,,.,,. 2.3 SIFT SIFT (Scale-Invariant Feature Transform) 8).,. SIFT,,. SIFT, Mean-Shift 9)., SIFT,., SIFT,. 3.,.,,,,,.,,,., 1,

24 Depth scaling of binocular stereopsis by observer s own movements

SOM SOM(Self-Organizing Maps) SOM SOM SOM SOM SOM SOM i

2. 30 Visual Words TF-IDF Lowe [4] Scale-Invarient Feature Transform (SIFT) Bay [1] Speeded Up Robust Features (SURF) SIFT 128 SURF 64 Visual Words Ni

yoo_graduation_thesis.dvi

IPSJ SIG Technical Report Vol.2011-CVIM-177 No /5/ TRECVID2010 SURF Bag-of-Features 1 TRECVID SVM 700% MKL-SVM 883% TRECVID2010 MKL-SVM A

kut-paper-template.dvi

4. C i k = 2 k-means C 1 i, C 2 i 5. C i x i p [ f(θ i ; x) = (2π) p 2 Vi 1 2 exp (x µ ] i) t V 1 i (x µ i ) 2 BIC BIC = 2 log L( ˆθ i ; x i C i ) + q

大学における原価計算教育の現状と課題

1 DHT Fig. 1 Example of DHT 2 Successor Fig. 2 Example of Successor 2.1 Distributed Hash Table key key value O(1) DHT DHT 1 DHT 1 ID key ID IP value D

卒業論文2.dvi

浜松医科大学紀要

LAN LAN LAN LAN LAN LAN,, i

一般社団法人電子情報通信学会 THE INSTITUTE OF ELECTRONICS, INFORMATION AND COMMUNICATION ENGINEERS THE INSTITUTE OF ELECTRONICS, INFORMATION AND COMMUNICATION ENGIN

THE INSTITUTE OF ELECTRONICS, INFORMATION AND COMMUNICATION ENGINEERS TECHNICAL REPORT OF IEICE.

光学

n 2 n (Dynamic Programming : DP) (Genetic Algorithm : GA) 2 i


DTN DTN DTN DTN i


Fig. 3 Flow diagram of image processing. Black rectangle in the photo indicates the processing area (128 x 32 pixels).

LBP 2 LBP 2. 2 Local Binary Pattern Local Binary pattern(lbp) [6] R

PDF用-表紙.pdf

B_01田中.indd

奈良大学紀要 46号(よこ)☆/5.横田

, (GPS: Global Positioning Systemg),.,, (LBS: Local Based Services).. GPS,.,. RFID LAN,.,.,.,,,.,..,.,.,,, i

untitled

(a) 1 (b) 3. Gilbert Pernicka[2] Treibitz Schechner[3] Narasimhan [4] Kim [5] Nayar [6] [7][8][9] 2. X X X [10] [11] L L t L s L = L t + L s

第5部門_05_垣本 徹.indd

thesis.dvi

塗装深み感の要因解析

卒業論文はMS-Word により作成して下さい

(Visual Secret Sharing Scheme) VSSS VSSS 3 i

) ,


kiyo5_1-masuzawa.indd

16.16%



SPSS

本文6(599) (Page 601)

先端社会研究 ★5★号/4.山崎

& Vol.5 No (Oct. 2015) TV 1,2,a) , Augmented TV TV AR Augmented Reality 3DCG TV Estimation of TV Screen Position and Ro

Web ( ) [1] Web Shibboleth SSO Web SSO Web Web Shibboleth SAML IdP(Identity Provider) Web Web (SP:ServiceProvider) ( ) IdP Web Web MRA(Mail Retrieval

WASEDA RILAS JOURNAL

1., 1 COOKPAD 2, Web.,,,,,,.,, [1]., 5.,, [2].,,.,.,, 5, [3].,,,.,, [4], 33,.,,.,,.. 2.,, 3.., 4., 5., ,. 1.,,., 2.,. 1,,

(MIRU2009) cuboid cuboid SURF 6 85% Web. Web Abstract Extracting Spatio-te

1 [1, 2, 3, 4, 5, 8, 9, 10, 12, 15] The Boston Public Schools system, BPS (Deferred Acceptance system, DA) (Top Trading Cycles system, TTC) cf. [13] [

25 Removal of the fricative sounds that occur in the electronic stethoscope

Microsoft Word - toyoshima-deim2011.doc

”Лï−wŁfl‰IŠv‚æ89“ƒ/‚qfic“NŸH

24_ChenGuang_final.indd


29 jjencode JavaScript

IPSJ SIG Technical Report Vol.2009-CVIM-167 No /6/10 Real AdaBoost HOG 1 1 1, 2 1 Real AdaBoost HOG HOG Real AdaBoost HOG A Method for Reducing

2017 (413812)

258 5) GPS 1 GPS 6) GPS DP 7) 8) 10) GPS GPS ) GPS Global Positioning System

IPSJ SIG Technical Report Secret Tap Secret Tap Secret Flick 1 An Examination of Icon-based User Authentication Method Using Flick Input for

Appropriate Disaster Preparedness Education in Classrooms According to Students Grade, from Kindergarten through High School Contrivance of an Educati

<836D815B83675F90C493A12E696E6464>


情報処理学会研究報告 IPSJ SIG Technical Report Vol.2016-MBL-80 No.11 Vol.2016-CDS-17 No /8/ (VR) (AR) VR, AR VR, AR Study of a Feedback Method fo

untitled

2 : Open Clip Art Library [4] Microsoft Office PowerPoint Web PowerPoint 2 Yahoo! Web [5] SlideShare Yahoo! Web Yahoo! Web

2 ( ) i

IPSJ SIG Technical Report Vol.2012-CG-148 No /8/29 3DCG 1,a) On rigid body animation taking into account the 3D computer graphics came

1 P2 P P3P4 P5P8 P9P10 P11 P12

Microsoft PowerPoint - SSII_harada pptx

(2001)(2001)(2001)(2001)(2001)

Transcription:

EMD 1,a) 1 1 1 SIFT. SIFT Bag-of-keypoints. SIFT SIFT.. Earth Mover s Distance (EMD), Bag-of-keypoints,. Bag-of-keypoints, SIFT, EMD, A method of similar image retrieval system using EMD and SIFT Hoshiga Fumito 1,a) Higuchi Tatsuya 1 Nakajima Yuma 1 Shishibori masami 1 Abstract: The content-based image retrieval methods using the SIFT features which is the local features of a image have been studied actively in recent years. The Bag-of-keypoints is very famous as the retrieval technique using the SIFT features. However, in order to quantize the whole SIFT features extracted from the image to a fixed-length feature vector, the positions of each SIFT in the image can not be taken into consideration. This method applys color segmentation module in order to separate the corresponging image into some regions which have same color pixels. And then, this method makes the corresponding fixed-length feature vector form SIFT features in each region area. However, t is impossible for this method to use the Euclidean distance measure, because the number of color segmentation areas of the image is not fixed value, as a result, the lenght of vector also changes. In order to solve this problem, this mehod applys the Earth Mover s Distance (EMD) as the distance measure instead of the Euclidean distance. Keywords: Bag-of-keypoints, SIFT, EMD, Content-based image retrieval methods 1.,,. SD,,,.,,,. 1 a) hoshiga-fumito@iss.tokushima-u.ac.jp,. SIFT,,,.SIFT,. Bag-of-keypoints,,.,, SIFT. SIFT 1

1 2.. Earth Mover s Distance (EMD), Bag-of-keypoints,. 2. Bag-of-keypoints Bag-of-keypoints,..,SIFT(Scale Invariant Feature Transform). 2.1 SIFT SIFT Lowe [1].,,.. 128 ( 1). 2.2 Bag-of-keypoints, visual words, visual words. visual words. 128 ( 2). visual words,. 3. Bag-of-keypoints,. SIFT 128,.,. EMD(Earth Mover s Distance),. 3.1 EMD Earth Mover s Distance(EMD), 1. 2,. EMD,., m, n P, Q. P = {(p 1, w p1 ),..., (p m, w pm )} (1) Q = {(q 1, w q1 ),..., (q n, w qn )} (2) p i i, w pi i., q j j, w qj j. P, Q i, j (d ij ). 2

p i, q j, d ij = p i q j (3)., i j., i j ( ) (F = {f ij }). (WORK), WORK(P, Q, F ) = d ij f ij (4) i=1 j=1., i j., ( (5) (8)). : f ij 0, (1 i m, 1 j n) (5) 4 3 EMD EMD : i w pi n f ij w pi, (1 i m) (6) j=1 : j w qj m f ij w qj, (1 j n) (7) i=1 : () f ij = min w pi, (8) i=1 j=1 i=1 j=1 w qi EMD(P Q) min(work(p, Q, F )), EMD(P, Q) =. min(work(p, Q, F )) m n i=1 j=1 f ij (9) EMD 3.,,.,.,.., (,, ) (X,Y ), ( 4). 3.2 Bag-of-keypoints + EMD EMD, Bag-of-keypotins. ( X,Y, ),.. 1. opencv2.4.2 cv::siftfeaturedetector cv::siftdescriptorextractor SIFT. 2., visual words k-means. 3., ImageMagick,. ( 5). 4. 6 5, visual words 7. 5. EMD 4, EMD ( 7).,,, EMD. 3

5 EMD 1 24, 24. 5, 5., 5. Bag-of-keypoints, visual-words 2 24, 23. Bag-of-keyoitins+EMD, 1 24, 24, visual-words 2 24. 24 23 552. 900. 4.1 900 3, ( 2). 6 2 Bag-of-keypoins EMD (900 ) 428 389 83 ( 3). 1, 90.,., 552 4. 7 EMD Bag-of-keypoints+EMD, Bag-of-keypoints, EMD.. Caltec256 10 ( 1), 0001 0090 90. 900. 1 Caltec256 10 015.bonsai-101 016.boom-box 023.bulldozer 036.chandelier 072.fire-truck 073.fireworks 092.grapes 132.light-house 213.teddy-bear 251.airplanes-101 3 Bag-of-keypoins EMD 20 58 12 45 44 1 55 28 7 16 62 12 46 40 4 76 9 5 16 45 29 59 26 5 33 49 8 62 28 0 (24 23 )., 3 ( 8). 900, 552. 5. 2,. 3, Bag-of-keypoints.,,,, 4

,. SIFT. [1] Lowe, D.G : Object recognition from local scale invariant features, Proc. of IEEE InternationalConference on Computer Vision, pp. 1150-1157(1999) 8.,,,,,.,,,,..,,, Bag-of-keyoituns.,, EMD EMD,.,, 900, 1, visual words 2 ( 8)., (, ), visual words visual words,. 6., Bag-of-keypoints,.,, Bag-of-keypoints,, EMD,,.,.,,.,, EMD., 5