Microsoft PowerPoint - cvim_harada pptx

Size: px
Start display at page:

Download "Microsoft PowerPoint - cvim_harada pptx"

Transcription

1 1

2 2

3 Flickr reaches 6 billion photos on 1 Aug,

4 4

5 5

6 6

7 位 LSVRC 位 LSVRC 位 Localization Car Car Categorization Car

8 1. neck brace 2. bullet train 3. potter's wheel 4. seat belt 5. barbell 1. mountain bike 2. hartebeest 3. yurt 4. bighorn 5. coho 1. brown bear 2. otter 3. hippopotamus 4. raccoon 5. deerhound 1. volleyball 2. bittern 3. shower curtain 4. crane 5. suspension bridge 1. mask 2. ski mask 3. jack-o'-lantern 4. jellyfish 5. teddy bear 1. toilet seat 2. scanner 3. hard disc 4. scale 5. backpack 1. baseball player 2. racket, racquet 3. solar dish 4. trimaran 5. paddle 1. aircraft carrier 2. paddle 3. bullfrog 4. water ouzel 5. mantis 8

9 9

10 The state of the world The gathered data The processed data w d r I( W; D) I( W; R) The data processing theorem states that data processing can only destroy information. 10 David J.C. MacKay. Information Theory, Inference, and Learning Algorithms. Cambridge University Press 2003.

11 11

12

13

14 S. Vijayanarasimhan and K. Grauman. Large Scale Live Active Learning: Training Object Detectors with Crawled Data and Crowds. In CVPR, 2011.

15 S. Vijayanarasimhan and K. Grauman. Large Scale Live Active Learning: Training Object Detectors with Crawled Data and Crowds. In CVPR, HOG deformation LLC+max pooling No deformation NIPS2010

16

17

18 S. J. Hwang, F. Sha, and K. Grauman. Sharing Features Between Objects and Their Attributes. CVPR, V. Ferrari and A. Zisserman. Learning visual attributes. In NIPS,

19 Attributes and Classification 20

20 21

21 S. Dhar, V. Ordonez, and T. L Berg. High Level Describable Attributes for Predicting Aesthetics and Interestingness. CVPR,

22 S. Dhar, V. Ordonez, and T. L Berg. High Level Describable Attributes for Predicting Aesthetics and Interestingness. CVPR,

23 24

24 M. Douze, A. Ramisa, and C. Schmid. Combining attributes and Fisher vectors for efficient image retrieval. CVPR,

25 26

26 D. Parikh and K. Grauman. Relative Attributes. In ICCV,

27

28

29 30

30 31

31 Deng et al., CVPR

32 33

33

34 d 2 d 3 d m d 1 d k d j d N 1) Input Image d m 2) Detection 3) Description p( d; θ) d N d 2 d 1 d k x f (θ) d j d 3 4) Local descriptors in feature space 5) PDF estimation 6) Feature vector 35

35 d 2 d 1 d m Local descriptors in feature space d k d N d j d 3 Descriptor matching Codebook Global feature # of anchor points: large # of anchor points: small Computational complexity: large Computational complexity: small SVM KNN Naïve Bayes Nearest Neighbor Graph Matching Kernel Bag of Visual Words Gaussian Mixture Model ScSPM, Super Vector, LLC Fisher Vector HLAC GLC Global Gaussian 36

36

37 H. Zhang, A. C. Berg, M. Maire, and J. Malik. SVM KNN: Discriminative Nearest Neighbor Classification for Visual Category Recognition. In CVPR,

38 T. Tuytelaars, M. Fritz, K. Saenko, and T. Darrell. The NBNN kernel. In ICCV,

39 40

40 O. Duchenne, A. Joulin and J. Ponce. A Graph Matching Kernel for Object Categorization. ICCV,

41

42

43 w 3 w 1 w 4 R d w 2 44

44 Bag of Visual Words Kernel codebook d 1 w1 w2 w3 w4 [ f kc ( x )] i k K j 1 exp 2 exp d 2 i w d j k 2 w k 2 w1 w2 w3 w4 d 1 d 2 w1 w2 w3 w4 d 1 w1 w2 w3 w4 d 2 d 3 w1 w2 w3 w4 w 1 d 2 w 3 d 3 w1 w2 w3 w4 d 3 d 4 w1 w2 w3 w4 w 2 w1 w2 w3 w4 d 4 f BoW 1 N fbow( xi ) N i 1 d 4 w 4 f kc 1 N fkc ( xi ) N i 1 w1 w2 w3 w4 w1 w2 w3 w4 45

45 Image Local descriptors in feature space PDF estimation 46

46 Generative approach Image Local descriptors in feature space PDF estimation Fisher Kernel Feature vector Fisher Vector Discriminative classifier F. Perronnin and C. Dance. Fisher kernels on visual vocabularies for image categorization. CVPR, Discriminative approach Classifier e.g., SVMs Category 47

47 48

48 49

49 50

50 51

51 net.org/challenges/lsvrc/2010/ilsvrc2010_xrce.pdf 52

52 K p( x; ) w Ν ( x;, k 1 k k k ) ˆμ 1 Image Local descriptors in feature space U U μ ~ ( ) ( ) 1/ k wk ( k ) 2 μˆ k μˆ GMM ( U ) ˆ k k N i1 μˆ K ˆμ 2 Means of components N i1 ( k) x i i ( k) i μ~ 1 μ~ 2 ( X ) μ ~ K GMM supervectors 53

53 54 N i i N i i i U k k k x k 1 1 ) ( ) ( ) ( ˆ ˆ μ N i i i U k U k N i i i U k N i k U k k U k U k k x k w N x k i w w 1 2 1/ ) ( ) ( 1 2 1/ ) ( 1 ) ( 2 1/ ) ( ) ( ) ( ) ( 1 ) ( ) ( ) ( ˆ ) ( ~ μ μ 0 N i i Nw k k 1 ) ( N i k i k i k i k w N g 1 2 1/, ) ( 1 μ x N. Inoue and K. Shinoda. A Fast MAP Adaptation Technique for GMMsupervector based Video Semantic Indexing. ACM Multimedia, 2011.

54 55 Asymmetric Distance Computation

55 H. Jegou, M. Douze, C. Schmid, and P. Perez. Aggregating local descriptors into a compact image representation. CVPR,

56 57

57 58

58 59

59 60

60 61

61 62

62 63

63 64

64 net.org/challenges/lsvrc/2010/ilsvrc2010_nec UIUC.pdf 65

65 net.org/challenges/lsvrc/2010/ilsvrc2010_nec UIUC.pdf 66

66 J. Yang, K. Yu, Y. Gong, and T. Huang. Linear spatial pyramid matching using sparse coding for image classification. CVPR,

67

68 H. Nakayama, T. Harada, and Y. Kuniyoshi. Dense Sampling Low Level Statistics of Local Features. In CIVR, GMM Single Gaussian 69

69 H. Nakayama, T. Harada, and Y. Kuniyoshi. Dense Sampling Low Level Statistics of Local Features. In CIVR,

70 H. Nakayama, T. Harada, and Y. Kuniyoshi. Global Gaussian Approach for Scene Categorization Using Information Geometry. In CVPR, Image 1 Local descriptor space Feature vector Feature vector Local descriptor space Image 2 (1) x (2) x Similarity? ( j) x (i) x (2) x (k ) x (1) x Manifold

71 H. Nakayama, T. Harada, and Y. Kuniyoshi. Global Gaussian Approach for Scene Categorization Using Information Geometry. In CVPR,

72 H. Nakayama, T. Harada, and Y. Kuniyoshi. Global Gaussian Approach for Scene Categorization Using Information Geometry. In CVPR,

73 Super Vector Coding VLAD GMM + Bag of Visual Words Fisher Vector Sparse Coding Global Gaussian Local Coordinate Coding Bag of Visual Words Locality constrained Linear Coding 74

74 75

75

76 J. Sanchez, and F. Perronnin. High Dimensional Signature Compression for Large Scale Image Classification. In CVPR, 2011.

77 78

78 識別機 CPU 識別機 識別機 識別機 CPU 識別機 識別機 識別機 CPU 識別機 識別機 データデータデータ データ データ データ HDD データ データ HDD HDD 79

79 D dim D/N dim D/N dim w 3 w 3 2^K w 3 w 3 w 1 w 4 w 1 w 4 w 1 w 4 w 1 w 4 w 2 w 2 w 2 w 2 NK/D [bit/dim] NK/D [bit/dim] NK/D [bit/dim] NK/D [bit/dim] 80

80 81

81 net.org/challenges/lsvrc/2011/ilsvrc11.pdf 82

82 83

Microsoft PowerPoint - SSII_harada pptx

Microsoft PowerPoint - SSII_harada pptx The state of the world The gathered data The processed data w d r I( W; D) I( W; R) The data processing theorem states that data processing can only destroy information. David J.C. MacKay. Information

More information

<4D F736F F F696E74202D2091E58B4B96CD88EA94CA89E6919C94468EAF82C689E6919C955C8CBB5F947A957A97702E >

<4D F736F F F696E74202D2091E58B4B96CD88EA94CA89E6919C94468EAF82C689E6919C955C8CBB5F947A957A97702E > 1 Flickr reached 5,000,000,000 photos on September 19, 2010. http://blog.flickr.net/en/2010/09/19/5000000000/ 2 http://www.flickr.com/photos/kullin/4999988381/ 3 http://twitter.com/randizuckerberg/status/22187407218577408#

More information

IPSJ SIG Technical Report Vol.2013-CVIM-187 No /5/30 1,a) 1,b), 1,,,,,,, (DNN),,,, 2 (CNN),, 1.,,,,,,,,,,,,,,,,,, [1], [6], [7], [12], [13]., [

IPSJ SIG Technical Report Vol.2013-CVIM-187 No /5/30 1,a) 1,b), 1,,,,,,, (DNN),,,, 2 (CNN),, 1.,,,,,,,,,,,,,,,,,, [1], [6], [7], [12], [13]., [ ,a),b),,,,,,,, (DNN),,,, (CNN),,.,,,,,,,,,,,,,,,,,, [], [6], [7], [], [3]., [8], [0], [7],,,, Tohoku University a) omokawa@vision.is.tohoku.ac.jp b) okatani@vision.is.tohoku.ac.jp, [3],, (DNN), DNN, [3],

More information

IPSJ SIG Technical Report Vol.2010-CVIM-170 No /1/ Visual Recognition of Wire Harnesses for Automated Wiring Masaki Yoneda, 1 Ta

IPSJ SIG Technical Report Vol.2010-CVIM-170 No /1/ Visual Recognition of Wire Harnesses for Automated Wiring Masaki Yoneda, 1 Ta 1 1 1 1 2 1. Visual Recognition of Wire Harnesses for Automated Wiring Masaki Yoneda, 1 Takayuki Okatani 1 and Koichiro Deguchi 1 This paper presents a method for recognizing the pose of a wire harness

More information

Duplicate Near Duplicate Intact Partial Copy Original Image Near Partial Copy Near Partial Copy with a background (a) (b) 2 1 [6] SIFT SIFT SIF

Duplicate Near Duplicate Intact Partial Copy Original Image Near Partial Copy Near Partial Copy with a background (a) (b) 2 1 [6] SIFT SIFT SIF Partial Copy Detection of Line Drawings from a Large-Scale Database Weihan Sun, Koichi Kise Graduate School of Engineering, Osaka Prefecture University E-mail: sunweihan@m.cs.osakafu-u.ac.jp, kise@cs.osakafu-u.ac.jp

More information

bag-of-words bag-of-keypoints Web bagof-keypoints Nearest Neighbor SVM Nearest Neighbor SIFT Nearest Neighbor bag-of-keypoints Nearest Neighbor SVM 84

bag-of-words bag-of-keypoints Web bagof-keypoints Nearest Neighbor SVM Nearest Neighbor SIFT Nearest Neighbor bag-of-keypoints Nearest Neighbor SVM 84 Bag-of-Keypoints Web G.Csurka bag-of-keypoints Web Bag-of-keypoints SVM 5.% Web Image Classification with Bag-of-Keypoints Taichi joutou and Keiji yanai Recently, need for generic image recognition is

More information

Microsoft PowerPoint - IBIS_harada _2.pptx

Microsoft PowerPoint - IBIS_harada _2.pptx 1 beach, water, people, kauai, tree ocean, coral, fish, angelfish, reefs Retrieving by flight 2 Image Annotation Results on Corel5K birds, booby, flight, rocks, water buildings, ships, bridge, flag, sky

More information

IPSJ SIG Technical Report Vol.2009-CVIM-167 No /6/10 Real AdaBoost HOG 1 1 1, 2 1 Real AdaBoost HOG HOG Real AdaBoost HOG A Method for Reducing

IPSJ SIG Technical Report Vol.2009-CVIM-167 No /6/10 Real AdaBoost HOG 1 1 1, 2 1 Real AdaBoost HOG HOG Real AdaBoost HOG A Method for Reducing Real AdaBoost HOG 1 1 1, 2 1 Real AdaBoost HOG HOG Real AdaBoost HOG A Method for Reducing number of HOG Features based on Real AdaBoost Chika Matsushima, 1 Yuji Yamauchi, 1 Takayoshi Yamashita 1, 2 and

More information

(b) BoF codeword codeword BoF (c) BoF Fergus Weber [11] Weber [12] Weber Fergus BoF (b) Fergus [13] Fergus 2. Fergus 2. 1 Fergus [3]

(b) BoF codeword codeword BoF (c) BoF Fergus Weber [11] Weber [12] Weber Fergus BoF (b) Fergus [13] Fergus 2. Fergus 2. 1 Fergus [3] * A Multimodal Constellation Model for Generic Object Recognition Yasunori KAMIYA, Tomokazu TAKAHASHI,IchiroIDE, and Hiroshi MURASE Bag of Features (BoF) BoF EM 1. [1] Part-based Graduate School of Information

More information

IPSJ SIG Technical Report Vol.2012-CG-149 No.13 Vol.2012-CVIM-184 No /12/4 3 1,a) ( ) DB 3D DB 2D,,,, PnP(Perspective n-point), Ransa

IPSJ SIG Technical Report Vol.2012-CG-149 No.13 Vol.2012-CVIM-184 No /12/4 3 1,a) ( ) DB 3D DB 2D,,,, PnP(Perspective n-point), Ransa 3,a) 3 3 ( ) DB 3D DB 2D,,,, PnP(Perspective n-point), Ransac. DB [] [2] 3 DB Web Web DB Web NTT NTT Media Intelligence Laboratories, - Hikarinooka Yokosuka-Shi, Kanagawa 239-0847 Japan a) yabushita.hiroko@lab.ntt.co.jp

More information

Microsoft PowerPoint - IBIS_harada_

Microsoft PowerPoint - IBIS_harada_ 2 Intelligence for Real World Recognition Natural Language Processing Database Pattern Recognition Information Theory Computer Vision Machine Learning Data Mining Robotics Cognitive Science Parallel Computing

More information

(MIRU2010) Geometric Context Randomized Trees Geometric Context Rand

(MIRU2010) Geometric Context Randomized Trees Geometric Context Rand (MIRU2010) 2010 7 Geometric Context Randomized Trees 487-8501 1200 E-mail: {fukuta,ky}@vision.cs.chubu.ac.jp, hf@cs.chubu.ac.jp Geometric Context Randomized Trees 10 3, Geometric Context, Abstract Image

More information

4. C i k = 2 k-means C 1 i, C 2 i 5. C i x i p [ f(θ i ; x) = (2π) p 2 Vi 1 2 exp (x µ ] i) t V 1 i (x µ i ) 2 BIC BIC = 2 log L( ˆθ i ; x i C i ) + q

4. C i k = 2 k-means C 1 i, C 2 i 5. C i x i p [ f(θ i ; x) = (2π) p 2 Vi 1 2 exp (x µ ] i) t V 1 i (x µ i ) 2 BIC BIC = 2 log L( ˆθ i ; x i C i ) + q x-means 1 2 2 x-means, x-means k-means Bayesian Information Criterion BIC Watershed x-means Moving Object Extraction Using the Number of Clusters Determined by X-means Clustering Naoki Kubo, 1 Kousuke

More information

,,, Twitter,,, ( ), 2. [1],,, ( ),,.,, Sungho Jeon [2], Twitter 4 URL, SVM,, , , URL F., SVM,, 4 SVM, F,.,,,,, [3], 1 [2] Step Entered

,,, Twitter,,, ( ), 2. [1],,, ( ),,.,, Sungho Jeon [2], Twitter 4 URL, SVM,, , , URL F., SVM,, 4 SVM, F,.,,,,, [3], 1 [2] Step Entered DEIM Forum 2016 C5-1 182-8585 1-5-1 E-mail: saitoh-ryoh@uec.ac.jp, terada.minoru@uec.ac.jp Twitter,, Twitter,,, Bag of Words, Latent Semantic Indexing,.,,,, Twitter,, Twitter,, 1. SNS, SNS Twitter 1,,,

More information

2 Fig D human model. 1 Fig. 1 The flow of proposed method )9)10) 2.2 3)4)7) 5)11)12)13)14) TOF 1 3 TOF 3 2 c 2011 Information

2 Fig D human model. 1 Fig. 1 The flow of proposed method )9)10) 2.2 3)4)7) 5)11)12)13)14) TOF 1 3 TOF 3 2 c 2011 Information 1 1 2 TOF 2 (D-HOG HOG) Recall D-HOG 0.07 HOG 0.16 Pose Estimation by Regression Analysis with Depth Information Yoshiki Agata 1 and Hironobu Fujiyoshi 1 A method for estimating the pose of a human from

More information

3: 2: 2. 2 Semi-supervised learning Semi-supervised learning [5,6] Semi-supervised learning Self-training [13] [14] Self-training Self-training Semi-s

3: 2: 2. 2 Semi-supervised learning Semi-supervised learning [5,6] Semi-supervised learning Self-training [13] [14] Self-training Self-training Semi-s THE INSTITUTE OF ELECTRONICS, INFORMATION AND COMMUNICATION ENGINEERS TECHNICAL REPORT OF IEICE. 599-8531 1-1 E-mail: tsukada@m.cs.osakafu-u.ac.jp, {masa,kise}@cs.osakafu-u.ac.jp Semi-supervised learning

More information

Dynamic Time Warping( DTW DTW 30 k-d tree Forebes [1] 2. DTW[2] DTW DTW DTW Forbes[1] k-d tree DTW Hsu[3] DTW Zhu[4] K-SVD Sun[5] Self-S

Dynamic Time Warping( DTW DTW 30 k-d tree Forebes [1] 2. DTW[2] DTW DTW DTW Forbes[1] k-d tree DTW Hsu[3] DTW Zhu[4] K-SVD Sun[5] Self-S 情報処理学会インタラクション 2015 IPSJ Interaction 2015 A62 2015/3/5 1,a) Natapon Pantuwong 2 1 1 1 Dynamic Time Warping 2 DTW DTW 30 k-d tree [1] A Rapid Motion Retrieval Technique using Simple and Discrete Representation

More information

THE INSTITUTE OF ELECTRONICS, INFORMATION AND COMMUNICATION ENGINEERS TECHNICAL REPORT OF IEICE. TRECVID2012 Instance Search {sak

THE INSTITUTE OF ELECTRONICS, INFORMATION AND COMMUNICATION ENGINEERS TECHNICAL REPORT OF IEICE. TRECVID2012 Instance Search {sak THE INSTITUTE OF ELECTRONICS, INFORMATION AND COMMUNICATION ENGINEERS TECHNICAL REPORT OF IEICE. TRECVID2012 Instance Search 599 8531 1 1 E-mail: {sakata,matozaki}@m.cs.osakafu-u.ac.jp, {kise,masa}@cs.osakafu-u.ac.jp

More information

PowerPoint プレゼンテーション

PowerPoint プレゼンテーション 東京大学大学院情報理工学系研究科創造情報学専攻中山研究室中山英樹 どちらがハヤブサでしょう? http://plaza.rakuten.co.jp http://birds.mints.ne.jp 2 どちらがハヤブサでしょう? タカ ハヤブサ http://plaza.rakuten.co.jp http://birds.mints.ne.jp 3 車種はなんでしょう? 4 車種はなんでしょう?

More information

情報処理学会研究報告 IPSJ SIG Technical Report Vol.2013-CVIM-186 No /3/15 EMD 1,a) SIFT. SIFT Bag-of-keypoints. SIFT SIFT.. Earth Mover s Distance

情報処理学会研究報告 IPSJ SIG Technical Report Vol.2013-CVIM-186 No /3/15 EMD 1,a) SIFT. SIFT Bag-of-keypoints. SIFT SIFT.. Earth Mover s Distance EMD 1,a) 1 1 1 SIFT. SIFT Bag-of-keypoints. SIFT SIFT.. Earth Mover s Distance (EMD), Bag-of-keypoints,. Bag-of-keypoints, SIFT, EMD, A method of similar image retrieval system using EMD and SIFT Hoshiga

More information

& 3 3 ' ' (., (Pixel), (Light Intensity) (Random Variable). (Joint Probability). V., V = {,,, V }. i x i x = (x, x,, x V ) T. x i i (State Variable),

& 3 3 ' ' (., (Pixel), (Light Intensity) (Random Variable). (Joint Probability). V., V = {,,, V }. i x i x = (x, x,, x V ) T. x i i (State Variable), .... Deeping and Expansion of Large-Scale Random Fields and Probabilistic Image Processing Kazuyuki Tanaka The mathematical frameworks of probabilistic image processing are formulated by means of Markov

More information

35_3_9.dvi

35_3_9.dvi 180 Vol. 35 No. 3, pp.180 185, 2017 Image Recognition by Deep Learning Hironobu Fujiyoshi and Takayoshi Yamashita Chubu University 1. 1990 2000 Scale-Invariant Feature Transform SIFT Histogram of Oriented

More information

Computational Semantics 1 category specificity Warrington (1975); Warrington & Shallice (1979, 1984) 2 basic level superiority 3 super-ordinate catego

Computational Semantics 1 category specificity Warrington (1975); Warrington & Shallice (1979, 1984) 2 basic level superiority 3 super-ordinate catego Computational Semantics 1 category specificity Warrington (1975); Warrington & Shallice (1979, 1984) 2 basic level superiority 3 super-ordinate category preservation 1 / 13 analogy by vector space Figure

More information

Microsoft Word - toyoshima-deim2011.doc

Microsoft Word - toyoshima-deim2011.doc DEIM Forum 2011 E9-4 252-0882 5322 252-0882 5322 E-mail: t09651yt, sashiori, kiyoki @sfc.keio.ac.jp CBIR A Meaning Recognition System for Sign-Logo by Color-Shape-Based Similarity Computations for Images

More information

IPSJ SIG Technical Report Vol.2011-CVIM-177 No /5/ TRECVID2010 SURF Bag-of-Features 1 TRECVID SVM 700% MKL-SVM 883% TRECVID2010 MKL-SVM A

IPSJ SIG Technical Report Vol.2011-CVIM-177 No /5/ TRECVID2010 SURF Bag-of-Features 1 TRECVID SVM 700% MKL-SVM 883% TRECVID2010 MKL-SVM A 1 1 TRECVID2010 SURF Bag-of-Features 1 TRECVID SVM 700% MKL-SVM 883% TRECVID2010 MKL-SVM Analysis of video data recognition using multi-frame Kazuya Hidume 1 and Keiji Yanai 1 In this study, we aim to

More information

情報処理学会研究報告 プレートマッチングによりリアルタイムに物体検出や追跡 を行うアプリケーションが提案されるなど近年モバイルと 画像認識の研究が盛んに行われている 本研究では視覚的 変化の大きい料理に対してスマートフォンの計算資源のみ を用いてリアルタイムに料理認識を行う ユーザインタラクティブな

情報処理学会研究報告 プレートマッチングによりリアルタイムに物体検出や追跡 を行うアプリケーションが提案されるなど近年モバイルと 画像認識の研究が盛んに行われている 本研究では視覚的 変化の大きい料理に対してスマートフォンの計算資源のみ を用いてリアルタイムに料理認識を行う ユーザインタラクティブな Bag-of-SURF fast χ 2 kernel SVMs 5 GrabCut SVM 5 8.55%. CPU PC χ2 2. [] SIFT HOGGabor MKL-SVM Yang [2] FoodLog[3] c 22 Information Processing Society of Japan 情報処理学会研究報告 プレートマッチングによりリアルタイムに物体検出や追跡 を行うアプリケーションが提案されるなど近年モバイルと

More information

1 IDC Wo rldwide Business Analytics Technology and Services 2013-2017 Forecast 2 24 http://www.soumu.go.jp/johotsusintokei/whitepaper/ja/h24/pdf/n2010000.pdf 3 Manyika, J., Chui, M., Brown, B., Bughin,

More information

FoodLog [3] TADAproject [4] Google Goggles 1 Kumar [5] () Leaf snap Maruyama [6] 3 Lee [7] Yu [8] Gist SVM Active Query Sensing(AQS)

FoodLog [3] TADAproject [4] Google Goggles 1 Kumar [5] () Leaf snap Maruyama [6] 3 Lee [7] Yu [8] Gist SVM Active Query Sensing(AQS) DEIM Forum 213 D3-4 食事認識を用いたモバイル食事管理システム 河野 憲之 柳井 啓司 電気通信大学 電気通信学部 情報工学科 182-8585 東京都調布市調布ヶ丘 1-5-1 電気通信大学 大学院情報理工学研究科 総合情報学専攻 182-8585 東京都調布市調布ヶ丘 1-5-1 E-mail: kawano-y@mm.inf.uec.ac.jp, yanai@cs.uec.ac.jp

More information

[6, 7] Caltech101[8] Caltech256[9] 20 1 Pascal VOC ,492 LSP15[10] Caltech % [8] %[11] 2.2. TinyImage

[6, 7] Caltech101[8] Caltech256[9] 20 1 Pascal VOC ,492 LSP15[10] Caltech % [8] %[11] 2.2. TinyImage Large-Scale Generic Image Recognition and Image Representation Tatsuya Harada 1,2 1 The University of Toyo, 2 JST PRESTO harada@isi.imi.i.u-toyo.ac.jp Abstract 1. [1] [2] 3 2. Web Web Web 2.1. Corel5K[3]

More information

SICE東北支部研究集会資料(2017年)

SICE東北支部研究集会資料(2017年) 307 (2017.2.27) 307-8 Deep Convolutional Neural Network X Detecting Masses in Mammograms Based on Transfer Learning of A Deep Convolutional Neural Network Shintaro Suzuki, Xiaoyong Zhang, Noriyasu Homma,

More information

(MIRU2009) cuboid cuboid SURF 6 85% Web. Web Abstract Extracting Spatio-te

(MIRU2009) cuboid cuboid SURF 6 85% Web. Web Abstract Extracting Spatio-te (MIRU2009) 2009 7 182 8585 1 5 1 E-mail: noguchi-a@mm.cs.uec.ac.jp, yanai@cs.uec.ac.jp cuboid cuboid SURF 6 85% Web. Web Abstract Extracting Spatio-temporal Local Features Considering Consecutiveness of

More information

本文6(599) (Page 601)

本文6(599) (Page 601) (MIRU2008) 2008 7 525 8577 1 1 1 E-mail: matsuzaki@i.ci.ritsumei.ac.jp, shimada@ci.ritsumei.ac.jp Object Recognition by Observing Grasping Scene from Image Sequence Hironori KASAHARA, Jun MATSUZAKI, Nobutaka

More information

(MIRU2008) HOG Histograms of Oriented Gradients (HOG)

(MIRU2008) HOG Histograms of Oriented Gradients (HOG) (MIRU2008) 2008 7 HOG - - E-mail: katsu0920@me.cs.scitec.kobe-u.ac.jp, {takigu,ariki}@kobe-u.ac.jp Histograms of Oriented Gradients (HOG) HOG Shape Contexts HOG 5.5 Histograms of Oriented Gradients D Human

More information

2008 : 80725872 1 2 2 3 2.1.......................................... 3 2.2....................................... 3 2.3......................................... 4 2.4 ()..................................

More information

2.2 6).,.,.,. Yang, 7).,,.,,. 2.3 SIFT SIFT (Scale-Invariant Feature Transform) 8).,. SIFT,,. SIFT, Mean-Shift 9)., SIFT,., SIFT,. 3.,.,,,,,.,,,., 1,

2.2 6).,.,.,. Yang, 7).,,.,,. 2.3 SIFT SIFT (Scale-Invariant Feature Transform) 8).,. SIFT,,. SIFT, Mean-Shift 9)., SIFT,., SIFT,. 3.,.,,,,,.,,,., 1, 1 1 2,,.,.,,, SIFT.,,. Pitching Motion Analysis Using Image Processing Shinya Kasahara, 1 Issei Fujishiro 1 and Yoshio Ohno 2 At present, analysis of pitching motion from baseball videos is timeconsuming

More information

理工ジャーナル 23‐1☆/1.外村

理工ジャーナル 23‐1☆/1.外村 Yoshinobu TONOMURA Professor, Department of Media Informatics 1 10 YouTube 2 1900 100 1 3 2 3 3 3 1 2 3 4 90 1 90 MIT Project Athena 1983 1991 2 3 4 5 6 7 8 9 10 2 90 11 12 7 13 14 15 16 17 18 19 390 5

More information

例題ではじめる部分空間法 - パターン認識へのいざない -

例題ではじめる部分空間法  - パターン認識へのいざない - - - ( ) 69 2012 5 22 (1) ( ) MATLAB/Octave 3 download http://www.tuat.ac.jp/ s-hotta/rsj2012 (2) ( ) [1] 対応付け 0 1 2 3 4 未知パターン ( クラスが未知 ) 利用 5 6 7 8 クラス ( 概念 ) 9 訓練パターン ( クラスが既知 ) (3) [1] 識別演算部 未知パターン

More information

kut-paper-template.dvi

kut-paper-template.dvi 26 Discrimination of abnormal breath sound by using the features of breath sound 1150313 ,,,,,,,,,,,,, i Abstract Discrimination of abnormal breath sound by using the features of breath sound SATO Ryo

More information

IPSJ SIG Technical Report Vol.2013-CG-153 No.19 Vol.2013-CVIM-189 No /11/29 1,a) 0 1 SIFT SURF 1. Scale-Invariant Feature Transform (SIFT)[16]

IPSJ SIG Technical Report Vol.2013-CG-153 No.19 Vol.2013-CVIM-189 No /11/29 1,a) 0 1 SIFT SURF 1. Scale-Invariant Feature Transform (SIFT)[16] 1,a) 0 1 SIFT SURF 1. Scale-Invariant Feature Transform (SIFT)[16] [14], [17] [6] 1 *1 SIFT 1 Shibuya CROSS TOWER 28th Floor 2-15-1 Shibuya Shibuya-ku Tokyo, 150-0002 Japan a) manbai@d-itlab.co.jp *1 Binary

More information

IPSJ SIG Technical Report Vol.2010-MPS-77 No /3/5 VR SIFT Virtual View Generation in Hallway of Cybercity Buildings from Video Sequen

IPSJ SIG Technical Report Vol.2010-MPS-77 No /3/5 VR SIFT Virtual View Generation in Hallway of Cybercity Buildings from Video Sequen VR 1 1 1 1 1 SIFT Virtual View Generation in Hallway of Cybercity Buildings from Video Sequences Sachiyo Yoshida, 1 Masami Takata 1 and Joe Kaduki 1 Appearance of Three-dimensional (3D) building model

More information

BDH Cao BDH BDH Cao Cao Cao BDH ()*$ +,-+.)*$!%&'$!"#$ 2. 1 Weng [4] Metric Learning Weng DB DB Yang [5] John [6] Sparse Coding sparse coding DB [7] K

BDH Cao BDH BDH Cao Cao Cao BDH ()*$ +,-+.)*$!%&'$!#$ 2. 1 Weng [4] Metric Learning Weng DB DB Yang [5] John [6] Sparse Coding sparse coding DB [7] K Bucket Distance Hashing Metric Learning 1,a) 1,b) 1,c) 1,d) (DB) [1] DB Cao [2] Cao Metric Learning Cao Cao Cao Cao Cao 100 DB 10% 1. m DB DB DB 1 599 8531 1 1 Graduate School of Engineering, Osaka Prefecture

More information

Optical Flow t t + δt 1 Motion Field 3 3 1) 2) 3) Lucas-Kanade 4) 1 t (x, y) I(x, y, t)

Optical Flow t t + δt 1 Motion Field 3 3 1) 2) 3) Lucas-Kanade 4) 1 t (x, y) I(x, y, t) http://wwwieice-hbkborg/ 2 2 4 2 -- 2 4 2010 9 3 3 4-1 Lucas-Kanade 4-2 Mean Shift 3 4-3 2 c 2013 1/(18) http://wwwieice-hbkborg/ 2 2 4 2 -- 2 -- 4 4--1 2010 9 4--1--1 Optical Flow t t + δt 1 Motion Field

More information

一般画像認識のための単語概念の視覚性の分析

一般画像認識のための単語概念の視覚性の分析 Bag-of-keypoints による カテゴリー認識 第 14 回画像センシングシンポジウム (SSII2008) 2008 年 6 月 13 日 電気通信大学 柳井啓司 情報工学科 2 アウトライン 1. イントロダクション 2. Bag-of-keypoints アプローチ その具体的な方法の詳細 3. Bag-of-keypoints アプローチの拡張 位置情報, 色情報の利用 4. 確率的言語モデルの画像への適用

More information

_314I01BM浅谷2.indd

_314I01BM浅谷2.indd 587 ネットワークの表現学習 1 1 1 1 Deep Learning [1] Google [2] Deep Learning [3] [4] 2014 Deepwalk [5] 1 2 [6] [7] [8] 1 2 1 word2vec[9] word2vec 1 http://www.ai-gakkai.or.jp/my-bookmark_vol31-no4 588 31 4 2016

More information

ばらつき抑制のための確率最適制御

ばらつき抑制のための確率最適制御 ( ) http://wwwhayanuemnagoya-uacjp/ fujimoto/ 2011 3 9 11 ( ) 2011/03/09-11 1 / 46 Outline 1 2 3 4 5 ( ) 2011/03/09-11 2 / 46 Outline 1 2 3 4 5 ( ) 2011/03/09-11 3 / 46 (1/2) r + Controller - u Plant y

More information

LBP 2 LBP 2. 2 Local Binary Pattern Local Binary pattern(lbp) [6] R

LBP 2 LBP 2. 2 Local Binary Pattern Local Binary pattern(lbp) [6] R DEIM Forum 24 F5-4 Local Binary Pattern 6 84 E-mail: {tera,kida}@ist.hokudai.ac.jp Local Binary Pattern (LBP) LBP 3 3 LBP 5 5 5 LBP improved LBP uniform LBP.. Local Binary Pattern, Gradient Local Auto-Correlations,,,,

More information

Trial for Value Quantification from Exceptional Utterances 37-066593 1 5 1.1.................................. 5 1.2................................ 8 2 9 2.1.............................. 9 2.1.1.........................

More information

Twitter‡Ì”À‰µ…c…C†[…g‡ðŠŸŠp‡µ‡½…^…C…•…›…C…fi‘ã‡Ì…l…^…o…„‘îŁñ„�™m

Twitter‡Ì”À‰µ…c…C†[…g‡ðŠŸŠp‡µ‡½…^…C…•…›…C…fi‘ã‡Ì…l…^…o…„‘îŁñ„�™m 27 Twitter 1431050 2016 3 14 1 Twitter,,.,.,., Twitter,.,,.,,. URL,,,. BoW(Bag of Words), LSI(Latent Semantic Indexing)., URL,,,,., Accuracy, AUC(Area Under the Curve), Precision, Recall, F,. URL,,,.,

More information

Convolutional Neural Network A Graduation Thesis of College of Engineering, Chubu University Investigation of feature extraction by Convolution

Convolutional Neural Network A Graduation Thesis of College of Engineering, Chubu University Investigation of feature extraction by Convolution Convolutional Neural Network 2014 3 A Graduation Thesis of College of Engineering, Chubu University Investigation of feature extraction by Convolutional Neural Network Fukui Hiroshi 1940 1980 [1] 90 3

More information

IPSJ SIG Technical Report iphone iphone,,., OpenGl ES 2.0 GLSL(OpenGL Shading Language), iphone GPGPU(General-Purpose Computing on Graphics Proc

IPSJ SIG Technical Report iphone iphone,,., OpenGl ES 2.0 GLSL(OpenGL Shading Language), iphone GPGPU(General-Purpose Computing on Graphics Proc iphone 1 1 1 iphone,,., OpenGl ES 2.0 GLSL(OpenGL Shading Language), iphone GPGPU(General-Purpose Computing on Graphics Processing Unit)., AR Realtime Natural Feature Tracking Library for iphone Makoto

More information

(a) 1 (b) 3. Gilbert Pernicka[2] Treibitz Schechner[3] Narasimhan [4] Kim [5] Nayar [6] [7][8][9] 2. X X X [10] [11] L L t L s L = L t + L s

(a) 1 (b) 3. Gilbert Pernicka[2] Treibitz Schechner[3] Narasimhan [4] Kim [5] Nayar [6] [7][8][9] 2. X X X [10] [11] L L t L s L = L t + L s 1 1 1, Extraction of Transmitted Light using Parallel High-frequency Illumination Kenichiro Tanaka 1 Yasuhiro Mukaigawa 1 Yasushi Yagi 1 Abstract: We propose a new sharpening method of transmitted scene

More information

& Vol.5 No (Oct. 2015) TV 1,2,a) , Augmented TV TV AR Augmented Reality 3DCG TV Estimation of TV Screen Position and Ro

& Vol.5 No (Oct. 2015) TV 1,2,a) , Augmented TV TV AR Augmented Reality 3DCG TV Estimation of TV Screen Position and Ro TV 1,2,a) 1 2 2015 1 26, 2015 5 21 Augmented TV TV AR Augmented Reality 3DCG TV Estimation of TV Screen Position and Rotation Using Mobile Device Hiroyuki Kawakita 1,2,a) Toshio Nakagawa 1 Makoto Sato

More information

11) 13) 11),12) 13) Y c Z c Image plane Y m iy O m Z m Marker coordinate system T, d X m f O c X c Camera coordinate system 1 Coordinates and problem

11) 13) 11),12) 13) Y c Z c Image plane Y m iy O m Z m Marker coordinate system T, d X m f O c X c Camera coordinate system 1 Coordinates and problem 1 1 1 Posture Esimation by Using 2-D Fourier Transform Yuya Ono, 1 Yoshio Iwai 1 and Hiroshi Ishiguro 1 Recently, research fields of augmented reality and robot navigation are actively investigated. Estimating

More information

IPSJ SIG Technical Report Vol.2011-MUS-91 No /7/ , 3 1 Design and Implementation on a System for Learning Songs by Presenting Musical St

IPSJ SIG Technical Report Vol.2011-MUS-91 No /7/ , 3 1 Design and Implementation on a System for Learning Songs by Presenting Musical St 1 2 1, 3 1 Design and Implementation on a System for Learning Songs by Presenting Musical Structures based on Phrase Similarity Yuma Ito, 1 Yoshinari Takegawa, 2 Tsutomu Terada 1, 3 and Masahiko Tsukamoto

More information

Microsoft PowerPoint _KAIST_NTCIR5_Patent-DigitalPoster.ppt

Microsoft PowerPoint _KAIST_NTCIR5_Patent-DigitalPoster.ppt Patent Document Retrieval and Classification at KAIST KAIST CS Dept. / BOLA 2005. 12. 8. Jae-Ho Kim, Jin-Xia Huang, Ha-Yong Jung, Key-Sun Choi Introduction Tasks Document retrieval subtask Theme categorization

More information

スライド 1

スライド 1 A SURF-based Spatio-Temporal Feature for Feature-fusion-based Action Recognition 1. Background & Objective action recognition object/scene recognition Bag-of-features (BoF) of spatiotemporal features [Dollar

More information

IPSJ SIG Technical Report Vol.2010-CVIM-171 No /3/19 1. Web 1 1 Web Web Web Multiple Kernel Learning(MKL) Web ( ) % MKL 68.8% Extractin

IPSJ SIG Technical Report Vol.2010-CVIM-171 No /3/19 1. Web 1 1 Web Web Web Multiple Kernel Learning(MKL) Web ( ) % MKL 68.8% Extractin 1. Web 1 1 Web Web Web Multiple Kernel Learning(MKL) Web ( ) 200 57.2% MKL 68.8% Extracting Spatio-Temporal Local Features for Classifying Web Video Shots Akitsugu Noguchi 1 and Keiji Yanai 1 Nowadays,

More information

Mining Regional Representative Photos from a Large-scale Geotagged Image Database

Mining Regional Representative Photos from a Large-scale Geotagged Image Database Web 上のジオタグ画像を用いた 世界各地の文化的差異の発見 2009 年度人工知能学会全国大会 2009 年 6 月高松 柳井啓司 電気通信大学情報工学科 研究の背景 Web には, ラーメンがいっぱい やっぱり, どこのラーメンか知りたい! アウトライン 研究の背景 目的 関連研究 方法 実験結果 まとめと今後の課題 背景 : 大量のジオタグ画像の登場 近年, 位置情報付き画像 (geo-tagged

More information

HOG HOG LBP LBP 4) LBP LBP Wang LBP HOG LBP 5) LBP LBP 1 r n 1 n, 1

HOG HOG LBP LBP 4) LBP LBP Wang LBP HOG LBP 5) LBP LBP 1 r n 1 n, 1 1 1 1 Shwartz Histgrams of Oriented Gradients HOG PLS PLS KPLS INRIA PLS KPLS KPLS PLS Pedestrian Detection Using Kernel Partial Least Squares Analysis Takashi Abe, 1 Takayuki Okatani 1 and Kouichiro Deguchi

More information

x T = (x 1,, x M ) x T x M K C 1,, C K 22 x w y 1: 2 2

x T = (x 1,, x M ) x T x M K C 1,, C K 22 x w y 1: 2 2 Takio Kurita Neurosceince Research Institute, National Institute of Advanced Indastrial Science and Technology takio-kurita@aistgojp (Support Vector Machine, SVM) 1 (Support Vector Machine, SVM) ( ) 2

More information

% 2 3 [1] Semantic Texton Forests STFs [1] ( ) STFs STFs ColorSelf-Simlarity CSS [2] ii

% 2 3 [1] Semantic Texton Forests STFs [1] ( ) STFs STFs ColorSelf-Simlarity CSS [2] ii 2012 3 A Graduation Thesis of College of Engineering, Chubu University High Accurate Semantic Segmentation Using Re-labeling Besed on Color Self Similarity Yuko KAKIMI 2400 90% 2 3 [1] Semantic Texton

More information

*2.5mm ”ŒŠá‡ÆfiÁ™¥‡Ì…Z†[…t…X…N…−†[…j…fi…O

*2.5mm ”ŒŠá‡ÆfiÁ™¥‡Ì…Z†[…t…X…N…−†[…j…fi…O I. Takeuchi, Nagoya Institute of Technology 1/38 f(x) = w 1 x 1 + w 2 x 2 +... + w d x d f(x) = α 1 K(x, x 1 ) + α 2 K(x, x 2 ) +... + α n K(x, x n ) {wj } d j=1 f {αi } n i=1 f I. Takeuchi, Nagoya Institute

More information

情報処理学会研究報告 IPSJ SIG Technical Report Vol.2014-GN-90 No.6 Vol.2014-CDS-9 No.6 Vol.2014-DCC-6 No /1/23 Bullet Time 1,a) 1 Bullet Time Bullet Time

情報処理学会研究報告 IPSJ SIG Technical Report Vol.2014-GN-90 No.6 Vol.2014-CDS-9 No.6 Vol.2014-DCC-6 No /1/23 Bullet Time 1,a) 1 Bullet Time Bullet Time Bullet Time 1,a) 1 Bullet Time Bullet Time Generation Technique and Eveluation on High-Resolution Bullet-Time Camera Work Ryuuki Sakamoto 1,a) Ding Chen 1 Abstract: The multi-camera environment have been

More information

0. Intro ( K CohFT etc CohFT 5.IKKT 6.

0. Intro ( K CohFT etc CohFT 5.IKKT 6. E-mail: sako@math.keio.ac.jp 0. Intro ( K 1. 2. CohFT etc 3. 4. CohFT 5.IKKT 6. 1 µ, ν : d (x 0,x 1,,x d 1 ) t = x 0 ( t τ ) x i i, j, :, α, β, SO(D) ( x µ g µν x µ µ g µν x ν (1) g µν g µν vector x µ,y

More information

Linear Distance Metric Learning for Large-scale Generic Image Recognition Hideki Nakayama Graduate School of Information Science and Technology The University of Tokyo A thesis submitted for the degree

More information

48_16_1.dvi

48_16_1.dvi Vol. 48 No. SIG 16(CVIM 19) Nov. 2007 1 1 101 6 The Current State and Future Directions on Generic Object Recognition Keiji Yanai Generic object recognition aims at enabling a computer to recognize objects

More information

Introduction of Self-Organizing Map * 1 Ver. 1.00.00 (2017 6 3 ) *1 E-mail: furukawa@brain.kyutech.ac.jp i 1 1 1.1................................ 2 1.2...................................... 4 1.3.......................

More information

IPSJ SIG Technical Report Vol.2015-MUS-107 No /5/23 HARK-Binaural Raspberry Pi 2 1,a) ( ) HARK 2 HARK-Binaural A/D Raspberry Pi 2 1.

IPSJ SIG Technical Report Vol.2015-MUS-107 No /5/23 HARK-Binaural Raspberry Pi 2 1,a) ( ) HARK 2 HARK-Binaural A/D Raspberry Pi 2 1. HARK-Binaural Raspberry Pi 2 1,a) 1 1 1 2 3 () HARK 2 HARK-Binaural A/D Raspberry Pi 2 1. [1,2] [2 5] () HARK (Honda Research Institute Japan audition for robots with Kyoto University) *1 GUI ( 1) Python

More information

2007/8 Vol. J90 D No. 8 Stauffer [7] 2 2 I 1 I 2 2 (I 1(x),I 2(x)) 2 [13] I 2 = CI 1 (C >0) (I 1,I 2) (I 1,I 2) Field Monitoring Server

2007/8 Vol. J90 D No. 8 Stauffer [7] 2 2 I 1 I 2 2 (I 1(x),I 2(x)) 2 [13] I 2 = CI 1 (C >0) (I 1,I 2) (I 1,I 2) Field Monitoring Server a) Change Detection Using Joint Intensity Histogram Yasuyo KITA a) 2 (0 255) (I 1 (x),i 2 (x)) I 2 = CI 1 (C>0) (I 1,I 2 ) (I 1,I 2 ) 2 1. [1] 2 [2] [3] [5] [6] [8] Intelligent Systems Research Institute,

More information

[1] SBS [2] SBS Random Forests[3] Random Forests ii

[1] SBS [2] SBS Random Forests[3] Random Forests ii Random Forests 2013 3 A Graduation Thesis of College of Engineering, Chubu University Proposal of an efficient feature selection using the contribution rate of Random Forests Katsuya Shimazaki [1] SBS

More information

DEIM Forum 2019 A7-1 Flexible Distance-based Hashing mori

DEIM Forum 2019 A7-1 Flexible Distance-based Hashing mori DEIM Forum 2019 A7-1 Flexible Distance-based Hashing 731 3194 E-mail: mc66023@e.hiroshima-cu.ac.jp,{wakaba,s naga,inagi,yoko}@hiroshima-cu.ac.jp, morikei18@gmail.com Flexible Distance-based Hashing(FDH)

More information

main.dvi

main.dvi CDMA 1 CDMA ( ) CDMA CDMA CDMA 1 ( ) Hopfield [1] Hopfield 1 E-mail: okada@brain.riken.go.jp 1 1: 1 [] Hopfield Sourlas Hopfield [3] Sourlas 1? CDMA.1 DS/BPSK CDMA (Direct Sequence; DS) (Binary Phase-Shift-Keying;

More information

thesis.dvi

thesis.dvi 2007 Graph Cuts Graph Cuts Graph Cuts Graph Cuts t-link Interactive Graph Cuts 4.7% Mean Shift Segmentation 1 1 2 3 2.1.................... 3 2.1.1............................. 3 2.2...........................

More information

IPSJ SIG Technical Report Vol.2014-CG-155 No /6/28 1,a) 1,2,3 1 3,4 CG An Interpolation Method of Different Flow Fields using Polar Inter

IPSJ SIG Technical Report Vol.2014-CG-155 No /6/28 1,a) 1,2,3 1 3,4 CG An Interpolation Method of Different Flow Fields using Polar Inter ,a),2,3 3,4 CG 2 2 2 An Interpolation Method of Different Flow Fields using Polar Interpolation Syuhei Sato,a) Yoshinori Dobashi,2,3 Tsuyoshi Yamamoto Tomoyuki Nishita 3,4 Abstract: Recently, realistic

More information

特別寄稿.indd

特別寄稿.indd 特別寄稿 ソフトインフラとしてのデジタル地図を活用した自動運転システム Autonomous vehicle using digital map as a soft infrastructure 菅沼直樹 Naoki SUGANUMA 1. はじめに 1) 2008 2012 ITS 2) CO 2 3) 4) Door to door Door to door Door to door DARPA(

More information

IPSJ-CVIM

IPSJ-CVIM 1 1 2 1 Estimation of Shielding Object Distribution in Scattering Media by Analyzing Light Transport Shosei Moriguchi, 1 Yasuhiro Mukaigawa, 1 Yasuyuki Matsushita 2 and Yasushi Yagi 1 In this paper, we

More information

IPSJ SIG Technical Report Vol.2012-CVIM-180 No /1/20 RGB-D 1 1, 2 1 RGB-D Interactive Object Recognition for Service Robot using an RGB-D Camer

IPSJ SIG Technical Report Vol.2012-CVIM-180 No /1/20 RGB-D 1 1, 2 1 RGB-D Interactive Object Recognition for Service Robot using an RGB-D Camer RGB-D 1 1, 2 1 RGB-D Interactive Object Recognition for Service Robot using an RGB-D Camera Hisato Fukuda, 1 Yoshinori Kobayashi 1, 2 and Yoshinori Kuno 1 Service robots need to be able to recognize objects

More information

Sobel Canny i

Sobel Canny i 21 Edge Feature for Monochrome Image Retrieval 1100311 2010 3 1 3 3 2 2 7 200 Sobel Canny i Abstract Edge Feature for Monochrome Image Retrieval Naoto Suzue Content based image retrieval (CBIR) has been

More information

28 TCG SURF Card recognition using SURF in TCG play video

28 TCG SURF Card recognition using SURF in TCG play video 28 TCG SURF Card recognition using SURF in TCG play video 1170374 2017 3 2 TCG SURF TCG TCG OCG SURF Bof 20 20 30 10 1 SURF Bag of features i Abstract Card recognition using SURF in TCG play video Haruka

More information

,.,. NP,., ,.,,.,.,,, (PCA)...,,. Tipping and Bishop (1999) PCA. (PPCA)., (Ilin and Raiko, 2010). PPCA EM., , tatsukaw

,.,. NP,., ,.,,.,.,,, (PCA)...,,. Tipping and Bishop (1999) PCA. (PPCA)., (Ilin and Raiko, 2010). PPCA EM., , tatsukaw ,.,. NP,.,. 1 1.1.,.,,.,.,,,. 2. 1.1.1 (PCA)...,,. Tipping and Bishop (1999) PCA. (PPCA)., (Ilin and Raiko, 2010). PPCA EM., 152-8552 2-12-1, tatsukawa.m.aa@m.titech.ac.jp, 190-8562 10-3, mirai@ism.ac.jp

More information

22_04.dvi

22_04.dvi Vol. 1 No. 2 32 40 (July 2008) 1, 2 1 Speaker Segmentation Using Audiovisual Correlation Yuyu Liu 1, 2 and Yoichi Sato 1 Audiovisual correlation has been used successfully for audio source localization.

More information

Microsoft Word - mitomi_v06.doc

Microsoft Word - mitomi_v06.doc MSS mitomi@edm.bosai.go.jp matsuoka@edm.bosai.go.jp yamazaki@edm.bosai.go.jp taniguchi@manage.nitech.ac.jp 1 MSS MSS 2 2 1 m MSS CCT CCT Fig.1 CCT b02-b0 b0-b0b-b b-b1 CCT Landsat/TM MSS S/N 21x21 21x21

More information

ohpmain.dvi

ohpmain.dvi fujisawa@ism.ac.jp 1 Contents 1. 2. 3. 4. γ- 2 1. 3 10 5.6, 5.7, 5.4, 5.5, 5.8, 5.5, 5.3, 5.6, 5.4, 5.2. 5.5 5.6 +5.7 +5.4 +5.5 +5.8 +5.5 +5.3 +5.6 +5.4 +5.2 =5.5. 10 outlier 5 5.6, 5.7, 5.4, 5.5, 5.8,

More information

12_39.dvi

12_39.dvi Vol. 52 No. 12 3588 3592 (Dec. 2011) Web 1, 1 1 2 1 1 1 Web GPS Creation of a Sight-seeing Map with Visual Classification of Photos on the Web Jiani Wang, 1, 1 Masafumi Noda, 1 Tomokazu Takahashi, 2 Daisuke

More information

ISCO自動コーディングシステムの分類精度向上に向けて―SSM およびJGSS データセットによる実験の結果―

ISCO自動コーディングシステムの分類精度向上に向けて―SSM およびJGSS データセットによる実験の結果― ISCO SSM JGSS Improvement of Classification Accuracy in an ISCO Automatic Coding System: Results of Experiments Using both the SSM Dataset and the JGSS Dataset Kazuko TAKAHASHI Faculty of International

More information

塗装深み感の要因解析

塗装深み感の要因解析 17 Analysis of Factors for Paint Depth Feeling Takashi Wada, Mikiko Kawasumi, Taka-aki Suzuki ( ) ( ) ( ) The appearance and quality of objects are controlled by paint coatings on the surfaces of the objects.

More information

1 Kinect for Windows M = [X Y Z] T M = [X Y Z ] T f (u,v) w 3.2 [11] [7] u = f X +u Z 0 δ u (X,Y,Z ) (5) v = f Y Z +v 0 δ v (X,Y,Z ) (6) w = Z +

1 Kinect for Windows M = [X Y Z] T M = [X Y Z ] T f (u,v) w 3.2 [11] [7] u = f X +u Z 0 δ u (X,Y,Z ) (5) v = f Y Z +v 0 δ v (X,Y,Z ) (6) w = Z + 3 3D 1,a) 1 1 Kinect (X, Y) 3D 3D 1. 2010 Microsoft Kinect for Windows SDK( (Kinect) SDK ) 3D [1], [2] [3] [4] [5] [10] 30fps [10] 3 Kinect 3 Kinect Kinect for Windows SDK 3 Microsoft 3 Kinect for Windows

More information

スライド 1

スライド 1 CMOS : swk(at)ic.is.tohoku.ac.jp [ 2003] [Wong1999] 2 : CCD CMOS 3 : CCD Q Q V 4 : CMOS V C 5 6 CMOS light input photon shot noise α quantum efficiency dark current dark current shot noise dt time integration

More information

,,,,,,,,,,,,,,,,,,, 976%, i

,,,,,,,,,,,,,,,,,,, 976%, i 20 Individual Recognition using positions of facial parts 1115081 2009 3 5 ,,,,,,,,,,,,,,,,,,, 976%, i Abstract Individual Recognition using positions of facial parts YOSHIHIRO Arisawa A facial recognition

More information

ERATO100913

ERATO100913 ERATO September 13, 2010, DC2 1/25 1. 2 2. 2/25 3/25 3/25 2 3/25 2 3/25 1 1 0.5 0.5 0 0 0.5 1 0 0 0.5 1 4/25 1 1 0.5 0.5 0 0 0.5 1 (0, 0) 0 0 0.5 1 4/25 1 1 0.5 0.5 0 0 0.5 1 (0, 0) ( 1, 0) 0 0 0.5 1 4/25

More information

1 Fig. 1 Extraction of motion,.,,, 4,,, 3., 1, 2. 2.,. CHLAC,. 2.1,. (256 ).,., CHLAC. CHLAC, HLAC. 2.3 (HLAC ) r,.,. HLAC. N. 2 HLAC Fig. 2

1 Fig. 1 Extraction of motion,.,,, 4,,, 3., 1, 2. 2.,. CHLAC,. 2.1,. (256 ).,., CHLAC. CHLAC, HLAC. 2.3 (HLAC ) r,.,. HLAC. N. 2 HLAC Fig. 2 CHLAC 1 2 3 3,. (CHLAC), 1).,.,, CHLAC,.,. Suspicious Behavior Detection based on CHLAC Method Hideaki Imanishi, 1 Toyohiro Hayashi, 2 Shuichi Enokida 3 and Toshiaki Ejima 3 We have proposed a method for

More information

untitled

untitled DEIM Forum 2019 I2-4 305-8573 1-1-1 305-8573 1-1-1 305-8573 1-1-1 ( ) 151-0053 1-3-15 6F 101-8430 2-1-2 CNN LSTM,,,, Measuring Beginner Friendliness / Visiual Intelligibility of Web Pages explaining Academic

More information

情報処理学会研究報告 Vol.2013-CG-153 No.6 Vol.2013-CVIM-189 No /11/28 IPSJ SIG Technical Report 逐次的四面体カービング法を用いた 3D モデリング 鳥居 秋彦1 杉浦 貴行1 奥富 正敏1 概要 画像が次々と入力

情報処理学会研究報告 Vol.2013-CG-153 No.6 Vol.2013-CVIM-189 No /11/28 IPSJ SIG Technical Report 逐次的四面体カービング法を用いた 3D モデリング 鳥居 秋彦1 杉浦 貴行1 奥富 正敏1 概要 画像が次々と入力 情報処理学会研究報告 逐次的四面体カービング法を用いた 3D モデリング 鳥居 秋彦 杉浦 貴行 奥富 正敏 概要 画像が次々と入力され Structure from Motion (SfM) によって疎な 3D 点群とカメラポーズが与え られる場合 逐次的に効率良くサーフェス生成を行う手法を提案する 提案手法では 四面体を削り出す サーフェス抽出法を 視線と四面体の交差の効率的な検出方法と ダイナミックグラフカットを適用した

More information

No. 3 Oct The person to the left of the stool carried the traffic-cone towards the trash-can. α α β α α β α α β α Track2 Track3 Track1 Track0 1

No. 3 Oct The person to the left of the stool carried the traffic-cone towards the trash-can. α α β α α β α α β α Track2 Track3 Track1 Track0 1 ACL2013 TACL 1 ACL2013 Grounded Language Learning from Video Described with Sentences (Yu and Siskind 2013) TACL Transactions of the Association for Computational Linguistics What Makes Writing Great?

More information

untitled

untitled c ILSVRC LeNet 1. 1 convolutional neural network 1980 Fukushima [1] [2] 80 LeCun (back propagation) LeNet [3, 4] LeNet 2. 2.1 980 8579 6 6 01 okatani@vision.is.tohoku.ac.jp (simple cell) (complex cell)

More information

Dirichlet process mixture Dirichlet process mixture 2 /40 MIRU2008 :

Dirichlet process mixture Dirichlet process mixture 2 /40 MIRU2008 : Dirichlet Process : joint work with: Max Welling (UC Irvine), Yee Whye Teh (UCL, Gatsby) http://kenichi.kurihara.googlepages.com/miru_workshop.pdf 1 /40 MIRU2008 : Dirichlet process mixture Dirichlet process

More information

Google Goggles [1] Google Goggles Android iphone web Google Goggles Lee [2] Lee iphone () [3] [4] [5] [6] [7] [8] [9] [10] :

Google Goggles [1] Google Goggles Android iphone web Google Goggles Lee [2] Lee iphone () [3] [4] [5] [6] [7] [8] [9] [10] : THE INSTITUTE OF ELECTRONICS, INFORMATION AND COMMUNICATION ENGINEERS TECHNICAL REPORT OF IEICE.,, 182-8585 1-5-1 E-mail: {maruya-t,akiyama-m}@mm.inf.uec.ac.jp, yanai@cs.uec.ac.jp SURF Bag-of-Features

More information

untitled

untitled (Robot Vision) Vision ( (computer) Machine VisionComputer Vision ( ) ( ) ( ) ( ) ( ) 1 DTV 2 DTV D 3 ( ( ( ( ( DTV D 4 () 5 A B C D E F G H I A B C D E F G H I I = A + D + G - C - F - I J = A + B + C -

More information