Microsoft PowerPoint - cvim_harada pptx
|
|
- いおり みおか
- 5 years ago
- Views:
Transcription
1 1
2 2
3 Flickr reaches 6 billion photos on 1 Aug,
4 4
5 5
6 6
7 位 LSVRC 位 LSVRC 位 Localization Car Car Categorization Car
8 1. neck brace 2. bullet train 3. potter's wheel 4. seat belt 5. barbell 1. mountain bike 2. hartebeest 3. yurt 4. bighorn 5. coho 1. brown bear 2. otter 3. hippopotamus 4. raccoon 5. deerhound 1. volleyball 2. bittern 3. shower curtain 4. crane 5. suspension bridge 1. mask 2. ski mask 3. jack-o'-lantern 4. jellyfish 5. teddy bear 1. toilet seat 2. scanner 3. hard disc 4. scale 5. backpack 1. baseball player 2. racket, racquet 3. solar dish 4. trimaran 5. paddle 1. aircraft carrier 2. paddle 3. bullfrog 4. water ouzel 5. mantis 8
9 9
10 The state of the world The gathered data The processed data w d r I( W; D) I( W; R) The data processing theorem states that data processing can only destroy information. 10 David J.C. MacKay. Information Theory, Inference, and Learning Algorithms. Cambridge University Press 2003.
11 11
12
13
14 S. Vijayanarasimhan and K. Grauman. Large Scale Live Active Learning: Training Object Detectors with Crawled Data and Crowds. In CVPR, 2011.
15 S. Vijayanarasimhan and K. Grauman. Large Scale Live Active Learning: Training Object Detectors with Crawled Data and Crowds. In CVPR, HOG deformation LLC+max pooling No deformation NIPS2010
16
17
18 S. J. Hwang, F. Sha, and K. Grauman. Sharing Features Between Objects and Their Attributes. CVPR, V. Ferrari and A. Zisserman. Learning visual attributes. In NIPS,
19 Attributes and Classification 20
20 21
21 S. Dhar, V. Ordonez, and T. L Berg. High Level Describable Attributes for Predicting Aesthetics and Interestingness. CVPR,
22 S. Dhar, V. Ordonez, and T. L Berg. High Level Describable Attributes for Predicting Aesthetics and Interestingness. CVPR,
23 24
24 M. Douze, A. Ramisa, and C. Schmid. Combining attributes and Fisher vectors for efficient image retrieval. CVPR,
25 26
26 D. Parikh and K. Grauman. Relative Attributes. In ICCV,
27
28
29 30
30 31
31 Deng et al., CVPR
32 33
33
34 d 2 d 3 d m d 1 d k d j d N 1) Input Image d m 2) Detection 3) Description p( d; θ) d N d 2 d 1 d k x f (θ) d j d 3 4) Local descriptors in feature space 5) PDF estimation 6) Feature vector 35
35 d 2 d 1 d m Local descriptors in feature space d k d N d j d 3 Descriptor matching Codebook Global feature # of anchor points: large # of anchor points: small Computational complexity: large Computational complexity: small SVM KNN Naïve Bayes Nearest Neighbor Graph Matching Kernel Bag of Visual Words Gaussian Mixture Model ScSPM, Super Vector, LLC Fisher Vector HLAC GLC Global Gaussian 36
36
37 H. Zhang, A. C. Berg, M. Maire, and J. Malik. SVM KNN: Discriminative Nearest Neighbor Classification for Visual Category Recognition. In CVPR,
38 T. Tuytelaars, M. Fritz, K. Saenko, and T. Darrell. The NBNN kernel. In ICCV,
39 40
40 O. Duchenne, A. Joulin and J. Ponce. A Graph Matching Kernel for Object Categorization. ICCV,
41
42
43 w 3 w 1 w 4 R d w 2 44
44 Bag of Visual Words Kernel codebook d 1 w1 w2 w3 w4 [ f kc ( x )] i k K j 1 exp 2 exp d 2 i w d j k 2 w k 2 w1 w2 w3 w4 d 1 d 2 w1 w2 w3 w4 d 1 w1 w2 w3 w4 d 2 d 3 w1 w2 w3 w4 w 1 d 2 w 3 d 3 w1 w2 w3 w4 d 3 d 4 w1 w2 w3 w4 w 2 w1 w2 w3 w4 d 4 f BoW 1 N fbow( xi ) N i 1 d 4 w 4 f kc 1 N fkc ( xi ) N i 1 w1 w2 w3 w4 w1 w2 w3 w4 45
45 Image Local descriptors in feature space PDF estimation 46
46 Generative approach Image Local descriptors in feature space PDF estimation Fisher Kernel Feature vector Fisher Vector Discriminative classifier F. Perronnin and C. Dance. Fisher kernels on visual vocabularies for image categorization. CVPR, Discriminative approach Classifier e.g., SVMs Category 47
47 48
48 49
49 50
50 51
51 net.org/challenges/lsvrc/2010/ilsvrc2010_xrce.pdf 52
52 K p( x; ) w Ν ( x;, k 1 k k k ) ˆμ 1 Image Local descriptors in feature space U U μ ~ ( ) ( ) 1/ k wk ( k ) 2 μˆ k μˆ GMM ( U ) ˆ k k N i1 μˆ K ˆμ 2 Means of components N i1 ( k) x i i ( k) i μ~ 1 μ~ 2 ( X ) μ ~ K GMM supervectors 53
53 54 N i i N i i i U k k k x k 1 1 ) ( ) ( ) ( ˆ ˆ μ N i i i U k U k N i i i U k N i k U k k U k U k k x k w N x k i w w 1 2 1/ ) ( ) ( 1 2 1/ ) ( 1 ) ( 2 1/ ) ( ) ( ) ( ) ( 1 ) ( ) ( ) ( ˆ ) ( ~ μ μ 0 N i i Nw k k 1 ) ( N i k i k i k i k w N g 1 2 1/, ) ( 1 μ x N. Inoue and K. Shinoda. A Fast MAP Adaptation Technique for GMMsupervector based Video Semantic Indexing. ACM Multimedia, 2011.
54 55 Asymmetric Distance Computation
55 H. Jegou, M. Douze, C. Schmid, and P. Perez. Aggregating local descriptors into a compact image representation. CVPR,
56 57
57 58
58 59
59 60
60 61
61 62
62 63
63 64
64 net.org/challenges/lsvrc/2010/ilsvrc2010_nec UIUC.pdf 65
65 net.org/challenges/lsvrc/2010/ilsvrc2010_nec UIUC.pdf 66
66 J. Yang, K. Yu, Y. Gong, and T. Huang. Linear spatial pyramid matching using sparse coding for image classification. CVPR,
67
68 H. Nakayama, T. Harada, and Y. Kuniyoshi. Dense Sampling Low Level Statistics of Local Features. In CIVR, GMM Single Gaussian 69
69 H. Nakayama, T. Harada, and Y. Kuniyoshi. Dense Sampling Low Level Statistics of Local Features. In CIVR,
70 H. Nakayama, T. Harada, and Y. Kuniyoshi. Global Gaussian Approach for Scene Categorization Using Information Geometry. In CVPR, Image 1 Local descriptor space Feature vector Feature vector Local descriptor space Image 2 (1) x (2) x Similarity? ( j) x (i) x (2) x (k ) x (1) x Manifold
71 H. Nakayama, T. Harada, and Y. Kuniyoshi. Global Gaussian Approach for Scene Categorization Using Information Geometry. In CVPR,
72 H. Nakayama, T. Harada, and Y. Kuniyoshi. Global Gaussian Approach for Scene Categorization Using Information Geometry. In CVPR,
73 Super Vector Coding VLAD GMM + Bag of Visual Words Fisher Vector Sparse Coding Global Gaussian Local Coordinate Coding Bag of Visual Words Locality constrained Linear Coding 74
74 75
75
76 J. Sanchez, and F. Perronnin. High Dimensional Signature Compression for Large Scale Image Classification. In CVPR, 2011.
77 78
78 識別機 CPU 識別機 識別機 識別機 CPU 識別機 識別機 識別機 CPU 識別機 識別機 データデータデータ データ データ データ HDD データ データ HDD HDD 79
79 D dim D/N dim D/N dim w 3 w 3 2^K w 3 w 3 w 1 w 4 w 1 w 4 w 1 w 4 w 1 w 4 w 2 w 2 w 2 w 2 NK/D [bit/dim] NK/D [bit/dim] NK/D [bit/dim] NK/D [bit/dim] 80
80 81
81 net.org/challenges/lsvrc/2011/ilsvrc11.pdf 82
82 83
Microsoft PowerPoint - SSII_harada pptx
The state of the world The gathered data The processed data w d r I( W; D) I( W; R) The data processing theorem states that data processing can only destroy information. David J.C. MacKay. Information
More information<4D F736F F F696E74202D2091E58B4B96CD88EA94CA89E6919C94468EAF82C689E6919C955C8CBB5F947A957A97702E >
1 Flickr reached 5,000,000,000 photos on September 19, 2010. http://blog.flickr.net/en/2010/09/19/5000000000/ 2 http://www.flickr.com/photos/kullin/4999988381/ 3 http://twitter.com/randizuckerberg/status/22187407218577408#
More informationIPSJ SIG Technical Report Vol.2013-CVIM-187 No /5/30 1,a) 1,b), 1,,,,,,, (DNN),,,, 2 (CNN),, 1.,,,,,,,,,,,,,,,,,, [1], [6], [7], [12], [13]., [
,a),b),,,,,,,, (DNN),,,, (CNN),,.,,,,,,,,,,,,,,,,,, [], [6], [7], [], [3]., [8], [0], [7],,,, Tohoku University a) omokawa@vision.is.tohoku.ac.jp b) okatani@vision.is.tohoku.ac.jp, [3],, (DNN), DNN, [3],
More informationIPSJ SIG Technical Report Vol.2010-CVIM-170 No /1/ Visual Recognition of Wire Harnesses for Automated Wiring Masaki Yoneda, 1 Ta
1 1 1 1 2 1. Visual Recognition of Wire Harnesses for Automated Wiring Masaki Yoneda, 1 Takayuki Okatani 1 and Koichiro Deguchi 1 This paper presents a method for recognizing the pose of a wire harness
More informationDuplicate Near Duplicate Intact Partial Copy Original Image Near Partial Copy Near Partial Copy with a background (a) (b) 2 1 [6] SIFT SIFT SIF
Partial Copy Detection of Line Drawings from a Large-Scale Database Weihan Sun, Koichi Kise Graduate School of Engineering, Osaka Prefecture University E-mail: sunweihan@m.cs.osakafu-u.ac.jp, kise@cs.osakafu-u.ac.jp
More informationbag-of-words bag-of-keypoints Web bagof-keypoints Nearest Neighbor SVM Nearest Neighbor SIFT Nearest Neighbor bag-of-keypoints Nearest Neighbor SVM 84
Bag-of-Keypoints Web G.Csurka bag-of-keypoints Web Bag-of-keypoints SVM 5.% Web Image Classification with Bag-of-Keypoints Taichi joutou and Keiji yanai Recently, need for generic image recognition is
More informationMicrosoft PowerPoint - IBIS_harada _2.pptx
1 beach, water, people, kauai, tree ocean, coral, fish, angelfish, reefs Retrieving by flight 2 Image Annotation Results on Corel5K birds, booby, flight, rocks, water buildings, ships, bridge, flag, sky
More informationIPSJ SIG Technical Report Vol.2009-CVIM-167 No /6/10 Real AdaBoost HOG 1 1 1, 2 1 Real AdaBoost HOG HOG Real AdaBoost HOG A Method for Reducing
Real AdaBoost HOG 1 1 1, 2 1 Real AdaBoost HOG HOG Real AdaBoost HOG A Method for Reducing number of HOG Features based on Real AdaBoost Chika Matsushima, 1 Yuji Yamauchi, 1 Takayoshi Yamashita 1, 2 and
More information(b) BoF codeword codeword BoF (c) BoF Fergus Weber [11] Weber [12] Weber Fergus BoF (b) Fergus [13] Fergus 2. Fergus 2. 1 Fergus [3]
* A Multimodal Constellation Model for Generic Object Recognition Yasunori KAMIYA, Tomokazu TAKAHASHI,IchiroIDE, and Hiroshi MURASE Bag of Features (BoF) BoF EM 1. [1] Part-based Graduate School of Information
More informationIPSJ SIG Technical Report Vol.2012-CG-149 No.13 Vol.2012-CVIM-184 No /12/4 3 1,a) ( ) DB 3D DB 2D,,,, PnP(Perspective n-point), Ransa
3,a) 3 3 ( ) DB 3D DB 2D,,,, PnP(Perspective n-point), Ransac. DB [] [2] 3 DB Web Web DB Web NTT NTT Media Intelligence Laboratories, - Hikarinooka Yokosuka-Shi, Kanagawa 239-0847 Japan a) yabushita.hiroko@lab.ntt.co.jp
More informationMicrosoft PowerPoint - IBIS_harada_
2 Intelligence for Real World Recognition Natural Language Processing Database Pattern Recognition Information Theory Computer Vision Machine Learning Data Mining Robotics Cognitive Science Parallel Computing
More information(MIRU2010) Geometric Context Randomized Trees Geometric Context Rand
(MIRU2010) 2010 7 Geometric Context Randomized Trees 487-8501 1200 E-mail: {fukuta,ky}@vision.cs.chubu.ac.jp, hf@cs.chubu.ac.jp Geometric Context Randomized Trees 10 3, Geometric Context, Abstract Image
More information4. C i k = 2 k-means C 1 i, C 2 i 5. C i x i p [ f(θ i ; x) = (2π) p 2 Vi 1 2 exp (x µ ] i) t V 1 i (x µ i ) 2 BIC BIC = 2 log L( ˆθ i ; x i C i ) + q
x-means 1 2 2 x-means, x-means k-means Bayesian Information Criterion BIC Watershed x-means Moving Object Extraction Using the Number of Clusters Determined by X-means Clustering Naoki Kubo, 1 Kousuke
More information,,, Twitter,,, ( ), 2. [1],,, ( ),,.,, Sungho Jeon [2], Twitter 4 URL, SVM,, , , URL F., SVM,, 4 SVM, F,.,,,,, [3], 1 [2] Step Entered
DEIM Forum 2016 C5-1 182-8585 1-5-1 E-mail: saitoh-ryoh@uec.ac.jp, terada.minoru@uec.ac.jp Twitter,, Twitter,,, Bag of Words, Latent Semantic Indexing,.,,,, Twitter,, Twitter,, 1. SNS, SNS Twitter 1,,,
More information2 Fig D human model. 1 Fig. 1 The flow of proposed method )9)10) 2.2 3)4)7) 5)11)12)13)14) TOF 1 3 TOF 3 2 c 2011 Information
1 1 2 TOF 2 (D-HOG HOG) Recall D-HOG 0.07 HOG 0.16 Pose Estimation by Regression Analysis with Depth Information Yoshiki Agata 1 and Hironobu Fujiyoshi 1 A method for estimating the pose of a human from
More information3: 2: 2. 2 Semi-supervised learning Semi-supervised learning [5,6] Semi-supervised learning Self-training [13] [14] Self-training Self-training Semi-s
THE INSTITUTE OF ELECTRONICS, INFORMATION AND COMMUNICATION ENGINEERS TECHNICAL REPORT OF IEICE. 599-8531 1-1 E-mail: tsukada@m.cs.osakafu-u.ac.jp, {masa,kise}@cs.osakafu-u.ac.jp Semi-supervised learning
More informationDynamic Time Warping( DTW DTW 30 k-d tree Forebes [1] 2. DTW[2] DTW DTW DTW Forbes[1] k-d tree DTW Hsu[3] DTW Zhu[4] K-SVD Sun[5] Self-S
情報処理学会インタラクション 2015 IPSJ Interaction 2015 A62 2015/3/5 1,a) Natapon Pantuwong 2 1 1 1 Dynamic Time Warping 2 DTW DTW 30 k-d tree [1] A Rapid Motion Retrieval Technique using Simple and Discrete Representation
More informationTHE INSTITUTE OF ELECTRONICS, INFORMATION AND COMMUNICATION ENGINEERS TECHNICAL REPORT OF IEICE. TRECVID2012 Instance Search {sak
THE INSTITUTE OF ELECTRONICS, INFORMATION AND COMMUNICATION ENGINEERS TECHNICAL REPORT OF IEICE. TRECVID2012 Instance Search 599 8531 1 1 E-mail: {sakata,matozaki}@m.cs.osakafu-u.ac.jp, {kise,masa}@cs.osakafu-u.ac.jp
More informationPowerPoint プレゼンテーション
東京大学大学院情報理工学系研究科創造情報学専攻中山研究室中山英樹 どちらがハヤブサでしょう? http://plaza.rakuten.co.jp http://birds.mints.ne.jp 2 どちらがハヤブサでしょう? タカ ハヤブサ http://plaza.rakuten.co.jp http://birds.mints.ne.jp 3 車種はなんでしょう? 4 車種はなんでしょう?
More information情報処理学会研究報告 IPSJ SIG Technical Report Vol.2013-CVIM-186 No /3/15 EMD 1,a) SIFT. SIFT Bag-of-keypoints. SIFT SIFT.. Earth Mover s Distance
EMD 1,a) 1 1 1 SIFT. SIFT Bag-of-keypoints. SIFT SIFT.. Earth Mover s Distance (EMD), Bag-of-keypoints,. Bag-of-keypoints, SIFT, EMD, A method of similar image retrieval system using EMD and SIFT Hoshiga
More information& 3 3 ' ' (., (Pixel), (Light Intensity) (Random Variable). (Joint Probability). V., V = {,,, V }. i x i x = (x, x,, x V ) T. x i i (State Variable),
.... Deeping and Expansion of Large-Scale Random Fields and Probabilistic Image Processing Kazuyuki Tanaka The mathematical frameworks of probabilistic image processing are formulated by means of Markov
More information35_3_9.dvi
180 Vol. 35 No. 3, pp.180 185, 2017 Image Recognition by Deep Learning Hironobu Fujiyoshi and Takayoshi Yamashita Chubu University 1. 1990 2000 Scale-Invariant Feature Transform SIFT Histogram of Oriented
More informationComputational Semantics 1 category specificity Warrington (1975); Warrington & Shallice (1979, 1984) 2 basic level superiority 3 super-ordinate catego
Computational Semantics 1 category specificity Warrington (1975); Warrington & Shallice (1979, 1984) 2 basic level superiority 3 super-ordinate category preservation 1 / 13 analogy by vector space Figure
More informationMicrosoft Word - toyoshima-deim2011.doc
DEIM Forum 2011 E9-4 252-0882 5322 252-0882 5322 E-mail: t09651yt, sashiori, kiyoki @sfc.keio.ac.jp CBIR A Meaning Recognition System for Sign-Logo by Color-Shape-Based Similarity Computations for Images
More informationIPSJ SIG Technical Report Vol.2011-CVIM-177 No /5/ TRECVID2010 SURF Bag-of-Features 1 TRECVID SVM 700% MKL-SVM 883% TRECVID2010 MKL-SVM A
1 1 TRECVID2010 SURF Bag-of-Features 1 TRECVID SVM 700% MKL-SVM 883% TRECVID2010 MKL-SVM Analysis of video data recognition using multi-frame Kazuya Hidume 1 and Keiji Yanai 1 In this study, we aim to
More information情報処理学会研究報告 プレートマッチングによりリアルタイムに物体検出や追跡 を行うアプリケーションが提案されるなど近年モバイルと 画像認識の研究が盛んに行われている 本研究では視覚的 変化の大きい料理に対してスマートフォンの計算資源のみ を用いてリアルタイムに料理認識を行う ユーザインタラクティブな
Bag-of-SURF fast χ 2 kernel SVMs 5 GrabCut SVM 5 8.55%. CPU PC χ2 2. [] SIFT HOGGabor MKL-SVM Yang [2] FoodLog[3] c 22 Information Processing Society of Japan 情報処理学会研究報告 プレートマッチングによりリアルタイムに物体検出や追跡 を行うアプリケーションが提案されるなど近年モバイルと
More information1 IDC Wo rldwide Business Analytics Technology and Services 2013-2017 Forecast 2 24 http://www.soumu.go.jp/johotsusintokei/whitepaper/ja/h24/pdf/n2010000.pdf 3 Manyika, J., Chui, M., Brown, B., Bughin,
More informationFoodLog [3] TADAproject [4] Google Goggles 1 Kumar [5] () Leaf snap Maruyama [6] 3 Lee [7] Yu [8] Gist SVM Active Query Sensing(AQS)
DEIM Forum 213 D3-4 食事認識を用いたモバイル食事管理システム 河野 憲之 柳井 啓司 電気通信大学 電気通信学部 情報工学科 182-8585 東京都調布市調布ヶ丘 1-5-1 電気通信大学 大学院情報理工学研究科 総合情報学専攻 182-8585 東京都調布市調布ヶ丘 1-5-1 E-mail: kawano-y@mm.inf.uec.ac.jp, yanai@cs.uec.ac.jp
More information[6, 7] Caltech101[8] Caltech256[9] 20 1 Pascal VOC ,492 LSP15[10] Caltech % [8] %[11] 2.2. TinyImage
Large-Scale Generic Image Recognition and Image Representation Tatsuya Harada 1,2 1 The University of Toyo, 2 JST PRESTO harada@isi.imi.i.u-toyo.ac.jp Abstract 1. [1] [2] 3 2. Web Web Web 2.1. Corel5K[3]
More informationSICE東北支部研究集会資料(2017年)
307 (2017.2.27) 307-8 Deep Convolutional Neural Network X Detecting Masses in Mammograms Based on Transfer Learning of A Deep Convolutional Neural Network Shintaro Suzuki, Xiaoyong Zhang, Noriyasu Homma,
More information(MIRU2009) cuboid cuboid SURF 6 85% Web. Web Abstract Extracting Spatio-te
(MIRU2009) 2009 7 182 8585 1 5 1 E-mail: noguchi-a@mm.cs.uec.ac.jp, yanai@cs.uec.ac.jp cuboid cuboid SURF 6 85% Web. Web Abstract Extracting Spatio-temporal Local Features Considering Consecutiveness of
More information本文6(599) (Page 601)
(MIRU2008) 2008 7 525 8577 1 1 1 E-mail: matsuzaki@i.ci.ritsumei.ac.jp, shimada@ci.ritsumei.ac.jp Object Recognition by Observing Grasping Scene from Image Sequence Hironori KASAHARA, Jun MATSUZAKI, Nobutaka
More information(MIRU2008) HOG Histograms of Oriented Gradients (HOG)
(MIRU2008) 2008 7 HOG - - E-mail: katsu0920@me.cs.scitec.kobe-u.ac.jp, {takigu,ariki}@kobe-u.ac.jp Histograms of Oriented Gradients (HOG) HOG Shape Contexts HOG 5.5 Histograms of Oriented Gradients D Human
More information2008 : 80725872 1 2 2 3 2.1.......................................... 3 2.2....................................... 3 2.3......................................... 4 2.4 ()..................................
More information2.2 6).,.,.,. Yang, 7).,,.,,. 2.3 SIFT SIFT (Scale-Invariant Feature Transform) 8).,. SIFT,,. SIFT, Mean-Shift 9)., SIFT,., SIFT,. 3.,.,,,,,.,,,., 1,
1 1 2,,.,.,,, SIFT.,,. Pitching Motion Analysis Using Image Processing Shinya Kasahara, 1 Issei Fujishiro 1 and Yoshio Ohno 2 At present, analysis of pitching motion from baseball videos is timeconsuming
More information理工ジャーナル 23‐1☆/1.外村
Yoshinobu TONOMURA Professor, Department of Media Informatics 1 10 YouTube 2 1900 100 1 3 2 3 3 3 1 2 3 4 90 1 90 MIT Project Athena 1983 1991 2 3 4 5 6 7 8 9 10 2 90 11 12 7 13 14 15 16 17 18 19 390 5
More information例題ではじめる部分空間法 - パターン認識へのいざない -
- - ( ) 69 2012 5 22 (1) ( ) MATLAB/Octave 3 download http://www.tuat.ac.jp/ s-hotta/rsj2012 (2) ( ) [1] 対応付け 0 1 2 3 4 未知パターン ( クラスが未知 ) 利用 5 6 7 8 クラス ( 概念 ) 9 訓練パターン ( クラスが既知 ) (3) [1] 識別演算部 未知パターン
More informationkut-paper-template.dvi
26 Discrimination of abnormal breath sound by using the features of breath sound 1150313 ,,,,,,,,,,,,, i Abstract Discrimination of abnormal breath sound by using the features of breath sound SATO Ryo
More informationIPSJ SIG Technical Report Vol.2013-CG-153 No.19 Vol.2013-CVIM-189 No /11/29 1,a) 0 1 SIFT SURF 1. Scale-Invariant Feature Transform (SIFT)[16]
1,a) 0 1 SIFT SURF 1. Scale-Invariant Feature Transform (SIFT)[16] [14], [17] [6] 1 *1 SIFT 1 Shibuya CROSS TOWER 28th Floor 2-15-1 Shibuya Shibuya-ku Tokyo, 150-0002 Japan a) manbai@d-itlab.co.jp *1 Binary
More informationIPSJ SIG Technical Report Vol.2010-MPS-77 No /3/5 VR SIFT Virtual View Generation in Hallway of Cybercity Buildings from Video Sequen
VR 1 1 1 1 1 SIFT Virtual View Generation in Hallway of Cybercity Buildings from Video Sequences Sachiyo Yoshida, 1 Masami Takata 1 and Joe Kaduki 1 Appearance of Three-dimensional (3D) building model
More informationBDH Cao BDH BDH Cao Cao Cao BDH ()*$ +,-+.)*$!%&'$!"#$ 2. 1 Weng [4] Metric Learning Weng DB DB Yang [5] John [6] Sparse Coding sparse coding DB [7] K
Bucket Distance Hashing Metric Learning 1,a) 1,b) 1,c) 1,d) (DB) [1] DB Cao [2] Cao Metric Learning Cao Cao Cao Cao Cao 100 DB 10% 1. m DB DB DB 1 599 8531 1 1 Graduate School of Engineering, Osaka Prefecture
More informationOptical Flow t t + δt 1 Motion Field 3 3 1) 2) 3) Lucas-Kanade 4) 1 t (x, y) I(x, y, t)
http://wwwieice-hbkborg/ 2 2 4 2 -- 2 4 2010 9 3 3 4-1 Lucas-Kanade 4-2 Mean Shift 3 4-3 2 c 2013 1/(18) http://wwwieice-hbkborg/ 2 2 4 2 -- 2 -- 4 4--1 2010 9 4--1--1 Optical Flow t t + δt 1 Motion Field
More information一般画像認識のための単語概念の視覚性の分析
Bag-of-keypoints による カテゴリー認識 第 14 回画像センシングシンポジウム (SSII2008) 2008 年 6 月 13 日 電気通信大学 柳井啓司 情報工学科 2 アウトライン 1. イントロダクション 2. Bag-of-keypoints アプローチ その具体的な方法の詳細 3. Bag-of-keypoints アプローチの拡張 位置情報, 色情報の利用 4. 確率的言語モデルの画像への適用
More information_314I01BM浅谷2.indd
587 ネットワークの表現学習 1 1 1 1 Deep Learning [1] Google [2] Deep Learning [3] [4] 2014 Deepwalk [5] 1 2 [6] [7] [8] 1 2 1 word2vec[9] word2vec 1 http://www.ai-gakkai.or.jp/my-bookmark_vol31-no4 588 31 4 2016
More informationばらつき抑制のための確率最適制御
( ) http://wwwhayanuemnagoya-uacjp/ fujimoto/ 2011 3 9 11 ( ) 2011/03/09-11 1 / 46 Outline 1 2 3 4 5 ( ) 2011/03/09-11 2 / 46 Outline 1 2 3 4 5 ( ) 2011/03/09-11 3 / 46 (1/2) r + Controller - u Plant y
More informationLBP 2 LBP 2. 2 Local Binary Pattern Local Binary pattern(lbp) [6] R
DEIM Forum 24 F5-4 Local Binary Pattern 6 84 E-mail: {tera,kida}@ist.hokudai.ac.jp Local Binary Pattern (LBP) LBP 3 3 LBP 5 5 5 LBP improved LBP uniform LBP.. Local Binary Pattern, Gradient Local Auto-Correlations,,,,
More informationTrial for Value Quantification from Exceptional Utterances 37-066593 1 5 1.1.................................. 5 1.2................................ 8 2 9 2.1.............................. 9 2.1.1.........................
More informationTwitter‡Ì”À‰µ…c…C†[…g‡ðŠŸŠp‡µ‡½…^…C…•…›…C…fi‘ã‡Ì…l…^…o…„‘îŁñ„�™m
27 Twitter 1431050 2016 3 14 1 Twitter,,.,.,., Twitter,.,,.,,. URL,,,. BoW(Bag of Words), LSI(Latent Semantic Indexing)., URL,,,,., Accuracy, AUC(Area Under the Curve), Precision, Recall, F,. URL,,,.,
More informationConvolutional Neural Network A Graduation Thesis of College of Engineering, Chubu University Investigation of feature extraction by Convolution
Convolutional Neural Network 2014 3 A Graduation Thesis of College of Engineering, Chubu University Investigation of feature extraction by Convolutional Neural Network Fukui Hiroshi 1940 1980 [1] 90 3
More informationIPSJ SIG Technical Report iphone iphone,,., OpenGl ES 2.0 GLSL(OpenGL Shading Language), iphone GPGPU(General-Purpose Computing on Graphics Proc
iphone 1 1 1 iphone,,., OpenGl ES 2.0 GLSL(OpenGL Shading Language), iphone GPGPU(General-Purpose Computing on Graphics Processing Unit)., AR Realtime Natural Feature Tracking Library for iphone Makoto
More information(a) 1 (b) 3. Gilbert Pernicka[2] Treibitz Schechner[3] Narasimhan [4] Kim [5] Nayar [6] [7][8][9] 2. X X X [10] [11] L L t L s L = L t + L s
1 1 1, Extraction of Transmitted Light using Parallel High-frequency Illumination Kenichiro Tanaka 1 Yasuhiro Mukaigawa 1 Yasushi Yagi 1 Abstract: We propose a new sharpening method of transmitted scene
More information& Vol.5 No (Oct. 2015) TV 1,2,a) , Augmented TV TV AR Augmented Reality 3DCG TV Estimation of TV Screen Position and Ro
TV 1,2,a) 1 2 2015 1 26, 2015 5 21 Augmented TV TV AR Augmented Reality 3DCG TV Estimation of TV Screen Position and Rotation Using Mobile Device Hiroyuki Kawakita 1,2,a) Toshio Nakagawa 1 Makoto Sato
More information11) 13) 11),12) 13) Y c Z c Image plane Y m iy O m Z m Marker coordinate system T, d X m f O c X c Camera coordinate system 1 Coordinates and problem
1 1 1 Posture Esimation by Using 2-D Fourier Transform Yuya Ono, 1 Yoshio Iwai 1 and Hiroshi Ishiguro 1 Recently, research fields of augmented reality and robot navigation are actively investigated. Estimating
More informationIPSJ SIG Technical Report Vol.2011-MUS-91 No /7/ , 3 1 Design and Implementation on a System for Learning Songs by Presenting Musical St
1 2 1, 3 1 Design and Implementation on a System for Learning Songs by Presenting Musical Structures based on Phrase Similarity Yuma Ito, 1 Yoshinari Takegawa, 2 Tsutomu Terada 1, 3 and Masahiko Tsukamoto
More informationMicrosoft PowerPoint _KAIST_NTCIR5_Patent-DigitalPoster.ppt
Patent Document Retrieval and Classification at KAIST KAIST CS Dept. / BOLA 2005. 12. 8. Jae-Ho Kim, Jin-Xia Huang, Ha-Yong Jung, Key-Sun Choi Introduction Tasks Document retrieval subtask Theme categorization
More informationスライド 1
A SURF-based Spatio-Temporal Feature for Feature-fusion-based Action Recognition 1. Background & Objective action recognition object/scene recognition Bag-of-features (BoF) of spatiotemporal features [Dollar
More informationIPSJ SIG Technical Report Vol.2010-CVIM-171 No /3/19 1. Web 1 1 Web Web Web Multiple Kernel Learning(MKL) Web ( ) % MKL 68.8% Extractin
1. Web 1 1 Web Web Web Multiple Kernel Learning(MKL) Web ( ) 200 57.2% MKL 68.8% Extracting Spatio-Temporal Local Features for Classifying Web Video Shots Akitsugu Noguchi 1 and Keiji Yanai 1 Nowadays,
More informationMining Regional Representative Photos from a Large-scale Geotagged Image Database
Web 上のジオタグ画像を用いた 世界各地の文化的差異の発見 2009 年度人工知能学会全国大会 2009 年 6 月高松 柳井啓司 電気通信大学情報工学科 研究の背景 Web には, ラーメンがいっぱい やっぱり, どこのラーメンか知りたい! アウトライン 研究の背景 目的 関連研究 方法 実験結果 まとめと今後の課題 背景 : 大量のジオタグ画像の登場 近年, 位置情報付き画像 (geo-tagged
More informationHOG HOG LBP LBP 4) LBP LBP Wang LBP HOG LBP 5) LBP LBP 1 r n 1 n, 1
1 1 1 Shwartz Histgrams of Oriented Gradients HOG PLS PLS KPLS INRIA PLS KPLS KPLS PLS Pedestrian Detection Using Kernel Partial Least Squares Analysis Takashi Abe, 1 Takayuki Okatani 1 and Kouichiro Deguchi
More informationx T = (x 1,, x M ) x T x M K C 1,, C K 22 x w y 1: 2 2
Takio Kurita Neurosceince Research Institute, National Institute of Advanced Indastrial Science and Technology takio-kurita@aistgojp (Support Vector Machine, SVM) 1 (Support Vector Machine, SVM) ( ) 2
More information% 2 3 [1] Semantic Texton Forests STFs [1] ( ) STFs STFs ColorSelf-Simlarity CSS [2] ii
2012 3 A Graduation Thesis of College of Engineering, Chubu University High Accurate Semantic Segmentation Using Re-labeling Besed on Color Self Similarity Yuko KAKIMI 2400 90% 2 3 [1] Semantic Texton
More information*2.5mm ”ŒŠá‡ÆfiÁ™¥‡Ì…Z†[…t…X…N…−†[…j…fi…O
I. Takeuchi, Nagoya Institute of Technology 1/38 f(x) = w 1 x 1 + w 2 x 2 +... + w d x d f(x) = α 1 K(x, x 1 ) + α 2 K(x, x 2 ) +... + α n K(x, x n ) {wj } d j=1 f {αi } n i=1 f I. Takeuchi, Nagoya Institute
More information情報処理学会研究報告 IPSJ SIG Technical Report Vol.2014-GN-90 No.6 Vol.2014-CDS-9 No.6 Vol.2014-DCC-6 No /1/23 Bullet Time 1,a) 1 Bullet Time Bullet Time
Bullet Time 1,a) 1 Bullet Time Bullet Time Generation Technique and Eveluation on High-Resolution Bullet-Time Camera Work Ryuuki Sakamoto 1,a) Ding Chen 1 Abstract: The multi-camera environment have been
More information0. Intro ( K CohFT etc CohFT 5.IKKT 6.
E-mail: sako@math.keio.ac.jp 0. Intro ( K 1. 2. CohFT etc 3. 4. CohFT 5.IKKT 6. 1 µ, ν : d (x 0,x 1,,x d 1 ) t = x 0 ( t τ ) x i i, j, :, α, β, SO(D) ( x µ g µν x µ µ g µν x ν (1) g µν g µν vector x µ,y
More informationLinear Distance Metric Learning for Large-scale Generic Image Recognition Hideki Nakayama Graduate School of Information Science and Technology The University of Tokyo A thesis submitted for the degree
More information48_16_1.dvi
Vol. 48 No. SIG 16(CVIM 19) Nov. 2007 1 1 101 6 The Current State and Future Directions on Generic Object Recognition Keiji Yanai Generic object recognition aims at enabling a computer to recognize objects
More informationIntroduction of Self-Organizing Map * 1 Ver. 1.00.00 (2017 6 3 ) *1 E-mail: furukawa@brain.kyutech.ac.jp i 1 1 1.1................................ 2 1.2...................................... 4 1.3.......................
More informationIPSJ SIG Technical Report Vol.2015-MUS-107 No /5/23 HARK-Binaural Raspberry Pi 2 1,a) ( ) HARK 2 HARK-Binaural A/D Raspberry Pi 2 1.
HARK-Binaural Raspberry Pi 2 1,a) 1 1 1 2 3 () HARK 2 HARK-Binaural A/D Raspberry Pi 2 1. [1,2] [2 5] () HARK (Honda Research Institute Japan audition for robots with Kyoto University) *1 GUI ( 1) Python
More information2007/8 Vol. J90 D No. 8 Stauffer [7] 2 2 I 1 I 2 2 (I 1(x),I 2(x)) 2 [13] I 2 = CI 1 (C >0) (I 1,I 2) (I 1,I 2) Field Monitoring Server
a) Change Detection Using Joint Intensity Histogram Yasuyo KITA a) 2 (0 255) (I 1 (x),i 2 (x)) I 2 = CI 1 (C>0) (I 1,I 2 ) (I 1,I 2 ) 2 1. [1] 2 [2] [3] [5] [6] [8] Intelligent Systems Research Institute,
More information[1] SBS [2] SBS Random Forests[3] Random Forests ii
Random Forests 2013 3 A Graduation Thesis of College of Engineering, Chubu University Proposal of an efficient feature selection using the contribution rate of Random Forests Katsuya Shimazaki [1] SBS
More informationDEIM Forum 2019 A7-1 Flexible Distance-based Hashing mori
DEIM Forum 2019 A7-1 Flexible Distance-based Hashing 731 3194 E-mail: mc66023@e.hiroshima-cu.ac.jp,{wakaba,s naga,inagi,yoko}@hiroshima-cu.ac.jp, morikei18@gmail.com Flexible Distance-based Hashing(FDH)
More informationmain.dvi
CDMA 1 CDMA ( ) CDMA CDMA CDMA 1 ( ) Hopfield [1] Hopfield 1 E-mail: okada@brain.riken.go.jp 1 1: 1 [] Hopfield Sourlas Hopfield [3] Sourlas 1? CDMA.1 DS/BPSK CDMA (Direct Sequence; DS) (Binary Phase-Shift-Keying;
More informationthesis.dvi
2007 Graph Cuts Graph Cuts Graph Cuts Graph Cuts t-link Interactive Graph Cuts 4.7% Mean Shift Segmentation 1 1 2 3 2.1.................... 3 2.1.1............................. 3 2.2...........................
More informationIPSJ SIG Technical Report Vol.2014-CG-155 No /6/28 1,a) 1,2,3 1 3,4 CG An Interpolation Method of Different Flow Fields using Polar Inter
,a),2,3 3,4 CG 2 2 2 An Interpolation Method of Different Flow Fields using Polar Interpolation Syuhei Sato,a) Yoshinori Dobashi,2,3 Tsuyoshi Yamamoto Tomoyuki Nishita 3,4 Abstract: Recently, realistic
More information特別寄稿.indd
特別寄稿 ソフトインフラとしてのデジタル地図を活用した自動運転システム Autonomous vehicle using digital map as a soft infrastructure 菅沼直樹 Naoki SUGANUMA 1. はじめに 1) 2008 2012 ITS 2) CO 2 3) 4) Door to door Door to door Door to door DARPA(
More informationIPSJ-CVIM
1 1 2 1 Estimation of Shielding Object Distribution in Scattering Media by Analyzing Light Transport Shosei Moriguchi, 1 Yasuhiro Mukaigawa, 1 Yasuyuki Matsushita 2 and Yasushi Yagi 1 In this paper, we
More informationIS2-06 第21回画像センシングシンポジウム 横浜 2015年6月 画像をスーパーピクセルに変換する手法として SLIC[5] を用いる Achanta らによって提案された SLIC 2.2 グラフマッチング は K-means をベースにした手法で 単純な K-means に いる SPIN
Cosegmentation E-mail: {tamanaha, nakayama}@nlab.ci.i.u-tokyo.ac.jp Abstract Cosegmentation Cosegmentation Cosegmentation 1 Never Ending Image Learner[1] Google Cosegmentation Cosegmentation Rother [2]
More informationIPSJ SIG Technical Report Vol.2012-CVIM-180 No /1/20 RGB-D 1 1, 2 1 RGB-D Interactive Object Recognition for Service Robot using an RGB-D Camer
RGB-D 1 1, 2 1 RGB-D Interactive Object Recognition for Service Robot using an RGB-D Camera Hisato Fukuda, 1 Yoshinori Kobayashi 1, 2 and Yoshinori Kuno 1 Service robots need to be able to recognize objects
More informationSobel Canny i
21 Edge Feature for Monochrome Image Retrieval 1100311 2010 3 1 3 3 2 2 7 200 Sobel Canny i Abstract Edge Feature for Monochrome Image Retrieval Naoto Suzue Content based image retrieval (CBIR) has been
More information28 TCG SURF Card recognition using SURF in TCG play video
28 TCG SURF Card recognition using SURF in TCG play video 1170374 2017 3 2 TCG SURF TCG TCG OCG SURF Bof 20 20 30 10 1 SURF Bag of features i Abstract Card recognition using SURF in TCG play video Haruka
More information,.,. NP,., ,.,,.,.,,, (PCA)...,,. Tipping and Bishop (1999) PCA. (PPCA)., (Ilin and Raiko, 2010). PPCA EM., , tatsukaw
,.,. NP,.,. 1 1.1.,.,,.,.,,,. 2. 1.1.1 (PCA)...,,. Tipping and Bishop (1999) PCA. (PPCA)., (Ilin and Raiko, 2010). PPCA EM., 152-8552 2-12-1, tatsukawa.m.aa@m.titech.ac.jp, 190-8562 10-3, mirai@ism.ac.jp
More information22_04.dvi
Vol. 1 No. 2 32 40 (July 2008) 1, 2 1 Speaker Segmentation Using Audiovisual Correlation Yuyu Liu 1, 2 and Yoichi Sato 1 Audiovisual correlation has been used successfully for audio source localization.
More informationMicrosoft Word - mitomi_v06.doc
MSS mitomi@edm.bosai.go.jp matsuoka@edm.bosai.go.jp yamazaki@edm.bosai.go.jp taniguchi@manage.nitech.ac.jp 1 MSS MSS 2 2 1 m MSS CCT CCT Fig.1 CCT b02-b0 b0-b0b-b b-b1 CCT Landsat/TM MSS S/N 21x21 21x21
More informationohpmain.dvi
fujisawa@ism.ac.jp 1 Contents 1. 2. 3. 4. γ- 2 1. 3 10 5.6, 5.7, 5.4, 5.5, 5.8, 5.5, 5.3, 5.6, 5.4, 5.2. 5.5 5.6 +5.7 +5.4 +5.5 +5.8 +5.5 +5.3 +5.6 +5.4 +5.2 =5.5. 10 outlier 5 5.6, 5.7, 5.4, 5.5, 5.8,
More information12_39.dvi
Vol. 52 No. 12 3588 3592 (Dec. 2011) Web 1, 1 1 2 1 1 1 Web GPS Creation of a Sight-seeing Map with Visual Classification of Photos on the Web Jiani Wang, 1, 1 Masafumi Noda, 1 Tomokazu Takahashi, 2 Daisuke
More informationISCO自動コーディングシステムの分類精度向上に向けて―SSM およびJGSS データセットによる実験の結果―
ISCO SSM JGSS Improvement of Classification Accuracy in an ISCO Automatic Coding System: Results of Experiments Using both the SSM Dataset and the JGSS Dataset Kazuko TAKAHASHI Faculty of International
More information塗装深み感の要因解析
17 Analysis of Factors for Paint Depth Feeling Takashi Wada, Mikiko Kawasumi, Taka-aki Suzuki ( ) ( ) ( ) The appearance and quality of objects are controlled by paint coatings on the surfaces of the objects.
More information1 Kinect for Windows M = [X Y Z] T M = [X Y Z ] T f (u,v) w 3.2 [11] [7] u = f X +u Z 0 δ u (X,Y,Z ) (5) v = f Y Z +v 0 δ v (X,Y,Z ) (6) w = Z +
3 3D 1,a) 1 1 Kinect (X, Y) 3D 3D 1. 2010 Microsoft Kinect for Windows SDK( (Kinect) SDK ) 3D [1], [2] [3] [4] [5] [10] 30fps [10] 3 Kinect 3 Kinect Kinect for Windows SDK 3 Microsoft 3 Kinect for Windows
More informationスライド 1
CMOS : swk(at)ic.is.tohoku.ac.jp [ 2003] [Wong1999] 2 : CCD CMOS 3 : CCD Q Q V 4 : CMOS V C 5 6 CMOS light input photon shot noise α quantum efficiency dark current dark current shot noise dt time integration
More information,,,,,,,,,,,,,,,,,,, 976%, i
20 Individual Recognition using positions of facial parts 1115081 2009 3 5 ,,,,,,,,,,,,,,,,,,, 976%, i Abstract Individual Recognition using positions of facial parts YOSHIHIRO Arisawa A facial recognition
More informationERATO100913
ERATO September 13, 2010, DC2 1/25 1. 2 2. 2/25 3/25 3/25 2 3/25 2 3/25 1 1 0.5 0.5 0 0 0.5 1 0 0 0.5 1 4/25 1 1 0.5 0.5 0 0 0.5 1 (0, 0) 0 0 0.5 1 4/25 1 1 0.5 0.5 0 0 0.5 1 (0, 0) ( 1, 0) 0 0 0.5 1 4/25
More information1 Fig. 1 Extraction of motion,.,,, 4,,, 3., 1, 2. 2.,. CHLAC,. 2.1,. (256 ).,., CHLAC. CHLAC, HLAC. 2.3 (HLAC ) r,.,. HLAC. N. 2 HLAC Fig. 2
CHLAC 1 2 3 3,. (CHLAC), 1).,.,, CHLAC,.,. Suspicious Behavior Detection based on CHLAC Method Hideaki Imanishi, 1 Toyohiro Hayashi, 2 Shuichi Enokida 3 and Toshiaki Ejima 3 We have proposed a method for
More informationuntitled
DEIM Forum 2019 I2-4 305-8573 1-1-1 305-8573 1-1-1 305-8573 1-1-1 ( ) 151-0053 1-3-15 6F 101-8430 2-1-2 CNN LSTM,,,, Measuring Beginner Friendliness / Visiual Intelligibility of Web Pages explaining Academic
More information情報処理学会研究報告 Vol.2013-CG-153 No.6 Vol.2013-CVIM-189 No /11/28 IPSJ SIG Technical Report 逐次的四面体カービング法を用いた 3D モデリング 鳥居 秋彦1 杉浦 貴行1 奥富 正敏1 概要 画像が次々と入力
情報処理学会研究報告 逐次的四面体カービング法を用いた 3D モデリング 鳥居 秋彦 杉浦 貴行 奥富 正敏 概要 画像が次々と入力され Structure from Motion (SfM) によって疎な 3D 点群とカメラポーズが与え られる場合 逐次的に効率良くサーフェス生成を行う手法を提案する 提案手法では 四面体を削り出す サーフェス抽出法を 視線と四面体の交差の効率的な検出方法と ダイナミックグラフカットを適用した
More informationNo. 3 Oct The person to the left of the stool carried the traffic-cone towards the trash-can. α α β α α β α α β α Track2 Track3 Track1 Track0 1
ACL2013 TACL 1 ACL2013 Grounded Language Learning from Video Described with Sentences (Yu and Siskind 2013) TACL Transactions of the Association for Computational Linguistics What Makes Writing Great?
More informationuntitled
c ILSVRC LeNet 1. 1 convolutional neural network 1980 Fukushima [1] [2] 80 LeCun (back propagation) LeNet [3, 4] LeNet 2. 2.1 980 8579 6 6 01 okatani@vision.is.tohoku.ac.jp (simple cell) (complex cell)
More informationDirichlet process mixture Dirichlet process mixture 2 /40 MIRU2008 :
Dirichlet Process : joint work with: Max Welling (UC Irvine), Yee Whye Teh (UCL, Gatsby) http://kenichi.kurihara.googlepages.com/miru_workshop.pdf 1 /40 MIRU2008 : Dirichlet process mixture Dirichlet process
More informationGoogle Goggles [1] Google Goggles Android iphone web Google Goggles Lee [2] Lee iphone () [3] [4] [5] [6] [7] [8] [9] [10] :
THE INSTITUTE OF ELECTRONICS, INFORMATION AND COMMUNICATION ENGINEERS TECHNICAL REPORT OF IEICE.,, 182-8585 1-5-1 E-mail: {maruya-t,akiyama-m}@mm.inf.uec.ac.jp, yanai@cs.uec.ac.jp SURF Bag-of-Features
More informationuntitled
(Robot Vision) Vision ( (computer) Machine VisionComputer Vision ( ) ( ) ( ) ( ) ( ) 1 DTV 2 DTV D 3 ( ( ( ( ( DTV D 4 () 5 A B C D E F G H I A B C D E F G H I I = A + D + G - C - F - I J = A + B + C -
More information