IPSJ SIG Technical Report Vol.2013-CVIM-187 No /5/30 1,a) 1,b), 1,,,,,,, (DNN),,,, 2 (CNN),, 1.,,,,,,,,,,,,,,,,,, [1], [6], [7], [12], [13]., [

Similar documents
Convolutional Neural Network A Graduation Thesis of College of Engineering, Chubu University Investigation of feature extraction by Convolution

IPSJ SIG Technical Report Vol.2012-CG-149 No.13 Vol.2012-CVIM-184 No /12/4 3 1,a) ( ) DB 3D DB 2D,,,, PnP(Perspective n-point), Ransa

IPSJ SIG Technical Report Vol.2010-CVIM-170 No /1/ Visual Recognition of Wire Harnesses for Automated Wiring Masaki Yoneda, 1 Ta

SICE東北支部研究集会資料(2017年)

光学

(MIRU2010) Geometric Context Randomized Trees Geometric Context Rand

Microsoft PowerPoint - SSII_harada pptx

untitled

(a) 1 (b) 3. Gilbert Pernicka[2] Treibitz Schechner[3] Narasimhan [4] Kim [5] Nayar [6] [7][8][9] 2. X X X [10] [11] L L t L s L = L t + L s

Duplicate Near Duplicate Intact Partial Copy Original Image Near Partial Copy Near Partial Copy with a background (a) (b) 2 1 [6] SIFT SIFT SIF

3: 2: 2. 2 Semi-supervised learning Semi-supervised learning [5,6] Semi-supervised learning Self-training [13] [14] Self-training Self-training Semi-s

IPSJ SIG Technical Report iphone iphone,,., OpenGl ES 2.0 GLSL(OpenGL Shading Language), iphone GPGPU(General-Purpose Computing on Graphics Proc

IPSJ SIG Technical Report Vol.2017-CVIM-207 No /5/10 GAN 1,a) 2,b) Generative Adversarial Networks GAN GAN CIFAR-10 10% GAN GAN Stacked GAN Sta

4. C i k = 2 k-means C 1 i, C 2 i 5. C i x i p [ f(θ i ; x) = (2π) p 2 Vi 1 2 exp (x µ ] i) t V 1 i (x µ i ) 2 BIC BIC = 2 log L( ˆθ i ; x i C i ) + q

03_特集2_3校_0929.indd

IPSJ SIG Technical Report Vol.2009-CVIM-167 No /6/10 Real AdaBoost HOG 1 1 1, 2 1 Real AdaBoost HOG HOG Real AdaBoost HOG A Method for Reducing

28 TCG SURF Card recognition using SURF in TCG play video

[1] SBS [2] SBS Random Forests[3] Random Forests ii

IPSJ SIG Technical Report Vol.2017-MUS-116 No /8/24 MachineDancing: 1,a) 1,b) 3 MachineDancing MachineDancing MachineDancing 1 MachineDan

3 2 2 (1) (2) (3) (4) 4 4 AdaBoost 2. [11] Onishi&Yoda [8] Iwashita&Stoica [5] 4 [3] 3. 3 (1) (2) (3)

untitled

本文6(599) (Page 601)

_314I01BM浅谷2.indd

Silhouette on Image Object Silhouette on Images Object 1 Fig. 1 Visual cone Fig. 2 2 Volume intersection method Fig. 3 3 Background subtraction Fig. 4

Deep Learning によるビッグデータ解析 ~ 手法や CUDA による高速化 2014 年 9 月 5 日 G-DEP ソリューションパートナー株式会社システム計画研究所奥村義和

Mastering the Game of Go without Human Knowledge ( ) AI 3 1 AI 1 rev.1 (2017/11/26) 1 6 2

Microsoft Word - toyoshima-deim2011.doc

LBP 2 LBP 2. 2 Local Binary Pattern Local Binary pattern(lbp) [6] R

No. 3 Oct The person to the left of the stool carried the traffic-cone towards the trash-can. α α β α α β α α β α Track2 Track3 Track1 Track0 1

THE INSTITUTE OF ELECTRONICS, INFORMATION AND COMMUNICATION ENGINEERS TECHNICAL REPORT OF IEICE. TRECVID2012 Instance Search {sak

Google Goggles [1] Google Goggles Android iphone web Google Goggles Lee [2] Lee iphone () [3] [4] [5] [6] [7] [8] [9] [10] :

ディープラーニングとオープンサイエンス ~研究の爆速化が引き起こす摩擦なき情報流通へのシフト~

bag-of-words bag-of-keypoints Web bagof-keypoints Nearest Neighbor SVM Nearest Neighbor SIFT Nearest Neighbor bag-of-keypoints Nearest Neighbor SVM 84

(MIRU2009) cuboid cuboid SURF 6 85% Web. Web Abstract Extracting Spatio-te

1 Kinect for Windows M = [X Y Z] T M = [X Y Z ] T f (u,v) w 3.2 [11] [7] u = f X +u Z 0 δ u (X,Y,Z ) (5) v = f Y Z +v 0 δ v (X,Y,Z ) (6) w = Z +

1 Fig. 1 Extraction of motion,.,,, 4,,, 3., 1, 2. 2.,. CHLAC,. 2.1,. (256 ).,., CHLAC. CHLAC, HLAC. 2.3 (HLAC ) r,.,. HLAC. N. 2 HLAC Fig. 2

35_3_9.dvi

& Vol.5 No (Oct. 2015) TV 1,2,a) , Augmented TV TV AR Augmented Reality 3DCG TV Estimation of TV Screen Position and Ro

(MIRU2008) HOG Histograms of Oriented Gradients (HOG)

IPSJ SIG Technical Report Vol.2015-MPS-103 No.29 Vol.2015-BIO-42 No /6/24 Deep Convolutional Neural Network 1,a) 1,b),c) X CT (Computer Aided D

% 2 3 [1] Semantic Texton Forests STFs [1] ( ) STFs STFs ColorSelf-Simlarity CSS [2] ii

PowerPoint Presentation

情報処理学会研究報告 IPSJ SIG Technical Report Vol.2013-CVIM-186 No /3/15 EMD 1,a) SIFT. SIFT Bag-of-keypoints. SIFT SIFT.. Earth Mover s Distance


paper.dvi

,,.,.,,.,.,.,.,,.,..,,,, i

2 Fig D human model. 1 Fig. 1 The flow of proposed method )9)10) 2.2 3)4)7) 5)11)12)13)14) TOF 1 3 TOF 3 2 c 2011 Information

Microsoft PowerPoint - cvim_harada pptx

Haiku Generation Based on Motif Images Using Deep Learning Koki Yoneda 1 Soichiro Yokoyama 2 Tomohisa Yamashita 2 Hidenori Kawamura Scho

情報処理学会研究報告 IPSJ SIG Technical Report Vol.2011-MBL-57 No.27 Vol.2011-UBI-29 No /3/ A Consideration of Features for Fatigue Es

SICE東北支部研究集会資料(2013年)

Vol. 44 No. SIG 9(CVIM 7) ) 2) 1) 1 2) 3 7) 1) 2) 3 3) 4) 5) (a) (d) (g) (b) (e) (h) No Convergence? End (f) (c) Yes * ** * ** 1

Vol.55 No (Jan. 2014) saccess 6 saccess 7 saccess 2. [3] p.33 * B (A) (B) (C) (D) (E) (F) *1 [3], [4] Web PDF a m

2. 30 Visual Words TF-IDF Lowe [4] Scale-Invarient Feature Transform (SIFT) Bay [1] Speeded Up Robust Features (SURF) SIFT 128 SURF 64 Visual Words Ni

IPSJ SIG Technical Report Vol.2011-CVIM-177 No /5/ TRECVID2010 SURF Bag-of-Features 1 TRECVID SVM 700% MKL-SVM 883% TRECVID2010 MKL-SVM A

IS1-09 第 回画像センシングシンポジウム, 横浜,14 年 6 月 2 Hough Forest Hough Forest[6] Random Forest( [5]) Random Forest Hough Forest Hough Forest 2.1 Hough Forest 1 2.2

IPSJ SIG Technical Report Vol.2015-MUS-107 No /5/23 HARK-Binaural Raspberry Pi 2 1,a) ( ) HARK 2 HARK-Binaural A/D Raspberry Pi 2 1.

24 Perceived depth position of autostereoscopic stimulus

2.2 6).,.,.,. Yang, 7).,,.,,. 2.3 SIFT SIFT (Scale-Invariant Feature Transform) 8).,. SIFT,,. SIFT, Mean-Shift 9)., SIFT,., SIFT,. 3.,.,,,,,.,,,., 1,

Vol1-CVIM-172 No.7 21/5/ Shan 1) 2 2)3) Yuan 4) Ancuti 5) Agrawal 6) 2.4 Ben-Ezra 7)8) Raskar 9) Image domain Blur image l PSF b / = F(

1 Web [2] Web [3] [4] [5], [6] [7] [8] S.W. [9] 3. MeetingShelf Web MeetingShelf MeetingShelf (1) (2) (3) (4) (5) Web MeetingShelf


画像分野におけるディープラーニングの新展開

(b) BoF codeword codeword BoF (c) BoF Fergus Weber [11] Weber [12] Weber Fergus BoF (b) Fergus [13] Fergus 2. Fergus 2. 1 Fergus [3]

Computer Security Symposium October ,a) 1,b) Microsoft Kinect Kinect, Takafumi Mori 1,a) Hiroaki Kikuchi 1,b) [1] 1 Meiji U

Gaze Head Eye (a) deg (b) 45 deg (c) 9 deg 1: - 1(b) - [5], [6] [7] Stahl [8], [9] Fang [1], [11] Itti [12] Itti [13] [7] Fang [1],

IPSJ SIG Technical Report Vol.2011-MUS-91 No /7/ , 3 1 Design and Implementation on a System for Learning Songs by Presenting Musical St

[1], []. AlphaZero 4 TPU 7 elmo 90 8 [3] AlphaZero 1 TPU TPU 64 [3] AlphaZero elmo AlphaZero [3] [4]AlphaZero [3].3 Saliency Map [5] Smooth- Gra

The 18th Game Programming Workshop ,a) 1,b) 1,c) 2,d) 1,e) 1,f) Adapting One-Player Mahjong Players to Four-Player Mahjong


DPA,, ShareLog 3) 4) 2.2 Strino Strino STRain-based user Interface with tacticle of elastic Natural ObjectsStrino 1 Strino ) PC Log-Log (2007 6)

@08470030ヨコ/篠塚・窪田 221号

Deep Learningとは

The copyright of this material is retained by the Information Processing Society of Japan (IPSJ). The material has been made available on the website

main.dvi

1., 1 COOKPAD 2, Web.,,,,,,.,, [1]., 5.,, [2].,,.,.,, 5, [3].,,,.,, [4], 33,.,,.,,.. 2.,, 3.., 4., 5., ,. 1.,,., 2.,. 1,,

2017 (413812)

II III I ~ 2 ~

中堅中小企業向け秘密保持マニュアル


PR映画-1


- 2 -


1 (1) (2)

( ) /

1(a) (b),(c) - [5], [6] Itti [12] [13] gaze eyeball head 2: [time] [7] Stahl [8], [9] Fang [1], [11] 3 -

34 (2017 ) Advances in machine learning technologies make inductive programming a reality. As opposed to the conventional (deductive) programming, the

PowerPoint プレゼンテーション

yoo_graduation_thesis.dvi

Fig. 2 28th Ryuou Tournament, Match 5, 59th move. The last move is Black s Rx5f. 1 Tic-Tac-Toe Fig. 1 AsearchtreeofTic-Tac-Toe. [2] [3], [4]

IPSJ SIG Technical Report Vol.2010-CVIM-171 No /3/19 1. Web 1 1 Web Web Web Multiple Kernel Learning(MKL) Web ( ) % MKL 68.8% Extractin

SURF,,., 55%,.,., SURF(Speeded Up Robust Features), 4 (,,, ), SURF.,, 84%, 96%, 28%, 32%.,,,. SURF, i

(3.6 ) (4.6 ) 2. [3], [6], [12] [7] [2], [5], [11] [14] [9] [8] [10] (1) Voodoo 3 : 3 Voodoo[1] 3 ( 3D ) (2) : Voodoo 3D (3) : 3D (Welc

26 Development of Learning Support System for Fixation of Basketball Shoot Form

24 Depth scaling of binocular stereopsis by observer s own movements

IPSJ SIG Technical Report Vol.2010-MPS-77 No /3/5 VR SIFT Virtual View Generation in Hallway of Cybercity Buildings from Video Sequen

29 jjencode JavaScript

Juntendo Medical Journal

xx/xx Vol. Jxx A No. xx 1 Fig. 1 PAL(Panoramic Annular Lens) PAL(Panoramic Annular Lens) PAL (2) PAL PAL 2 PAL 3 2 PAL 1 PAL 3 PAL PAL 2. 1 PAL

Transcription:

,a),b),,,,,,,, (DNN),,,, (CNN),,.,,,,,,,,,,,,,,,,,, [], [6], [7], [], [3]., [8], [0], [7],,,, Tohoku University a) omokawa@vision.is.tohoku.ac.jp b) okatani@vision.is.tohoku.ac.jp, [3],, (DNN), DNN, [3], DNN,,, DNN,,,,,,,, DNN, DNN, (CNN)[4], [9], (i) CNN, c 03 Information Processing Society of Japan

(ii) CNN (iii) CNN., [7], [], [3] Motoyoshi [3],,,,,,,, [], [6],,,, Dror [6],, SVM,,,,,, [5], Abe [], Liu [0],,,, SIFT, Edge-slice, Hu [8] Large Margin Nearest Neighbor Liu, Timofte van Gool, [5] BIF(Basic Image Features),, [7],,,,, 3. 3., CG, Autodesk Maya., Maya. : Interior. : Nature. mental ray,, mentalray mia material, miamaterial,,, 8 8 3.., Dosch Design Interior NatureV HDR 40, 80,, Interior, Nature, DNN,,, y 0, 7, 44, 6, 88,, x, z 30, 5, 0, 5, 30,, x, z,,,, 5 5 5 5 3.. mia material, 5, (refl_gloss),,., 90 (brdf_90_degree_refl).0, 0 (brdf_0_degree_refl), 0.0.0..0,., 5, (0.,0.8), (0.8,0.8), (0.5,0.5), (0.8,0.), (0.,0.).,. c 03 Information Processing Society of Japan

3 4 5 5. ID. Interior, Nature. 3.,.,,., Interior Nature 3, 8.,, 8000( 3 5), 000( 8 5). 3 CNN.,,, softmax. CNN,,.,,,,., CNN, 50. 3.3, cuda-convnet * CNN.,,.,, C, P,, N, N.,,,,., C7, x, y, 7 7.,, 3,,, 7 7., f(x) =max(0,x) NN [9].,,,,. 3, CNN, C 7P 5 N 4 C 3P 3 N. CNN, 5,. * http://code.google.com/p/cuda-convnet/, [9]. 3.4 3.4. SIFT-BoF CNN, bag-of-features,, 3 SIFT, 7 7, SIFT 000 visual words visual words, k-means, BoF,, SVM,,,, BoF, SIFT-BoF,,,, CNN,, 3.4.,, 0 5 4,, Interior, Nature 8,, 80(= 5 8), 80 c 03 Information Processing Society of Japan 3

0.95 0.03 0.0 0.94 0.0 0.04 0.00 0.006 0.97 0.03 3 0.03 0.05 0.75 0.7 4 0.05 0.3 0.70 0.0 0.80 0.8 0.0 3 0.00 0.00 0.99 0.003 4 0.009 0.08 0.3 0.59 0.006 5 0.09 0.0 0.89 5 0.37 0.004 0.07 0.03 0.53 4 5. CNNsingle, -double and -triple,,,3 CNN. 3 4 5 (a) 0.90 0.00 0.00 0.0 3 4 5 (b) 0.97 0.00 0.03, 3, 5, 80,,,, EIZO FlexScan SX76W LCD 4. 5 0.00 0.87 0.06 0.07 3 0.0 0.006 0.85 0.3 0.003 4 0.003 0.005 0. 0.85 0.0 5 0.0 0.003 0.0 0.87 3 4 5 (c) 0.8 0.6 0.03 3 0.0 0.77 0.3 4 0.07 0.93 0.00 5 0.07 0.0 0.0 0.90 3 4 5 (d). (a). (b) CNN( ). (c) CNN( ). (d) CNN(3 ). 4. 4,,, SIFT-BoF(SVM),,,,3 CNN,,, 3 CNN 5, CNN, CNN,, C7P 3N 8, C7P 5 N4 C3P 3 N, C7P 3 N 9 C4 N 9 P3 C3 N 9 P3, CNN, SIFT-BoF(SVM),, 3 CNN, CNN,, 5,, CNN, CNN, 5, 3 4, CNN, 3, 4. 6, CNN,, [0:55], CNN,,,,,, 7 Motoyoshi [3].,,,.,,.,,,,,, Motoyoshi, [3],,,, 7,,, NN., CNN [9], c 03 Information Processing Society of Japan 4

CNN( ) CNN( ) CNN(3 ) 6. CNN,,,,. [4] CNN,,,, miamaterial,,, 5,,,,, 4 8, 5 CNN,.% 6.95% CNN,,,,, 4, 5, 5 8,,,,,., 6,,,,, CNN, 5,,, Interiror Nature, CNN 95, 8,,, CNN,,,.,, CNN,,,, [9],, 5., CNN, CNN,,, CNN, CNN,, Motoyoshi,,, Motoyoshi,, [], CNN,, [] T. Abe, T. Okatani, and K. Deguchi. Recognizing surface qualities from natural images based on learning to rank. In Proc. ICPR, pages 37-375, 0. [] B. L. Anderson and J. Kim. Image statistics do not explain the perception of gloss and lightness. journal of Vision, 9(), 009. [3] Y. Bengio, A. C. Courville, and P. Vincent. Unsuper- c 03 Information Processing Society of Japan 5

(Interior/Nature) 8 CNN 3.,,, (Interior/Nature). vised feature learning and deep learning: A review and new perspectives. CoRR, abs/06.5538, 0. [4] D. C. Ciresan, U. Meier, and J. Schmidhuber. Multicolumn deep neural networks for image classification. In Proc. CVPR, 0. [5] M. Crosier and L. D. Griffin. Using basic image features for texture classification. International Journal of Computer Vision, 88(3):447-460, 00. [6] R. O. Dror, E. H. Adelson, and A. S. Willsky. Recognition of surface reflectance properties from a single image under unknown real-world illumination. In Proc. the Workshop on Identifying Objects Across Variations in Lighting at CVPR, 00. [7] C. Hiramatsu, N. Goda, and H. Komatsu. Transformation from image-based to perceptual representation of materials along the human ventral visual pathway. NeuroImage, 57():48-494, 0. [8] D. Hu, L. Bo, and X. Ren. Toward robust material recognition for everyday objects. In Proc. BMVC, pages 48.- 48., 0. [9] A. Krizhevsky, I. Sutskever, and G. E. Hinton. Imagenet classification with deep convolutional neural networks. In NIPS, 0. [0] C. Liu, L. Sharan, E. H. Adelson, and R. Rosenholtz. Exploring features in a bayesian framework for material recognition. In Proc. CVPR, 00. [] S. Lombardi and K. Nishino. Reflectance and natural illumination from a single image. In Proc. ECCV, 0. [] I. Motoyoshi. Highlight shading relationship as a cue for the perception of translucent and transparent materials. Journal of Vision, 0(9), 00. [3] I. Motoyoshi, S. Nishida, L. Sharan, and E. H. Adelson. Image statistics and the perception of surface qualities. Nature, 447(74):06-09, 007. [4] B. A. Olshausen and D. J. Field. Emergence of simplecell receptive field properties by learning a sparse code for natural images. Nature, 38(6583):607-609, 996. [5] D. Parikh and K. Grauman. Relative attributes. In Proc. ICCV, 0. [6] F. Romeiro and T. Zickler. Blind reflectrometry. In Proc. ECCV, 00. [7] R. Timofte and L. Van Gool. A training-free classification framework for textures, writers,and materials. In Proc. BMVC, pages 93.-93., 0. c 03 Information Processing Society of Japan 6