( ) /
|
|
- ひでき かなり
- 5 years ago
- Views:
Transcription
1 NAIST-IS-MT
2 ( ) /
3 , NAIST-IS-MT , i
4 80% ii
5 Finding Important People in a Video using a Deep Neural Network with Conditional Random Field Atsushi Nishida Abstract Finding important regions is essential for applications like content-aware video compression and video retargeting, which automatically crops an important region in a video for small screens. Various models for important region estimation have been proposed. Since people are one of the main content of videos, some methods for finding important regions use face detection. However, those existing methods usually do not distinguish important people from passers-by in a video. This thesis proposes a method to classify people in a video frame into important or non-important ones. Generally, this classification problem is not well designed because who is important or not may differ viewer by viewer. Therefore, instead of the viewers perspective, we use videographers perspective. That is, our method finds people who are important for the videographer. Since viewers try to understand what the videographer wants to express in the video, important people for viewers and videographers may highly correlate. It is considered that videographers have a certain tendency in, e.g, how to move the camera when taking the video, such as placing important people near the center of the video frame. Since videographers such behavior is reflected in the trajectories and sizes of face regions, we use them as features for the classification. In addition, Master s Thesis, Graduate School of Information Science, Nara Institute of Science and Technology, NAIST-IS-MT , March 16, iii
6 as visual cues like the orientation of faces are helpful for important person classification, the proposed method exploits visual features such as color histograms. The proposed method uses a conditional random field (CRF) built upon a deep neural network (DNN), which can capture the various types of relationships, such as spatial one, among people in a video frame in order to facilitate the classification. Experimental results demonstrate that our models trained on a dataset of user-generated videos achieve the accuracy of over 80%. Our experiments also verify the effectiveness of the proposed model and the effect of the conditional random field by comparing our model with baselines, such as a support vector machines and a DNN without a CRF. Keywords: Neural network Conditional random field Important people classification iv
7 v
8 Itti [1] Yang [2] (1) (5) (2a) (5a) (2a) (5a) (2a) (5a) (2b) (5b) (2b) (5b) (2b) (5b) (1) (5) vi
9 1. [3,4] [5 8] [1,9,10] [2, 11, 12] Itti [1] Itti [1] Yang [2] Ma [11] Ma [11] 1 2 2(a) 1 2(b) 1 1
10 図 1: 重要人物と非重要人物の例 (a) 全ての人物を重要領域と考えた場合 (b) 人物の重要度を考慮した場合 図 2: 図 1 のリターゲティング処理例 ティングのようなアプリケーションの性能が損なわれる場合がある 本研究では このような複数の人物を撮影した映像から重要人物だけを含む重 要領域を抽出するために 映像中の人物の重要度推定に取り組む 具体的には 映像中から検出した人物をそれぞれが映像中において重要な人物か あるいは偶 然写り込んだ非重要人物かを判定する識別器を開発する この識別結果を用いて 非重要人物の領域を重要領域の候補から除去することにより 非重要人物を含ま ない重要領域推定が可能となる 一般に 映像中の人物が重要か 非重要かは視聴者によって異なり 一意に決 2
11 1 1 (Deep Neural Network: DNN) (Conditional Random Fields: CRF) CRF DNN CRF End-to-End YouTube 3
12 2 CRF DNN 3 CRF DNN 4 5 4
13 [1,9,10,13] Itti [1] Itti Baldi [9,10] Bayesian Surprise Achanta [13, 14] Lab 3(b) 3(a) 3(c) Itti [1] 3(d) Yang [2] 5
14 (a) (b) (c) (d) 3: Itti [1] 4 Ma [11] [12] Ma [11] 6
15 (a) 入力画像 (b) 自転車を重視した重要度マップ (c) 車を重視した重要度マップ (d) 人物を重視した重要度マップ 図 4: Yang ら [2] の手法による重要領域推定 有無を重要領域推定の指標として用いた手法は その人物の映像中における重要 度を考慮しないため 複数の人物を含む映像において重要度の低い人物も重要領 域に含む場合がある 重要でない人物が重要領域に含まれると リターゲティン グのようなアプリケーションの性能が損なわれる場合がある このような課題を解決するため Nakashima ら [15] は撮影者の観点に基づき 複数の人物を含む映像の重要人物を識別をする手法を提案した Nakashima らは 同じフレーム中の重要人物同士は大きさや動きの軌跡に相関があるという考えと 重要人物や非重要人物は短い期間では入れ替わらないという考えのもとに 条件 付き確率場を用いたモデルを採用した 本論文では さらなる精度向上のため Nakashima ら [15, 16] の手法を拡張し CRF を取り入れた DNN を用いた識別手 7
16 CRF DNN 2.2 CRF (Markov Random Field: MRF) x y CRF p(y x) p(y x) = 1 Z e E(y,x) (1) E(y, x) = i f i (x i y) + ij f ij (x i, x j y) (2) E(y, x) x i f i (x i y) x i, x j f ij (x i, x j y) Z (Partition function) DNN DNN CRF [17 23] Bengio [17] (Convolutional Neural Networks: CNN) Yao Wang [18, 19] CRF Ma [24] Long Short Term Memory CRF CRF CNN [20 23] [25] [26] Arnab [22] CNN 4 8
17 CRF Farabet [27] CNN CRF CNN CRF Liu [26] CNN CRF Chanra [23] CRF DNN End-to-End CRF CRF Contrastive Divergence [28] 2.3 DNN Nakashima [15,16] CRF DNN CRF CRF 9
18 CRF DNN CRF
19 5: 11
20 (a) 注目フレームから 100 フレーム前 (b) 注目フレーム (c) 注目フレームから 100 フレーム後 (d) トラッキングから得られた人物の軌跡 図 6: トラッキングの例 は人物の重要度は映像中の人物の位置や大きさに反映されるとして 人物の動き から得られる特徴量を重要人物識別に用いる まず注目フレームから検出された 人物を前後 100 フレームの間トラッキングし その人物の顔領域の大きさと位置 の変化を取得する 本手法では 顔領域を追跡するために KCF トラッカー [29] を採用した 図 6 はトラッキングの例である 図 6(a) は注目フレームから 100 フレーム前 6(c) は注目フレームの 100 フレーム後を表しており 青色の矩形が顔領域である 図 6(d) の黄色の線が顔領域中心の変化を表している こうしてある人物 i から得られた 前後 100 フレームにおける顔領域から座標 600 を人物の動きの特 と大きさを抽出し この 3 次元ベクトルを連結した xm i R 徴量とする なお 図 6(b) の奥の人物のように トラッキング対象の人物が 移 動やオクルージョンにより画面上から消失した場合 トラッキングを中止し 残 りフレームの顔領域の大きさおよび位置は 0 とする 人物の見えの特徴量 重要人物はカメラに対して正面か 少なくとも顔が見えるように撮影されること 12
21 0.2 Histogram 0.1 (a) x (b) (a) 0.2 Histogram 0.1 (c) x (d) (b) 7: DNN [30] 2 R G B 50 x l i R (a) 7(c) 7(d) DNN DNN FaceNet [30] x l i
22 CRF i x m i xl i f i h m i = ρ(w m x m i + b m ) (3) h l i = ρ(w l x l i + b l ) (4) f i = ρ(w h ml i + b ml ) (5) W m R W l R d 100 W R x l i d = 150 DNN d = 128 ρ Rectificed Linear Unit [31] (5) h ml i h m i h l i CRF i f i (i = 1,..., I) t 1,..., t I i t i t i = 1 0 CRF ϕ 0 (f i ) = ρ(v 0 f i + k 0 ) (6) ϕ 1 (f i ) = ρ(v 1 f i + k 1 ) (7) v 0 v 1 R 100 k 0, k 1 ϕ 0 (fi) ϕ 1 (fi) i 0 1 t i = 0 14
23 8: 15
24 2 ψ 00 (f ij ) = ρ(u 00f ij + c 00 ) (8) ψ 01 (f ij ) = ρ(u 01f ij + c 01 ) (9) ψ 10 (f ij ) = ρ(u 10f ij + c 10 ) (10) ψ 11 (f ij ) = ρ(u 11f ij + c 11 ) (11) f ij (5) f i, f j ψ 00 (f ij ), ψ 01 (f ij ), ψ 10 (f ij ), ψ 11 (f ij ) 2 (0 (1) T = {t i i = 1... I} F = {f i i = 1... I} E(T F ) E(T F ) = i ϕ ti (f i ) + ij ψ ti t j (f ij ) (12) p(t F ) = 1 Z e E(T F ) (13) Z Z = T e E(T F ) (14) (14) p(t F ) Z CRF Contrastive Divergence [28] 16
25 : (14) Z 17
26 Z ϕ(f i ) = V f i + K (15) ψ(f ij ) = Uf ij + C (16) V = (v 0 v 1 ) K = (k 0 k 1 ) U = (u 00 u 01 u 10 u 11 ) C = (c 00 c 01 c 10 c 11 ) ϕ ψ ϕ (1) (0) 2 ψ 4 E ϕ, ψ 3.4 L L(T m, F m ) = m log p(t m F m ) (17) T m F m m Dropout [32] [33] 18
27 4. CRF 4.1 [15] YouTube YouTube 20 YouTube 6 YouTube , , , , 764 [15] YouTube 19
28 (a) (b) 10: 55, , CRF 20
29 1: YouTube ,955 82, ,655 39, ,336 37,431 Nakashima [15] Nakashima CRF CRF CRF CRF DNN (1) (5) 5 (1) Nakashima [15] (2) (3) CRF (4) ( ) (5) 21
30 (1) (2) (3) (4) (2) (2) CRF CRF (3) CRF (3) CRF 1 Softmax Cross-Entropy z a = exp(u a ) 1 b=0 exp(u b) (18) a (a = 0, 1) u 0 u z 0 z 1 i f i t i t i = { 1 (z1 0.5) 0 (otherwise) (19) (4) ( ) (4) CRF (4) CRF Chainer [34] 22
31 4.3 (1) (5) T P (True Positive) F N(False Negative) REC = T P T P + F N (20) F P (False Positive) T N (True Negative) F P R: False positive rate F P R = F P F P + T N P RE (precision) ACC (Accuracy) F (F1-measure) P RE = T P T P + F P ACC = T P + T N T P + T N + F P + F N F 1 = P RE REC 2 P RE + REC (21) (22) (23) (24) 2 2 F FaceNet
32 2: (1) (5) REC(%) PRE(%) FPR(%) ACC(%) F1(%) (1) Nakashima [15] (5) (2a) (3a) CRF (4a) ( ) (5a) FaceNet (2b) (3b) CRF (4b) ( ) (5b)
33 手法 (1) frame 1350 frame 1355 frame 1360 手法 (5) frame 1350 frame 1355 frame 1360 手法 (1) frame 85 frame 90 frame 95 手法 (5) frame 85 frame 90 frame 95 11: (1) (5) 25
34 手法 (2a) frame 1350 frame 1355 frame 1360 手法 (3a) frame 1350 frame 1355 frame 1360 手法 (4a) frame 1350 frame 1355 frame 1360 手法 (5a) frame 1350 frame 1355 frame : (2a) (5a) 1 26
35 手法 (2a) frame 85 frame 90 frame 95 手法 (3a) frame 85 frame 90 frame 95 手法 (4a) frame 85 frame 90 frame 95 手法 (5a) frame 85 frame 90 frame 95 13: (2a) (5a) 2 27
36 手法 (2a) frame 5 frame 15 frame 25 手法 (3a) frame 5 frame 15 frame 25 手法 (4a) frame 5 frame 15 frame 25 手法 (5a) frame 5 frame 15 frame 25 14: (2a) (5a) 3 28
37 手法 (2b) frame 1350 frame 1355 frame 1360 手法 (3b) frame 1350 frame 1355 frame 1360 手法 (4b) frame 1350 frame 1355 frame 1360 手法 (5b) frame 1350 frame 1355 frame : (2b) (5b) 1 29
38 手法 (2b) frame 85 frame 90 frame 95 手法 (3b) frame 85 frame 90 frame 95 手法 (4b) frame 85 frame 90 frame 95 手法 (5b) frame 85 frame 90 frame 95 16: (2b) (5b) 2 30
39 手法 (2b) frame 5 frame 15 frame 25 手法 (3b) frame 5 frame 15 frame 25 手法 (4b) frame 5 frame 15 frame 25 手法 (5b) frame 5 frame 15 frame 25 17: (2b) (5b) 3 31
40 4.4 (4) (4) F (4) F P 13 (4a) 13 (5a) CRF F P (2) (3) (2) (3) F (2) (3) 12 (2a) (3a) CRF 12 CRF 18 (2) (3) 32
41 (2a) (3a) (5a) 18: DNN FaceNet [30] (4) 55, , 431 (4) FaceNet CRF FaceNet 33
42 5. CRF DNN CRF CRF YouTube CRF DNN CRF CRF FaceNet End-to-End 34
43 ( ) 35
44 [1] L. Itti, C. Koch, and E. Niebur, A model of saliency-based visual attention for rapid scene analysis, IEEE Trans. Pattern Analysis and Machine Intelligence (PAMI), vol. 20, no. 11, pp , [2] J. Yang and M.-H. Yang, Top-down visual saliency via joint CRF and dictionary learning, in Proc. IEEE Computer Society Conf. Computer Vision and Pattern Recognition (CVPR), pp , [3] F. Liu and M. Gleicher, Video retargeting: Automating pan and scan, in Proc. ACM Int. Conf. Multimedia (MM), pp , [4] X. Fan, X. Xie, H.-Q. Zhou, and W.-Y. Ma, Looking into video frames on small displays, in Proc. ACM Int. Conf. Multimedia (MM), pp , [5] L. Itti, Automatic foveation for video compression using a neurobiological model of visual attention, IEEE Trans. Image Processing, vol. 13, no. 10, pp , [6] W. Lai, X.-D. Gu, R.-H. Wang, W.-Y. Ma, and H.-J. Zhang, A contentbased bit allocation model for video streaming, in Proc. IEEE Int. Conf. Multimedia and Expo (ICME), vol. 2, pp , [7] M.-H. Hsiao, Y.-W. Chen, H.-T. Chen, K.-H. Chou, and S.-Y. Lee, Contentaware video adaptation under low-bitrate constraint, EURASIP Journal on Advances in Signal Processing, vol. 2007, no. 2, 17 pages, [8] M. Sun, A. Farhadi, B. Taskar, and S. Seitz, Salient montages from unconstrained videos, in Proc. European Conf. Computer Vision (ECCV), pp , [9] L. Itti and P. Baldi, Bayesian surprise attracts human attention, in Proc. Neural Information Processing Systems (NIPS), pp ,
45 [10] P. Baldi and L. Itti, Of bits and wows: A Bayesian theory of surprise with applications to attention, Neural Networks, vol. 23, no. 5, pp , [11] Y.-F. Ma, X.-S. Hua, L. Lu, and H.-J. Zhang, A generic framework of user attention model and its application in video summarization, IEEE Trans. Multimedia, vol. 7, no. 5, pp , [12] D. Walther and C. Koch, Modeling attention to salient proto-objects, Neural networks, vol. 19, no. 9, pp , [13] R. Achanta, S. Hemami, F. Estrada, and S. Susstrunk, Frequency-tuned salient region detection, in Proc. IEEE Computer Society Conf. Computer Vision and Pattern Recognition (CVPR), pp , [14] R. Achanta, F. Estrada, P. Wils, and S. Süsstrunk, Salient region detection and segmentation, in Proc. Int. Conf. Computer Vision Systems, pp , [15] Y. Nakashima, N. Babaguchi, and J. Fan, Intended human object detection for automatically protecting privacy in mobile video surveillance, Multimedia Systems, vol. 18, no. 2, pp , [16] Y. Nakashima, N. Babaguchi, and J. Fan, Privacy protection for social video via background estimation and CRF-based videographer s intention modeling, IEICE Trans. Information and Systems, vol. E99.D, no. 4, pp , [17] Y. Bengio, Y. LeCun, and D. Henderson, Globally trained handwritten word recognizer using spatial representation, convolutional neural networks, and hidden markov models, in Proc. Neural Information Processing Systems (NIPS), pp ,
46 [18] K. Yao, B. Peng, G. Zweig, D. Yu, X. Li, and F. Gao, Recurrent conditional random field for language understanding, in Proc. IEEE Conf. Acoustics, Speech and Signal Processing (ICASSP), pp , [19] W. Wang, S. J. Pan, D. Dahlmeier, and X. Xiao, Recursive neural conditional random fields for aspect-based sentiment analysis, in Proc. ACL Conf. Empirical Methods Natural Language Processing (EMNLP), pp , [20] X. Liang, X. Shen, J. Feng, L. Lin, and S. Yan, Semantic object parsing with graph LSTM, in Proc. European Conf. Computer Vision (ECCV), pp , [21] S. Zheng, S. Jayasumana, B. Romera-Paredes, V. Vineet, Z. Su, D. Du, C. Huang, and P. H. S. Torr, Conditional random fields as recurrent neural networks, in Proc. IEEE Int. Conf. Computer Vision (ICCV), pp , [22] A. Arnab, S. Jayasumana, S. Zheng, and P. H. S. Torr, Higher order conditional random fields in deep neural networks, in Proc. European Conf. Computer Vision (ECCV), pp , [23] S. Chandra and I. Kokkinos, Fast, exact and multi-scale inference for semantic image segmentation with deep gaussian CRFs, in Proc. European Conf. Computer Vision (ECCV), pp , [24] X. Ma and E. Hovy, End-to-end sequence labeling via bi-directional LSTM- CNNs-CRF, in Proc. Association for Computational Linguistics (ACL), 10 pages, [25] X. Chu, W. Ouyang, H. Li, and X. Wang, CRF-CNN: Modeling structured information in human pose estimation, in Proc. Neural Information Processing Systems (NIPS), pp ,
47 [26] F. Liu, C. Shen, and G. Lin, Deep convolutional neural fields for depth estimation from a single image, in Proc. IEEE Computer Society Conf. Computer Vision and Pattern Recognition (CVPR), pp , [27] C. Farabet, C. Couprie, L. Najman, and Y. LeCun, Learning hierarchical features for scene labeling, IEEE Trans. Pattern Aalysis and Machine Intelligence (PAMI), vol. 35, no. 8, pp , [28] G. E. Hinton, Training products of experts by minimizing contrastive divergence, Neural Computation, vol. 14, no. 8, pp , [29] J. F. Henriques, R. Caseiro, P. Martins, and J. Batista, Exploiting the circulant structure of tracking-by-detection with kernels, in Proc. European Conf. Computer Vision (ECCV), pp , [30] F. Schroff, D. Kalenichenko, and J. Philbin, FaceNet: A unified embedding for face recognition and clustering, in Proc. IEEE Computer Society Conf. Computer Vision and Pattern Recognition (CVPR), pp , [31] V. Nair and G. E. Hinton, Rectified linear units improve restricted boltzmann machines, in Proc. Int. Conf. Machine Learning (ICML), pp , [32] G. E. Hinton, N. Srivastava, A. Krizhevsky, I. Sutskever, and R. R. Salakhutdinov, Dropout: A simple way to prevent neural networks from overfitting, Jounal of Machine Learning Research, vol. 15, no. 1, pp , [33] D. Kingma and J. Ba, Adam: A method for stochastic optimization, in Proc. Int. Conf. Learning Representations (ICLR), 13 pages, [34] S. Tokui, K. Oono, S. Hido, and J. Clayton, Chainer: A next-generation open source framework for deep learning, in Proc. Neural Information Processing Systems (NIPS), 6 pages,
Convolutional Neural Network A Graduation Thesis of College of Engineering, Chubu University Investigation of feature extraction by Convolution
Convolutional Neural Network 2014 3 A Graduation Thesis of College of Engineering, Chubu University Investigation of feature extraction by Convolutional Neural Network Fukui Hiroshi 1940 1980 [1] 90 3
More informationIPSJ SIG Technical Report Vol.2010-CVIM-170 No /1/ Visual Recognition of Wire Harnesses for Automated Wiring Masaki Yoneda, 1 Ta
1 1 1 1 2 1. Visual Recognition of Wire Harnesses for Automated Wiring Masaki Yoneda, 1 Takayuki Okatani 1 and Koichiro Deguchi 1 This paper presents a method for recognizing the pose of a wire harness
More information,,.,.,,.,.,.,.,,.,..,,,, i
22 A person recognition using color information 1110372 2011 2 13 ,,.,.,,.,.,.,.,,.,..,,,, i Abstract A person recognition using color information Tatsumo HOJI Recently, for the purpose of collection of
More information3 2 2 (1) (2) (3) (4) 4 4 AdaBoost 2. [11] Onishi&Yoda [8] Iwashita&Stoica [5] 4 [3] 3. 3 (1) (2) (3)
(MIRU2012) 2012 8 820-8502 680-4 E-mail: {d kouno,shimada,endo}@pluto.ai.kyutech.ac.jp (1) (2) (3) (4) 4 AdaBoost 1. Kanade [6] CLAFIC [12] EigenFace [10] 1 1 2 1 [7] 3 2 2 (1) (2) (3) (4) 4 4 AdaBoost
More informationHaiku Generation Based on Motif Images Using Deep Learning Koki Yoneda 1 Soichiro Yokoyama 2 Tomohisa Yamashita 2 Hidenori Kawamura Scho
Haiku Generation Based on Motif Images Using Deep Learning 1 2 2 2 Koki Yoneda 1 Soichiro Yokoyama 2 Tomohisa Yamashita 2 Hidenori Kawamura 2 1 1 School of Engineering Hokkaido University 2 2 Graduate
More informationIPSJ SIG Technical Report Vol.2012-CG-149 No.13 Vol.2012-CVIM-184 No /12/4 3 1,a) ( ) DB 3D DB 2D,,,, PnP(Perspective n-point), Ransa
3,a) 3 3 ( ) DB 3D DB 2D,,,, PnP(Perspective n-point), Ransac. DB [] [2] 3 DB Web Web DB Web NTT NTT Media Intelligence Laboratories, - Hikarinooka Yokosuka-Shi, Kanagawa 239-0847 Japan a) yabushita.hiroko@lab.ntt.co.jp
More information(MIRU2008) HOG Histograms of Oriented Gradients (HOG)
(MIRU2008) 2008 7 HOG - - E-mail: katsu0920@me.cs.scitec.kobe-u.ac.jp, {takigu,ariki}@kobe-u.ac.jp Histograms of Oriented Gradients (HOG) HOG Shape Contexts HOG 5.5 Histograms of Oriented Gradients D Human
More information& Vol.5 No (Oct. 2015) TV 1,2,a) , Augmented TV TV AR Augmented Reality 3DCG TV Estimation of TV Screen Position and Ro
TV 1,2,a) 1 2 2015 1 26, 2015 5 21 Augmented TV TV AR Augmented Reality 3DCG TV Estimation of TV Screen Position and Rotation Using Mobile Device Hiroyuki Kawakita 1,2,a) Toshio Nakagawa 1 Makoto Sato
More information28 Horizontal angle correction using straight line detection in an equirectangular image
28 Horizontal angle correction using straight line detection in an equirectangular image 1170283 2017 3 1 2 i Abstract Horizontal angle correction using straight line detection in an equirectangular image
More informationIPSJ SIG Technical Report Vol.2013-CVIM-187 No /5/30 1,a) 1,b), 1,,,,,,, (DNN),,,, 2 (CNN),, 1.,,,,,,,,,,,,,,,,,, [1], [6], [7], [12], [13]., [
,a),b),,,,,,,, (DNN),,,, (CNN),,.,,,,,,,,,,,,,,,,,, [], [6], [7], [], [3]., [8], [0], [7],,,, Tohoku University a) omokawa@vision.is.tohoku.ac.jp b) okatani@vision.is.tohoku.ac.jp, [3],, (DNN), DNN, [3],
More information[1] SBS [2] SBS Random Forests[3] Random Forests ii
Random Forests 2013 3 A Graduation Thesis of College of Engineering, Chubu University Proposal of an efficient feature selection using the contribution rate of Random Forests Katsuya Shimazaki [1] SBS
More information4. C i k = 2 k-means C 1 i, C 2 i 5. C i x i p [ f(θ i ; x) = (2π) p 2 Vi 1 2 exp (x µ ] i) t V 1 i (x µ i ) 2 BIC BIC = 2 log L( ˆθ i ; x i C i ) + q
x-means 1 2 2 x-means, x-means k-means Bayesian Information Criterion BIC Watershed x-means Moving Object Extraction Using the Number of Clusters Determined by X-means Clustering Naoki Kubo, 1 Kousuke
More information_314I01BM浅谷2.indd
587 ネットワークの表現学習 1 1 1 1 Deep Learning [1] Google [2] Deep Learning [3] [4] 2014 Deepwalk [5] 1 2 [6] [7] [8] 1 2 1 word2vec[9] word2vec 1 http://www.ai-gakkai.or.jp/my-bookmark_vol31-no4 588 31 4 2016
More information2017 (413812)
2017 (413812) Deep Learning ( NN) 2012 Google ASIC(Application Specific Integrated Circuit: IC) 10 ASIC Deep Learning TPU(Tensor Processing Unit) NN 12 20 30 Abstract Multi-layered neural network(nn) has
More information1 Fig. 1 Extraction of motion,.,,, 4,,, 3., 1, 2. 2.,. CHLAC,. 2.1,. (256 ).,., CHLAC. CHLAC, HLAC. 2.3 (HLAC ) r,.,. HLAC. N. 2 HLAC Fig. 2
CHLAC 1 2 3 3,. (CHLAC), 1).,.,, CHLAC,.,. Suspicious Behavior Detection based on CHLAC Method Hideaki Imanishi, 1 Toyohiro Hayashi, 2 Shuichi Enokida 3 and Toshiaki Ejima 3 We have proposed a method for
More informationReal AdaBoost HOG 2009 3 A Graduation Thesis of College of Engineering, Chubu University Efficient Reducing Method of HOG Features for Human Detection based on Real AdaBoost Chika Matsushima ITS Graphics
More informationIPSJ SIG Technical Report Vol.2012-CG-148 No /8/29 3DCG 1,a) On rigid body animation taking into account the 3D computer graphics came
3DCG 1,a) 2 2 2 2 3 On rigid body animation taking into account the 3D computer graphics camera viewpoint Abstract: In using computer graphics for making games or motion pictures, physics simulation is
More informationComputational Semantics 1 category specificity Warrington (1975); Warrington & Shallice (1979, 1984) 2 basic level superiority 3 super-ordinate catego
Computational Semantics 1 category specificity Warrington (1975); Warrington & Shallice (1979, 1984) 2 basic level superiority 3 super-ordinate category preservation 1 / 13 analogy by vector space Figure
More information,,,,,,,,,,,,,,,,,,, 976%, i
20 Individual Recognition using positions of facial parts 1115081 2009 3 5 ,,,,,,,,,,,,,,,,,,, 976%, i Abstract Individual Recognition using positions of facial parts YOSHIHIRO Arisawa A facial recognition
More information20 Method for Recognizing Expression Considering Fuzzy Based on Optical Flow
20 Method for Recognizing Expression Considering Fuzzy Based on Optical Flow 1115084 2009 3 5 3.,,,.., HCI(Human Computer Interaction),.,,.,,.,.,,..,. i Abstract Method for Recognizing Expression Considering
More informationIPSJ SIG Technical Report Vol.2009-CVIM-167 No /6/10 Real AdaBoost HOG 1 1 1, 2 1 Real AdaBoost HOG HOG Real AdaBoost HOG A Method for Reducing
Real AdaBoost HOG 1 1 1, 2 1 Real AdaBoost HOG HOG Real AdaBoost HOG A Method for Reducing number of HOG Features based on Real AdaBoost Chika Matsushima, 1 Yuji Yamauchi, 1 Takayoshi Yamashita 1, 2 and
More information2 Fig D human model. 1 Fig. 1 The flow of proposed method )9)10) 2.2 3)4)7) 5)11)12)13)14) TOF 1 3 TOF 3 2 c 2011 Information
1 1 2 TOF 2 (D-HOG HOG) Recall D-HOG 0.07 HOG 0.16 Pose Estimation by Regression Analysis with Depth Information Yoshiki Agata 1 and Hironobu Fujiyoshi 1 A method for estimating the pose of a human from
More information1 Web [2] Web [3] [4] [5], [6] [7] [8] S.W. [9] 3. MeetingShelf Web MeetingShelf MeetingShelf (1) (2) (3) (4) (5) Web MeetingShelf
1,a) 2,b) 4,c) 3,d) 4,e) Web A Review Supporting System for Whiteboard Logging Movies Based on Notes Timeline Taniguchi Yoshihide 1,a) Horiguchi Satoshi 2,b) Inoue Akifumi 4,c) Igaki Hiroshi 3,d) Hoshi
More information音響モデル triphone 入力音声 音声分析 デコーダ 言語モデル N-gram bigram HMM の状態確率として利用 出力層 triphone: 3003 ノード リスコア trigram 隠れ層 2048 ノード X7 層 1 Structure of recognition syst
1,a) 1 1 1 deep neural netowrk(dnn) (HMM) () GMM-HMM 2 3 (CSJ) 1. DNN [6]. GPGPU HMM DNN HMM () [7]. [8] [1][2][3] GMM-HMM Gaussian mixture HMM(GMM- HMM) MAP MLLR [4] [3] DNN 1 1 triphone bigram [5]. 2
More information( )
NAIST-IS-MT1051071 2012 3 16 ( ) Pustejovsky 2 2,,,,,,, NAIST-IS- MT1051071, 2012 3 16. i Automatic Acquisition of Qualia Structure of Generative Lexicon in Japanese Using Learning to Rank Takahiro Tsuneyoshi
More information(a) 1 (b) 3. Gilbert Pernicka[2] Treibitz Schechner[3] Narasimhan [4] Kim [5] Nayar [6] [7][8][9] 2. X X X [10] [11] L L t L s L = L t + L s
1 1 1, Extraction of Transmitted Light using Parallel High-frequency Illumination Kenichiro Tanaka 1 Yasuhiro Mukaigawa 1 Yasushi Yagi 1 Abstract: We propose a new sharpening method of transmitted scene
More information1 Table 1: Identification by color of voxel Voxel Mode of expression Nothing Other 1 Orange 2 Blue 3 Yellow 4 SSL Humanoid SSL-Vision 3 3 [, 21] 8 325
社団法人人工知能学会 Japanese Society for Artificial Intelligence 人工知能学会研究会資料 JSAI Technical Report SIG-Challenge-B3 (5/5) RoboCup SSL Humanoid A Proposal and its Application of Color Voxel Server for RoboCup SSL
More information28 TCG SURF Card recognition using SURF in TCG play video
28 TCG SURF Card recognition using SURF in TCG play video 1170374 2017 3 2 TCG SURF TCG TCG OCG SURF Bof 20 20 30 10 1 SURF Bag of features i Abstract Card recognition using SURF in TCG play video Haruka
More information2.2 6).,.,.,. Yang, 7).,,.,,. 2.3 SIFT SIFT (Scale-Invariant Feature Transform) 8).,. SIFT,,. SIFT, Mean-Shift 9)., SIFT,., SIFT,. 3.,.,,,,,.,,,., 1,
1 1 2,,.,.,,, SIFT.,,. Pitching Motion Analysis Using Image Processing Shinya Kasahara, 1 Issei Fujishiro 1 and Yoshio Ohno 2 At present, analysis of pitching motion from baseball videos is timeconsuming
More information17 Proposal of an Algorithm of Image Extraction and Research on Improvement of a Man-machine Interface of Food Intake Measuring System
1. (1) ( MMI ) 2. 3. MMI Personal Computer(PC) MMI PC 1 1 2 (%) (%) 100.0 95.2 100.0 80.1 2 % 31.3% 2 PC (3 ) (2) MMI 2 ( ),,,, 49,,p531-532,2005 ( ),,,,,2005,p66-p67,2005 17 Proposal of an Algorithm of
More informationIPSJ SIG Technical Report Vol.2011-EC-19 No /3/ ,.,., Peg-Scope Viewer,,.,,,,. Utilization of Watching Logs for Support of Multi-
1 3 5 4 1 2 1,.,., Peg-Scope Viewer,,.,,,,. Utilization of Watching Logs for Support of Multi-View Video Contents Kosuke Niwa, 1 Shogo Tokai, 3 Tetsuya Kawamoto, 5 Toshiaki Fujii, 4 Marutani Takafumi,
More information258 5) GPS 1 GPS 6) GPS DP 7) 8) 10) GPS GPS 2 3 4 5 2. 2.1 3 1) GPS Global Positioning System
Vol. 52 No. 1 257 268 (Jan. 2011) 1 2, 1 1 measurement. In this paper, a dynamic road map making system is proposed. The proposition system uses probe-cars which has an in-vehicle camera and a GPS receiver.
More information4.1 % 7.5 %
2018 (412837) 4.1 % 7.5 % Abstract Recently, various methods for improving computial performance have been proposed. One of these various methods is Multi-core. Multi-core can execute processes in parallel
More informationSilhouette on Image Object Silhouette on Images Object 1 Fig. 1 Visual cone Fig. 2 2 Volume intersection method Fig. 3 3 Background subtraction Fig. 4
Image-based Modeling 1 1 Object Extraction Method for Image-based Modeling using Projection Transformation of Multi-viewpoint Images Masanori Ibaraki 1 and Yuji Sakamoto 1 The volume intersection method
More information本文6(599) (Page 601)
(MIRU2008) 2008 7 525 8577 1 1 1 E-mail: matsuzaki@i.ci.ritsumei.ac.jp, shimada@ci.ritsumei.ac.jp Object Recognition by Observing Grasping Scene from Image Sequence Hironori KASAHARA, Jun MATSUZAKI, Nobutaka
More information情報処理学会研究報告 IPSJ SIG Technical Report Vol.2013-CVIM-186 No /3/15 EMD 1,a) SIFT. SIFT Bag-of-keypoints. SIFT SIFT.. Earth Mover s Distance
EMD 1,a) 1 1 1 SIFT. SIFT Bag-of-keypoints. SIFT SIFT.. Earth Mover s Distance (EMD), Bag-of-keypoints,. Bag-of-keypoints, SIFT, EMD, A method of similar image retrieval system using EMD and SIFT Hoshiga
More information1., 1 COOKPAD 2, Web.,,,,,,.,, [1]., 5.,, [2].,,.,.,, 5, [3].,,,.,, [4], 33,.,,.,,.. 2.,, 3.., 4., 5., ,. 1.,,., 2.,. 1,,
THE INSTITUTE OF ELECTRONICS, INFORMATION AND COMMUNICATION ENGINEERS TECHNICAL REPORT OF IEICE.,, 464 8601 470 0393 101 464 8601 E-mail: matsunagah@murase.m.is.nagoya-u.ac.jp, {ide,murase,hirayama}@is.nagoya-u.ac.jp,
More informationVol. 48 No. 4 Apr LAN TCP/IP LAN TCP/IP 1 PC TCP/IP 1 PC User-mode Linux 12 Development of a System to Visualize Computer Network Behavior for L
Vol. 48 No. 4 Apr. 2007 LAN TCP/IP LAN TCP/IP 1 PC TCP/IP 1 PC User-mode Linux 12 Development of a System to Visualize Computer Network Behavior for Learning to Associate LAN Construction Skills with TCP/IP
More informationGaze Head Eye (a) deg (b) 45 deg (c) 9 deg 1: - 1(b) - [5], [6] [7] Stahl [8], [9] Fang [1], [11] Itti [12] Itti [13] [7] Fang [1],
1 1 1 Structure from Motion - 1 Ville [1] NAC EMR-9 [2] 1 Osaka University [3], [4] 1 1(a) 1(c) 9 9 9 c 216 Information Processing Society of Japan 1 Gaze Head Eye (a) deg (b) 45 deg (c) 9 deg 1: - 1(b)
More informationMicrosoft PowerPoint - SSII_harada pptx
The state of the world The gathered data The processed data w d r I( W; D) I( W; R) The data processing theorem states that data processing can only destroy information. David J.C. MacKay. Information
More information21 Pitman-Yor Pitman- Yor [7] n -gram W w n-gram G Pitman-Yor P Y (d, θ, G 0 ) (1) G P Y (d, θ, G 0 ) (1) Pitman-Yor d, θ, G 0 d 0 d 1 θ Pitman-Yor G
ol2013-nl-214 No6 1,a) 2,b) n-gram 1 M [1] (TG: Tree ubstitution Grammar) [2], [3] TG TG 1 2 a) ohno@ilabdoshishaacjp b) khatano@maildoshishaacjp [4], [5] [6] 2 Pitman-Yor 3 Pitman-Yor 1 21 Pitman-Yor
More information動画コンテンツ 動画 1 動画 2 動画 3 生成中の映像 入力音楽 選択された素片 テンポによる伸縮 音楽的構造 A B B B B B A C C : 4) 6) Web Web 2 2 c 2009 Information Processing S
1 2 2 1 Web An Automatic Music Video Creation System by Reusing Dance Video Content Sora Murofushi, 1 Tomoyasu Nakano, 2 Masataka Goto 2 and Shigeo Morishima 1 This paper presents a system that automatically
More information29 jjencode JavaScript
Kochi University of Technology Aca Title jjencode で難読化された JavaScript の検知 Author(s) 中村, 弘亮 Citation Date of 2018-03 issue URL http://hdl.handle.net/10173/1975 Rights Text version author Kochi, JAPAN http://kutarr.lib.kochi-tech.ac.jp/dspa
More information24 Region-Based Image Retrieval using Fuzzy Clustering
24 Region-Based Image Retrieval using Fuzzy Clustering 1130323 2013 3 9 Visual-key Image Retrieval(VKIR) k-means Fuzzy C-means 2 200 2 2 20 VKIR 5 18% 54% 7 30 Fuzzy C-means i Abstract Region-Based Image
More information1 4 4 [3] SNS 5 SNS , ,000 [2] c 2013 Information Processing Society of Japan
SNS 1,a) 2 3 3 2012 3 30, 2012 10 10 SNS SNS Development of Firefighting Knowledge Succession Support SNS in Tokyo Fire Department Koutarou Ohno 1,a) Yuki Ogawa 2 Hirohiko Suwa 3 Toshizumi Ohta 3 Received:
More information図 2: 高周波成分を用いた超解像 解像度度画像とそれらを低解像度化して得られる 低解像度画像との差により低解像度の高周波成分 を得る 高解像度と低解像度の高周波成分から位 置関係を保ったままパッチ領域をそれぞれ切り出 し 高解像度パッチ画像と低解像度パッチ画像の ペアとしてデータベースに登録する
Exemplar-Based Super-Resolution of Human Body Image in Surveillance Video 1 1,2 1 1 1 Kento Nishibori 1, Tomokazu TAKAHASHI 1,2, Daisuke DEGUCHI 1, Ichiro IDE 1 and Hiroshi MURASE 1 1 2 nishiborik@murase.m.is.nagoya-u.ac.jp
More informationIPSJ SIG Technical Report iphone iphone,,., OpenGl ES 2.0 GLSL(OpenGL Shading Language), iphone GPGPU(General-Purpose Computing on Graphics Proc
iphone 1 1 1 iphone,,., OpenGl ES 2.0 GLSL(OpenGL Shading Language), iphone GPGPU(General-Purpose Computing on Graphics Processing Unit)., AR Realtime Natural Feature Tracking Library for iphone Makoto
More informationOutline ACL 2017 ACL ACL 2017 Chairs/Presidents
ACL 2017, 2017/9/7 Outline ACL 2017 ACL ACL 2017 Chairs/Presidents ACL ACL he annual meeting of the Association for Computational Linguistics (Computational Linguistics) (Natural Language Processing) /
More informationInput image Initialize variables Loop for period of oscillation Update height map Make shade image Change property of image Output image Change time L
1,a) 1,b) 1/f β Generation Method of Animation from Pictures with Natural Flicker Abstract: Some methods to create animation automatically from one picture have been proposed. There is a method that gives
More information2008 : 80725872 1 2 2 3 2.1.......................................... 3 2.2....................................... 3 2.3......................................... 4 2.4 ()..................................
More information(3.6 ) (4.6 ) 2. [3], [6], [12] [7] [2], [5], [11] [14] [9] [8] [10] (1) Voodoo 3 : 3 Voodoo[1] 3 ( 3D ) (2) : Voodoo 3D (3) : 3D (Welc
1,a) 1,b) Obstacle Detection from Monocular On-Vehicle Camera in units of Delaunay Triangles Abstract: An algorithm to detect obstacles by using a monocular on-vehicle video camera is developed. Since
More information1 (PCA) 3 2 P.Viola 2) Viola AdaBoost 1 Viola OpenCV 3) Web OpenCV T.L.Berg PCA kpca LDA k-means 4) Berg 95% Berg Web k-means k-means
Web, Web k-means 62% Associating Faces and Names in Web Photo News Akio Kitahara and Keiji Yanai We propose a system which extracts faces and person names from news articles with photographs on the Web
More informationWeb Basic Web SAS-2 Web SAS-2 i
19 Development of moving image delivery system for elementary school 1080337 2008 3 10 Web Basic Web SAS-2 Web SAS-2 i Abstract Development of moving image delivery system for elementary school Ayuko INOUE
More information1(a) (b),(c) - [5], [6] Itti [12] [13] gaze eyeball head 2: [time] [7] Stahl [8], [9] Fang [1], [11] 3 -
Vol216-CVIM-22 No18 216/5/12 1 1 1 Structure from Motion - 1 8% Tobii Pro TX3 NAC EMR ACTUS Eye Tribe Tobii Pro Glass NAC EMR-9 Pupil Headset Ville [1] EMR-9 [2] 1 Osaka University Gaze Head Eye (a) deg
More informationSobel Canny i
21 Edge Feature for Monochrome Image Retrieval 1100311 2010 3 1 3 3 2 2 7 200 Sobel Canny i Abstract Edge Feature for Monochrome Image Retrieval Naoto Suzue Content based image retrieval (CBIR) has been
More information03_特集2_3校_0929.indd
MEDICAL IMAGING TECHNOLOGY Vol. 35 No. 4 September 2017 187 CT 1 1 convolutional neural network; ConvNet CT CT ConvNet 2D ConvNet CT ConvNet CT CT Med Imag Tech 35 4 : 187 193, 2017 1. CT MR 1 501-1194
More information( )
NAIST-IS-MT0851100 2010 2 4 ( ) CR CR CR 1980 90 CR Kerberos SSH CR CR CR CR CR CR,,, ID, NAIST-IS- MT0851100, 2010 2 4. i On the Key Management Policy of Challenge Response Authentication Schemes Toshiya
More informationDuplicate Near Duplicate Intact Partial Copy Original Image Near Partial Copy Near Partial Copy with a background (a) (b) 2 1 [6] SIFT SIFT SIF
Partial Copy Detection of Line Drawings from a Large-Scale Database Weihan Sun, Koichi Kise Graduate School of Engineering, Osaka Prefecture University E-mail: sunweihan@m.cs.osakafu-u.ac.jp, kise@cs.osakafu-u.ac.jp
More information1 Kinect for Windows M = [X Y Z] T M = [X Y Z ] T f (u,v) w 3.2 [11] [7] u = f X +u Z 0 δ u (X,Y,Z ) (5) v = f Y Z +v 0 δ v (X,Y,Z ) (6) w = Z +
3 3D 1,a) 1 1 Kinect (X, Y) 3D 3D 1. 2010 Microsoft Kinect for Windows SDK( (Kinect) SDK ) 3D [1], [2] [3] [4] [5] [10] 30fps [10] 3 Kinect 3 Kinect Kinect for Windows SDK 3 Microsoft 3 Kinect for Windows
More informationVRSJ-SIG-MR_okada_79dce8c8.pdf
THE INSTITUTE OF ELECTRONICS, INFORMATION AND COMMUNICATION ENGINEERS TECHNICAL REPORT OF IEICE. 630-0192 8916-5 E-mail: {kaduya-o,takafumi-t,goshiro,uranishi,miyazaki,kato}@is.naist.jp,.,,.,,,.,,., CG.,,,
More informationTF-IDF TDF-IDF TDF-IDF Extracting Impression of Sightseeing Spots from Blogs for Supporting Selection of Spots to Visit in Travel Sat
1 1 2 1. TF-IDF TDF-IDF TDF-IDF. 3 18 6 Extracting Impression of Sightseeing Spots from Blogs for Supporting Selection of Spots to Visit in Travel Satoshi Date, 1 Teruaki Kitasuka, 1 Tsuyoshi Itokawa 2
More informationIPSJ SIG Technical Report Vol.2009-DPS-141 No.20 Vol.2009-GN-73 No.20 Vol.2009-EIP-46 No /11/27 1. MIERUKEN 1 2 MIERUKEN MIERUKEN MIERUKEN: Spe
1. MIERUKEN 1 2 MIERUKEN MIERUKEN MIERUKEN: Speech Visualization System Based on Augmented Reality Yuichiro Nagano 1 and Takashi Yoshino 2 As the spread of the Augmented Reality(AR) technology and service,
More informationuntitled
JAIS 1 2 1 2 In this paper, we focus on the pauses that partly characterize the utterances of simultaneous interpreters, and attempt to analyze the results of experiments conducted using human subjects
More informationVol. 42 No. SIG 8(TOD 10) July HTML 100 Development of Authoring and Delivery System for Synchronized Contents and Experiment on High Spe
Vol. 42 No. SIG 8(TOD 10) July 2001 1 2 3 4 HTML 100 Development of Authoring and Delivery System for Synchronized Contents and Experiment on High Speed Networks Yutaka Kidawara, 1 Tomoaki Kawaguchi, 2
More informationkut-paper-template.dvi
26 Discrimination of abnormal breath sound by using the features of breath sound 1150313 ,,,,,,,,,,,,, i Abstract Discrimination of abnormal breath sound by using the features of breath sound SATO Ryo
More informationTHE INSTITUTE OF ELECTRONICS, INFORMATION AND COMMUNICATION ENGINEERS TECHNICAL REPORT OF IEICE.
THE INSTITUTE OF ELECTRONICS, INFORMATION AND COMMUNICATION ENGINEERS TECHNICAL REPORT OF IEICE. E-mail: {ytamura,takai,tkato,tm}@vision.kuee.kyoto-u.ac.jp Abstract Current Wave Pattern Analysis for Anomaly
More informationIPSJ SIG Technical Report Vol.2014-GN-90 No.16 Vol.2014-CDS-9 No.16 Vol.2014-DCC-6 No /1/24 1,a) 2,b) 2,c) 1,d) QUMARION QUMARION Kinect Kinect
1,a) 2,b) 2,c) 1,d) QUMARION QUMARION Kinect Kinect Using a Human-Shaped Input Device for Remote Pose Instruction Yuki Tayama 1,a) Yoshiaki Ando 2,b) Misaki Hagino 2,c) Ken-ichi Okada 1,d) Abstract: There
More informationB HNS 7)8) HNS ( ( ) 7)8) (SOA) HNS HNS 4) HNS ( ) ( ) 1 TV power, channel, volume power true( ON) false( OFF) boolean channel volume int
SOA 1 1 1 1 (HNS) HNS SOA SOA 3 3 A Service-Oriented Platform for Feature Interaction Detection and Resolution in Home Network System Yuhei Yoshimura, 1 Takuya Inada Hiroshi Igaki 1, 1 and Masahide Nakamura
More informationgengo.dvi
4 97.52% tri-gram 92.76% 98.49% : Japanese word segmentation by Adaboost using the decision list as the weak learner Hiroyuki Shinnou In this paper, we propose the new method of Japanese word segmentation
More informationIPSJ SIG Technical Report Pitman-Yor 1 1 Pitman-Yor n-gram A proposal of the melody generation method using hierarchical pitman-yor language model Aki
Pitman-Yor Pitman-Yor n-gram A proposal of the melody generation method using hierarchical pitman-yor language model Akira Shirai and Tadahiro Taniguchi Although a lot of melody generation method has been
More informationIS2-06 第21回画像センシングシンポジウム 横浜 2015年6月 画像をスーパーピクセルに変換する手法として SLIC[5] を用いる Achanta らによって提案された SLIC 2.2 グラフマッチング は K-means をベースにした手法で 単純な K-means に いる SPIN
Cosegmentation E-mail: {tamanaha, nakayama}@nlab.ci.i.u-tokyo.ac.jp Abstract Cosegmentation Cosegmentation Cosegmentation 1 Never Ending Image Learner[1] Google Cosegmentation Cosegmentation Rother [2]
More informationIPSJ SIG Technical Report Vol.2014-CG-155 No /6/28 1,a) 1,2,3 1 3,4 CG An Interpolation Method of Different Flow Fields using Polar Inter
,a),2,3 3,4 CG 2 2 2 An Interpolation Method of Different Flow Fields using Polar Interpolation Syuhei Sato,a) Yoshinori Dobashi,2,3 Tsuyoshi Yamamoto Tomoyuki Nishita 3,4 Abstract: Recently, realistic
More information..,,,, , ( ) 3.,., 3.,., 500, 233.,, 3,,.,, i
25 Feature Selection for Prediction of Stock Price Time Series 1140357 2014 2 28 ..,,,,. 2013 1 1 12 31, ( ) 3.,., 3.,., 500, 233.,, 3,,.,, i Abstract Feature Selection for Prediction of Stock Price Time
More information35_3_9.dvi
180 Vol. 35 No. 3, pp.180 185, 2017 Image Recognition by Deep Learning Hironobu Fujiyoshi and Takayoshi Yamashita Chubu University 1. 1990 2000 Scale-Invariant Feature Transform SIFT Histogram of Oriented
More informationFig. 2 28th Ryuou Tournament, Match 5, 59th move. The last move is Black s Rx5f. 1 Tic-Tac-Toe Fig. 1 AsearchtreeofTic-Tac-Toe. [2] [3], [4]
1,a) 2 3 2017 4 6, 2017 9 5 Predicting Moves in Comments for Shogi Commentary Generation Hirotaka Kameko 1,a) Shinsuke Mori 2 Yoshimasa Tsuruoka 3 Received: April 6, 2017, Accepted: September 5, 2017 Abstract:
More information先端社会研究 ★5★号/4.山崎
71 72 5 1 2005 7 8 47 14 2,379 2,440 1 2 3 2 73 4 3 1 4 1 5 1 5 8 3 2002 79 232 2 1999 249 265 74 5 3 5. 1 1 3. 1 1 2004 4. 1 23 2 75 52 5,000 2 500 250 250 125 3 1995 1998 76 5 1 2 1 100 2004 4 100 200
More informationNo. 3 Oct The person to the left of the stool carried the traffic-cone towards the trash-can. α α β α α β α α β α Track2 Track3 Track1 Track0 1
ACL2013 TACL 1 ACL2013 Grounded Language Learning from Video Described with Sentences (Yu and Siskind 2013) TACL Transactions of the Association for Computational Linguistics What Makes Writing Great?
More informationIPSJ SIG Technical Report Secret Tap Secret Tap Secret Flick 1 An Examination of Icon-based User Authentication Method Using Flick Input for
1 2 3 3 1 Secret Tap Secret Tap Secret Flick 1 An Examination of Icon-based User Authentication Method Using Flick Input for Mobile Terminals Kaoru Wasai 1 Fumio Sugai 2 Yosihiro Kita 3 Mi RangPark 3 Naonobu
More informationOptical Flow t t + δt 1 Motion Field 3 3 1) 2) 3) Lucas-Kanade 4) 1 t (x, y) I(x, y, t)
http://wwwieice-hbkborg/ 2 2 4 2 -- 2 4 2010 9 3 3 4-1 Lucas-Kanade 4-2 Mean Shift 3 4-3 2 c 2013 1/(18) http://wwwieice-hbkborg/ 2 2 4 2 -- 2 -- 4 4--1 2010 9 4--1--1 Optical Flow t t + δt 1 Motion Field
More informationIPSJ-CVIM
1 1 2 1 Estimation of Shielding Object Distribution in Scattering Media by Analyzing Light Transport Shosei Moriguchi, 1 Yasuhiro Mukaigawa, 1 Yasuyuki Matsushita 2 and Yasushi Yagi 1 In this paper, we
More informationA Study on Throw Simulation for Baseball Pitching Machine with Rollers and Its Optimization Shinobu SAKAI*5, Yuichiro KITAGAWA, Ryo KANAI and Juhachi
A Study on Throw Simulation for Baseball Pitching Machine with Rollers and Its Optimization Shinobu SAKAI*5, Yuichiro KITAGAWA, Ryo KANAI and Juhachi ODA Department of Human and Mechanical Systems Engineering,
More information25 Removal of the fricative sounds that occur in the electronic stethoscope
25 Removal of the fricative sounds that occur in the electronic stethoscope 1140311 2014 3 7 ,.,.,.,.,.,.,.,,.,.,.,.,,. i Abstract Removal of the fricative sounds that occur in the electronic stethoscope
More information% 2 3 [1] Semantic Texton Forests STFs [1] ( ) STFs STFs ColorSelf-Simlarity CSS [2] ii
2012 3 A Graduation Thesis of College of Engineering, Chubu University High Accurate Semantic Segmentation Using Re-labeling Besed on Color Self Similarity Yuko KAKIMI 2400 90% 2 3 [1] Semantic Texton
More informationDPA,, ShareLog 3) 4) 2.2 Strino Strino STRain-based user Interface with tacticle of elastic Natural ObjectsStrino 1 Strino ) PC Log-Log (2007 6)
1 2 1 3 Experimental Evaluation of Convenient Strain Measurement Using a Magnet for Digital Public Art Junghyun Kim, 1 Makoto Iida, 2 Takeshi Naemura 1 and Hiroyuki Ota 3 We present a basic technology
More information1 1 tf-idf tf-idf i
14 A Method of Article Retrieval Utilizing Characteristics in Newspaper Articles 1055104 2003 1 31 1 1 tf-idf tf-idf i Abstract A Method of Article Retrieval Utilizing Characteristics in Newspaper Articles
More information2007/8 Vol. J90 D No. 8 Stauffer [7] 2 2 I 1 I 2 2 (I 1(x),I 2(x)) 2 [13] I 2 = CI 1 (C >0) (I 1,I 2) (I 1,I 2) Field Monitoring Server
a) Change Detection Using Joint Intensity Histogram Yasuyo KITA a) 2 (0 255) (I 1 (x),i 2 (x)) I 2 = CI 1 (C>0) (I 1,I 2 ) (I 1,I 2 ) 2 1. [1] 2 [2] [3] [5] [6] [8] Intelligent Systems Research Institute,
More information( ) [1] [4] ( ) 2. [5] [6] Piano Tutor[7] [1], [2], [8], [9] Radiobaton[10] Two Finger Piano[11] Coloring-in Piano[12] ism[13] MIDI MIDI 1 Fig. 1 Syst
情報処理学会インタラクション 2015 IPSJ Interaction 2015 15INT014 2015/3/7 1,a) 1,b) 1,c) Design and Implementation of a Piano Learning Support System Considering Motivation Fukuya Yuto 1,a) Takegawa Yoshinari 1,b) Yanagi
More information7,, i
23 Research of the authentication method on the two dimensional code 1145111 2012 2 13 7,, i Abstract Research of the authentication method on the two dimensional code Karita Koichiro Recently, the two
More information2) 3) LAN 4) 2 5) 6) 7) K MIC NJR4261JB0916 8) 24.11GHz V 5V 3kHz 4 (1) (8) (1)(5) (2)(3)(4)(6)(7) (1) (2) (3) (4)
ドップラーセンサ 送信波 観測対象 1 1 1 SVM 2 9 Activity and State Recognition without Body-Attached Sensor Using Microwave Doppler Sensor Masatoshi Sekine, 1 Kurato Maeno 1 and Masanori Nozaki 1 To spread context-aware
More information@08470030ヨコ/篠塚・窪田 221号
Abstract Among three distinctive types of Japanese writing systems Kanji, Hiragana and Katakana, a behavioral experiment using 97 university students as subjects implies that Katakana is regarded as most
More informationGID Haar-like Mean-Shift Multi-Viewpoint Human Tracking Based on Face Detection Using Haar-like Features and Mean-Shift Yu Ito (Shizuoka Univers
GID-08-6 Haar-like Mean-Shift Multi-Viewpoint Human Tracking Based on Face Detection Using Haar-like Features and Mean-Shift Yu Ito (Shizuoka University), Atsushi Yamashita, Toru Kaneko (Shizuoka University)
More informationIPSJ SIG Technical Report Vol.2014-HCI-158 No /5/22 1,a) 2 2 3,b) Development of visualization technique expressing rainfall changing conditions
1,a) 2 2 3,b) Development of visualization technique expressing rainfall changing conditions with a still picture Yuuki Hyougo 1,a) Hiroko Suzuki 2 Tadanobu Furukawa 2 Kazuo Misue 3,b) Abstract: In order
More informationIPSJ SIG Technical Report Vol.2010-GN-74 No /1/ , 3 Disaster Training Supporting System Based on Electronic Triage HIROAKI KOJIMA, 1 KU
1 2 2 1, 3 Disaster Training Supporting System Based on Electronic Triage HIROAKI KOJIMA, 1 KUNIAKI SUSEKI, 2 KENTARO NAGAHASHI 2 and KEN-ICHI OKADA 1, 3 When there are a lot of injured people at a large-scale
More information,.,.,,.,. X Y..,,., [1].,,,.,,.. HCI,,,,,,, i
23 Experimental investigation of Natural Use Profiles of Pen Pressure, Tilt and Azimuth 1120230 2012 3 1 ,.,.,,.,. X Y..,,., [1].,,,.,,.. HCI,,,,,,, i Abstract Experimental investigation of Natural Use
More informationStudies of Foot Form for Footwear Design (Part 9) : Characteristics of the Foot Form of Young and Elder Women Based on their Sizes of Ball Joint Girth
Studies of Foot Form for Footwear Design (Part 9) : Characteristics of the Foot Form of Young and Elder Women Based on their Sizes of Ball Joint Girth and Foot Breadth Akiko Yamamoto Fukuoka Women's University,
More information2 4 2 3 4 3 [12] 2 3 4 5 1 1 [5, 6, 7] [5, 6] [7] 1 [8] 1 1 [9] 1 [10, 11] [10] [11] 1 [13, 14] [13] [14] [13, 14] [10, 11, 13, 14] 1 [12]
Walking Person Recognition by Matching Video Fragments Masashi Nishiyama, Mayumi Yuasa, Tomokazu Wakasugi, Tomoyuki Shibata, Osamu Yamaguchi ( ), Corporate Research and Development Center, TOSHIBA Corporation
More information2.2 (a) = 1, M = 9, p i 1 = p i = p i+1 = 0 (b) = 1, M = 9, p i 1 = 0, p i = 1, p i+1 = 1 1: M 2 M 2 w i [j] w i [j] = 1 j= w i w i = (w i [ ],, w i [
RI-002 Encoding-oriented video generation algorithm based on control with high temporal resolution Yukihiro BANDOH, Seishi TAKAMURA, Atsushi SHIMIZU 1 1T / CMOS [1] 4K (4096 2160 /) 900 Hz 50Hz,60Hz 240Hz
More informationIPSJ SIG Technical Report Vol.2014-IOT-27 No.14 Vol.2014-SPT-11 No /10/10 1,a) 2 zabbix Consideration of a system to support understanding of f
1,a) 2 zabbix Consideration of a system to support understanding of fault occurrences based on the similarity of the time series Miyaza Nao 1,a) Masuda Hideo 2 Abstract: With the development of network
More information(a) (b) 2 2 (Bosch, IR Illuminator 850 nm, UFLED30-8BD) ( 7[m] 6[m]) 3 (PointGrey Research Inc.Grasshopper2 M/C) Hz (a) (b
(MIRU202) 202 8 AdrianStoica 89 0395 744 89 0395 744 Jet Propulsion Laboratory 4800 Oak Grove Drive, Pasadena, CA 909, USA E-mail: uchino@irvs.ait.kyushu-u.ac.jp, {yumi,kurazume}@ait.kyushu-u.ac.jp 2 nearest
More information;~ (Summary) The Study on the Effects of Foot Bathing on Urination Kumiko Toyoda School of Human Nursing, University of Shiga Prefecture Background Foot bathing is one of the important nursing care for
More information