jpaper : 2017/4/17(17:52),,.,,,.,.,.,, Improvement in Domain Specific Word Segmentation by Symbol Grounding suzushi tomori, hirotaka kameko, takashi n

Similar documents
x i 2 x x i i 1 i xi+ 1xi+ 2x i+ 3 健康児に本剤を接種し ( 窓幅 3 n-gram 長の上限 3 の場合 ) 文字 ( 種 )1-gram: -3/ 児 (K) -2/ に (H) -1/ 本 (K) 1/ 剤 (K) 2/ を (H) 3/ 接 (K) 文字 (

A Japanese Word Dependency Corpus ÆüËܸì¤Îñ¸ì·¸¤ê¼õ¤±¥³¡¼¥Ñ¥¹

Fig. 2 28th Ryuou Tournament, Match 5, 59th move. The last move is Black s Rx5f. 1 Tic-Tac-Toe Fig. 1 AsearchtreeofTic-Tac-Toe. [2] [3], [4]

¥ì¥·¥Ô¤Î¸À¸ì½èÍý¤Î¸½¾õ

Vol. 22 No. 2 June 2015 and language expressions. Based on these backgrounds, in this study, we discuss the definition of a tag set for recipe terms a

21 Pitman-Yor Pitman- Yor [7] n -gram W w n-gram G Pitman-Yor P Y (d, θ, G 0 ) (1) G P Y (d, θ, G 0 ) (1) Pitman-Yor d, θ, G 0 d 0 d 1 θ Pitman-Yor G

No. 3 Oct The person to the left of the stool carried the traffic-cone towards the trash-can. α α β α α β α α β α Track2 Track3 Track1 Track0 1

IPSJ SIG Technical Report Vol.2010-CVIM-170 No /1/ Visual Recognition of Wire Harnesses for Automated Wiring Masaki Yoneda, 1 Ta

自然言語処理24_705

[1] [2] [3] 1 GPS 1 Twitter *1 *1 GPS [4] [5] [6] 2 [7] 1 [8] Restricted Boltzmann Machine RBM RBM

Modal Phrase MP because but 2 IP Inflection Phrase IP as long as if IP 3 VP Verb Phrase VP while before [ MP MP [ IP IP [ VP VP ]]] [ MP [ IP [ VP ]]]

gengo.dvi

The 18th Game Programming Workshop ,a) 1,b) 1,c) 2,d) 1,e) 1,f) Adapting One-Player Mahjong Players to Four-Player Mahjong

1 Web [2] Web [3] [4] [5], [6] [7] [8] S.W. [9] 3. MeetingShelf Web MeetingShelf MeetingShelf (1) (2) (3) (4) (5) Web MeetingShelf

Haiku Generation Based on Motif Images Using Deep Learning Koki Yoneda 1 Soichiro Yokoyama 2 Tomohisa Yamashita 2 Hidenori Kawamura Scho

main.dvi

IPSJ SIG Technical Report Vol.2009-DPS-141 No.20 Vol.2009-GN-73 No.20 Vol.2009-EIP-46 No /11/27 1. MIERUKEN 1 2 MIERUKEN MIERUKEN MIERUKEN: Spe

1 7.35% 74.0% linefeed point c 200 Information Processing Society of Japan

IPSJ SIG Technical Report Vol.2011-EC-19 No /3/ ,.,., Peg-Scope Viewer,,.,,,,. Utilization of Watching Logs for Support of Multi-

4. C i k = 2 k-means C 1 i, C 2 i 5. C i x i p [ f(θ i ; x) = (2π) p 2 Vi 1 2 exp (x µ ] i) t V 1 i (x µ i ) 2 BIC BIC = 2 log L( ˆθ i ; x i C i ) + q

Computational Semantics 1 category specificity Warrington (1975); Warrington & Shallice (1979, 1984) 2 basic level superiority 3 super-ordinate catego

IPSJ SIG Technical Report Vol.2010-NL-199 No /11/ treebank ( ) KWIC /MeCab / Morphological and Dependency Structure Annotated Corp

THE INSTITUTE OF ELECTRONICS, INFORMATION AND COMMUNICATION ENGINEERS TECHNICAL REPORT OF IEICE.

DEIM Forum 2010 A Web Abstract Classification Method for Revie

(MIRU2008) HOG Histograms of Oriented Gradients (HOG)

1 1 CodeDrummer CodeMusician CodeDrummer Fig. 1 Overview of proposal system c

(a) 1 (b) 3. Gilbert Pernicka[2] Treibitz Schechner[3] Narasimhan [4] Kim [5] Nayar [6] [7][8][9] 2. X X X [10] [11] L L t L s L = L t + L s

JFE.dvi

自然言語処理21_249

[1], B0TB2053, i

TF-IDF TDF-IDF TDF-IDF Extracting Impression of Sightseeing Spots from Blogs for Supporting Selection of Spots to Visit in Travel Sat

,,.,.,,.,.,.,.,,.,..,,,, i

johnny-paper2nd.dvi

Input image Initialize variables Loop for period of oscillation Update height map Make shade image Change property of image Output image Change time L

258 5) GPS 1 GPS 6) GPS DP 7) 8) 10) GPS GPS ) GPS Global Positioning System

Q [4] 2. [3] [5] ϵ- Q Q CO CO [4] Q Q [1] i = X ln n i + C (1) n i i n n i i i n i = n X i i C exploration exploitation [4] Q Q Q ϵ 1 ϵ 3. [3] [5] [4]

_314I01BM浅谷2.indd

1., 1 COOKPAD 2, Web.,,,,,,.,, [1]., 5.,, [2].,,.,.,, 5, [3].,,,.,, [4], 33,.,,.,,.. 2.,, 3.., 4., 5., ,. 1.,,., 2.,. 1,,

IPSJ SIG Technical Report Vol.2017-SLP-115 No /2/18 1,a) 1 1,2 Sakriani Sakti [1][2] [3][4] [5][6][7] [8] [9] 1 Nara Institute of Scie

( : A8TB2163)

Study on Throw Accuracy for Baseball Pitching Machine with Roller (Study of Seam of Ball and Roller) Shinobu SAKAI*5, Juhachi ODA, Kengo KAWATA and Yu

IPSJ SIG Technical Report Vol.2009-CVIM-167 No /6/10 Real AdaBoost HOG 1 1 1, 2 1 Real AdaBoost HOG HOG Real AdaBoost HOG A Method for Reducing

日本感性工学会論文誌

Vol. 42 No MUC-6 6) 90% 2) MUC-6 MET-1 7),8) 7 90% 1 MUC IREX-NE 9) 10),11) 1) MUCMET 12) IREX-NE 13) ARPA 1987 MUC 1992 TREC IREX-N

IPSJ SIG Technical Report 1,a) 1,b) 1,c) 1,d) 2,e) 2,f) 2,g) 1. [1] [2] 2 [3] Osaka Prefecture University 1 1, Gakuencho, Naka, Sakai,

Vol.54 No (July 2013) [9] [10] [11] [12], [13] 1 Fig. 1 Flowchart of the proposed system. c 2013 Information

[1] SBS [2] SBS Random Forests[3] Random Forests ii

IPSJ SIG Technical Report Vol.2014-HCI-158 No /5/22 1,a) 2 2 3,b) Development of visualization technique expressing rainfall changing conditions

Convolutional Neural Network A Graduation Thesis of College of Engineering, Chubu University Investigation of feature extraction by Convolution

COM COM 4) 5) COM COM 3 4) 5) COM COM 6) 7) 10) COM Bonanza 6) Bonanza Hearts COM 7) 10) Hearts 3 2,000 4,000

2 ( ) i

3 2 2 (1) (2) (3) (4) 4 4 AdaBoost 2. [11] Onishi&Yoda [8] Iwashita&Stoica [5] 4 [3] 3. 3 (1) (2) (3)

Outline ACL 2017 ACL ACL 2017 Chairs/Presidents

3. ( 1 ) Linear Congruential Generator:LCG 6) (Mersenne Twister:MT ), L 1 ( 2 ) 4 4 G (i,j) < G > < G 2 > < G > 2 g (ij) i= L j= N

[1] AI [2] Pac-Man Ms. Pac-Man Ms. Pac-Man Pac-Man Ms. Pac-Man IEEE AI Ms. Pac-Man AI [3] AI 2011 UCT[4] [5] 58,990 Ms. Pac-Man AI Ms. Pac-Man 921,360

一般社団法人電子情報通信学会 THE INSTITUTE OF ELECTRONICS, INFORMATION AND COMMUNICATION ENGINEERS THE INSTITUTE OF ELECTRONICS, INFORMATION AND COMMUNICATION ENGIN

24312.dvi

A Study on Throw Simulation for Baseball Pitching Machine with Rollers and Its Optimization Shinobu SAKAI*5, Yuichiro KITAGAWA, Ryo KANAI and Juhachi

IPSJ-TOD

[1] B =b 1 b n P (S B) S S O = {o 1,2, o 1,3,, o 1,n, o 2,3,, o i,j,, o n 1,n } D = {d 1, d 2,, d n 1 } S = O, D o i,j 1 i

Q-Learning Support-Vector-Machine NIKKEI NET Infoseek MSN i

TA3-4 31st Fuzzy System Symposium (Chofu, September 2-4, 2015) Interactive Recommendation System LeonardoKen Orihara, 1 Tomonori Hashiyama, 1

独立行政法人情報通信研究機構 Development of the Information Analysis System WISDOM KIDAWARA Yutaka NICT Knowledge Clustered Group researched and developed the infor

3807 (3)(2) ,267 1 Fig. 1 Advertisement to the author of a blog. 3 (1) (2) (3) (2) (1) TV 2-0 Adsense (2) Web ) 6) 3

IPSJ SIG Technical Report Vol.2014-GN-90 No.16 Vol.2014-CDS-9 No.16 Vol.2014-DCC-6 No /1/24 1,a) 2,b) 2,c) 1,d) QUMARION QUMARION Kinect Kinect

[4], [5] [6] [7] [7], [8] [9] 70 [3] 85 40% [10] Snowdon 50 [5] Kemper [3] 2.2 [11], [12], [13] [14] [15] [16]

HASC2012corpus HASC Challenge 2010,2011 HASC2011corpus( 116, 4898), HASC2012corpus( 136, 7668) HASC2012corpus HASC2012corpus

149 (Newell [5]) Newell [5], [1], [1], [11] Li,Ryu, and Song [2], [11] Li,Ryu, and Song [2], [1] 1) 2) ( ) ( ) 3) T : 2 a : 3 a 1 :

1 Fig. 1 Extraction of motion,.,,, 4,,, 3., 1, 2. 2.,. CHLAC,. 2.1,. (256 ).,., CHLAC. CHLAC, HLAC. 2.3 (HLAC ) r,.,. HLAC. N. 2 HLAC Fig. 2

動画コンテンツ 動画 1 動画 2 動画 3 生成中の映像 入力音楽 選択された素片 テンポによる伸縮 音楽的構造 A B B B B B A C C : 4) 6) Web Web 2 2 c 2009 Information Processing S

DEIM Forum 2009 C8-4 QA NTT QA QA QA 2 QA Abstract Questions Recomme

IPSJ SIG Technical Report Vol.2009-BIO-17 No /5/26 DNA 1 1 DNA DNA DNA DNA Correcting read errors on DNA sequences determined by Pyrosequencing

: ( 1) () 1. ( 1) 2. ( 1) 3. ( 2)

Izard 10 [1]Plutchik 8 [2] [3] Izard Neviarouskaya [4][5] 2.2 Hao [6] 1 Twitter[a] a) Shook Wikipedia

untitled

( : A9TB2096)

IPSJ SIG Technical Report Vol.2009-DBS-149 No /11/ Bow-tie SCC Inter Keyword Navigation based on Degree-constrained Co-Occurrence Graph

2). 3) 4) 1.2 NICTNICT DCRA Dihedral Corner Reflector micro-arraysdcra DCRA DCRA DCRA 3D DCRA PC USB PC PC ON / OFF Velleman K8055 K8055 K8055

HP cafe HP of A A B of C C Map on N th Floor coupon A cafe coupon B Poster A Poster A Poster B Poster B Case 1 Show HP of each company on a user scree

DEIM Forum 2009 B4-6, Str

36 Theoretical and Applied Linguistics at Kobe Shoin No. 20, 2017 : Key Words: syntactic compound verbs, lexical compound verbs, aspectual compound ve

IHI Robust Path Planning against Position Error for UGVs in Rough Terrain Yuki DOI, Yonghoon JI, Yusuke TAMURA(University of Tokyo), Yuki IKEDA, Atsus

& Vol.5 No (Oct. 2015) TV 1,2,a) , Augmented TV TV AR Augmented Reality 3DCG TV Estimation of TV Screen Position and Ro

27 YouTube YouTube UGC User Generated Content CDN Content Delivery Networks LRU Least Recently Used UGC YouTube CGM Consumer Generated Media CGM CGM U

2) TA Hercules CAA 5 [6], [7] CAA BOSS [8] 2. C II C. ( 1 ) C. ( 2 ). ( 3 ) 100. ( 4 ) () HTML NFS Hercules ( )

Lytro [11] The Franken Camera [12] 2.2 Creative Coding Community Creative Coding Community [13]-[19] Sketch Fork 2.3 [20]-[23] 3. ourcam 3.1 ou

untitled

計量国語学 アーカイブ ID KK 種別 特集 招待論文 A タイトル Webコーパスの概念と種類, 利用価値 語史研究の情報源としてのWebコーパス Title The Concept, Types and Utility of Web Corpora: Web Corpora as

レビューテキストの書き の評価視点に対する評価点の推定 29 3

2015 9

1 Kinect for Windows M = [X Y Z] T M = [X Y Z ] T f (u,v) w 3.2 [11] [7] u = f X +u Z 0 δ u (X,Y,Z ) (5) v = f Y Z +v 0 δ v (X,Y,Z ) (6) w = Z +

‰gficŒõ/’ÓŠ¹

Microsoft Word - toyoshima-deim2011.doc

% 95% 2002, 2004, Dunkel 1986, p.100 1

IPSJ SIG Technical Report An Evaluation Method for the Degree of Strain of an Action Scene Mao Kuroda, 1 Takeshi Takai 1 and Takashi Matsuyama 1

IPSJ SIG Technical Report iphone iphone,,., OpenGl ES 2.0 GLSL(OpenGL Shading Language), iphone GPGPU(General-Purpose Computing on Graphics Proc

IPSJ SIG Technical Report Vol.2012-CG-148 No /8/29 3DCG 1,a) On rigid body animation taking into account the 3D computer graphics came



2014/1 Vol. J97 D No. 1 2 [2] [3] 1 (a) paper (a) (b) (c) 1 Fig. 1 Issues in coordinating translation services. (b) feast feast feast (c) Kran

Transcription:

,,.,,,.,.,.,, Improvement in Domain Specific Word Segmentation by Symbol Grounding suzushi tomori, hirotaka kameko, takashi ninomiya, shinsuke mori and yoshimasa tsuruoka We propose a novel framework for improving a word segmenter using information acquired from symbol grounding. The framework uses a dataset consisting of pairs of non-textual information and a commentary. We generate a pseudo-stochastically segmented corpus from the commentaries, and then build a neural network to predict relationships between non-textual information and the words. We generate a domain specific term dictionary by using the neural network for word segmenter. We applied our method to game records of Japanese chess with commentaries. The experimental results show that the accuracy of a word segmenter can be improved by incorporating the generated dictionary. Key Words: symbol grounding, word segmentation, dictionary, Graduate School of Informatics, Kyoto University, Graduate School of Engineering, The University of Tokyo, Graduate School of Science and Engineering, Ehime University, Academic Center for Computing and Media Studies, Kyoto University

Vol. 13 No. 2 April 2006 AB!"#$%&'()!"#$%&'() *+!"#$%&'() *+ *+, >?@ -./0!"#"$ '%(%)%*%+, '(%)*%+%, '(%)%*+%, '%(%)*%+%, 1.23 $%'()9:, CDEFGHIJKEL!-#"$ '( 678 -.;<= &'()()('*()+ &'()()('*()+ &'()()('*()+ ' ( '(!,#"$ -./0 45 $%%'(%)%9:%, ) 1 1,, web.,, (Farhadi, Hejrati, Sadeghi, Young, Rashtchian, Hockenmaier, and Forsyth 2010)(Yang, Teo, Daumé, and Aloimonos 2011)(Rohrbach, Qiu, Titov, Thater, Pinkal, and Schiele 2013). Kiros (Kiros, Salakhutdinov, and Zemel 2014).,.,.,., (Mori, Richardson, Ushiku, Sasada, Kameko, and Tsuruoka 2016).,,.. 3 ( 1).,.., 2

,,,,.,.. 2. 3,. 4,,,. 5, 6. 7. 2,., (Mori and Takuma 2004).,.,,,,., ( 2009). 2.1 C r (, x nr 1 ) P i. P i x i x i+1. x i x i+1 (Fan, Chang, Hsieh, Wang, and Lin 2008).,. 1 (P 0 = P nr = 1). f r (w). f r (w) = k 1 P i (1 P i+j ) P i+k (1) i O j=1 O = {i x i+k i+1 = w}, O x i+k i+1 i. 3

Vol. 13 No. 2 April 2006 2.2.,,, ( 2009).,.,,,., 1,,.,,..,. For i = 1 to n r 1 (1) x i (2) 0 < p < 1 p (3) if p < P i : otherwise: m, x i x i+1 m P i. m, P i P i 0. 3,., (Mori et al. 2016)., (Regneri, Rohrbach, Wetzel, Thater, Schiele, and Pinkal 2013). 3.1 2,.,. 4

,,,,,,.,,. 3.2 S i (i = 1,..., n) C i, S i f(s i ). n., C i C i. C i m C ij (j = 1,..., m),, ( m = 4 ). f(s i ) C i 3. 100,. d (d ),,. 1, 0, 2., Bag-of-Words 3. (Tsuruoka, Yokoyama, and Chikayama 2002),.. a: b: c: a b d: c, 2 ( 2 ) 3, d,. a, b, c, 94.7%, 87.9%.,,..,,,. 5

Vol. 13 No. 2 April 2006 4. 4.1, (Neubig, Nakata, and Mori 2011). x = x 1,..., x nr. (Fan et al. 2008) P i = 1 P i = 0. 6 n-gram n-gram (n = 1, 2, 3)., n-gram. 4.2,...,,.,., S i d.,. sum d, R%. max d, R%. each d R%., S 1, S 2 [,,,, ] 5, 40%. S 1 [1.4, 1.5, 0.2, 0.5, 3.8] S 2 [4.9, 0.8, 0.1, 0.9, 3.2] 6

,,,, 1 ( ) 33,151 - - 33,151 BCCWJ-train 56,753 1324,951 1,911,660 0 8,164 240,097 361,843 0 11,700 147,809 197,941 0 ( ) 253 3,898 4,961 137 BCCWJ-test 6,025 148,929 212,261 0 ( ) 3,000 21,261 26,767 0 ( ) 1,788 31,220 41,104 928 sum S 1, S 2 [6.3, 2.3, 0.3, 1.4, 7.0] 40%. max [4.9, 1.5, 0.2, 0.9, 3.8]. each [1.4, 1.5, 0.2, 0.5, 3.8] [4.9, 0.8, 0.1, 0.9, 3.2] 40%. ( ).,. 5 4., ( ). 5.1 1., ( ) / /., 33,151.,., ( 7

Vol. 13 No. 2 April 2006 )., (BCCWJ) (Maekawa, Yamazaki, Ogiso, Maruyama, Ogura, Kashino, Koiso, Yamaguchi, Tanaka, and Den 2014), (1990-2000),. BCCWJ,. 5, 041, (253 ), (3, 000 ), (1, 788 ) 3. 1,.,.,,. 5.2 2. : UniDic (234, 652 ) 2. + :,. + :,. UniDic. UniDic,., 1. P i, P i 0.5, P i. m = 4, 4.,, R %,., (sum, max, each) R, R (F ). 1 /. 2 http://pj.ninjal.ac.jp/corpus_center/unidic/ 8

,,,, 2 BCCWJ (6, 025 ) F 99.36% 96.37% 99.37 + 99.34% 99.35% 99.34 3 (4, 788 ) F 90.78% 91.03% 90.90 + 90.84% 91.53% 91.19 + 90.92% 91.57% 91.24 R = 0.074, 110., each R = 0.074, 110. 5.3,, F. = = F = 2 + 2 BCCWJ, 3.. BCCWJ ( 2) ( 3),.. 3,, 1%. (Liu, Zhang, Che, Liu, and Wu 2014).. 3,, 9

Vol. 13 No. 2 April 2006 4 (3, 000 ) F 90.90% 88.91% 89.89 + 90.96% 89.51% 90.23 + 90.95% 89.44% 90.19 5 (1, 788 ) F 90.70% 92.47% 91.58 + 90.76% 92.91% 91.83 + 90.89% 93.03% 91.95.,.,,.,.., 2. 4 5. 4, 2,,..,.,, ( 5).,.,.,, 2., (4, 788 ) (F ). 12, 000. 10

,,,,!"#&)%!"#&(%!"#&'%!!"!"#&&%!"#&"%!"#&%!"#"!%!"#"$% *% '***% +***%!***% "&***% ")***% "$***% &"***% &(***% &,***% '"***% ''")"%!"#$ 2 6. (Nagata 1994)., Sproat (Sproat, Gale, Shih, and Chang 1996).., Neubig (Neubig et al. 2011),, BI. BIES (Xue 2003). BIES,,, 1. BI BIES CRF. 1., BIES CRF.., (Yang and Vozila 2014)(Jiang, Sun, Lü, Yang, and Liu 2013)(Liu et al. 2014).,, CRF., Tsuboi 11

Vol. 13 No. 2 April 2006 (Tsuboi, Kashima, Mori, Oda, and Matsumoto 2008), Mori (Mori and Nagao 1996). (Roy and Pentland 2002; Nguyen, Vogel, and Smith 2010). Roy,.,,. Nguyen..,,. 7,.,,..,., (Ma and Hinrichs 2015),. JSPS 26540190 16K00293, 25280084.. Fan, R.-E., Chang, K.-W., Hsieh, C.-J., Wang, X.-R., and Lin, C.-J. (2008). LIBLINEAR: A Library for Large Linear Classification. Journal of Machine Learning Research, 9, pp. 1871 1874. Farhadi, A., Hejrati, M., Sadeghi, M. A., Young, P., Rashtchian, C., Hockenmaier, J., and Forsyth, D. (2010). Every Picture Tells a Story: Generating Sentences from Images. In 12

,,,, Proceedings of the 11th European Conference on Computer Vision, pp. 15 29. Jiang, W., Sun, M., Lü, Y., Yang, Y., and Liu, Q. (2013). Discriminative Learning with Natural Annotations: Word Segmentation as a Case Study. In Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, pp. 761 769. Kiros, R., Salakhutdinov, R., and Zemel, R. (2014). Multimodal Neural Language Models. In Proceedings of the 31st International Conference on Machine Learning, pp. 595 603. Liu, Y., Zhang, Y., Che, W., Liu, T., and Wu, F. (2014). Domain Adaptation for CRF-based Chinese Word Segmentation using Free Annotations. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, pp. 864 874. Ma, J. and Hinrichs, E. W. (2015). Accurate Linear-Time Chinese Word Segmentation via Embedding Matching. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics, pp. 1733 1743. Maekawa, K., Yamazaki, M., Ogiso, T., Maruyama, T., Ogura, H., Kashino, W., Koiso, H., Yamaguchi, M., Tanaka, M., and Den, Y. (2014). Balanced corpus of contemporary written Japanese. Language Resources and Evaluation, 48, pp. 345 371. Mori, S. and Nagao, M. (1996). Word Extraction from Corpora and Its Part-of-Speech Estimation Using Distributional Analysis. In Proceedings of the 16th International Conference on Computational Linguistics, pp. 1119 1122. Mori, S., Richardson, J., Ushiku, A., Sasada, T., Kameko, H., and Tsuruoka, Y. (2016). A Japanese Chess Commentary Corpus. In Proceedings of the Tenth International Conference on Language Resources and Evaluation, pp. 1415 1420. Mori, S. and Takuma, D. (2004). Word n-gram probability estimation from a Japanese raw corpus. In Proceedings of the Eighth International Conference on Speech and Language Processing, pp. 1037 1040. Nagata, M. (1994). A Stochastic Japanese Morphological Analyzer Using a forward-dp backward-a* N-best Search Algorithm. In Proceedings of the 15th Conference on Computational Linguistics, pp. 201 207. Neubig, G., Nakata, Y., and Mori, S. (2011). Pointwise Prediction for Robust, Adaptable Japanese Morphological Analysis. In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, pp. 529 533. Nguyen, T., Vogel, S., and Smith, N. A. (2010). Nonparametric Word Segmentation for Machine Translation. In Proceedings of the 23rd International Conference on Computational Linguistics, pp. 815 823. Regneri, M., Rohrbach, M., Wetzel, D., Thater, S., Schiele, B., and Pinkal, M. (2013). Ground- 13

Vol. 13 No. 2 April 2006 ing Action Descriptions in Videos. Transactions of the Association for Computational Linguistics, 1 (Mar), pp. 25 36. Rohrbach, M., Qiu, W., Titov, I., Thater, S., Pinkal, M., and Schiele, B. (2013). Translating Video Content to Natural Language Descriptions. In Proceedings of the 14th International Conference on Computer Vision, pp. 433 440. Roy, D. K. and Pentland, A. P. (2002). Learning words from sights and sounds: a computational model. Cognitive Science, 26 (1), pp. 113 146. Sproat, R., Gale, W., Shih, C., and Chang, N. (1996). A Stochastic Finite-state Wordsegmentation Algorithm for Chinese. Computational Linguistics, 22 (3), pp. 377 404. Tsuboi, Y., Kashima, H., Mori, S., Oda, H., and Matsumoto, Y. (2008). Training Conditional Random Fields Using Incomplete Annotations. In Proceedings of the 22nd International Conference on Computational Linguistics, pp. 897 904. Tsuruoka, Y., Yokoyama, D., and Chikayama, T. (2002). Game-Tree Search Algorithm Based On Realization Probability. Journal of the International Computer Games Association, 25, p. 2002. Xue, N. (2003). Chinese Word Segmentation as Character Tagging. The Association for Computational Linguistics and Chinese Language Processing, 8, pp. 29 48. Yang, F. and Vozila, P. (2014). Semi-Supervised Chinese Word Segmentation Using Partial- Label Learning With Conditional Random Fields. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, pp. 90 98. Yang, Y., Teo, C. L., Daumé, III, H., and Aloimonos, Y. (2011). Corpus-guided Sentence Generation of Natural Images. In Proceedings of 2011 the Conference on Empirical Methods in Natural Language Processing, pp. 444 454. (2009).., 16 (5), pp. 7 21. 2016.. 2015.. 2001.. 2006. 2010, 2017. 14

,,,, ( ).,,,,, ACL, ACM. 1998.,. 2007, 2016. ( ). 1997. 2010, 2013. 2010 58.,,, ACL. 2002.. 2005. 2009. 2011..,,,. 15