自然言語処理21_249

Similar documents
¥ì¥·¥Ô¤Î¸À¸ì½èÍý¤Î¸½¾õ

A Japanese Word Dependency Corpus ÆüËܸì¤Îñ¸ì·¸¤ê¼õ¤±¥³¡¼¥Ñ¥¹

IPSJ SIG Technical Report Vol.2010-NL-199 No /11/ treebank ( ) KWIC /MeCab / Morphological and Dependency Structure Annotated Corp

(2008) JUMAN *1 (, 2000) google MeCab *2 KH coder TinyTextMiner KNP(, 2000) google cabocha(, 2001) JUMAN MeCab *1 *2 h

21 Pitman-Yor Pitman- Yor [7] n -gram W w n-gram G Pitman-Yor P Y (d, θ, G 0 ) (1) G P Y (d, θ, G 0 ) (1) Pitman-Yor d, θ, G 0 d 0 d 1 θ Pitman-Yor G

1., 1 COOKPAD 2, Web.,,,,,,.,, [1]., 5.,, [2].,,.,.,, 5, [3].,,,.,, [4], 33,.,,.,,.. 2.,, 3.., 4., 5., ,. 1.,,., 2.,. 1,,

( )

Modal Phrase MP because but 2 IP Inflection Phrase IP as long as if IP 3 VP Verb Phrase VP while before [ MP MP [ IP IP [ VP VP ]]] [ MP [ IP [ VP ]]]

kut-paper-template.dvi

大学における原価計算教育の現状と課題

自然言語処理24_705

2016

untitled

36 Theoretical and Applied Linguistics at Kobe Shoin No. 20, 2017 : Key Words: syntactic compound verbs, lexical compound verbs, aspectual compound ve

1 1 tf-idf tf-idf i

IPSJ SIG Technical Report Vol.2011-EC-19 No /3/ ,.,., Peg-Scope Viewer,,.,,,,. Utilization of Watching Logs for Support of Multi-

0701073‐立命‐社会システム15号/15‐9-招待-横井

Fig. 3 Flow diagram of image processing. Black rectangle in the photo indicates the processing area (128 x 32 pixels).

1 Fig. 1 Extraction of motion,.,,, 4,,, 3., 1, 2. 2.,. CHLAC,. 2.1,. (256 ).,., CHLAC. CHLAC, HLAC. 2.3 (HLAC ) r,.,. HLAC. N. 2 HLAC Fig. 2

/ p p

2 : Open Clip Art Library [4] Microsoft Office PowerPoint Web PowerPoint 2 Yahoo! Web [5] SlideShare Yahoo! Web Yahoo! Web

Vol. 23 No. 4 Oct Kitchen of the Future 1 Kitchen of the Future 1 1 Kitchen of the Future LCD [7], [8] (Kitchen of the Future ) WWW [7], [3

3_23.dvi

@08470030ヨコ/篠塚・窪田 221号

The copyright of this material is retained by the Information Processing Society of Japan (IPSJ). The material has been made available on the website

,,,,., C Java,,.,,.,., ,,.,, i

NINJAL Project Review Vol.3 No.3

& Vol.5 No (Oct. 2015) TV 1,2,a) , Augmented TV TV AR Augmented Reality 3DCG TV Estimation of TV Screen Position and Ro

IPSJ SIG Technical Report Vol.2009-DPS-141 No.20 Vol.2009-GN-73 No.20 Vol.2009-EIP-46 No /11/27 1. MIERUKEN 1 2 MIERUKEN MIERUKEN MIERUKEN: Spe

tikeya[at]shoin.ac.jp The Function of Quotation Form -tte as Sentence-final Particle Tomoko IKEYA Kobe Shoin Women s University Institute of Linguisti

自然言語処理16_2_45

fiš„v8.dvi

本文.indd

IPSJ SIG Technical Report Vol.2011-CE-110 No /7/9 Bebras 1, 6 1, 2 3 4, 6 5, 6 Bebras 2010 Bebras Reporting Trial of Bebras Contest for K12 stud

NINJAL Research Papers No.8

IPSJ SIG Technical Report Vol.2010-GN-74 No /1/ , 3 Disaster Training Supporting System Based on Electronic Triage HIROAKI KOJIMA, 1 KU

% 95% 2002, 2004, Dunkel 1986, p.100 1

258 5) GPS 1 GPS 6) GPS DP 7) 8) 10) GPS GPS ) GPS Global Positioning System

大学論集第42号本文.indb

Unknown

-like BCCWJ CD-ROM CiNii NII BCCWJ BCCWJ


149 (Newell [5]) Newell [5], [1], [1], [11] Li,Ryu, and Song [2], [11] Li,Ryu, and Song [2], [1] 1) 2) ( ) ( ) 3) T : 2 a : 3 a 1 :

corpus.indd

Ł\1,4.ai

[4], [5] [6] [7] [7], [8] [9] 70 [3] 85 40% [10] Snowdon 50 [5] Kemper [3] 2.2 [11], [12], [13] [14] [15] [16]

THE JAPANESE JOURNAL OF PERSONALITY 2007, Vol. 15 No. 2, 217–227

Vol.55 No (Jan. 2014) saccess 6 saccess 7 saccess 2. [3] p.33 * B (A) (B) (C) (D) (E) (F) *1 [3], [4] Web PDF a m

e-learning e e e e e-learning 2 Web e-leaning e 4 GP 4 e-learning e-learning e-learning e LMS LMS Internet Navigware


A B C B C ICT ICT ITC ICT

田中ゆかり・早川洋平・冨田悠・林直樹



1 Web [2] Web [3] [4] [5], [6] [7] [8] S.W. [9] 3. MeetingShelf Web MeetingShelf MeetingShelf (1) (2) (3) (4) (5) Web MeetingShelf



untitled

1: A/B/C/D Fig. 1 Modeling Based on Difference in Agitation Method artisoc[7] A D 2017 Information Processing

HASC2012corpus HASC Challenge 2010,2011 HASC2011corpus( 116, 4898), HASC2012corpus( 136, 7668) HASC2012corpus HASC2012corpus

wki_shuronn.pdf


【教】⑮長島真人先生【本文】/【教】⑮長島真人先生【本文】

Transcription:

1,327 Annotation of Focus for Negation in Japanese Text Suguru Matsuyoshi This paper proposes an annotation scheme for the focus of negation in Japanese text. Negation has a scope, and its focus falls within this scope. The scope of negation is the part of the sentence that is negated. The focus of negation is the part of the scope that is prominently negated. In natural language processing, correct interpretation of negated statements requires precise detection of the focus of negation in the statements. As a foundation for developing a focus detector, we have annotated a part of Rakuten Travel: User Review Data and a part of a newspaper subcorpus of the Balanced Corpus of Contemporary Written Japanese, with our annotation scheme. In this scheme, a negation cue in the text data is linked to the focus by annotation with identifying clues. These clues include focus particles such as wa and shika, and other expressions in the context. We report 1,327 negation cues and the foci in the corpora. Key Words: Negation, Focus of Negation, Corpus Annotation, Modality, Interdisciplinary Graduate School of Medicine and Engineering, University of Yamanashi

Vol. 21 No. 2 April 2014 1 MeCab 1 JUMAN 2 CaboCha 3 KNP 4 5 KNP SynCha 6 ( 2007) (1) [ ] (2) [ ] (1) (1) (2) (1) (2) (1) (2) 1 http://mecab.googlecode.com/svn/trunk/mecab/doc/index.html 2 http://nlp.ist.i.kyoto-u.ac.jp/index.php?juman 3 http://code.google.com/p/cabocha/ 4 http://nlp.ist.i.kyoto-u.ac.jp/index.php?knp 5 6 https://www.cl.cs.titech.ac.jp/ ryu-i/syncha/ 250

( 2007; Blanco and Moldovan 2011a) 2 ( 2009) 2 3 4 5 2 6 2 (Huddleston and Pullum 2002; 2010; 2007) ( 2007, 2009) ( 1986; 1999; 2009; 2009) BioScope (Vincze, Szarvas, Farkas, Móra, and Csirik 2008) not without Morante 251

Vol. 21 No. 2 April 2014 (Morante, Liekens, and Daelemans 2008) Li BioScope (Li, Zhou, Wang, and Zhu 2010) *SEM 2012 7 Shared task 1 Conan Doyle 8 ( 2011) Blanco PropBank (Babko-Malaya 2005) (Blanco and Moldovan 2011a) (1) not MNEG (2) MNEG (3) A0, A1, A2, TMP, LOC 9 Blanco (Blanco and Moldovan 2011a, 2011b) *SEM 2012 Shared task 1 10 Rosenberg 4 (Rosenberg and Bergler 2012) 1 ( 2010) 3 7 http://ixa2.si.ehu.es/starsem/ 8 http://www.clips.ua.ac.be/sem2012-st-neg/ 9 MNEG 10 http://www.clips.ua.ac.be/sem2012-st-neg/ 252

3.1 (BCCWJ) 11 12 (3) [PN1a 00002] (4) [PN2f 00002] (5) [PN2f 00003] (6) [PN4g 00001] 1 ( 2007) 13 (3) (4) (5) (6) WHO 11 http://www.ninjal.ac.jp/corpus center/bccwj/ 12 PN BCCWJ 13 c- ( 2010; 2006) 253

Vol. 21 No. 2 April 2014 ( 2007; Blanco and Moldovan 2011a) (5) (4) 1 3.2 3 ( 1989; 2007) 1 (7) [PN1b 00004] 3.3 2 1 2 254

2 ( 1998; 2000) ( 1989; 1998) 3.4 (8) [PN2f 00002] (9) [PN2g 00004] (10) [PN3b 00004] (8) (9) (10) ( 2007) 14 (8) (10) 14 ( 2007) 255

Vol. 21 No. 2 April 2014 15 1 3.1 ( 2007) (11) (12) (11) [PN1e 00004] (12) [PN1b 00002] 3.5 ( 2009) (13) [PN3d 00003] (14) [PN3b 00004] (13) (14) 16 15 3.1 1 16 (14) 256

(14) 3.1 (13) 3.1 (13 ) (13 ) [ ] ( 2009) (15) [PN1e 00003] 3.1 2 ( 1986; 1999; 2009; 2009) 2 2 ( 1999) (16) [( 1999) p. 29] 4.1 3.1 257

Vol. 21 No. 2 April 2014 3.6 2 ( 2007) i j (17) 1 j i j [ ] (18) j i j [ ] (19) k j i i j [ ] (20) j i j [ ] (17) 2 1 (18) (19) 3 2 (20) 17 (17) (18) 3.1 (17) 3.1 1 (19) 3.1 17 5 258

3.5 3.1 (21) i i j [ ] 4 4.1 1 3.1 (1) (2) (3) A B (3) 4.2 5 259

Vol. 21 No. 2 April 2014 ID ID 3.2 YYYYMMDD UniDic 18 7 ID ID - - - - - - 1 1 1 1 - - 18 http://sourceforge.jp/projects/unidic/ 260

19 1 20 1 1 1 4.3 4.4 1 XML 3.1 (3) 19 20 1 2 261

Vol. 21 No. 2 April 2014 1 XML [PN1a 00002] 1 <sentence> <SUW> <tok> 1 ID BCCWJ XML -f 3 CaboCha <sentence> ID XML <wsb:negation> <wsb:focus> <wsb:description> <wsb:clue> 21 <wsb:negation> 1 <sentence> 4.2 @wsb:orthtoken : @wsb:morphid : ID @wsb:pos : @wsb:doublenegative : 21 wsb 262

@wsb:lastupdate : <wsb:focus> <wsb:negation> 1 @wsb:scope <wsb:description> <wsb:clue> <wsb:focus> 22 @wsb:orthtoken : @wsb:morphid : ID @wsb:argtypes : @wsb:toritate : @wsb:class : <wsb:description> 1 <wsb:clue> @wsb:sid : ID @wsb:orthtokens :. @wsb:morphids : ID. <wsb:clue> 2 <sentence> 1 BCCWJ XML XML CaboCha ( 2010) 5 2 (1) 23 : (2) BCCWJ (PN) 22 @wsb:numofcandidates 4.2 1 pl 23 http://travel.rakuten.co.jp/ 263

Vol. 21 No. 2 April 2014 5.1 : : ( 2012) 24 90% 1 58 10 58 40 5,178 1,246 5.2 BCCWJ BCCWJ 1/100 25 340 1 54 A 1 XML <sentence> 2,708 406 5.3 4.4 XML HTML HTML 2 HTML XML 100 3 XML 2 2 304 103 2 2 3 24 25 http://d.hatena.ne.jp/masayua/20120807/1344313720 264

2 HTML 1 1 1 5.4 2 1 2 1,023 304 2 301 72 29% (301/1,023) 24% (72/304) 30% 265

Vol. 21 No. 2 April 2014 2 1 2 3 2 35% (129/373) 3.5 4 - - 26 1 2 637 173 810 116 33 149 19 34 53 211 53 264 28 6 34 12 5 17 (1,023) (304) (1,327) 94 30 124 121 72 193 8 0 8 (223) (102) (325) 1,246 406 1,652 141 18 159 30 5 35 7 6 13 49 11 60 17 6 23 5 4 9 3 2 5 3 1 4 1 2 3 20 7 27 8 8 16 1 0 1 1 2 3 1 0 1 14 0 14 301 72 373 26 1 266

86 20 8 1 160 4.2 5 2 2 373 375 87% (327/375) 3 4 66 13 79 34 7 41 7 1 8 0 1 1 107 22 129-13 5 18-27 12 39-10 9 19-40 3 43-10 5 15-43 12 55-125 15 140-19 11 30-14 0 14 301 72 373 5 271 56 327 32 16 48 303 72 375 267

Vol. 21 No. 2 April 2014 6 2 3 1 2 BCCWJ 3 ( 2013) BCCWJ (B) 25870278 268

Babko-Malaya, O. (2005). PropBank Annotation Guidelines. ACE (Automatic Content Extraction) Program. http://verbs.colorado.edu/~mpalmer/projects/ace/pbguidelines. pdf. Blanco, E. and Moldovan, D. (2011a). Semantic Representation of Negation Using Focus Detection. In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, pp. 581 589. Blanco, E. and Moldovan, D. (2011b). Some Issues on Detecting Negation from Text. In Proceedings of the 24th International Florida Artificial Intelligence Research Society Conference, pp. 228 233. (1998)... Huddleston, R. and Pullum, G. K. (Eds.) (2002). The Cambridge Grammar of the English Language. Cambridge University Press. (2006)... (2010)... (2011). Ver.2.4. Technical Report of Department of Information Science, Ochanomizu University. (2009).., 136, pp. 121 151. (2012).. 18, pp. 1188 1191. Li, J., Zhou, G., Wang, H., and Zhu, Q. (2010). Learning the Scope of Negation via Shallow Semantic Parsing. In Proceedings of the 23rd International Conference on Computational Linguistics (Coling 2010), pp. 671 679. (1998)... (2010)... D,, 93 (6), pp. 705 713. (1999).., 28, pp. 27 36. Morante, R., Liekens, A., and Daelemans, W. (2008). Learning the Scope of Negation in Biomedical Texts. In Proceedings of the Conference on Empirical Methods in Natural Language Processing, pp. 715 724. (1989)... 269

Vol. 21 No. 2 April 2014 (2007). 3.. (2009). 5.. (2000)... (2009)... (1986)... (2013).. 19, pp. 936 939. Rosenberg, S. and Bergler, S. (2012). UConcordia: CLaC Negation Focus Detection at *Sem 2012. In Proceedings of the 1st Joint Conference on Lexical and Computational Semantics: SemEval 12, pp. 294 300. Vincze, V., Szarvas, G., Farkas, R., Móra, G., and Csirik, J. (2008). The BioScope Corpus: Biomedical Texts Annotated for Uncertainty, Negation and their Scopes. In BMC Bioinformatics, pp. 1 9. 2003 2008 2013 9 20 2013 11 28 2013 12 13 270