untitled



Similar documents
K-A05.dvi

1 1 tf-idf tf-idf i

自然言語処理21_125

DEIM Forum 2009 E

Vol.54 No (July 2013) [9] [10] [11] [12], [13] 1 Fig. 1 Flowchart of the proposed system. c 2013 Information

1 Fig. 1 Extraction of motion,.,,, 4,,, 3., 1, 2. 2.,. CHLAC,. 2.1,. (256 ).,., CHLAC. CHLAC, HLAC. 2.3 (HLAC ) r,.,. HLAC. N. 2 HLAC Fig. 2

(a) 1 (b) 3. Gilbert Pernicka[2] Treibitz Schechner[3] Narasimhan [4] Kim [5] Nayar [6] [7][8][9] 2. X X X [10] [11] L L t L s L = L t + L s

IPSJ SIG Technical Report Vol.2010-SLDM-144 No.50 Vol.2010-EMB-16 No.50 Vol.2010-MBL-53 No.50 Vol.2010-UBI-25 No /3/27 Twitter IME Twitte

untitled

1 Web Web 1,,,, Web, Web : - i -

H1-H4*.ai

IPSJ SIG Technical Report Vol.2009-DPS-141 No.20 Vol.2009-GN-73 No.20 Vol.2009-EIP-46 No /11/27 1. MIERUKEN 1 2 MIERUKEN MIERUKEN MIERUKEN: Spe

[2] OCR [3], [4] [5] [6] [4], [7] [8], [9] 1 [10] Fig. 1 Current arrangement and size of ruby. 2 Fig. 2 Typography combined with printing

IPSJ SIG Technical Report Vol.2011-EC-19 No /3/ ,.,., Peg-Scope Viewer,,.,,,,. Utilization of Watching Logs for Support of Multi-

TF-IDF TDF-IDF TDF-IDF Extracting Impression of Sightseeing Spots from Blogs for Supporting Selection of Spots to Visit in Travel Sat

Microsoft Word - toyoshima-deim2011.doc

1 4 4 [3] SNS 5 SNS , ,000 [2] c 2013 Information Processing Society of Japan

IPSJ SIG Technical Report Secret Tap Secret Tap Secret Flick 1 An Examination of Icon-based User Authentication Method Using Flick Input for

Vol. 9 No. 5 Oct (?,?) A B C D 132

& Vol.5 No (Oct. 2015) TV 1,2,a) , Augmented TV TV AR Augmented Reality 3DCG TV Estimation of TV Screen Position and Ro

SNS GIS Abstract The Tourism-based Country Promotion Basic Act was enacted in Japan over a decade ago. Tourism is expected to be the primary contribut

[2] , [3] 2. 2 [4] 2. 3 BABOK BABOK(Business Analysis Body of Knowledge) BABOK IIBA(International Institute of Business Analysis) BABOK 7

untitled

IPSJ SIG Technical Report An Evaluation Method for the Degree of Strain of an Action Scene Mao Kuroda, 1 Takeshi Takai 1 and Takashi Matsuyama 1

A Japanese Word Dependency Corpus ÆüËܸì¤Îñ¸ì·¸¤ê¼õ¤±¥³¡¼¥Ñ¥¹

1., 1 COOKPAD 2, Web.,,,,,,.,, [1]., 5.,, [2].,,.,.,, 5, [3].,,,.,, [4], 33,.,,.,,.. 2.,, 3.., 4., 5., ,. 1.,,., 2.,. 1,,

DPA,, ShareLog 3) 4) 2.2 Strino Strino STRain-based user Interface with tacticle of elastic Natural ObjectsStrino 1 Strino ) PC Log-Log (2007 6)

21 Pitman-Yor Pitman- Yor [7] n -gram W w n-gram G Pitman-Yor P Y (d, θ, G 0 ) (1) G P Y (d, θ, G 0 ) (1) Pitman-Yor d, θ, G 0 d 0 d 1 θ Pitman-Yor G

kut-paper-template.dvi

2006 [3] Scratch Squeak PEN [4] PenFlowchart 2 3 PenFlowchart 4 PenFlowchart PEN xdncl PEN [5] PEN xdncl DNCL 1 1 [6] 1 PEN Fig. 1 The PEN

知能と情報, Vol.30, No.5, pp

2 : Open Clip Art Library [4] Microsoft Office PowerPoint Web PowerPoint 2 Yahoo! Web [5] SlideShare Yahoo! Web Yahoo! Web

Vol.55 No (Jan. 2014) saccess 6 saccess 7 saccess 2. [3] p.33 * B (A) (B) (C) (D) (E) (F) *1 [3], [4] Web PDF a m

IPSJ SIG Technical Report Vol.2010-NL-199 No /11/ treebank ( ) KWIC /MeCab / Morphological and Dependency Structure Annotated Corp

29 jjencode JavaScript

3_23.dvi

(2008) JUMAN *1 (, 2000) google MeCab *2 KH coder TinyTextMiner KNP(, 2000) google cabocha(, 2001) JUMAN MeCab *1 *2 h

1 7.35% 74.0% linefeed point c 200 Information Processing Society of Japan

Vol. 48 No. 4 Apr LAN TCP/IP LAN TCP/IP 1 PC TCP/IP 1 PC User-mode Linux 12 Development of a System to Visualize Computer Network Behavior for L

untitled

Web

IT i

Core Ethics Vol. Nerriere D.Hon EU GS NPO GS GS Oklahoma State University Kyoto Branch OSU-K OSU-K OSU-K

60 90% ICT ICT [7] [8] [9] 2. SNS [5] URL 1 A., B., C., D. Fig. 1 An interaction using Channel-Oriented Interface. SNS SNS SNS SNS [6] 3. Processing S


22 Google Trends Estimation of Stock Dealing Timing using Google Trends

大学における原価計算教育の現状と課題


DEIM Forum 2010 A Web Abstract Classification Method for Revie


36 Theoretical and Applied Linguistics at Kobe Shoin No. 20, 2017 : Key Words: syntactic compound verbs, lexical compound verbs, aspectual compound ve

2reN-A14.dvi


( )

日本感性工学会論文誌

07九州工業大学.indd

行動経済学 第5巻 (2012)

06’ÓŠ¹/ŒØŒì

12) NP 2 MCI MCI 1 START Simple Triage And Rapid Treatment 3) START MCI c 2010 Information Processing Society of Japan

16_.....E...._.I.v2006


揃 Lag [hour] Lag [day] 35

Ł×

THE INSTITUTE OF ELECTRONICS, INFORMATION AND COMMUNICATION ENGINEERS TECHNICAL REPORT OF IEICE.

2. Twitter Twitter 2.1 Twitter Twitter( ) Twitter Twitter ( 1 ) RT ReTweet RT ReTweet RT ( 2 ) URL Twitter Twitter 140 URL URL URL 140 URL URL

IPSJ SIG Technical Report Vol.2016-CE-137 No /12/ e β /α α β β / α A judgment method of difficulty of task for a learner using simple

1 AND TFIDF Web DFIWF Wikipedia Web Web AND 5. Wikipedia AND 6. Wikipedia Web Ma [4] Ma URL AND Tian [8] Tian Tian Web Cimiano [3] [

Introduction Purpose This training course describes the configuration and session features of the High-performance Embedded Workshop (HEW), a key tool

ISSN NII Technical Report Patent application and industry-university cooperation: Analysis of joint applications for patent in the Universit

Web Web Web Web 1 1,,,,,, Web, Web - i -


FA

% 95% 2002, 2004, Dunkel 1986, p.100 1

/ p p


IT,, i

2006 3

Microsoft Word - deim2011_new-ichinose doc

1 2. Nippon Cataloging Rules NCR [6] (1) 5 (2) 4 3 (3) 4 (4) 3 (5) ISSN 7 International Standard Serial Number ISSN (6) (7) 7 16 (8) ISBN ISSN I

Core Ethics Vol.

IPSJ SIG Technical Report Vol.2011-MUS-91 No /7/ , 3 1 Design and Implementation on a System for Learning Songs by Presenting Musical St

市区町村別平均寿命の全国順位の変化からみた青森県市町村平均寿命の解析

¥ì¥·¥Ô¤Î¸À¸ì½èÍý¤Î¸½¾õ

EQUIVALENT TRANSFORMATION TECHNIQUE FOR ISLANDING DETECTION METHODS OF SYNCHRONOUS GENERATOR -REACTIVE POWER PERTURBATION METHODS USING AVR OR SVC- Ju

1 Web [2] Web [3] [4] [5], [6] [7] [8] S.W. [9] 3. MeetingShelf Web MeetingShelf MeetingShelf (1) (2) (3) (4) (5) Web MeetingShelf

計量国語学 アーカイブ ID KK 種別 特集 招待論文 A タイトル Webコーパスの概念と種類, 利用価値 語史研究の情報源としてのWebコーパス Title The Concept, Types and Utility of Web Corpora: Web Corpora as

総 説 6 6 PIMs P S J 7

DEIM Forum 2009 C8-4 QA NTT QA QA QA 2 QA Abstract Questions Recomme

IPSJ SIG Technical Report Vol.2011-DBS-153 No /11/3 Wikipedia Wikipedia Wikipedia Extracting Difference Information from Multilingual Wiki

IPSJ SIG Technical Report Pitman-Yor 1 1 Pitman-Yor n-gram A proposal of the melody generation method using hierarchical pitman-yor language model Aki

kiyo5_1-masuzawa.indd

framing 2 3 reframing 4 LRT LRT LRT LRT 5 2LRT LRT 2.1 LRT JR JR8.0km 45, JR LRT LRT JR3 5 7,000 6,000 5,000 4,000

Table 1. Assumed performance of a water electrol ysis plant. Fig. 1. Structure of a proposed power generation system utilizing waste heat from factori


04_奥田順也.indd


論文9.indd

<4D F736F F D EC959F90B781758BA38B5A82A982E982BD82CC897282DD82C982A882AF82E B837D EFC CC93C192A D34392E646F63>

& Vol.2 No (Mar. 2012) 1,a) , Bluetooth A Health Management Service by Cell Phones and Its Us

thesis.dvi

Transcription:

580 26 5 SP-G 2011 AI An Automatic Question Generation Method for a Local Councilor Search System Yasutomo KIMURA Hideyuki SHIBUKI Keiichi TAKAMARU Hokuto Ototake Tetsuro KOBAYASHI Tatsunori MORI Otaru University of Commerce kimura@res.otaru-uc.ac.jp, http://minna.ih.otaru-uc.ac.jp Yokohama National University shib@forest.eis.ynu.ac.jp Utsunomiya Kyowa University takamaru@kyowa-u.ac.jp Fukuoka University ototake@fukuoka-u.ac.jp National Institute of Informatics k-tetsu@nii.ac.jp Yokohama National University mori@forest.eis.ynu.ac.jp keywords: local politics, question generation, information extraction Summary This paper presents an automatic question generation method for a local councilor search system. Our purpose is to provide residents with information about local council activities in an easy-to-understand manner. Our designed system creates a decision tree with leaves that correspond to local councilors in order to clarify the differences in the activities of local councilors using local council minutes as the source. Moreover, our system generates questions for selecting the next branch at each condition in the decision tree. We confirmed experimentally that these questions are appropriate for the selection of branches in the decision tree. 1. TV 1 22 A4 200

581 1 n 2 3 4 5 6 2. 2 [ 08, 09, 09a, Takamaru 09, 09, 09b, 10, 10] ( ) 1 2 2 1 SVM 2 2 2 SVM 1 2 1 2 2 1 1 [ 10]

582 26 5 SP-G 2011 [ 09] 20 3 2 1 4 [ 08] 96 1 19 1 19 7,084 1 7,084 2 2 2 59 3 [ 09] XML <Paragraph> 2 1 100% 3 20 180 Web 63 4 1 1000 1010 4,236 59.8% 1011 565 8.0% 1012 821 11.6% 1020 859 12.1% 1021 543 7.7% 1030 1,112 15.7% 1050 704 9.9% 1060 479 6.8% 1061 585 8.3% 1062 471 6.6% 1100 1,500 21.2% 1101 1,739 24.5% 1120 1,306 18.4% 1121 1,268 17.9% 1160 880 12.4% 1162 498 7.0% 2000 2013 427 6.0% 2065 396 5.6% 3000 3030 1,112 5.6% 3040 988 13.9% 3060 548 7.7% 4000 4020 646 9.1% 4110 415 5.9% 5000 5030 517 7.3% <Keyword> <Keyword> Member Category <Paragraph> Member 19 2 1 2 2 3.

583 2 <Paragraph Member= 37 > <Keyword Member= Category= 4110 > </Keyword> <Keyword Member= Category= 4110 > </Keyword> <Keyword Member= Category= 1050 > </Keyword> <Keyword Member= Category= 1050 > </Keyword> </Paragraph> <Paragraph Member= > </Paragraph> <Paragraph Member= > </Paragraph> <Paragraph Member= > <Keyword Member= Category= 1010;1101 > </Keyword> <Keyword Member= Category= 1101 > </Keyword> <Keyword Member= Category= 1030 > </Keyword> <Keyword Member= Category= 1030 > </Keyword> </Paragraph> <Paragraph Member= > </Paragraph> 2 3 2 2 ID3 C4.5 C4.5 Weka J48 4 [ 03] 3 1 Weka J48 4 http://www.cs.waikato.ac.nz/ml/weka/

584 26 5 SP-G 2011 3 A B C 1 5 10 10 25 2 10 10 20 40 3 5 10 20 35 20 30 50 100 0.2 0.3 0.5 1.0 4 3 210 8.5 13 5 9 367 9.4 14 6 10 3 2 3 1 1 1 1 n P1 P2 P1 P2 M1 P2 P1 M2 n n 1 3 3 1 3 3 19 19 4 19 1 19 2-4 2 20 49 3 1,140 18,424 20 C 3 =1, 140 49 C 3 =18, 424 3 25 3 210 367 4 9 3

585 2 (1) (2) 1 19 800 2 3 3 2 3 1 1 4 2 3 A ID=1 A C A C A B B ID=2 B ID=5 ID=5 A B

586 26 5 SP-G 2011 5 A B A B 97 63 25 24 13 13 2 4. 4 1 3 2 [ 09] 6 A B A B A B A B A B 2 [ 99] [ 09] 2 - - - A B A B - A B A B A B A B 4 2 A B 19 - A B 19 A B 14,336 5 A B 4 1 5 A B - - A B IPADIC 5 - - IPADIC - IPADIC - - - A ( / / ) B A B [ 00] A BA B A B A B GoogleN-gram [Google 07] Google N-gram 7 20 20 14,336 805 6 5 http://chasen.aist-nara.ac.jp/chasen/doc/ipadic-2.6.3-j.pdf

587 A B A B A 4 3 4 1 A B 96 19 1,667 Google N-gram A B A B CaboCha [Kudo 03] CaboCha A 7 7 A B 20 3 5 3 3 2 4 A 8 6 6 0 4 4 4 3 A A B 8 20 7 A B 7 A B A B 20 90 90 6 2 79%=(71/90) 1 A B A B ( ) A B A B 7 #

588 26 5 SP-G 2011 6 Google N-gram A B Google N-gram A B 25 43 13 31 13 70 12 51 12 20 11 95 11 71 7 1 2 3 4 5 6 1 2 1 2 3 4 5 3 ( ) 1 2 3 4 5 4 1 2 3 4 5 IDF 4 5 4 4 (1) Baseline() - A B A B Google N-gram A B Baseline (2) A B A B (3) 3 (4) IDF(Inverse Document Frequency) ICF(Inverse Category Frequency) ICF IDF Document Category Document CF t ICF t

589 8 1 3.60 2 3.94 3 # 3.41 4 3.80 5 3.57 6 4.00 7 4.17 8 4.30 9 # 3.62 10 4.17 11 # 3.38 12 # 4.41 13 2.24 14 # 3.08 15 # 4.01 16 # 3.57 17 3.60 18 4.30 19 3.74 20 3.72 74.63 # A B N ICF(w i )=1+log 2 CF(w i ) w i N 19 7,084 19 96 96 96 CF( ) ICF ICF ICF ICF = 1 n n ICF(w i ) i=1 ICF ICF( ) ICF + L (L) ICF ICF ICF + L = 1 n n ICF(w i ) logl(w i ) i=1 L(w i ) w i ICF ICF( )+L 5. 5 1 A B 4 3 20

590 26 5 SP-G 2011 19 37 2 4 3 2 = 4 3 = 5 2 20 90 4-5 9 0.85(=17/20) 0.9887 8 1 2 8 4 3 74.63 ICF 3 1 2 3 Mecab 1 IPA : 2 1 : 3 : : 9 + 10 10 : 0.95(=19/20) 5 3 19 4 3

591 9 / / Baseline() 7/20 0.35 61.84/74.63 0.8286 ( ) 15/20 0.75 66.62/74.63 0.8927 ( ) 5/20 0.25 46.74/74.63 0.6227 ( ) 16/20 0.80 70.24/74.63 0.9412 17/20 0.85 73.79/74.63 0.9887 ICF 3/20 0.15 44.54/74.63 0.5968 ICF( ) 4/20 0.20 51.91/74.63 0.6956 ICF + L 6/20 0.30 40.60/74.63 0.5440 ICF( )+L 5/20 0.25 45.10/74.63 0.6043 10 / / 17/20 0.85 73.79/74.63 0.9887 + 15/20 0.75 66.62/74.63 0.8927 + 17/20 0.85 73.79/74.63 0.9887 + + 16/20 0.80 70.24/74.63 0.9412 + : 19/20 0.95 74.51/74.63 0.9983 + :+ 19/20 0.95 71.60/74.63 0.9593 11 / / Baseline() 8/37 0.22 98.86/128.36 0.7702 ( ) 21/37 0.57 109.40/128.36 0.8523 ( ) 20/37 0.54 103.26/128.36 0.8045 ( + ) 22/37 0.60 110.83/128.36 0.8634 24/37 0.65 115.74/128.36 0.9016 + 24/37 0.65 116.00/128.36 0.9037 + 25/37 0.68 117.17/128.36 0.9128 + + 25/37 0.68 117.43/128.36 0.9148 + : 23/37 0.62 112.14/128.36 0.8736 + :+ 22/37 0.60 110.11/128.36 0.8578 37 133 11 + : + : + + + + 0.6756(=25/37) + + 0.9148 + + + + 6.

592 26 5 SP-G 2011 http://www.hokkaido-politics.net 2 twitter( ) 22300086 [ 03],,, Boosting, 4 (2003) [Google 07] Google Web N 1 by Google, GSK GSK2007-C (2007) [ 08],,,,,, 2008-NL-187, pp. 23 28 (2008) [ 00],, N1 N2,, Vol. 7, No. 4, pp. 79 98 (2000) [ 09a],, 2009 (2009) [ 09b],,,,25 1, pp. 100 118 (2009) [ 10],,,, 16, pp. 563 566 (2010) [Kudo 03] Kudo, T. and Matsumoto, Y.: Fast Methods for Kernel- Based Text Analysis, ACL 2003 (2003) [ 99], A B,, Vol. 129, No. 16, pp. 109 116 (1999) [ 09],,,,, 15, pp. 298 301 (2009) [ 10],,,,, NLC2010-1, Vol. 110, No. 142, pp. 7 12 (2010) [ 09],,,,, Vol. 109, No. 234, pp. 25 30 (2009) [Takamaru 09] Takamaru, K., Shibuki, H., Kimura, Y., Hasegawa, D., Ototake, H., and Araki, K.: Extraction of Political Activity of Assemblyman from Minutes of Municipal Assemblies Using the Political Category, Proc. 11th Conference of Pacific Association for Computational Linguistics (PACLING 2009), p. B11 (2009) [ 09], - -,, Vol. 25, No. 1, pp. 61 73 (2009) 2010 8 1

593 2004 ( ) 2005 2007 2010 10 2011 9 New York 2002 ( ) 2006 ( ) 2002 2008 2004 2006 2010 2007. 2007, Rutgers, Stanford 1991 1998 2 11 Stanford CSLI ACM