305 8550 1 2 CREST fujii@slis.tsukuba.ac.jp 1 7% 2 2 3 PRIME Multi-lingual Information Retrieval 2 2.1 Cross-Language Information Retrieval CLIR 1990 CD-ROM a. b. c. d. b CLIR b 70%
CLIR CLIR 2.2 (b) 2 (c) 2.3 3 (a) 3 2.4
4 Altavista 4 Excite 5 3 3.1 PRIME 2.5 1998 4 CLIR [2,3,4,5,7,16,17] NTCIR [1, 6] 1960 [10] Cross-Language Information Retrieval PRIME Patent Retrieval In Multi-lingual Environment 6 2000 7 2001 3 [8, 14] 2001 4 2002 3 [9, 11] PRIME [15] 2002 4 2003 3 [13] [12] / TREC 1 CLEF 2 3.2 NTCIR 3 PRIME 1 3 NTCIR-3 1 http://trec.nist.gov/ 2 http://clef.iei.pi.cnr.it/ 3 http://research.nii.ac.jp/ntcir/ 4 http://www.altavista.com 5 http://www.excite.co.jp 6 http://patolis-e.patolis.co.jp/
DB PRIME 19 100 1: PRIME 3.5 collaboration PAT-Transer 3.3 3.3.1 1 95% PAJ (Patent Abstracts of Japan) CD-ROM 1995 1999 5 175 5 1995 1999 32,000 3.3.3 7 http://chasen.aist-nara.ac.jp/
U 1 D U = u 1,u 2,...,u n D = d 1,d 2,...,d n P (D U) D (1) P (D U) D address space control block associative learning associative memory associative record compressor research facility correlation function factor correlation hash associative memory 2: arg max D P (D U) = arg max P (U D) P (D) (1) P (U D) P (D) 1 (2) P (U D) P (D) D n P (u i d i ) i=1 n 1 P (d i+1 d i ) i=1 (2) P (d i+1 d i ) 1P (u i d i ) 3.3.2 address associative block compressor control correlation facility factor function hash learning memory record research space,,, 3.3.2 3: 3.3.1 EM P (u i d i ) 3 associative 1 correlation 2 (3) P (associative ) = 1/3 (3) P (correlation ) = 2/3 P (u i d i ) 2 EM 23,313 10,724 600 d i 95% P (u i d i ) u i 2 3 d i 2 3.3.3
3.3.3 3.3.1 transliteration (si-su-temu) system -sy -s -te -m L/R C/K 3 2 1 0 21 4 4 te-ki-su-to text $ -te - x-t $ 423 1,018 (1) (2) U D u i d i P (D U) D P (D U) P (u i d i ) te ki su to $ t 3 1 2 3 0 e 0 0 0 0 0 x 1 2 1 1 0 t 3 1 2 3 0 $ 0 0 0 0 3 4: (2) P (D) D P (D)=0 3.4 PRIME PRIME 5 natural language processing information retrieval AND Search 6 7 8
5: PRIME 3.5 3.5.1 2000 1 PRIME 157 1 2 2 1 1 2
6: 2 9 10 (31) (33) (31) 21 Appl.No. 3.5.2
7: [54] [ABSTRACT] 1995 1999 5 32,000 100 3,000 11 information retrieval 4 PRIME PRIME
8: [1] Atsushi Fujii and Tetsuya Ishikawa. Cross-language information retrieval at ULIS. In Proceedings of the 1st NTCIR Workshop on Research in Japanese Text Retrieval and Term Recognition, pp. 163 169, 1999. [2] Atsushi Fujii and Tetsuya Ishikawa. Cross-language information retrieval for technical documents. In Proceedings of the Joint ACL SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora, pp. 29 37, 1999. [3] Atsushi Fujii and Tetsuya Ishikawa. Cross-language information retrieval using compound word translation. In Proceedings of the 18th International Conference on Computer Processing of Oriental Languages, pp. 105 110, 1999. [4] Atsushi Fujii and Tetsuya Ishikawa. Applying machine translation to two-stage cross-language information retrieval. In Proceedings of the 4th Conference of the Association for Machine Translation in the Americas, pp. 13 24, 2000. [5] Atsushi Fujii and Tetsuya Ishikawa. Cross-language information retrieval based on query keyword translation: An Internet search application. International Journal of Computer Processing of Oriental Languages, Vol. 13, No. 1, pp. 1 13, 2000. [6] Atsushi Fujii and Tetsuya Ishikawa. Evaluating multilingual information retrieval and clustering at ULIS. In Proceedings of the 2nd NTCIR Workshop Meeting on Evaluation of Chinese & Japanese Text Retrieval and Text Summarization, 2001.
9:
10: alkenyl-containing polydiorganosiloxane iontrapmassspectrometer indolylalkylpiperazinyl pyridine air bag canister diorganopolysiloxane polymer silicone pressure sensitive adhesive compositions cellulose triacetate photographic cellulose triacetate photographic film base vulcanizable elastomeric compound thermal dye transfer system miniature scan confocal microscope electronic-monetary system peristaltic pump 11:
[7] Atsushi Fujii and Tetsuya Ishikawa. Japanese/English cross-language information retrieval: Exploration of query translation and transliteration. Computers and the Humanities, Vol. 35, No. 4, pp. 389 420, 2001. [8] Masatoshi Fukui, Shigeto Higuchi, Youichi Nakatani, Masao Tanaka, Atsushi Fujii, and Tetsuya Ishikawa. Applying a hybrid query translation method to Japanese/English cross-language patent retrieval. In ACM SIGIR Workshop on Patent Retrieval, 2000. [9] Shigeto Higuchi, Masatoshi Fukui, Atsushi Fujii, and Tetsuya Ishikawa. PRIME: A system for multi-lingual patent retrieval. In Proceedings of MT Summit VIII, pp. 163 167, 2001. [10] P.E. Mongar. International co-operation in abstracting services for road engineering. The Information Scientist, Vol. 3, pp. 51 62, 1969. [11],,,.., 2001-NL-145, pp. 23 28, 2001. [12],,.. 9, 2003. [13],,,.., 2002-NL- 151, pp. 95 101, 2002. [14],,,.. 7, pp. 445 447, 2001. [15],,,. prime. 8, pp. 196 199, 2002. [16],.., Vol. 41, No. 4, pp. 1038 1045, 2000. [17],.., Vol. J84-D-II, No. 2, pp. 362 369, 2001.