Shiwen Yu, Huiming Duan, XuefengZhu, Bin Swen, Baobao Chang

Similar documents

GPGPU

06佐々木雅哉_4C.indd

総研大文化科学研究第 11 号 (2015)

h1

-like BCCWJ CD-ROM CiNii NII BCCWJ BCCWJ

.O../.O....

gengo.dvi

KIT33_h1-h4-2

8y4...l

A Japanese Word Dependency Corpus ÆüËܸì¤Îñ¸ì·¸¤ê¼õ¤±¥³¡¼¥Ñ¥¹

計量国語学 アーカイブ ID KK 種別 特集 招待論文 A タイトル Webコーパスの概念と種類, 利用価値 語史研究の情報源としてのWebコーパス Title The Concept, Types and Utility of Web Corpora: Web Corpora as


結語と展望―世界史の全体構図からみた「太平洋戦争」の歴史的意味とその教訓

pp DC 2,



untitled

7_matsumoto.indd

ÿþ

1 USNET No.13

SP100 取扱説明書

IPSJ SIG Technical Report Vol.2010-NL-199 No /11/ treebank ( ) KWIC /MeCab / Morphological and Dependency Structure Annotated Corp

80

110527BR機能カ?イト?_110527BR機能カ?イト?

Ł\1.pdf

1) , 215, 1441, , 132, 1237, % College Analysis 2-4) 2

49148

全建総連 論文集●/P38~69 作本博昭(優秀賞)

ñ{ï 01-65

indd

新善-1208

8_p01.indd

untitled

2

1_p01.indd

40_No43.indd

2007.3„”76“ƒ


201_P1_P24(2)


sayo pdf

月信11-12pdf用.indd

広報ちくしの_ indd


katagami No.65

P01-14.indd

レッツ中央205号.indd

えふ・サポート-113号-162.indd




d


レッツ中央210号.indd


レッツ中央212号.indd

0405宅建表01.indd

広報ちくしの_ indd


:. * ** *** **** Little Lord Fauntleroy Little Lord Fauntleroy Frances Eliza Hodgson Burnett, - The Differences between the Initial Edition and First

x i 2 x x i i 1 i xi+ 1xi+ 2x i+ 3 健康児に本剤を接種し ( 窓幅 3 n-gram 長の上限 3 の場合 ) 文字 ( 種 )1-gram: -3/ 児 (K) -2/ に (H) -1/ 本 (K) 1/ 剤 (K) 2/ を (H) 3/ 接 (K) 文字 (

Vol.54 No (July 2013) [9] [10] [11] [12], [13] 1 Fig. 1 Flowchart of the proposed system. c 2013 Information

表1-表4_05

, PDD ASD p.,.,..,..,.,..,.,..,.,.,.,, 146

36 Theoretical and Applied Linguistics at Kobe Shoin No. 20, 2017 : Key Words: syntactic compound verbs, lexical compound verbs, aspectual compound ve

56

PeerPool IP NAT IP UPnP 2) Bonjour 3) PeerPool CPU 4) 2 UPnP Bonjour PeerPool CPU PeerPool PeerPool PPv2 PPv2 2. PeerPool 2.1 PeerPool PeerPool PoolGW

<4D F736F F D EC959F90B781758BA38B5A82A982E982BD82CC897282DD82C982A882AF82E B837D EFC CC93C192A D34392E646F63>

indd

Microsoft Word - toyoshima-deim2011.doc


ISSN ISBN C3033 The Institute for Economic Studies Seijo University , Seijo, Setagaya Tokyo , Japan

スポーツ選手における日常的トレーニングが味覚に及ぼす影響について

06_仲野恵美.indd


320 Nippon Shokuhin Kagaku Kogaku Kaishi Vol. /., No.1, -,* -,/ (,**1) 8 * ** *** * ** *** E#ect of Superheated Steam Treatment on the Preservation an

Financial Reporting Standard 17 FRS17 FAS87 87 Financial Accounting Standard 87 FAS87 International Accounting Standard Board IASB 19 Internat




直売所

いしずえ134.indd


NINJAL Project Review Vol.3 No.3

Aries,

IPSJ SIG Technical Report Vol.2016-CE-137 No /12/ e β /α α β β / α A judgment method of difficulty of task for a learner using simple

SNS GIS Abstract The Tourism-based Country Promotion Basic Act was enacted in Japan over a decade ago. Tourism is expected to be the primary contribut


1

ID 3) 9 4) 5) ID 2 ID 2 ID 2 Bluetooth ID 2 SRCid1 DSTid2 2 id1 id2 ID SRC DST SRC 2 2 ID 2 2 QR 6) 8) 6) QR QR QR QR

DEIM Forum 2009 B4-6, Str

CJL NEWS VOL JANUARY contents

”Y‰Æ”ЛïŸ_‘W40−ª3/ ’¼„´

22.Q06.Q

63 Author s Address: A Study on the Activities and Characteristics of Johnny s fans in china WEI Ran, LU Yijing Foreign Lang

2 2.1 NPCMJ ( (Santorini, 2010) (NPCMJ, 2016) (1) (, 2016) (1) (2) (1) ( (IP-MAT (CONJ ) (PP (NP (D ) (N )) (P )) (NP-SBJ *

Oda

jpaper : 2017/4/17(17:52),,.,,,.,.,.,, Improvement in Domain Specific Word Segmentation by Symbol Grounding suzushi tomori, hirotaka kameko, takashi n

07九州工業大学.indd

Transcription:

Journal ofchinese Language and Computing, l3 (2) 12l-158

Shiwen Yu, Huiming Duan, XuefengZhu, Bin Swen, Baobao Chang

Specification for Corpus Processing at Peking University

Shiwen Yu, Huiming Duan, Xuefeng Zhu, Bin Swen, Baobao Chang bu4 de2dao4 duol falzhan3 gan3shan4 guo2jial jingljia jiu4 ke3neng2 min2zu2 ren2min2 ru2guo3 shenglhuo2 shui3ping2 ti2gaol tuan?jie2 ye3 yi lge4 zan2men5 zhe4me5 zhonglguo2

Specification for Corpus Processing at Peking University

Shiwen Yu, Huiming Duan, Xuefeng Zhu, Bin Swen, Baobao Chang

Specification for Corpus Processing at Peking University

ShiwenYu, Huiming Duan, XuefengZhu, Bin Swen, Baobao Chang

Specification for Corpus Processing at Peking University

Shiwen Yu, Huiming Duan, XuefengZhu, Bin Swen, Baobao Chang

Specification for Corpus Processing at Peking University

Shiwen Yu, Huiming Duan, XuefengZht, Bin Swen, Baobao Chang

Specification for Corpus Processing at Peking University

Shiwen Yu, Huiming Duan, XuefengZhu, Bin Swen, Baobao Chang

Specification for Corpus Processing at Peking University

Shiwen Yu, Huiming Duan, XuefengZhu, Bin Swen, Baobao Chang

Specifrcation for Corpus Processing at Peking University

Shiwen Yu, Huiming Duan, XuefengZhu, Bin Swen, Baobao Chang

Specification for Corpus Processing at Peking University

Shiwen Yu, Huiming Duan, XuefengZhu, Bin Swen, Baobao Chang

Specification for Corpus Processing at Peking University

Shiwen Yu, Huiming Duan, XuefengZhu, Bin Swen, Baobao Chang

Specification for Corpus Processing at Peking University

Shiwen Yu, Huiming Duan, XuefengZhu,Bin Swen, Baobao Chang

Specification for Corpus Processing at Peking University

Shiwen Yu, Huiming Duan, Xuefeng Zhu, Bin Swen, Baobao Chang

Specification for Corpus Processing at Peking University

Shiwen Yu, Huiming Duan, Xuefeng Zhu, Bin Swen, Baobao Chang

Specification for Corpus Processing at Peking University

ShiwenYu, Huiming Duan, XuefengZhu, Bin Swen, Baobao Chang

Specification for Corpus Processing at Peking University

Shiwen Yu, Huiming Duan, XuefengZhu, Bin Swen, Baobao Chang

Specification for Corpus Processing at Peking University

Shiwen Yu, Huiming Duan, XuefengZhu, Bin Swen, Baobao Chang (w pos:"c" pinyin="he2''>f,[</w> <w pos:"wj'). </w>

Specification for Corpus Processing at Peking University

Shiwen Yu, Huiming Duan, XuefengZhu, Bin Swen, Baobao Chang

Specification for Corpus Processing at Peking University

Shiwen Yu, Huiming Duan, XuefengZhu, Bin Swen, Baobao Chang Specification for Corpus Processing at Peking University: Word Segmentationo POS Tagging and Phonetic Notation Shiwen Yu, Huiming Duan, Xuefeng Zht,Bin Swen, Baobao Chang Institute of Computati onal Li nguistics, Peking University, Beijing, 100871, China yusw@pku. edu. cn ; duenhm@pku. edu. cn; bswen@pku. edu. cn; chbb@pku. edu. cn AbStraCt: The Institute of Computational Linguistics, Peking University made a specification for the word segmentation and POS tagging of its People's Daily corpus (over 26 million Chinese characters) fhereinafter: Specification 2001, which was published in the Journal of Chinese Information Processing (lssue 5 & Issued 6, 2002), entitled The Basb Prrcessing of Conternporary Chfuese Corpw ct Peking ljnivercity - Specifuatbnl. In additbn arnther specificatbn was nude for building the phonaicalty tmrntaled cotpus (l million Chinese charar:ters). Based on these two specftcatbns, we hercby prcset the latest Specifuatianfor Corpu Prcrcessing U Peking IJniversity: Word Segmentation, POS Tagging and Phonetic Notation fhereinafter: Specification 20031. With the newly added ones, the togset now includes more than 100 tags. Following Specification 2003, the Institute of Computational Linguistics wiii go on with more corpora of high quality and in-depth processing. Keyword: Contemporary Chinese; Corpus; Word Segmentation; POS Tagging; P hone tic N otqtion ; Sp e cift cati on