ACORPUS based Education and Self learning with Full Text Search Tool capable of Regular Expression Takanori Sato, Donald C. Wood, Masayuki Katahira and Akira Nakamura Abstract This paper introduces a hardwareindependent fulltext search tool for any browser, named SCORPSelfish CORPuswhich can be operated online as well as stand alone with fullysupported Regular Expression. The system provides the service from computers running FreeBSD or any other UNIXcompatible OS. Corpus linguistics can provide effective English learning for students aswell as invaluable English writing assistance for researchers in need of accurate and characteristic expressions for specialized topics or fields of research, by referring them to a collection of texts, including transcriptions of conversations or speeches. The system enables users to create their own corpus according to their own interests. We also elucidated some considerations for conducting SCORPbased academic classestechnical terms like character code or end of line code are not serious problems, but individual users must be aware and respectful ofthe sensitive nature of copyrighted materials. Keywords: English Education, Corpus, Information Processing, Internet, Copyrights 1 1 2003 EAPEnglish for Academic Purposes 010 8543 1 1 1 Contact to EAP 1 MS DOSPC DOS MicroConcord MCONCORD 2 Corpus 3 MCON- CORD 4 Corpus EAP SCORPSCORP Corpus 18 VOL.20 2006
1 URL 5 URL Web Page 6a SCORP selfish corpus URL 5 1 2 2 2 URL CORPUS Charles Dickens 7 Fig. 1 for going on to overflowing with 8 Dickens Fig. 2 to overflowing with Dickens The Battle oflife 387 Fig. 3 collocation regular expression 7 Web SCORP 5 SCORP URL 6a SCORP Package Download 6b Hardware1 OS 8a 2Web Server Software 8b 3CGI 8c 3 1FreeBSD 9a 2Apache 9b 3Perl 9c Mac OS X 3 x86 PC Install Disk Install Guide Web Site 10 Files Guide Web Page 9b SCORP VOL.20 2006 19
FreeBSD Perl Version Perl 5.8.x 2 byte character code 1 Unicode Unicode Perl 5.8.x 2 byte EUCExtended UniCodeSCORP ASCII EUC 3 Apache httpd.conf 1SCORP Corpus Directory2CGI 3 11 Apache Basic Corpus 12 Directory.htaccess Fig. 4 Apache.htpasswdUser ID Password.htpasswd Fig. 5 FreeBSD Console apachectl startfig. 6 Web Page 6b apachectl restart httpd.conf Mac OS X Tag Web Web Server Fig. 7 Windows OS Cygwin, Apache for Windows, Active Perl 13 SCORP DirectoryFile Fig. 8Apache Basic Internet Web Browser 20 VOL.20 2006
Browser Editor Web Page URL.htaccess SCORP CORPUS Directory SCORP CGI Perl SCORP Package Package Source Script 16 Perl Script Fig. 9 1 17 Perl end of line code, EOL code OS 14 FreeBSD LFline feed UNIX OS 1 EUC2 LF 15 SCORP paragraph VOL.20 2006 21
Perl Table 1Table 2Table 1 Table 2 18 2 1 II2 GIO General Instructive Object GIO Corpus Table 3 SBOsSpecific Behavioral Objectives SBOs 1 6 Internet Corpus SCORP Essay Essay Corpus Es- say II SCORP GIO SBOs 1SCORP 2 CORPORA 2 Mini Research Project Project 1 2 Corpus 3SCORP 4 SCORP 19 Corpus Table 4obviously clearly written spoken Corpus CORPORA SCORP 2 2 Corpus 22 VOL.20 2006
Internet Corpus 23 Corpus 2 RDBRelational Database Table 2 SCORP EAP Script Unicode Corpus SCORP Corpus Corpus jargon SCORP GNU GPLGeneral Public License 20 2 English Native SCORP SCORP SCORP VOL.20 2006 23
24 VOL.20 2006