[2] OCR [3], [4] [5] [6] [4], [7] [8], [9] 1 [10] Fig. 1 Current arrangement and size of ruby. 2 Fig. 2 Typography combined with printing

Similar documents
& Vol.5 No (Oct. 2015) TV 1,2,a) , Augmented TV TV AR Augmented Reality 3DCG TV Estimation of TV Screen Position and Ro

2006 [3] Scratch Squeak PEN [4] PenFlowchart 2 3 PenFlowchart 4 PenFlowchart PEN xdncl PEN [5] PEN xdncl DNCL 1 1 [6] 1 PEN Fig. 1 The PEN

IPSJ SIG Technical Report Secret Tap Secret Tap Secret Flick 1 An Examination of Icon-based User Authentication Method Using Flick Input for

GPGPU

Studies of Foot Form for Footwear Design (Part 9) : Characteristics of the Foot Form of Young and Elder Women Based on their Sizes of Ball Joint Girth

DPA,, ShareLog 3) 4) 2.2 Strino Strino STRain-based user Interface with tacticle of elastic Natural ObjectsStrino 1 Strino ) PC Log-Log (2007 6)

258 5) GPS 1 GPS 6) GPS DP 7) 8) 10) GPS GPS ) GPS Global Positioning System

Vol.55 No (Jan. 2014) saccess 6 saccess 7 saccess 2. [3] p.33 * B (A) (B) (C) (D) (E) (F) *1 [3], [4] Web PDF a m

1: A/B/C/D Fig. 1 Modeling Based on Difference in Agitation Method artisoc[7] A D 2017 Information Processing

Fig. 1. Example of characters superimposed on delivery slip.

Vol.53 No (Mar. 2012) 1, 1,a) 1, 2 1 1, , Musical Interaction System Based on Stage Metaphor Seiko Myojin 1, 1,a

1 Fig. 1 Extraction of motion,.,,, 4,,, 3., 1, 2. 2.,. CHLAC,. 2.1,. (256 ).,., CHLAC. CHLAC, HLAC. 2.3 (HLAC ) r,.,. HLAC. N. 2 HLAC Fig. 2

Vol.54 No (July 2013) [9] [10] [11] [12], [13] 1 Fig. 1 Flowchart of the proposed system. c 2013 Information

28 Horizontal angle correction using straight line detection in an equirectangular image

IPSJ SIG Technical Report Vol.2012-IS-119 No /3/ Web A Multi-story e-picture Book with the Degree-of-interest Extraction Function

IPSJ SIG Technical Report Vol.2009-DPS-141 No.20 Vol.2009-GN-73 No.20 Vol.2009-EIP-46 No /11/27 1. MIERUKEN 1 2 MIERUKEN MIERUKEN MIERUKEN: Spe

& Vol.2 No (Mar. 2012) 1,a) , Bluetooth A Health Management Service by Cell Phones and Its Us

1_26.dvi

JOURNAL OF THE JAPANESE ASSOCIATION FOR PETROLEUM TECHNOLOGY VOL. 66, NO. 6 (Nov., 2001) (Received August 10, 2001; accepted November 9, 2001) Alterna

Input image Initialize variables Loop for period of oscillation Update height map Make shade image Change property of image Output image Change time L

17 Proposal of an Algorithm of Image Extraction and Research on Improvement of a Man-machine Interface of Food Intake Measuring System

4.1 % 7.5 %

Microsoft Word - toyoshima-deim2011.doc

2). 3) 4) 1.2 NICTNICT DCRA Dihedral Corner Reflector micro-arraysdcra DCRA DCRA DCRA 3D DCRA PC USB PC PC ON / OFF Velleman K8055 K8055 K8055

3 1 Table 1 1 Feature classification of frames included in a comic magazine Type A Type B Type C Others 81.5% 10.3% 5.0% 3.2% Fig. 1 A co

Fig. 3 3 Types considered when detecting pattern violations 9)12) 8)9) 2 5 methodx close C Java C Java 3 Java 1 JDT Core 7) ) S P S

( ) [1] [4] ( ) 2. [5] [6] Piano Tutor[7] [1], [2], [8], [9] Radiobaton[10] Two Finger Piano[11] Coloring-in Piano[12] ism[13] MIDI MIDI 1 Fig. 1 Syst

計量国語学 アーカイブ ID KK 種別 特集 招待論文 A タイトル Webコーパスの概念と種類, 利用価値 語史研究の情報源としてのWebコーパス Title The Concept, Types and Utility of Web Corpora: Web Corpora as

(a) 1 (b) 3. Gilbert Pernicka[2] Treibitz Schechner[3] Narasimhan [4] Kim [5] Nayar [6] [7][8][9] 2. X X X [10] [11] L L t L s L = L t + L s

28 TCG SURF Card recognition using SURF in TCG play video

B HNS 7)8) HNS ( ( ) 7)8) (SOA) HNS HNS 4) HNS ( ) ( ) 1 TV power, channel, volume power true( ON) false( OFF) boolean channel volume int

IPSJ SIG Technical Report Vol.2011-EC-19 No /3/ ,.,., Peg-Scope Viewer,,.,,,,. Utilization of Watching Logs for Support of Multi-

IPSJ SIG Technical Report Vol.2011-DBS-153 No /11/3 Wikipedia Wikipedia Wikipedia Extracting Difference Information from Multilingual Wiki

大学における原価計算教育の現状と課題

16_.....E...._.I.v2006

Fig. 3 Flow diagram of image processing. Black rectangle in the photo indicates the processing area (128 x 32 pixels).

0801391,繊維学会ファイバ12月号/報文-01-西川

第62巻 第1号 平成24年4月/石こうを用いた木材ペレット

ID 3) 9 4) 5) ID 2 ID 2 ID 2 Bluetooth ID 2 SRCid1 DSTid2 2 id1 id2 ID SRC DST SRC 2 2 ID 2 2 QR 6) 8) 6) QR QR QR QR

IPSJ SIG Technical Report Pitman-Yor 1 1 Pitman-Yor n-gram A proposal of the melody generation method using hierarchical pitman-yor language model Aki

IPSJ SIG Technical Report Vol.2012-CG-148 No /8/29 3DCG 1,a) On rigid body animation taking into account the 3D computer graphics came

Visual Evaluation of Polka-dot Patterns Yoojin LEE and Nobuko NARUSE * Granduate School of Bunka Women's University, and * Faculty of Fashion Science,

Tf dvi

, IT.,.,..,.. i

4. C i k = 2 k-means C 1 i, C 2 i 5. C i x i p [ f(θ i ; x) = (2π) p 2 Vi 1 2 exp (x µ ] i) t V 1 i (x µ i ) 2 BIC BIC = 2 log L( ˆθ i ; x i C i ) + q

1 Web [2] Web [3] [4] [5], [6] [7] [8] S.W. [9] 3. MeetingShelf Web MeetingShelf MeetingShelf (1) (2) (3) (4) (5) Web MeetingShelf

HIS-CCBASEver2

Appropriate Disaster Preparedness Education in Classrooms According to Students Grade, from Kindergarten through High School Contrivance of an Educati

Vol. 45 No Web ) 3) ),5) 1 Fig. 1 The Official Gazette. WTO A

情報処理学会研究報告 IPSJ SIG Technical Report Vol.2017-CG-166 No /3/ HUNTEXHUNTER1 NARUTO44 Dr.SLUMP1,,, Jito Hiroki Satoru MORITA The

IT,, i

FUJII, M. and KOSAKA, M. 2. J J [7] Fig. 1 J Fig. 2: Motivation and Skill improvement Model of J Orchestra Fig. 1: Motivating factors for a

IPSJ SIG Technical Report Vol.2012-HCI-149 No /7/20 1 1,2 1 (HMD: Head Mounted Display) HMD HMD,,,, An Information Presentation Method for Weara

1 Table 1: Identification by color of voxel Voxel Mode of expression Nothing Other 1 Orange 2 Blue 3 Yellow 4 SSL Humanoid SSL-Vision 3 3 [, 21] 8 325

情報処理学会研究報告 IPSJ SIG Technical Report Vol.2015-GI-34 No /7/ % Selections of Discarding Mahjong Piece Using Neural Network Matsui

1 7.35% 74.0% linefeed point c 200 Information Processing Society of Japan

7,, i

The copyright of this material is retained by the Information Processing Society of Japan (IPSJ). The material has been made available on the website

On the Wireless Beam of Short Electric Waves. (VII) (A New Electric Wave Projector.) By S. UDA, Member (Tohoku Imperial University.) Abstract. A new e

1 DHT Fig. 1 Example of DHT 2 Successor Fig. 2 Example of Successor 2.1 Distributed Hash Table key key value O(1) DHT DHT 1 DHT 1 ID key ID IP value D

1., 1 COOKPAD 2, Web.,,,,,,.,, [1]., 5.,, [2].,,.,.,, 5, [3].,,,.,, [4], 33,.,,.,,.. 2.,, 3.., 4., 5., ,. 1.,,., 2.,. 1,,

IPSJ SIG Technical Report Vol.2014-CG-155 No /6/28 1,a) 1,2,3 1 3,4 CG An Interpolation Method of Different Flow Fields using Polar Inter

IPSJ SIG Technical Report Vol.2013-GN-86 No.35 Vol.2013-CDS-6 No /1/17 1,a) 2,b) (1) (2) (3) Development of Mobile Multilingual Medical

soturon.dvi

THE INSTITUTE OF ELECTRONICS, INFORMATION AND COMMUNICATION ENGINEERS TECHNICAL REPORT OF IEICE.

A Nutritional Study of Anemia in Pregnancy Hematologic Characteristics in Pregnancy (Part 1) Keizo Shiraki, Fumiko Hisaoka Department of Nutrition, Sc

3D UbiCode (Ubiquitous+Code) RFID ResBe (Remote entertainment space Behavior evaluation) 2 UbiCode Fig. 2 UbiCode 2. UbiCode 2. 1 UbiCode UbiCode 2. 2

DEIM Forum 2010 A Web Abstract Classification Method for Revie

,,,,., C Java,,.,,.,., ,,.,, i

..,,,, , ( ) 3.,., 3.,., 500, 233.,, 3,,.,, i

<30315F836D815B83675F95D08BCB8E812E696E6464>

23 Fig. 2: hwmodulev2 3. Reconfigurable HPC 3.1 hw/sw hw/sw hw/sw FPGA PC FPGA PC FPGA HPC FPGA FPGA hw/sw hw/sw hw- Module FPGA hwmodule hw/sw FPGA h

013858,繊維学会誌ファイバー1月/報文-02-古金谷

2 The Bulletin of Meiji University of Integrative Medicine 3, Yamashita 10 11


0801297,繊維学会ファイバ11月号/報文-01-青山

HP cafe HP of A A B of C C Map on N th Floor coupon A cafe coupon B Poster A Poster A Poster B Poster B Case 1 Show HP of each company on a user scree

3_39.dvi

IPSJ SIG Technical Report Vol.2015-CVIM-196 No /3/6 1,a) 1,b) 1,c) U,,,, The Camera Position Alignment on a Gimbal Head for Fixed Viewpoint Swi

1 p.27 Fig. 1 Example of a koto score. [1] 1 1 [1] A 2. Rogers [4] Zhang [5] [6] [7] Löchtefeld [8] Xiao [

2007/8 Vol. J90 D No. 8 Stauffer [7] 2 2 I 1 I 2 2 (I 1(x),I 2(x)) 2 [13] I 2 = CI 1 (C >0) (I 1,I 2) (I 1,I 2) Field Monitoring Server

IT i

EQUIVALENT TRANSFORMATION TECHNIQUE FOR ISLANDING DETECTION METHODS OF SYNCHRONOUS GENERATOR -REACTIVE POWER PERTURBATION METHODS USING AVR OR SVC- Ju

(a) Picking up of six components (b) Picking up of three simultaneously. components simultaneously. Fig. 2 An example of the simultaneous pickup. 6 /

,4) 1 P% P%P=2.5 5%!%! (1) = (2) l l Figure 1 A compilation flow of the proposing sampling based architecture simulation

Vol.11-HCI-15 No. 11//1 Xangle 5 Xangle 7. 5 Ubi-WA Finger-Mount 9 Digitrack 11 1 Fig. 1 Pointing operations with our method Xangle Xa

2. CABAC CABAC CABAC 1 1 CABAC Figure 1 Overview of CABAC 2 DCT 2 0/ /1 CABAC [3] 3. 2 値化部 コンテキスト計算部 2 値算術符号化部 CABAC CABAC

01_梅村佳代_紀要_2007最終

知能と情報, Vol.30, No.5, pp

IIC Proposal of Range Extension Control System by Drive and Regeneration Distribution Based on Efficiency Characteristic of Motors for Electric

johnny-paper2nd.dvi


pp DC 2,

IPSJ SIG Technical Report An Evaluation Method for the Degree of Strain of an Action Scene Mao Kuroda, 1 Takeshi Takai 1 and Takashi Matsuyama 1

日本感性工学会論文誌

,,.,.,,.,.,.,.,,.,..,,,, i

IPSJ SIG Technical Report Vol.2011-MUS-91 No /7/ , 3 1 Design and Implementation on a System for Learning Songs by Presenting Musical St

Study on Application of the cos a Method to Neutron Stress Measurement Toshihiko SASAKI*3 and Yukio HIROSE Department of Materials Science and Enginee

Fig. 1 Schematic construction of a PWS vehicle Fig. 2 Main power circuit of an inverter system for two motors drive

m City Lights 1931 DIE 3 GROSCHEN-OPER G.W Blackmail 1929 DVD M M 1931Vampyr 1932

Transcription:

1,a) 1,b) 1,c) 2012 11 8 2012 12 18, 2013 1 27 WEB Ruby Removal Filters Using Genetic Programming for Early-modern Japanese Printed Books Taeka Awazu 1,a) Masami Takata 1,b) Kazuki Joe 1,c) Received: November 8, 2012, Revised: December 18, 2012, Accepted: January 27, 2013 Abstract: In National Diet Library, books which are possessed in library as the digital library from meiji era are open to the public on WEB. Since these are shown as image data and cannot search using document contents, an automatic text conversion is needed. However, ruby is a disturbing text conversion. Since existing techniques of linearly removing ruby had developed for books of the current standard, the techniques are inapplicable to early-modern Japanese books, which have a specific characteristic different from characters of current books. In this paper, we propose a method to remove ruby from early-modern Japanese books using Genetic Programming. Keywords: ruby remove, early-modern printed books, genetic programming, character segmentation, transforming text, histogram, recognition of characters 1. 57 [1] 1 Nara Women s University, Nara 630 8506, Japan a) awazu-taeka0802@ics.nara-wu.ac.jp b) takata@ics.nara-wu.ac.jp c) joe@ics.nara-wu.ac.jp WEB c 2013 Information Processing Society of Japan 53

[2] OCR [3], [4] [5] [6] [4], [7] [8], [9] 1 [10] 2 3 4 5 2. 1 Fig. 1 Current arrangement and size of ruby. 2 Fig. 2 Typography combined with printing type. 3 Fig. 3 The ruby in the present books. DTP JIS 3 1/2 3 1 3 1 2 3 c 2013 Information Processing Society of Japan 54

Fig. 4 4 Ruby of the early-modern printed books. Fig. 6 6 Projection histogram by black pixels. 5 Fig. 5 An example of distorted lines. Fig. 7 7 Black pixel projection histogram of connected characters. 4 5 3. 3.1 6 7 8 1 Fig. 8 Left: A character divided by small rectangles, Right: A character unified to a rectangle. 1 3.2 8 1 8 1 c 2013 Information Processing Society of Japan 55

9 3 Fig. 9 A character divided by three rectangles. 10 Fig. 10 A dividing rectangle includes the ruby. 9 10 4. 4.1 11 (1) (2) (1) (a) (b) (1) (c) (d) (e) (f) (g) (h) (2c) 11 Fig. 11 Flow of the proposed method. (3) (1) (2) (1) x y y =( ) sin cos 1 9 π (1) x x =0 12 x 1 x x =0 1 13 x =0 (2a) N (2b) c 2013 Information Processing Society of Japan 56

Fig. 14 14 Range of the original image for fitness calculation. K 1 a X a y =(1/2) Y a X a Y a S a S a o a(x,y) S a 12 x Fig. 12 Variable x as termination element for GP. c a(x,y) S a t a(x,y) x y S a B a (o a(x,y) ) E a (o a(x,y),c a(x,y),t a(x,y) ) (1) (2) 1 (o a(x,y) =0) B a (o a(x,y) )= 0 (o a(x,y) 0) (1) E a (o a(x,y),c a(x,y),t a(x,y) ) 1 ( (o a(x,y) =0) (c a(x,y) = t a(x,y) )) = 0 ( (o a(x,y) =0) (c a(x,y) = t a(x,y) )) (2) i f i f i (3) f i = 1 K K X a Y a a=1 x=0 y=0 E a (o a(x,y),c a(x,y),t a(x,y) ) B a (o a(x,y) ) (3) 13 x Fig. 13 Variable x in a case of ruby in front of a parent character. 14 1 (2c) 1 (2d) i p i (4) p i = f i N k=1 f k (4) 1 c 2013 Information Processing Society of Japan 57

(5) ( 1+ 1 ) 4 15 Fig. 15 Isolated points. (2e) (2f) (2b) (2g) (3) 8 10 15 4.2 1 1 (5) x =0 1 1 5. 5.1 PGM 3 1883 1897 1898 1912 1912 1925 3 10 50 100 200 300 400 100 100 1 10 10 100 1,000 5,000 1,000 3,000 3,000 200 0.8 0.2 300 c 2013 Information Processing Society of Japan 58

Table 1 1 10 The number of appearances of curves and straight lines, average and the maximum values of fitness in 10 times. 7 0.9878 0.9881 3 0.9870 0.9874 8 0.9896 0.9893 2 0.9869 0.9876 9 0.9875 0.9887 1 0.9874 0.9874 7 0.9752 0.9797 3 0.9757 0.9785 3 0.9822 0.9845 7 0.9836 0.9845 10 0.9751 0.9753 - - - 7 0.9843 0.9849 3 0.9838 0.9846 9 0.9857 0.9857 1 0.9851 0.9851 9 0.9848 0.9842 1 0.9830 0.9830 5.2 Intel Xeon Processor 8GB 3 10 1 2 91.3% sin cos sin cos 2 4.1 (2b) 2 99% 2 2 % Table 2 The coincidence rate by publisher and era. 99.67 99.64 99.32 99.33 99.60 99.54 99.67 99.77 99.75 16 (6) Fig. 16 The curve denoted by (6) and the result. (6) (7) 16 (6) 17 (7) x x =0 y y =0 y = ((8/3) + (( (cos((2 π x/(((4 (cos ((2 π x/((sin((2 π x/(((5 + 3)/2)) π))/2)) π/2))/1))/2)) π/2))/(8/3))) (cos((2 π x /((( +4)/2)) π/2))/(7/5)))) (6) c 2013 Information Processing Society of Japan 59

17 Fig. 17 (7) The curve denoted by (7) and result. 3 % Table 3 Removal success rate of the existing and the proposal method. 82.3 79.0 99.0 92.7 81.7 99.3 90.7 62.7 96.7 84.3 76.7 97.3 86.0 82.0 99.3 95.7 88.3 99.0 96.3 93.3 99.0 93.3 91.7 99.0 94.3 91.0 98.7 18 Fig. 18 Example of ruby removal failure. y =(( cos((2 π x/(((x (cos((2 π x /(((1 (x (((8 + 7)/((5 (( (6 + ( )) )/( )) 8)))/2)) π/2)) 8))/2)) π/2))) (7) 300 2 1 10 200 10 2 3 3 2 18 19 19 19 Fig. 19 Digital data of Digital Library from the Meiji Era. 1 20 (a) (b) (6) (c) 21 (a) (b) (7) (c) 20 21 c 2013 Information Processing Society of Japan 60

99% 300 2 a b c 20 (6) Fig. 20 The original image and the result by applying (6) for ruby removal. C 21500237 a b c 21 (7) Fig. 21 The original image and the result by applying (7) for ruby removal. 6. 100 [1] (online), http://www.ndl.go.jp/ 2012-11-8. [2] C 21500237 (2009 2011). [3] Ishikawa, C., Ashida, N., Enomoto, Y., Takata, M., Kimesawa, T. and Joe, K.: Recognition of Multi-Fonts Character in Early-Modern Printed Books, Proc. 2009 International Conference on Parallel and Distributed Processing Technologies and Applications (PDPTA 2009 ), Vol.II, pp.728 734 (2009). [4] Fukuo, M., Enomoto, Y., Yoshii, N., Takata, M., Kimesawa, T. and Joe, K.: Evaluation of the SVM based Multi-Fonts Kanji Character Recognition Method for Early-Modern Japanese Printed Books, Proc. 2011 International Conference on Parallel and Distributed Processing Technologies and Applications (PDPTA 2011 ), Vol.II, pp.727 732 (2011). [5] Vol.2012- MPS-90, No.26 (2012). [6] OCR SS Vol.100, No.678, pp.17 22 (2001). [7] D Vol.J67-D, No.10, pp.1194 1201 (1984). [8] D Vol.J68-D, No.12, pp.2123 2131 (1985). [9] PRU Vol.94, No.242, pp.49 56 (1994). [10] (2001). c 2013 Information Processing Society of Japan 61

2012 2013 2013 2004 2004 JST 2006 2007 2013 DEC ATR DEC 1993 1996 1997 1998 1999 c 2013 Information Processing Society of Japan 62