IPSJ SIG Technical Report Vol.2009-BIO-17 No /5/26 DNA 1 1 DNA DNA DNA DNA Correcting read errors on DNA sequences determined by Pyrosequencing

Similar documents
1 Fig. 1 Extraction of motion,.,,, 4,,, 3., 1, 2. 2.,. CHLAC,. 2.1,. (256 ).,., CHLAC. CHLAC, HLAC. 2.3 (HLAC ) r,.,. HLAC. N. 2 HLAC Fig. 2

IPSJ SIG Technical Report Vol.2013-GN-87 No /3/ Research of a surround-sound field adjustmen system based on loudspeakers arrangement Ak

IPSJ SIG Technical Report Vol.2016-CE-137 No /12/ e β /α α β β / α A judgment method of difficulty of task for a learner using simple

Input image Initialize variables Loop for period of oscillation Update height map Make shade image Change property of image Output image Change time L

第62巻 第1号 平成24年4月/石こうを用いた木材ペレット

1: A/B/C/D Fig. 1 Modeling Based on Difference in Agitation Method artisoc[7] A D 2017 Information Processing

IPSJ SIG Technical Report Pitman-Yor 1 1 Pitman-Yor n-gram A proposal of the melody generation method using hierarchical pitman-yor language model Aki

1 Web [2] Web [3] [4] [5], [6] [7] [8] S.W. [9] 3. MeetingShelf Web MeetingShelf MeetingShelf (1) (2) (3) (4) (5) Web MeetingShelf

IPSJ SIG Technical Report Vol.2009-DPS-141 No.20 Vol.2009-GN-73 No.20 Vol.2009-EIP-46 No /11/27 1. MIERUKEN 1 2 MIERUKEN MIERUKEN MIERUKEN: Spe

( ) [1] [4] ( ) 2. [5] [6] Piano Tutor[7] [1], [2], [8], [9] Radiobaton[10] Two Finger Piano[11] Coloring-in Piano[12] ism[13] MIDI MIDI 1 Fig. 1 Syst

DPA,, ShareLog 3) 4) 2.2 Strino Strino STRain-based user Interface with tacticle of elastic Natural ObjectsStrino 1 Strino ) PC Log-Log (2007 6)

IPSJ SIG Technical Report Vol.2012-CG-148 No /8/29 3DCG 1,a) On rigid body animation taking into account the 3D computer graphics came

258 5) GPS 1 GPS 6) GPS DP 7) 8) 10) GPS GPS ) GPS Global Positioning System

& Vol.5 No (Oct. 2015) TV 1,2,a) , Augmented TV TV AR Augmented Reality 3DCG TV Estimation of TV Screen Position and Ro

塗装深み感の要因解析

28 Horizontal angle correction using straight line detection in an equirectangular image

3. ( 1 ) Linear Congruential Generator:LCG 6) (Mersenne Twister:MT ), L 1 ( 2 ) 4 4 G (i,j) < G > < G 2 > < G > 2 g (ij) i= L j= N

2. CABAC CABAC CABAC 1 1 CABAC Figure 1 Overview of CABAC 2 DCT 2 0/ /1 CABAC [3] 3. 2 値化部 コンテキスト計算部 2 値算術符号化部 CABAC CABAC

1_26.dvi

TF-IDF TDF-IDF TDF-IDF Extracting Impression of Sightseeing Spots from Blogs for Supporting Selection of Spots to Visit in Travel Sat

浜松医科大学紀要

149 (Newell [5]) Newell [5], [1], [1], [11] Li,Ryu, and Song [2], [11] Li,Ryu, and Song [2], [1] 1) 2) ( ) ( ) 3) T : 2 a : 3 a 1 :

IPSJ SIG Technical Report Vol.2014-CG-155 No /6/28 1,a) 1,2,3 1 3,4 CG An Interpolation Method of Different Flow Fields using Polar Inter

kiyo5_1-masuzawa.indd

SERPWatcher SERPWatcher SERP Watcher SERP Watcher,

Vol.54 No (July 2013) [9] [10] [11] [12], [13] 1 Fig. 1 Flowchart of the proposed system. c 2013 Information

IPSJ SIG Technical Report Vol.2009-HCI-134 No /7/17 1. RDB Wiki Wiki RDB SQL Wiki Wiki RDB Wiki RDB Wiki A Wiki System Enhanced by Visibl

HASC2012corpus HASC Challenge 2010,2011 HASC2011corpus( 116, 4898), HASC2012corpus( 136, 7668) HASC2012corpus HASC2012corpus

08医療情報学22_1_水流final.PDF

xx/xx Vol. Jxx A No. xx 1 Fig. 1 PAL(Panoramic Annular Lens) PAL(Panoramic Annular Lens) PAL (2) PAL PAL 2 PAL 3 2 PAL 1 PAL 3 PAL PAL 2. 1 PAL

6_27.dvi

Fig. 3 3 Types considered when detecting pattern violations 9)12) 8)9) 2 5 methodx close C Java C Java 3 Java 1 JDT Core 7) ) S P S

A Feasibility Study of Direct-Mapping-Type Parallel Processing Method to Solve Linear Equations in Load Flow Calculations Hiroaki Inayoshi, Non-member

1 2. Nippon Cataloging Rules NCR [6] (1) 5 (2) 4 3 (3) 4 (4) 3 (5) ISSN 7 International Standard Serial Number ISSN (6) (7) 7 16 (8) ISBN ISSN I

Fig. 3 Flow diagram of image processing. Black rectangle in the photo indicates the processing area (128 x 32 pixels).

<4D F736F F D EC959F90B781758BA38B5A82A982E982BD82CC897282DD82C982A882AF82E B837D EFC CC93C192A D34392E646F63>

1 7.35% 74.0% linefeed point c 200 Information Processing Society of Japan

Vol. 42 No. SIG 8(TOD 10) July HTML 100 Development of Authoring and Delivery System for Synchronized Contents and Experiment on High Spe

IPSJ SIG Technical Report Vol.2011-MUS-91 No /7/ , 3 1 Design and Implementation on a System for Learning Songs by Presenting Musical St

3_23.dvi

12) NP 2 MCI MCI 1 START Simple Triage And Rapid Treatment 3) START MCI c 2010 Information Processing Society of Japan

7,, i

1 4 4 [3] SNS 5 SNS , ,000 [2] c 2013 Information Processing Society of Japan

【HP用】26.12月号indd.indd

26.2月号indd.indd

IPSJ SIG Technical Report Vol.2011-EC-19 No /3/ ,.,., Peg-Scope Viewer,,.,,,,. Utilization of Watching Logs for Support of Multi-

26.1月号indd.indd

情報処理学会研究報告 IPSJ SIG Technical Report Vol.2011-MBL-57 No.27 Vol.2011-UBI-29 No /3/ A Consideration of Features for Fatigue Es

06_学術_技師の現状および将来需要_武藤様1c.indd

1 UD Fig. 1 Concept of UD tourist information system. 1 ()KDDI UD 7) ) UD c 2010 Information Processing S

百人一首かるた選手の競技時の脳の情報処理に関する研究

GPGPU

23_02.dvi

Vol.55 No (Jan. 2014) saccess 6 saccess 7 saccess 2. [3] p.33 * B (A) (B) (C) (D) (E) (F) *1 [3], [4] Web PDF a m

IPSJ SIG Technical Report 1, Instrument Separation in Reverberant Environments Using Crystal Microphone Arrays Nobutaka ITO, 1, 2 Yu KITANO, 1

IPSJ SIG Technical Report Vol.2010-GN-74 No /1/ , 3 Disaster Training Supporting System Based on Electronic Triage HIROAKI KOJIMA, 1 KU

2 1 ( ) 2 ( ) i

1., 1 COOKPAD 2, Web.,,,,,,.,, [1]., 5.,, [2].,,.,.,, 5, [3].,,,.,, [4], 33,.,,.,,.. 2.,, 3.., 4., 5., ,. 1.,,., 2.,. 1,,

平常時火災における消火栓の放水能力に関する研究

2 The Bulletin of Meiji University of Integrative Medicine 3, Yamashita 10 11

HP cafe HP of A A B of C C Map on N th Floor coupon A cafe coupon B Poster A Poster A Poster B Poster B Case 1 Show HP of each company on a user scree

Q [4] 2. [3] [5] ϵ- Q Q CO CO [4] Q Q [1] i = X ln n i + C (1) n i i n n i i i n i = n X i i C exploration exploitation [4] Q Q Q ϵ 1 ϵ 3. [3] [5] [4]

IPSJ SIG Technical Report Secret Tap Secret Tap Secret Flick 1 An Examination of Icon-based User Authentication Method Using Flick Input for


2. Twitter Twitter 2.1 Twitter Twitter( ) Twitter Twitter ( 1 ) RT ReTweet RT ReTweet RT ( 2 ) URL Twitter Twitter 140 URL URL URL 140 URL URL

EQUIVALENT TRANSFORMATION TECHNIQUE FOR ISLANDING DETECTION METHODS OF SYNCHRONOUS GENERATOR -REACTIVE POWER PERTURBATION METHODS USING AVR OR SVC- Ju

1 1 CodeDrummer CodeMusician CodeDrummer Fig. 1 Overview of proposal system c

IPSJ SIG Technical Report Vol.2012-HCI-149 No /7/20 1 1,2 1 (HMD: Head Mounted Display) HMD HMD,,,, An Information Presentation Method for Weara

ID 3) 9 4) 5) ID 2 ID 2 ID 2 Bluetooth ID 2 SRCid1 DSTid2 2 id1 id2 ID SRC DST SRC 2 2 ID 2 2 QR 6) 8) 6) QR QR QR QR

IPSJ SIG Technical Report An Evaluation Method for the Degree of Strain of an Action Scene Mao Kuroda, 1 Takeshi Takai 1 and Takashi Matsuyama 1

平成○○年度知能システム科学専攻修士論文

4. C i k = 2 k-means C 1 i, C 2 i 5. C i x i p [ f(θ i ; x) = (2π) p 2 Vi 1 2 exp (x µ ] i) t V 1 i (x µ i ) 2 BIC BIC = 2 log L( ˆθ i ; x i C i ) + q


Web Web Web Web Web, i

ISSN NII Technical Report Patent application and industry-university cooperation: Analysis of joint applications for patent in the Universit


知能と情報, Vol.30, No.5, pp

e-learning e e e e e-learning 2 Web e-leaning e 4 GP 4 e-learning e-learning e-learning e LMS LMS Internet Navigware

Study on Application of the cos a Method to Neutron Stress Measurement Toshihiko SASAKI*3 and Yukio HIROSE Department of Materials Science and Enginee

IPSJ SIG Technical Report Vol.2010-NL-199 No /11/ treebank ( ) KWIC /MeCab / Morphological and Dependency Structure Annotated Corp

2. ICA ICA () (Blind Source Separation BBS) 2) Fig. 1 Model of Optical Topography. ( ) ICA 2.2 ICA ICA 3) n 1 1 x 1 (t) 2 x 2 (t) n x(t) 1 x(t

23 The Study of support narrowing down goods on electronic commerce sites

人工知能学会研究会資料 SIG-KBS-B Analysis of Voting Behavior in One Night Werewolf 1 2 Ema Nishizaki 1 Tomonobu Ozaki Graduate School of Integrated B

320 Nippon Shokuhin Kagaku Kogaku Kaishi Vol. /., No.1, -,* -,/ (,**1) 8 * ** *** * ** *** E#ect of Superheated Steam Treatment on the Preservation an

Vol.11-HCI-15 No. 11//1 Xangle 5 Xangle 7. 5 Ubi-WA Finger-Mount 9 Digitrack 11 1 Fig. 1 Pointing operations with our method Xangle Xa

( )

独立行政法人情報通信研究機構 Development of the Information Analysis System WISDOM KIDAWARA Yutaka NICT Knowledge Clustered Group researched and developed the infor

( )

橡最終原稿.PDF

...

[2] OCR [3], [4] [5] [6] [4], [7] [8], [9] 1 [10] Fig. 1 Current arrangement and size of ruby. 2 Fig. 2 Typography combined with printing

17 Proposal of an Algorithm of Image Extraction and Research on Improvement of a Man-machine Interface of Food Intake Measuring System

3D UbiCode (Ubiquitous+Code) RFID ResBe (Remote entertainment space Behavior evaluation) 2 UbiCode Fig. 2 UbiCode 2. UbiCode 2. 1 UbiCode UbiCode 2. 2

THE INSTITUTE OF ELECTRONICS, INFORMATION AND COMMUNICATION ENGINEERS TECHNICAL REPORT OF IEICE.

渡辺(2309)_渡辺(2309)

IPSJ SIG Technical Report Vol.2014-IOT-27 No.14 Vol.2014-SPT-11 No /10/10 1,a) 2 zabbix Consideration of a system to support understanding of f

IPSJ SIG Technical Report Vol.2010-SLDM-144 No.50 Vol.2010-EMB-16 No.50 Vol.2010-MBL-53 No.50 Vol.2010-UBI-25 No /3/27 Twitter IME Twitte

ipod touch 1 2 Apple ipod touch ipod touch 3 ( ) ipod touch ( 1 ) Apple ( 2 ) Web 1),2) 3. ipod touch 1 2 ipod touch x y z i

IPSJ SIG Technical Report Vol.2014-HCI-158 No /5/22 1,a) 2 2 3,b) Development of visualization technique expressing rainfall changing conditions

ActionScript Flash Player 8 ActionScript3.0 ActionScript Flash Video ActionScript.swf swf FlashPlayer AVM(Actionscript Virtual Machine) Windows

Transcription:

DNA 1 1 DNA DNA DNA DNA Correcting read errors on DNA sequences determined by Pyrosequencing Youhei Namiki 1 and Yutaka Akiyama 1 Pyrosequencing, one of the DNA sequencing technologies, allows us to determine the order of nucleotides in a large amount of DNA at a time. However, this method has a tendency to contain some particular read errors in the result sequences when determining long DNA sequences. In this study, we developed a method correcting read errors on DNA sequences determined by Pyrosequencing. In our method, a simple pyrosequencing simulator is repeatedly used and a corrected sequence which gives a simulated pyrogram most similar to that of real experimental record is chosen. 1 Graduate School of Information Science and Engineering, Tokyo Institute of Technology 1. DNA DNA DNA DNA 454 Life Sciences GS20 DNA DNA DNA DNA DNA DNA DNA DNA DNA DNA 2. 2.1 (Pyrosequencing method) DNA (sequencing-by-synthesis) ( 1) 1990 Mostafa Ronagh 1)2). DNA ( 1 ) DNA DNA ( 2 ) A T G C 1 c 2009 Information Processing Society of Japan

DNA DNA 2005 454 Life Sciences DNA 100 ( ) 1 Fig. 1 Pyrosequencing method. 2 ( ) Fig. 2 Pyrogram. DNA ATP ( 3 ) ( 2) ( 4 ) (2) (3) DNA DNA 2.2 DNA 3) ( 1 ) ( 2 ) DNA DNA 2.2.1 (Incomplete-Hybridization) DNA DNA DNA (delay) DNA DNA DNA 2.2.2 (Miss-Washing) DNA DNA (gain) 3. (PyroSequencing Simulator) DNA 2007 4) DNA DNA 2 c 2009 Information Processing Society of Japan

4. DNA 4.1 DNA ( 1 ) DNA s 1 ( 2 ) s 1 p 0 DNA 1 1 DNA s 0 ŝ 0 sequencing s 0 (s 1, p 0 ) (1) error correction (s 1, p 0) ŝ 0 (2) 4.2 ( 3) ( 1 ) DNA s 1 DNA ŝ 0 ( 2 ) ( 3 ) ( 4 ) (1) (3) ŝ 0 4.3 3 ( 1 ) (AS: All-neighbor Search method) ( 2 ) (SS: Sequenial Search method) 3 4 Fig. 3 Error correction. Fig. 4 Candidate sequences numeration method. ( 3 ) (TSS: Threshold Sequential Search method) ŝ 0 ( 4) 4.4 1 (AS: All-neighbor Search method) ( 1 ) ( 2 ) ( 3 ) ( 4 ) ( 5 ) ( 6 ) (2) (5) 4.4.1 S DNA DNA s 1 S S = {s 1 } (3) 4.4.2 C C = S s( S) (edit) DNA C ( 1 ) s 1 (insertion) ( 2 ) s 1 (mutation) 3 c 2009 Information Processing Society of Japan

( 3 ) s 1 (deletion) C = S + edit(s) (4) C s 1 DNA s L s insertion 4L mutation 3L deletion L 8L 4.4.3 c( C) DNA p c DNA 4.4.4 x i p 0 i x c,i p c i p 0 p c M 0 M c M = max{m 0, M c } p 0 = (x 1, x 2,..., x M0 ) (5) p c = (x c,1, x c,2,..., x c,mc ) (6) d c d c = M (x i x c,i ) 2 (7) i > M 0 x i = 0 i > M c x c,i = 0 i=1 d c p 0 DNA c DNA ( ) 4.4.5 4.4.4 (d c ) c N S 4.4.2 S 4.4.2 4.4.5 S DNA DNA 1 s 0 s 1 4.4.2 4.4.5 4.5 2 (SS: Sequential Search method) ( 1 ) ( 2 ) ( 3 ) ( 4 ) ( 5 ) ( 6 ) (2) (5) 4.5.1 S DNA DNA s 1 S i = 1 4.5.2 C C = S S = {s 1} (8) s( S) (edit) DNA C ( 1 ) s i 1 (insertion) ( 2 ) s i (mutation) ( 3 ) s i (deletion) C = S + edit(s, i) (9) s insertion 4 mutation 3 deletion 1 8 4.5.3 4.5.4 4.5.5 4.5.4 (d c 4 c 2009 Information Processing Society of Japan

) c N S S S i i 1 4.5.2 4.5.5 i s( S) S DNA 4.6 3 (TSS: Threshold Sequential Search method) ( 1 ) ( 2 ) ( 3 ) i ( 4 ) ( 5 ) ( 6 ) ( 7 ) ( 8 ) (2) (7) 4.6.1 S DNA DNA s 1 S S = {s 1 } (10) θ i = 1 4.6.2 i C s( S) p s = (x s,1, x s,2,..., x s,ms ) M = max{m 0, M s } s p 0 s p s i 5 Fig. 5 Calculation of the deference between two pyrograms. d s,i Fig. 6 6 The deference between two pyrograms. d s,i = (x i x s,i ) 2 (11) i > M 0 x i = 0 i > M s x s,i = 0 d s,i θ s C d s,i θ s ( C = ) i 1 4.6.2 4.6.3 i c ( C ) i 4.6.4 C C = S c ( C ) i i b i insertion, deletion C i b i b i C = S + edit(c, i, b i) (12) c insertion 1 deletion 1 2 4.6.5 4.6.6 5 c 2009 Information Processing Society of Japan

4.6.7 4.6.6 (d c ) c N S S S i i 1 4.6.2 5. 3 (AS, SS, TSS) Perl DNA 5.1 (GSIC) TSUB- AME 5.2 UCSC Genome Bioinformatics Web 1 DNA 50 (50b) 75 (75b) 500 DNA N(A, T, G, C ) 5.3 DNA s i DNA s i p i 2007 s i s i s i s i s i s i 50b 500 352 75b 500 490 50b 75b 200 s i p i D D = { } (s i, p i) s i s i (13) 1 1(AS) Table 1 The result of method1(as). 50b 169/200 (84.5%) 200/200 (100%) 75b 161/200 (80.5%) 199/200 (99.5%) 5.4 3 3(TSS) Table 3 The result of method3(tss). θ 2 2(SS) Table 2 The result of method2(ss). 50b 168/200 (84%) 196/200 (98%) 75b 161/200 (81.5%) 185/200 (92.5%) 50b 0.2 183/200 (91.5%) 183/200 (91.5%) 50b 0.1 186/200 (93%) 186/200 (93%) 50b 0.05 186/200 (93%) 186/200 (93%) 50b 0.01 183/200 (91.5%) 188/200 (94%) 50b 0.005 179/200 (89.5%) 192/200 (96%) 75b 0.2 141/200 (70.5%) 141/200 (70.5%) 75b 0.1 176/200 (88%) 176/200 (88%) 75b 0.05 179/200 (89.5%) 180/200 (90%) 75p 0.01 170/200 (85%) 180/200 (90%) 75p 0.005 171/200 (85.5%) 182/200 (91%) D s i p i s i p i TSUBAME 200 DNA 1CPU 5.5 1 3 1 1 5 (200) 7 8 M{ }_T{ 3 θ } ( M1 = 1 M3_T0.01 = 3, θ = 0.01) (match) (weak match) ( 1 6 c 2009 Information Processing Society of Japan

7 (match) (weak match) 50b Fig. 7 The numbers of matches and weak matches(50b). 8 (match) (weak match) 75b Fig. 8 The numbers of matches and weak matches(75b). ) 50b 75b 3 θ 0.05 90% 0.05 1 2 3 1 2 3 3 (DNA ) 3 0.1 (1 5 ) 1 2 3 2 50b 75b 3 θ = 0.05 5.6 4 6 ( ) 9 10 4 6 9 10 1 2 2 3 3 3 6. DNA 3 (Threshold 7 c 2009 Information Processing Society of Japan

4 1(AS) ( ) Table 4 The runtime(sec) of method1(as). Min. 1st Qu. Median Mean 3rd Qu. Max. 50b 309 1,852 2,832 3,523 4,411 12,830 75b 2,584 8,226 11,970 12,220 15,580 26,800 5 2(SS) ( ) Table 5 The runtime(sec) of method2(ss). Min. 1st Qu. Median Mean 3rd Qu. Max. 50b 176 309 439 502 626 1,469 75b 439 980 1,329 1,618 1,950 9,067 6 3(TSS) ( ) Table 6 The runtime(sec) of method3(tss). θ Min. 1st Qu. Median Mean 3rd Qu. Max. 50b 0.2 0.4 1.5 3.2 5.9 8.8 31 50b 0.1 0.4 1.5 3.1 5.5 7.2 25 50b 0.05 0.6 1.8 4.4 10 11 92 50b 0.01 0.6 3.6 8.7 19 22 209 50b 0.005 0.5 4.4 8.7 13 18 116 75b 0.2 1.9 11 21 24 36 64 75b 0.1 1.8 13 22 27 39 113 75b 0.05 1.7 16 31 43 48 253 75b 0.01 1.8 29 58 75 94 570 75b 0.005 4.0 30 51 61 84 233 Sequential Search method) θ = 0.05 50b DNA 93% 75b DNA 90% 454 Life Sciences DNA DNA 9 200 ( 50b) Fig. 9 The deviation of the runtime(50b). 10 200 ( 75b) Fig. 10 The deviation of the runtime(75b). 1) M. Ronaghi, M. Uhlen, P. Nyren: A Sequencing Method Based on Real-Time Pyrophosphate, Science 281:363-365 (1998). 2) Mostafa Ronaghi: Pyrosequencing Sheds Light on DNA Sequencing, Genome Research, 11:3-11 (2001). 3) Helmy Eltoukhy, Abbas El Gamal: Modeling and Base-Calling for DNA Sequencing- By-Synthesis, Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on, 2:II-II (2006). 4) Yutaka Akiyama: SimPyro: Pyrosequencing simulation software for analyzing random process with millions of reactions. Poster presentation, The 2nd International Workshop on Approaches to Single-Cell Analysis, Tokyo (2007). 8 c 2009 Information Processing Society of Japan