2009
Perl + α. : DNA, mrna,,
DNA
..
DNA A C G T DNA 2 A-T, C-G
DNA NH 2 NH 2 O - O O N P O - O CH 2 O N N O - O P O CH 2 O N O - O O P O NH 2 O - O - N CH 2 O N O OH OH OH
DNA or RNA (U) (A) (G) (C) (T)
(A) (G) AGTC (T) (C) AGTC TCAG
ggagctgcagcccgaccgcggggaggacgccatcgccgcctgcttcctcatcaactgcct ctacgagcagaacttcgtgtgcaagttcgcgcccagggagggcttcatcaactacctcac gagggaagtgtaccgctcctaccgccagctgcggacccagggctttggagggtctgggat ccccaaggcctgggcaggcatagacttgaaggtacaaccccaggaacccctggtgctgaa ggatgtggaaaacacagattggcgcctactgcggggtgacacggatgtcagggtagagag gaaagacccaaaccaggtggaactgtggggactcaaggaaggcacctacctgttccagct gacagtgactagctcagaccacccagaggacacggccaacgtcacagtcactgtgctgtc caccaagcagacagaagactactgcctcgcatccaacaaggtgggtcgctgccggggctc tttcccacgctggtactatgaccccacggagcagatctgcaagagtttcgtttatggagg ctgcttgggcaacaagaacaactaccttcgggaagaagagtgcattctagcctgtcgggg tgtgcaaggcccctccatggaaaggcgccatccagtgtgctctggcacctgtcagcccac ccagttccgctgcagcaatggctgctgcatcgacagtttcctggagtgtgacgacacccc caactgccccgacgcctccgacgaggctgcctgtgaaaaatacacgagtggctttgacga gctccagcgcatccatttccccagtgacaaagggcactgcgtggacctgccagacacagg actctgcaaggagagcatcccgcgctggtactacaaccccttcagcgaacactgcgcccg ctttacctatggtggttgttatggcaacaagaacaactttgaggaagagcagcagtgcct cgagtcttgtcgcggcatctccaagaaggatgtgtttggcctgaggcgggaaatccccat
DNA ( etc.)
.? (%) 70 1 3 0.5 0.5 15 DNA 0.5 RNA 6 2 2
α
DNA
TAA mrna UG UAA
trna mrna ACC ACGAGUACA UGCUCAUGUUGG
UUU Phe (F) UCU Ser (S) UAU Tyr (Y) UGU Cys (C) UUC Phe (F) UCC Ser (S) UAC Tyr (Y) UGC Cys (C) UUA Leu (L) UCA Ser (S) UAA * UGA * UUG Leu (L) UCG Ser (S) UAG * UGG Trp (W) CUU Leu (L) CCU Pro (P) CAU His (H) CGU Arg (R) CUC Leu (L) CCC Pro (P) CAC His (H) CGC Arg (R) CUA Leu (L) CCA Pro (P) CAA Gln (Q) CGA Arg (R) CUG Leu (L) CCG Pro (P) CAG Gln (Q) CGG Arg (R) AUU Ile (I) ACU Thr (T) AAU Asn (N) AGU Ser (S) AUC Ile (I) ACC Thr (T) AAC Asn (N) AGC Ser (S) AUA Ile (I) ACA Thr (T) AAA Lys (K) AGA Arg (R) AUG Met (M) ACG Thr (T) AAG Lys (K) AGG Arg (R) GUU Val (V) GCU Ala (A) GAU Asp (D) GGU Gly (G) GUC Val (V) GCC Ala (A) GAC Asp (D) GGC Gly (G) GUA Val (V) GCA Ala (A) GAA Glu (E) GGA Gly (G) GUG Val (V) GCG Ala (A) GAG Glu (E) GGG Gly (G)
DNA
(DNA) (DNA) 4.6M 4,000 15M 6,000 100M 14,000 170M 12,000 3,000M 25,000
22
Nature Feb.15, 2001
ATTCCTACGA..
( ) 0 10000 20000 30000 40000 50000
A C C B A B B A C B A C DNA RNA A B C
97% 97% < 3 DNA? 50% ) LINE1 ALU LINE1 ALU LINE1 LINE1
RNA? RNA
RNA?
RNA RNA 62.5% RNA RNA
cdna 8,331
AAAAA # # # # AAAAA >12,000 >5,000 (Zhang et al. 2004)
? # # # ## AAAAA...
(1) mrna Makorin1-p1 ( ) mrna mrna Makorin1 mrna (Hirotsune et al. 2003) Makorin1
(2) No Makorin1-p1 Makorin1 mrna mrna mrna mrna is degraded (Hirotsune et al. 2003) Makorin1-p1 RNA
RNA # # # # # # # # # # # # # # # # # # # # # # # # # # 7.8% 6.2%
GenBank, etc. ATTCCTACGA..
GenBank exon, intron NCBI
GenBank LOCUS DEFINITION E.coli peptide SOURCE ORGANISM E.coli CDS 2..16 ORIGIN 1 catgatgtac atctaataga 21 acgagtgagg //
Perl
GC
Perl (1 6 ) Perl WEB (7 12 ) TA/SA (13
WEB page http://www.bioinfo.sfc.keio.ac.jp/class/genpro
A (30%) (30%) (40%) 20% B (30%) (30%) (40%) B C
(0-7) etc. (0-4) etc. (0-3) etc. (0-3) (0-3) α etc.
TA SA