28DDBJing_nagasaki.pdf

Similar documents
長崎は遺伝研 大量遺伝情報研究室の所属です 国立遺伝学研究所 生命情報研究センター 3F 2F 欧州EBIと米国NCBIと密接に協力しながら DDBJ/EMBL/GenBank国際塩基配列データ ベースを構築しています 私たちは 塩基配列登録を支援するシステムづくり 登録データの活用するシステムづく

AJACS18_ ppt

JC オンライン投稿の操作方法について(mac) 2011_9 FINAL

PowerPoint プレゼンテーション

Microsoft Word - CATNewsVol2No7Text.doc

Microsoft Word - Meta70_Preferences.doc

Microsoft Word - Live Meeting Help.docx

Microsoft Word - Win-Outlook.docx

Cleaner XL 1.5 クイックインストールガイド

2 I I / 61

Microsoft Word - D JP.docx

Introduction Purpose This training course describes the configuration and session features of the High-performance Embedded Workshop (HEW), a key tool

シーケンサー利用技術講習会 第10回 サンプルQC、RNAseqライブラリー作製/データ解析実習講習会

Maser - User Operation Manual

Step 1 Feature Extraction Featuer Extraction Feature Extraction Featuer Extraction Image Analysis Start>Programs>Agilent-Life Sciences>Feature Extract

1 I EViews View Proc Freeze

ProVisionaire Control V3.0セットアップガイド

Page 1 of 6 B (The World of Mathematics) November 20, 2006 Final Exam 2006 Division: ID#: Name: 1. p, q, r (Let p, q, r are propositions. ) (10pts) (a

Łñ“’‘‚2004

プリント


MOTIF XF 取扱説明書

バクテリアゲノム解析


CLC Genomics Workbench ウェブトレーニングセミナー: 変異解析編

DICOM UG_JPN_P book

untitled

自動シャットタ<3099>ウンクイックインストールカ<3099>イト<3099>.indb

25 II :30 16:00 (1),. Do not open this problem booklet until the start of the examination is announced. (2) 3.. Answer the following 3 proble

LM35 高精度・摂氏直読温度センサIC

AuthorManual_JSTP.ppt

untitled

[2] , [3] 2. 2 [4] 2. 3 BABOK BABOK(Business Analysis Body of Knowledge) BABOK IIBA(International Institute of Business Analysis) BABOK 7

Introduction Purpose This training course demonstrates the use of the High-performance Embedded Workshop (HEW), a key tool for developing software for

fx-9860G Manager PLUS_J

Microsoft Word - MetaFluor70取扱説明.doc

Introduction Purpose This course explains how to use Mapview, a utility program for the Highperformance Embedded Workshop (HEW) development environmen

New version (2.15.1) of Specview is now available Dismiss Windows Specview.bat set spv= Specview set jhome= JAVA (C:\Program Files\Java\jre<version>\

: (EQS) /EQUATIONS V1 = 30*V F1 + E1; V2 = 25*V *F1 + E2; V3 = 16*V *F1 + E3; V4 = 10*V F2 + E4; V5 = 19*V99

cover1.indd

soturon.dvi

AtCoder Regular Contest 073 Editorial Kohei Morita(yosupo) A: Shiritori if python3 a, b, c = input().split() if a[len(a)-1] == b[0] and b[len(

Plan of Talk CAS CAS 2 CAS Single Sign On CAS CAS 2 CAS Aug. 19, 2005 NII p. 2/32


L3 Japanese (90570) 2008

Microsoft PowerPoint - Lecture_2

Sequencher 4.9 Confidence score Clustal Clustal ClustalW Sequencher ClustalW Windows Macintosh motif confidence Sequencher V4.9 Trim Ends Without Prev

RNA-seq

help gem gem gem my help

Microsoft PowerPoint - Lecture_3

DocuWide 2051/2051MF 補足説明書

~~~~~~~~~~~~~~~~~~ wait Call CPU time 1, latch: library cache 7, latch: library cache lock 4, job scheduler co

スライド 1

Microsoft Word - RMD_75.doc

名称未設定

dvi

1., 1 COOKPAD 2, Web.,,,,,,.,, [1]., 5.,, [2].,,.,.,, 5, [3].,,,.,, [4], 33,.,,.,,.. 2.,, 3.., 4., 5., ,. 1.,,., 2.,. 1,,

RX600 & RX200シリーズ アプリケーションノート RX用仮想EEPROM

A-GAGE High - Resolution MINI ARRAY Instruction Manual Printed in Japan J20005M

Microsoft Word - PrivateAccess_UM.docx

,, create table drop table alter table

SCREENOS NAT ScreenOS J-Series(JUNOS9.5 ) NAT ScreenOS J-Series(JUNOS9.5 ) NAT : Destination NAT Zone NAT Pool DIP IF NAT Pool Egress IF Loopback Grou

Visual Evaluation of Polka-dot Patterns Yoojin LEE and Nobuko NARUSE * Granduate School of Bunka Women's University, and * Faculty of Fashion Science,

™…

自分の天職をつかめ

kubostat2017b p.1 agenda I 2017 (b) probability distribution and maximum likelihood estimation :

by CASIO W61CA For Those Requiring an English/Chinese Instruction

StarLogoテキスト(4匹).PDF


<30315F985F95B65F90B490852E696E6464>

TeraTerm Pro V.2.32の利用法

1


Transcription:

de novo de novo

http://p.ddbj.nig.ac.jp http://p.ddbj.nig.ac.jp

http://p.ddbj.nig.ac.jp http://p.ddbj.nig.ac.jp

http://p-galaxy.ddbj.nig.ac.jp de novo http://p-galaxy.ddbj.nig.ac.jp (http://genome.ucsc.edu/cgi-bin/hggateway)

http://p-galaxy.ddbj.nig.ac.jp de novo a b c Read set Overlap linear sequences by overlaps of k 1 to build graph components... ATTCG TTCGC TCGCA CGCAA... CTTCG De Bruijn graph (k = 5)... k 1 CAATG GCAAT CAATC T G C A C A Extend in k-mer space and break ties T G C A T A T C A T C G T C! T G T* C k 1 k 1 k 1 k 1 AATGA ATGAT TGATC GATCG... A ATCGG TCGGA CGGAT AATCA ATCAT TCATC CATCG C TTCGCAA...T G... C Compacting Compact graph............ ATCGGAT... >a121:len = 5,845 >a122:len = 2,560 >a123:len = 4,443 >a124:len = 48 >a126:len = 66 Linear sequences... A G... C TTCGCAA...T C ATCGGAT... Finding paths Compact graph with reads Extracting sequences...cttcgcaa...tgatcggat......attcgcaa...tcatcggat... Transcripts Figure 1 Overview of Trinity. (a) Inchworm assembles the read data set (short black lines, top) by greedily searching for paths in a k-mer graph (middle), resulting in a collection of linear contigs (color lines, bottom), with each k-mer present only once in the contigs. (b) Chrysalis pools contigs (colored lines) if they share at least one k 1-mer and if reads span the junction between contigs, and then it builds individual de Bruijn graphs from each pool. (c) Butterfly takes each de Bruijn graph from Chrysalis (top), and trims spurious edges and compacts linear paths (middle). It then reconciles the graph with reads (dashed colored arrows, bottom) and pairs (not shown), and outputs one linear sequence for each splice form and/or paralogous transcript represented in the graph (bottom, colored sequences).

Tet raodon nigroviridis 2013 Aug;20(4):383-90.

DRA: http://trace.ddbj.nig.ac.jp/dra RNAseq DRAweb DRASearchweb Organism:Tetraodon nigroviridis Search SRA012701 Pipeline http://www.ddbj.nig.ac.jp/ http://p.ddbj.nig.ac.jp/

DDBJ Import public DRA Input DRA/ERA/SRA Accession Number SRA012701 Add my DRA entry Confirmation Send a mail when completed importing import OK importimport public DRA web DRAqueueddone

Trinity QV Preprocessing Private DRA entry SRAA012701 FTP Tetraodon_nigroviridis_RNA-Seq Experimental ACCESION20122 NEXT

Trinity QV NEXT Trinity QV Run Preprocessing denovo Assemblly / mapping ID View

Trinity QV Count of QS Count 0.0e+00 2.0e+07 4.0e+07 6.0e+07 8.0e+07 1.0e+08 1.2e+08 0 10 20 30 40 Phred Quality Score

FASTQ/FASTADDBJ FTP FTP upload web HTTP upload DRA Private DRA entry Preprocessing Preprocessing Preprocessing Preprocessing Start Preprocessing PreprocesingID e.fastq.bz2 ID NEXT denovo Assembly Trinity NEXT

confirm NEXT Trinity.pl --seqtype fq --JM 100G --bflyheapspacemax 4G --bflygcthreads 1 --CPU 4 --single <> --output <> --min_contig_length 201

>m.565 g.565 ORF g.565 m.565 type:internal len:207 (-)... DLEMQIEGLKEELIFLKKNHEEELLAMRAQMSGQVHVEVEAAPAEDLTKVMADIREHYES ITAKNQKELETWFNSKSEALNKEMMTQTVTLQTSRSEVTEVKRSLQALQIELESLLGMKA SLEGTLQDTQNRYSMMLAGYQQQVTSLEQQLVQLRADLVRQGQDYQMLLDIKTRLELEIA EYRRLLEGEAAASSSTSSTSSTKTRRL >m.566 g.566 ORF g.566 m.566 type:complete len:216 (+)... MAQSVPVVMFKLVLVGDGGTGKTTFVKRHLTGEFEKKYVATLGVEVHPLFFNTNRGNVKF NVWDTAGQEKFGGLRDGYYIQAQCAIIMFDVTSRVTYKNVPNWHRDLVRVCENIPIVLCG NKVDIKDRKVKAKSIVFHRKKNLQYYDISAKSNYNFEKPFLWLARKLIGDPNLEFVEMPA LAPPEVTMDPALAVQYEKELHVASQTALPDDEDDL* >m.568 g.568 ORF g.568 m.568 type:internal len:227 (-)... GDRFKEDRKAKRLPEKSIDMIILLTDGDPNSGESRIPVIQENVKAAIGGQMSLFSLGFGN DVKYPFLDVMSRENNGLARRIYEGSDAALQLQGFYDEVSSPLLLDVDLRYPDNAVDSLTT NQFSQLFNGSEIVVAGRLKDNDIDNFPVEVFGQGLNDFSEQGQFSVLDWSGMYPDDDYIF GDFTERLWAYLTIQQLLDKSKTGDAEEKANASAEALDMSLRYSFVTP >m.571 g.571 ORF g.571 m.571 type:5prime_partial len:394... ASGGEGTHSSCGSWFNAGAKDFPSVPYSYLDFNDYKCKTSSGEIESYHDVHQVRDCRLVS LLDLALEKDYVRGKVADYMNRLVDMGVAGFRVDACKHMWPGDLSAVYGRLNNLNTKWFPE GSRPFIFQEVIDLGGEAISYTVYVHLGRVTEFKYGAKLGTVFRKWNNEKLMYTKNWGEGW GFMPNGNAVVFIDNHDNQRGHGAGGAAIVTFWDSRLHKMAVAYMLAHPYGVTRVMSSFRW NRHIVNGKDQNDWMGPPSHPDGSTKSVPINPDETCGDGWVCEHRWRQIKNMVIFRNVVNG QPHSNWWDNNSNQVAFGRGNRGFIIFNNDDWDLDVTLNTGLPAGTYCDVISGQKEAGRCT GKQIHVGSDGRAHFRISNRDEDPFVAIHVESKL* >m.573 g.573 ORF g.573 m.573 type:5prime_partial len:224... WEPSWPWQVSLQEYTGFHFCGGSLINENWVVTAAHCNVRTSHRVILGEHDRSSNNENIQV MQVGQVFKHPNYNSYTINNDITLIKLASPAQLNIRVSPVCVAETSDVFPGGMKCVTSGWG LTRYNAPDTPPRLQQVALPLLTNEECRKHWGSKITDLMVCAGASGASSCMGDSGGPLVCE KAGAWTLVGIVSWGSGFCSVSSPGVYARVTMLRAWMDQIIAAN* # --- full sequence ---- --- best 1 domain ---- --- domain number estimation ---- # target name accession query name accession E-value score bias E-value score bias exp reg clu ov env dom rep inc description of target #------------------- ---------- -------------------- ---------- --------- ------ ----- --------- ------ ----- --- --- --- --- --- --- --- --- --------------------- Actin PF00022.14 m.1-2.8e-162 539.5 0.0 3.2e-162 539.3 0.0 1.0 1 0 0 1 1 1 1 Actin Apolipoprotein PF01442.13 m.3-1.1e-38 132.6 10.6 1.1e-38 132.6 7.3 1.8 2 0 0 2 2 2 2 Apolipoprotein A1/A4/E domain

http://p.ddbj.nig.ac.jp/ DDBJ(http://p.ddbj.nig.ac.jp/) New account UserID Registration e 1.FTP Upload 4

1.FTP clientpc DDBJFTP 2. FTP client Cyberduck 1.http://cyberduck.ch/ 2.

1.Cyberduck 2. 3. FTP-SSL(Explicit AUTH TLS) 4.(133.39.116.60) (21) 5.Pipeline guest 6. Query file Upload 1.Query submission DRA000001 sample Bacillus subtilis subsp. natto BEST195 without plasmid pbest195l Read : 9,977,388 Read length : 36 2.Upload & 3.UploadPipeline

Query file Upload 1 1.Pipeline UploadSingle-end 2.Select a FASTA/FASTQ file UploadPaired-end 2.Select a FASTA/FASTQ file 3.Single-end 3.Paired-end 4.read 4.read1 file 5. 5.read1 fileread2 file 6. 1. 2.Study title 3. 4.Assembly/Mapping

Query file Upload Upload 1.Upload FASTA/FASTQ(FTP client) 2. 3.