1 1.1 Web DNA (Dayhoff et al., 1976) Web (van den Berg et al., 2010) Application Programming Interface (API) API Web (Fernandez-Suarez and Galperin, 2



Similar documents
橡dbweb2002-sato.PDF

計算機生命科学の基礎II_


AJACS18_ ppt

生物工学会誌 第95巻 第5号 バイオインフォマティクスを使い尽くす秘訣教えます!【第5回】

Vol.55 No (Jan. 2014) saccess 6 saccess 7 saccess 2. [3] p.33 * B (A) (B) (C) (D) (E) (F) *1 [3], [4] Web PDF a m

ウェブサービスとは WWWを介してデータの取得 解析などをサー バ側で行うサービス 人が直接使うことは意図されていない プログラム等を使って大量に処理できる(単純) 作業を意図している SOAP, REST

遺伝子ネットワーク解析


スライド 1

プレゼンテーション2.ppt

JP1/Integrated Management - Service Support 操作ガイド

Wiki Wiki Wiki...

untitled

日本感性工学会論文誌

FileMaker Server Getting Started Guide

IPSJ SIG Technical Report Vol.2010-NL-199 No /11/ treebank ( ) KWIC /MeCab / Morphological and Dependency Structure Annotated Corp

1. [1, 2, 3] (PDF ) [4] API API [5] ( ) PDF Web Web Annotate[6] Digital Library for Earth System Education(DLESE)[7] Web PDF Text, Link, FreeTe

Web SOAP Internet Web REST SOAP REST 3 REST SOAP 4

ルール&マナー集_社内版)_修正版.PDF

WebRTC P2P,. Web,. WebRTC. WebRTC, P2P, i

DEIM Forum 2009 C8-4 QA NTT QA QA QA 2 QA Abstract Questions Recomme

Genome-Wide Genetic Analysis More flexibility. More content. CGH CGH CGH CGH Comparative Genomic Hybridization CGH2 DNA KaryotypingFISH CGH BACBacteri

KEGG.ppt

PubMedとは

国立遺伝学研究所におけるDNAデータバンク:DDBJ

morita.PDF

HTML5無料セミナ.key

2. blog SNS (1) 2009 Data.gov 2010 data.gov.uk data.gouv.fr ) IT ) a) b) c) G8 3) data.go

はじめに

Web Microsoft 2008 R2 Database Database!! Database 04 08

NBDC-DBCLS-共同研究計画.key

Dublin Core Metadata Element Set DCMES DCMES interoperability Interoperability DL DL [3] 2001 Joint Conference on Digital Libraries JCDL 2001

ÿþ

KNOB Knoppix for Bio Itoshi NIKAIDO

_02-5.ppt

ESA_UI_1110.PDF

XMLを基盤とするビジネスプロトコルの動向

b2bintro.book

Mining Social Network of Conference Participants from the Web

fiš„v5.dvi

LWW EJ on Ovid LWW Ovid Online (Ovid Web Gateway) Ovid Online LWW tutorial Ovid Online Refresh Ovid Online LWW Ovid Medline, Cinahl, EBMR, Ovid

2 A. Ramón y Cajal projectional brain map barrel Statview I I II II I I II II

Oracle Identity Managementの概要およびアーキテクチャ

自然言語処理16_2_45

IBM Software Group DB2 Information Management Software DB2 V8 XML SQL/XML 2 XML XML UDF XMLExtender XML XML XMLCollection, XMLColumn XML UDF Informati

matsuda.dvi

Transcription:

Web Web Web Web Web G-Links Web G-Links ID URL http://link.g-language.org/ Web

1 1.1 Web DNA (Dayhoff et al., 1976) Web (van den Berg et al., 2010) Application Programming Interface (API) API Web (Fernandez-Suarez and Galperin, 2013) 1200 Web (Brazas et al., 2012) Web (Bhagat et al., 2010) 1. 2. 3. 3 Web Web Uniform Resource Identifier (URI) Web (Stein, 2002, 2008) 1.2 Federated Query (Jacso, 2004) SOAP Web API (Wilkinson et al., 2003) BioMoby (Wilkinson et al., 2008) mygrid 1

Web BioMoart (Kasprzyk, 2011) 1.3 ID ID Link Linked Data Relational Database (RDB) (Codd, 1969) Linked Data Link Link ID ID Linked Data ID ID ID SOURCE (Diehn et al., 2003) Protein Identifier Cross-Referencing (PICR) (Cote et al., 2007) ID biodbnet (Mudunuri et al., 2009) ID ID Link ID Link ID ID ID Link Link Link Link 1.4 Semantic Web Tim Berners-Lee World Wide Web (WWW) Semantic Web Semantic Web URI Link Web Ontology Language (OWL) WWW Semantic Web Resource Description Framework (RDF) Semantic Web Link RDF Semantic Web 2

2 Web Link Linked Data ID Linked Data ID ID ID ID ID URI ID ID ID 3

3 3.1 G-Links ID G-Links Primary Key Linked Data Primary Key ID UniProt ID UniProt (The UniProt Consortium, 2012) Linked Data Link ID mapping (Huang et al., 2011) Primary Key G-Links biodbnet UniProt ID ID UniProt ID 2 Perl MySQL 5.0 RDB UniProt 1 G-Links 3.1 3.2 ID G-Links ID ID ID / ID 3 ID UniProt ID KEGG Orthology ID UniProt ID UniProt ID ID ID UniProt ID UniProt Taxonomy Search (http://www.uniprot.org/taxonomy/) ID NCBI Taxonomy (Federhen, 2012) RefSeq (Pruitt 4

Outputs RefSeq SMART ID conversion Name Pfam RefSeq EMBL Sequence Web Services GeneID GO slim OMA KEGG Reaction KEGG Pathway 3.1: G-Links G-Links ID UniProt ID ID UniProt ID ID Web URL et al., 2012) G-Links UniProt ID European Molecular Biology Open Software Suite (EMBOSS) (Rice et al., 2000) transeq BLAST Like Alignment Tool (BLAT) (Kent, 2002) Swiss-Prot +0, +1, +2 3 Watson Click 6 BLAT E-value Identity ID G-Links UniProt ID E-value Identity UniProt ID G-Links URL UniProt ID ID 3.3 ID UniProt ID G-Links UniProt ID ID UniProt Link GO slim (Harris et al., 2004) G-Links ID ID UniProt ID ID 5

Perl Storable 3.4 G-Links G-Links RESTful URL ID ID ID URL 3 G-Links Programmable Human-Readable Semantic Web 3 Programable JSON Tabular Human-Readable HTML 1 ID UniProt KEGG Pathway COXPRESdb (Obayashi et al., 2013) PHP: Hypertext Preprocessor (PHP) ImageFlow (http://imageflow.finnrudolph.de/) ID JavaScrip tablesorter (http://tablesorter.com/docs/) Semantic Web RDF/XML Notation 3 Notation 3 Perl RDF/XML RDF::Notation3 Notation 3 RDF G-Links EDAM Ontology UniProt Core Ontology 4 4.1 G-Links RESTful ID ID / URL http://link.g-language.org/ http://g-language.org/wiki/glinks https://github.com/cory-ko/g-links 1 G-Links [] () 6

G-Links Syntax (1) ID ID http://link.g-language.org/[gene or GENE SET ID] (/filter=[filter])(/extract=[extract])(/format=[format]) (2) http://link.g-language.org/[sequence] (/evalue=[e-value])(/identity=[identity])(/direct=[0 or 1]) G-Links 85 205,829,185 ID (205,811,947 ID 17,238 ID) / 132 315,481,016 ID ID ID http://link.g-language.org/input_list http://link.g-language.org/output_list 4.2 G-Links REST ID http://link.g-language.org/ [GENEID] ID URL G-Links URL Homo Sapiens BRCA1 (Serova et al., 1997) UniProt BRCA1 HUMAN http://link.g-language.org/brca1_human 4.1 4.1: G-Links ID 0.03 (TSV), 1.98 (HTML) 25 (KEGG Pathway, PDB, COXPRESdb ) 184 (48 ) 443 (68 )?? http://link.g-language.org/brca1_human G- Links 1URL 4.3 G-Links ID 1URL ID 7

ID ID UCSC ID uc003hui GeneID 93986 http: //link.g-language.org/uc003hui,93986 KEGG Orthology ID ID ID UniProt taxonomy 4.4 G-Links G-Links 4.2 4.2: G-Links tsv slim URL json JSON html HTML rdf RDF/XML n3 Notation3 G-Links format 6 BRCA1 JSON http://link.g-language.org/brca1_human/format=json JSON HTML tsv G-Links 3 HTML Humanreadable ID Programmatic JSON Tab Separated Value (TSV) UNIX URL G-Links Web Semantic Web RDF/XML Notation3 RDF Semantic Web RDF URL G-Links RDF EDAM Ontology UniProt Ontology EDAM Ontology Web 8

4.5 G-Link G-Links 2 filter filter DISEASE G-Links 2 : : DISEASE filter=disease filter=:cancer DISEASE filter=disease:cancer filter ( ) filter G-Links AND extract DISEASE extract=disease extract filter extract OR URL filter extract Homo Sapiens http://link.g-language.org/9606/format=tsv/filter=disease:cancer SNP http://link.g-language.org/9606/format=tsv/filter=disease:cancer /filter=:breast :ovarian dbsnp SNPedia http://link.g-language.org/9606/format=tsv/filter=disease:cancer /filter=:breast :ovarian :snps :polymorphisms /extract=dbsnp SNPedia filter extract G-Links filter extract Homo Sapiens SNP dbsnp SNPedia URL 9

5 Web G-Links ID ID URL 130 Web ID ID ID / 2 URL G-Links G-language EMBOSS REST KBWS REST URL G-Links KBWS G-language Project 10

Bhagat, J., Tanoh, F., Nzuobontane, E., Laurent, T., Orlowski, J., Roos, M., Wolstencroft, K., Aleksejevs, S., Stevens, R., Pettifer, S., Lopez, R., and Goble, C. A. (2010). BioCatalogue: a universal catalogue of web services for the life sciences. Nucleic Acids Res., 38(Web Server issue), W689 694. Brazas, M. D., Yim, D., Yeung, W., and Ouellette, B. F. (2012). A decade of Web Server updates at the Bioinformatics Links Directory: 2003-2012. Nucleic Acids Res., 40(Web Server issue), W3 W12. Codd, E. F. (1969). Derivability, redundancy and consistency of relations stored in large data banks. IBM Research Report, San Jose, California, RJ599. Cote, R. G., Jones, P., Martens, L., Kerrien, S., Reisinger, F., Lin, Q., Leinonen, R., Apweiler, R., and Hermjakob, H. (2007). The Protein Identifier Cross-Referencing (PICR) service: reconciling protein identifiers across multiple source databases. BMC Bioinformatics, 8, 401. Dayhoff, M. O., Barker, W. C., Schwartz, R. M., Orcutt, B. C., and Hunt, L. T. (1976). Data base for protein sequences. In Proceedings of the June 7-10, 1976, national computer conference and exposition, AFIPS 76,261 266,NewYork,NY,USA. ACM. Diehn, M., Sherlock, G., Binkley, G., Jin, H., Matese, J. C., Hernandez-Boussard, T., Rees, C. A., Cherry, J. M., Botstein, D., Brown, P. O., and Alizadeh, A. A. (2003). SOURCE: a unified genomic resource of functional annotations, ontologies, and gene expression data. Nucleic Acids Res., 31(1), 219 223. Federhen, S. (2012). The NCBI Taxonomy database. Nucleic Acids Res., 40(Database issue), D136 143. Fernandez-Suarez, X. M. and Galperin, M. Y. (2013). The 2013 Nucleic Acids Research Database Issue and the online Molecular Biology Database Collection. Nucleic Acids Res., 41(D1), 1 7. Harris, M. A., Clark, J., Ireland, A., Lomax, J., Ashburner, M., Foulger, R., Eilbeck, K., Lewis, S., Marshall, B., Mungall, C., Richter, J., Rubin, G. M., Blake, J. A., Bult, C., Dolan, M., Drabkin, H., Eppig, J. T., Hill, D. P., Ni, L., Ringwald, M., Balakrishnan, R., Cherry, J. M., Christie, K. R., Costanzo, M. C., Dwight, S. S., Engel, S., Fisk, D. G., Hirschman, J. E., Hong, E. L., Nash, R. S., Sethuraman, A., Theesfeld, C. L., Botstein, D., Dolinski, K., Feierbach, B., Berardini, T., Mundodi, S., Rhee, S. Y., Apweiler, R., Barrell, D., Camon, E., Dimmer, E., Lee, V., Chisholm, R., Gaudet, P., Kibbe, W., Kishore, R., Schwarz, E. M., Sternberg, P., Gwinn, M., Hannick, L., Wortman, J., Berriman, M., Wood, V., de la Cruz, N., Tonellato, P., Jaiswal, P., Seigfried, T., and White, R. (2004). The Gene Ontology (GO) database and informatics resource. Nucleic Acids Res., 32(Database issue), D258 261. Huang, H., McGarvey, P. B., Suzek, B. E., Mazumder, R., Zhang, J., Chen, Y., and Wu, C. H. (2011). protein-centric ID mapping service for molecular data integration. Bioinformatics, 27(8), 1190 1191. A comprehensive Jacso, P. (2004). Thoughts about federated searching. Information Today, 21(9), 17 20. Kasprzyk, A. (2011). BioMart: driving a paradigm change in biological data management. Database, 2011, bar049. Kent, W. J. (2002). BLAT the BLAST-like alignment tool. Genome Res., 12(4), 656 664. Mudunuri, U., Che, A., Yi, M., and Stephens, R. M. (2009). biodbnet: the biological database network. Bioinformatics, 25(4), 555 556. Obayashi, T., Okamura, Y., Ito, S., Tadaka, S., Motoike, I. N., and Kinoshita, K. (2013). COXPRESdb: a database of comparative gene coexpression networks of eleven species for mammals. Nucleic Acids Res., 41(D1), D1014 1020. Pruitt, K. D., Tatusova, T., Brown, G. R., and Maglott, D. R. (2012). NCBI Reference Sequences (RefSeq): current status, new features and genome annotation policy. Nucleic Acids Res., 40(Database issue), D130 135. Rice, P., Longden, I., and Bleasby, A. (2000). EMBOSS: the European Molecular Biology Open Software Suite. Trends Genet., 16(6), 276 277. Serova, O. M., Mazoyer, S., Puget, N., Dubois, V., Tonin, P., Shugart, Y. Y., Goldgar, D., Narod, S. A., Lynch, H. T., and Lenoir, G. M. (1997). Mutations in BRCA1 and BRCA2 in breast cancer families: are there more breast cancer-susceptibility genes? Am. J. Hum. Genet., 60(3), 486 495. Stein, L. (2002). Creating a bioinformatics nation. Nature, 417(6885), 119 120. Stein, L. D. (2008). Towards a cyberinfrastructure for the biological sciences: progress, visions and challenges. Nat. Rev. Genet., 9(9), 678 688. 11

The UniProt Consortium (2012). Update on activities at the Universal Protein Resource (UniProt) in 2013. Nucleic Acids Res. van den Berg, B. H., McCarthy, F. M., Lamont, S. J., and Burgess, S. C. (2010). Re-annotation is an essential step in systems biology modeling of functional genomics data. PLoS ONE, 5(5), e10642. Wilkinson, M., Gessler, D., Farmer, A., and Stein, L. (2003). The BioMOBY Project Explores Open-Source, Simple, Extensible Protocols for Enabling Biological Database Interoperability. Wilkinson, M. D., Senger, M., Kawas, E., Bruskiewich, R., Gouzy, J., Noirot, C., Bardou, P., Ng, A., Haase, D., Saiz, E. d. e. A., Wang, D., Gibbons, F., Gordon, P. M., Sensen, C. W., Carrasco, J. M., Fernandez, J. M., Shen, L., Links, M., Ng, M., Opushneva, N., Neerincx, P. B., Leunissen, J. A., Ernst, R., Twigger, S., Usadel, B., Good, B., Wong, Y., Stein, L., Crosby, W., Karlsson, J., Royo, R., Parraga, I., Ramirez, S., Gelpi, J. L., Trelles, O., Pisano, D. G., Jimenez, N., Kerhornou, A., Rosset, R., Zamacola, L., Tarraga, J., Huerta-Cepas, J., Carazo, J. M., Dopazo, J., Guigo, R., Navarro, A., Orozco, M., Valencia, A., Claros, M. G., Perez, A. J., Aldana, J., Rojano, M., Fernandez-Santa Cruz, R., Navas, I., Schiltz, G., Farmer, A., Gessler, D., Schoof, H., and Groscurth, A. (2008). Interoperability with Moby 1.0 it s better than sharing your toothbrush! Brief. Bioinformatics, 9(3), 220 231. 12