DEIM Forum 2012 C2-6 Hadoop Web Hadoop Distributed File System Hadoop I/O I/O Hadoo

Similar documents
2 Hadoop MapReduce Hadoop, MapReduce Apache Hadoop Project Open Source Software Hadoop common MapReduce Hadoop Distributed File System( HDFS)

,., ping - RTT,., [2],RTT TCP [3] [4] Android.Android,.,,. LAN ACK. [5].. 3., 1.,. 3 AI.,,Amazon, (NN),, 1..NN,, (RNN) RNN

[1] [2] [3] (RTT) 2. Android OS Android OS Google OS 69.7% [4] 1 Android Linux [5] Linux OS Android Runtime Dalvik Dalvik UI Application(Home,T

IPSJ-HPC

yamamoto_hadoop.pptx

DEIM Forum 2017 H2-2 Android LAN Android 1 Android LAN

2 JSON., 2. JSON,, JSON Jaql [9] Spark Streaming [8], Spark [7].,, 2, 3 4, JSON [3], Jaql [9], Spark [7] Spark Streaming [8] JSON JSON [

P P P P P P P OS... P P P P P P

情報処理学会研究報告 IPSJ SIG Technical Report Vol.2013-HPC-139 No /5/29 Gfarm/Pwrake NICT NICT 10TB 100TB CPU I/O HPC I/O NICT Gf

HP ProLiant Gen8とRed Hatで始めるHadoop™ ~Hadoop™スタートアップ支援サービス~

Android LAN 1 1,, Google Android. Android, Android,. Android x86 CPU,,,. A study of performance improvement of a wireless LAN bases on Android termina

Introduction

2

i Ceph

Microsoft PowerPoint _Hadoop.pptx

PowerPoint Presentation

DEIM Forum 2019 H2-2 SuperSQL SuperSQL SQL SuperSQL Web SuperSQL DBMS Pi

PC Development of Distributed PC Grid System,,,, Junji Umemoto, Hiroyuki Ebara, Katsumi Onishi, Hiroaki Morikawa, and Bunryu U PC WAN PC PC WAN PC 1 P

1 Web DTN DTN 2. 2 DTN DTN Epidemic [5] Spray and Wait [6] DTN Android Twitter [7] 2 2 DTN 10km 50m % %Epidemic 99% 13.4% 10km DTN [8] 2

IPSJ SIG Technical Report Vol.2009-DPS-141 No.23 Vol.2009-GN-73 No.23 Vol.2009-EIP-46 No /11/27 t-room t-room 2 Development of

Microsoft PowerPoint - Sep2110_桜井.ppt

2011年11月10日 クラウドサービスのためのSINET 学認説明会 九州地区説明会 九州大学キャンパス クラウドシステムの導入 伊東栄典 情報基盤研究開発センター 1

Hadoop Introduction

HBase Phoenix API Mars GPU MapReduce GPU Hadoop Hadoop Hadoop MapReduce : (1) MapReduce (2)JobTracker 1 Hadoop CPU GPU Fig. 1 The overview of CPU-GPU

AV 1000 BASE-T LAN 90 IEEE ac USB (3 ) LAN (IEEE 802.1X ) LAN AWS (Amazon Web Services) AP 3 USB wget iperf3 wget 40 MBytes 2 wget 40 MByt

ITソリューション フロンティア2012年4月号

WLAN WLAN AP WLAN WLAN WLAN AP- WLAN SINR WLAN WLAN CE WLAN WLAN WLAN CE 2 3 WLAN 4 WLAN 2. WLAN [10] AP CE [11] AP CE CE [12] CE AP AP AP WLAN WLAN A

Hadoopの全て

ビッグデータアナリティクス - 第3回: 分散処理とApache Spark

先進的計算基盤システムシンポジウム SACSIS2012 Symposium on Advanced Computing Systems and Infrastructures SACSIS /5/17 Android LAN TCP Android. TCP A Proposal

無料セミナー資料:ビッグデータ管理基盤ソフトウェアHadoop入門

untitled

2

nakayama15icm01_l7filter.pptx

Amazon EC2 IaaS (Infrastructure as a Service) HPCI HPCI ( VM) VM VM HPCI VM OS VM HPCI HPC HPCI RENKEI-PoP 2 HPCI HPCI 1 HPCI HPCI HPC CS

Dockerの商用サービスでの利用事例紹介

分散ストレージシステム (4) (5) (6) 書き込み 書き込み 読み出し 読み出し (2) コーディネータ 1 Fig. 1 Image of distributed storage system. 2 Fig. 2 Process flow of ( 1 ) ( 2 ) ( 3 )

.A.N.Z.X36..PDF

2

操作1 <設問作成>

卒業論文

untitled

1

DEIM Forum 2017 H ,

DEIM Forum 2009 B4-6, Str

Microsoft PowerPoint - SWoPP2010_Shirahata

クララパンフレット2011冬1P-P40


(Microsoft PowerPoint - Hadoop\225\224\211\357.ppt)

Shonan Institute of Technology MEMOIRS OF SHONAN INSTITUTE OF TECHNOLOGY Vol. 41, No. 1, 2007 Ships1 * ** ** ** Development of a Small-Mid Range Paral

スライド 1

スライド 1

1. 1 DBMS Unix (USP ) ( )[3] 20 UNIX [2] KISS UNIX 1. 2 (Tukubai ) Unix OS Unix USP Tukubai Tukubai 1. 3 Unix SQL Tukubai usp Tukubai Open usp Tukubai

FIT2015( 第 14 回情報科学技術フォーラム ) RC-003 ファイル格納位置制御による Hadoop MapReduce ジョブの性能の向上 藤島永太山口実靖 工学院大学大学院工学研究科電気 電子工学専攻工学院大学工学部情報通信工学科 1. はじめに近年, 世界中の情報量が爆発的に増加し

~~~~~~~~~~~~~~~~~~ wait Call CPU time 1, latch: library cache 7, latch: library cache lock 4, job scheduler co


SWoPP BOF BOF-1 8/3 19:10 BoF SWoPP : BOF-2 8/5 17:00 19:00 HW/SW 15 x5 SimMips/MieruPC M-Core/SimMc FPGA S

2015: Moodle 1,2, 2, 1, 2, Moodle Moodle SCO(Sharable Content Object) Moodle (Conditional Activities)

TheRecord.indd

本組/根間弘海

Title

月刊「家」11月号hp2.indd

情報処理学会研究報告 IPSJ SIG Technical Report Vol.2015-DPS-163 No.17 Vol.2015-MBL-75 No /5/28 Hadoop MapReduce の Reduce 処理の I/O 高速化 藤島永太山口実靖工学院大学大学院工学研究

new_emc_panf_Hyoushi_0818

20 Covert Channel

MDS Patient Text-Japan.indd

ProLiant BL20p Generation 4 システム構成図

OSS 体験セミナー Hadoop の概要 高スケーラブルな分散管理基盤 2 つのコア機能 分散ファイルシステム (HDFS) 分散処理フレームワーク (Map/Reduce) BigData の管理基盤として注目 分散処理基盤 (Map/Reduce) Hadoop 分散ファイルシステム (HDF

untitled

26 FPGA FPGA (Field Programmable Gate Array) ASIC (Application Specific Integrated Circuit) FPGA FPGA FPGA FPGA Linux FreeDOS skewed way L1

MATLAB® における並列・分散コンピューティング ~ Parallel Computing Toolbox™ & MATLAB Distributed Computing Server™ ~

1 Microsoft Windows Server 2012 Windows Server Windows Azure Hyper-V Windows Server 2012 Datacenter/Standard Hyper-V Windows Server Windo

AWSSummitTokyo2018

Cell/B.E. BlockLib

EMC-greenplum-SG s-1p

AJACS18_ ppt

08 IPSJ/SIGSE Software Engineering Symposium (SES08) duce [] Assembly [6] Script 0 64 % 4 8% BBVC BBVC.. VC: Volunteer Computing VC LAN VC VC VC LAN V

Linux勉強会 ~Hadoopと高可用性~ Hadoop入門


untitled

CONTENTS N T

template.dvi

IPSJ SIG Technical Report Vol.2014-DBS-159 No.6 Vol.2014-IFAT-115 No /8/1 1,a) 1 1 1,, 1. ([1]) ([2], [3]) A B 1 ([4]) 1 Graduate School of Info

Leveraging Cloud Computing to launch Python apps

IPSJ SIG Technical Report Vol.2011-ARC-195 No.23 Vol.2011-OS-117 No /4/14 1. Cassandra CMS CMS 100 PC Cassandra Cassandra CMS Design of S

main.dvi

PowerPoint プレゼンテーション

Hadoopによる バッチ処理検証

2

VXPRO R1400® ご提案資料

Oracle Application Server 10g(9

「ネットワークを渡り歩けるコンピュータ」の

HPE Moonshot System ~ビッグデータ分析&モバイルワークプレイスを新たなステージへ~

strtok-count.eps

カスペルスキー アンチウイルス 2011 for Mac

21 A contents organization method for information sharing systems

Agenda Scalability Availability CAP Theorem Scalability Availability Consistency BASE Transaction

MAC root Linux 1 OS Linux 2.6 Linux Security Modules LSM [1] Security-Enhanced Linux SELinux [2] AppArmor[3] OS OS OS LSM LSM Performance Monitor LSMP

102

Transcription:

DEIM Forum 12 C2-6 Hadoop 112-86 2-1-1 E-mail: momo@ogl.is.ocha.ac.jp, oguchi@computer.org Web Hadoop Distributed File System Hadoop I/O I/O Hadoop A Study about the Remote Data Access Control for Hadoop Distributed File System Asuka MOMOSE and Masato OGUCHI Ochanomizu University 2-1-1 Otsuka, Bunkyouku Tokyo 112-86 JAPAN E-mail: momo@ogl.is.ocha.ac.jp, oguchi@computer.org 1. [1] Hadoop 1

Distributed File System HDFS [2] 2. Web Web 2. 1 Hadoop Distributed File System Hadoop Distributed File System HDFS [2] Hadoop Apache Software Foundation Hadoop Common HDFS MapReduce hbase 1 HDFS Hadoop MapReduce 1 Hadoop 2. 2 Hadoop HDFS Namenode JobTracker TastTracker 2 Hadoop MapReduce Map Reduce Mapper Reducer TaskTracker Hadoop 3. Rocks [3] Hadoop-..2 Hadoop 1. 2.MB Hadoop JobTracker TaskTracker Master Slave 2 Namenode TaskTracker TaskTracker Slave Slave Task Hadoop TestDFSIO Map/Reduce URL, Map N-gram URL Reduce N-gram n= 4. 4. 1 dummynet Namenode1 3 1 3 2 LocalArea Namenode 3 dummynet 1 OS CPU Main Memory Master node Linux 2.6.9-..2. Elsmp(CentOS 4) Intel(R) Xeon(R) @3.6GHz 4.GB Slave node Linux 2.6.9-..2. Elsmp(CentOS 4) Quad-Core Intel(R) Xeon(R) @1.6GHz 2.GB 2

4. 2 TestDFSIO TestDFSIO MB I/O Throughput ( RTT) msec msec 1 4 1 T hroughput ( M B/se c ) T hroughput (MB/sec ) 2 4 6 8 12 14 16 18 4 Write 1Replica 2Replica 3Replica 2 4 6 8 12 14 16 18 Read 1Replica 2Replica 3Replica 1 4. 3 HDFS 12 RTT 6 HDFS exec time(sec) 9 8 7 6 4 2 4 6 8 12 14 16 18 6 4. 4 Hadoop RandomWrite MB, 1GB RandomWrite RandomWrite Sort RTT ( 7 8) TestDFS I/O Sort Test exec time (sec) msec 2msec msec msec RTT 7 RandomWrite. Hadoop 3

Test exec time (sec) 7 6 4 msec 2msec msec msec RTT Namenode 4 4 3 Local node 1 Local node 2 8 Sort write1 write2 write3 write4 write. 1 Wireshark [4] 3 Namenode Wireshark Namenode. 2 1 9 3 RTT HDFS 3 write Local node 1 Local node 2 read1 read2 read3 read4 read 11 read 9 8 7 Local node 1 6 Local node 2 4 Total 6 16 9 1 11 RTT=msec 3 1 Namenode 1 Namenode % %. 3 HDFS 6. HDFS Hadoop 6. 1 Hadoop 12 4

rack Namenode 12 rack dummynet rack1 Remote 13 HDFS RTTmsec 1.9% % % % % % -% 6. 2 Hadoop (i)1st (ii)2nd Remote rack Local rack HDFS 2Remote rack % Hadoop HDFS 2 (Remote rack : Local rack) Math.random.,. : % - %.4,.6 9 : 11 4% - %.3,.7 8 : 12 4% - 6%.2,.8 7 : 13 3% - 6%.1,.9 6 : 14 % - 7% 6. 3 HDFS 3 Math.Random.8 2 13:7 3 (Remote rack : Local rack) Rack Node Blocks rack 1 122 rack 2 127 rack1 3 62 rack1 4 6 7. HDFS 7. 1 HDFS HDFS time(sec) xec e 3 13 % 3% 4% 4% % 4 [1] HDFS 14 optimized rack MB/sec) ( optimized rack simple rack 2 4 6 8 12 14 16 18 14 Write HDFS HDFS %

MB/sec) ( optimized rack simple rack 2 4 6 8 12 14 16 18 8. 2 HDFS HDFS HDFS 9. Read 7. 2 1.9% RTTmsec RTTmsec RTTmsec 22.9% RTTmsec - RTTmsec - 8. 8. 1 Web Hadoop Distributed File System Hadoop I/O I/O [1] Hadoop DE &PRMU ( ) 6 Vol.111 No.76 pp.19-24 11 6. [2] Dhruba Borthakur HDFS Architecture 8 The Apache Software Foundation [3] Rocks Cluster http://www.rocksclusters.org/ [4] Wireshark http://www.wireshark.org/ [] Sanjay Ghemawat Howard Gobioff and Shun-Tak Leung The Google File System ACM SIGOPS Operating Systems Review, Vol.37, No., pp.29-43, December 3 [6] Tom WhiteHadoop O Reilly Japan, Inc [7] Jason Venner Pro Hadoop 9 Apress [8] Gfarm v2 7-HPC-113 pp.7-12 7 12 [9] Gfarm MapReduce SWopp Vol.-HPC-126 8 6