MapTask 678 Map 関数 バッファ管理モジュール リングバッファ 45#$% *+,-./ 0123!"#$% &'() 外部記憶装置 1 MapReduce IFIle IFIle MapReduce 25% MapReduce 2 MapReduce OS

Similar documents
2 JSON., 2. JSON,, JSON Jaql [9] Spark Streaming [8], Spark [7].,, 2, 3 4, JSON [3], Jaql [9], Spark [7] Spark Streaming [8] JSON JSON [

IPSJ-HPC

DEIM Forum 2019 H2-2 SuperSQL SuperSQL SQL SuperSQL Web SuperSQL DBMS Pi

DEIM Forum 2015 E4-5 DSMS DSMS DSMS 32% 46% RTOS Priority Inversion Time

HBase Phoenix API Mars GPU MapReduce GPU Hadoop Hadoop Hadoop MapReduce : (1) MapReduce (2)JobTracker 1 Hadoop CPU GPU Fig. 1 The overview of CPU-GPU

DEIM Forum 2014 D3-5 DSMS DSMS DSMS 2.13% RTOS Realtime-Aware Efficient Query Processing for Automotiv

DEIM Forum 2012 C2-6 Hadoop Web Hadoop Distributed File System Hadoop I/O I/O Hadoo

DEIM Forum 2009 B4-6, Str

untitled

Publish/Subscribe KiZUNA P2P 2 Publish/Subscribe KiZUNA 2. KiZUNA 1 Skip Graph BF Skip Graph BF Skip Graph Skip Graph Skip Graph DDLL 2.1 Skip Graph S

DEIM Forum 2017 E Netflix (Video on Demand) IP 4K [1] Video on D

スライド 1

先進的計算基盤システムシンポジウム Shuffle KVP KVP MapReduce KVP 7) Jimmy PageRank MapReduce.69 Jimmy KVP Jimmy key KVP value KVP MapReduce 3 PageRank 4 Jimmy M

IPSJ SIG Technical Report Vol.2014-DBS-159 No.6 Vol.2014-IFAT-115 No /8/1 1,a) 1 1 1,, 1. ([1]) ([2], [3]) A B 1 ([4]) 1 Graduate School of Info

OS,,, Abstract OS LibOS LibOS OS OS OS LibOS Elasticty LibOS LibOS Li

3.1 Thalmic Lab Myo * Bluetooth PC Myo 8 RMS RMS t RMS(t) i (i = 1, 2,, 8) 8 SVM libsvm *2 ν-svm 1 Myo 2 8 RMS 3.2 Myo (Root

BOK body of knowledge, BOK BOK BOK 1 CC2001 computing curricula 2001 [1] BOK IT BOK 2008 ITBOK [2] social infomatics SI BOK BOK BOK WikiBOK BO

分散ストレージシステム (4) (5) (6) 書き込み 書き込み 読み出し 読み出し (2) コーディネータ 1 Fig. 1 Image of distributed storage system. 2 Fig. 2 Process flow of ( 1 ) ( 2 ) ( 3 )

([ ]!) name1 name2 : [Name]! name SuperSQL,,,,,,, (@) < >@{ < > } =,,., 200,., TFE,, 1 2.,, 4, 3.,,,, Web EGG [5] SSVisual [6], Java SSedit( ss

/ Apache Cassandra 3)4) Apache HBase 5) Yahoo Sherpa 6) sharded MySQL 7) (MySQL sharding ) MyCassandra 8) MyCassandra MyCassandra Cluster My- C

Microsoft Word - toyoshima-deim2011.doc

untitled

08 IPSJ/SIGSE Software Engineering Symposium (SES08) duce [] Assembly [6] Script 0 64 % 4 8% BBVC BBVC.. VC: Volunteer Computing VC LAN VC VC VC LAN V

Vol.55 No (Jan. 2014) saccess 6 saccess 7 saccess 2. [3] p.33 * B (A) (B) (C) (D) (E) (F) *1 [3], [4] Web PDF a m

Run-Based Trieから構成される 決定木の枝刈り法

bit bit bit VAST N d i d 1 <d 2 <...<d k <...<d N d k VAST d k 3 d k 3 d k 2 d k 1 d k 4 w w=4 ) HW HW 32bit γ δ [4] PForDelta [3] HW CPU VAST VAST VA

[1] [3]. SQL SELECT GENERATE< media >< T F E > GENERATE. < media > HTML PDF < T F E > Target Form Expression ( ), 3.. (,). : Name, Tel name tel

IPSJ SIG Technical Report Vol.2011-ARC-195 No.23 Vol.2011-OS-117 No /4/14 1. Cassandra CMS CMS 100 PC Cassandra Cassandra CMS Design of S

2 Hadoop MapReduce Hadoop, MapReduce Apache Hadoop Project Open Source Software Hadoop common MapReduce Hadoop Distributed File System( HDFS)

独立行政法人情報通信研究機構 Development of the Information Analysis System WISDOM KIDAWARA Yutaka NICT Knowledge Clustered Group researched and developed the infor

THE INSTITUTE OF ELECTRONICS, INFORMATION AND COMMUNICATION ENGINEERS TECHNICAL REPORT OF IEICE. TRECVID2012 Instance Search {sak


IPSJ SIG Technical Report Vol.2016-ARC-221 No /8/9 GC 1 1 GC GC GC GC DalvikVM GC 12.4% 5.7% 1. Garbage Collection: GC GC Java GC GC GC GC Dalv

, [! [, ]! ]!,,., ([ ],). : [Name], name1 name2 name10 ([ ]!). name1 name2 : [Name]! name SuperSQL,,,,,,, < < > } =.,

Fig. 3 3 Types considered when detecting pattern violations 9)12) 8)9) 2 5 methodx close C Java C Java 3 Java 1 JDT Core 7) ) S P S

IPSJ SIG Technical Report Vol.2013-ICS-172 No /11/12 1,a), 1,b) Anomaly Detection 1. 1 Nagoya Institute of Technology 1 Presently with Nagoya In

,4) 1 P% P%P=2.5 5%!%! (1) = (2) l l Figure 1 A compilation flow of the proposing sampling based architecture simulation

Lyra X Y X Y ivis Designer Lyra ivisdesigner Lyra ivisdesigner 2 ( 1 ) ( 2 ) ( 3 ) ( 4 ) ( 5 ) (1) (2) (3) (4) (5) Iv Studio [8] 3 (5) (4) (1) (

paper

main.dvi

1 Web DTN DTN 2. 2 DTN DTN Epidemic [5] Spray and Wait [6] DTN Android Twitter [7] 2 2 DTN 10km 50m % %Epidemic 99% 13.4% 10km DTN [8] 2

データグラフ ( 外部記憶 ) 主記憶 S 1 S 2 読み込んだ部分グラフ 部分解を格納 解と成り得る 部分解 集合 A さん C さん c 近況 Bさん u 動画 u s D さん E さん s c 写真 u c s (b) 出力 完全解集合 (a) Facebook 3: データグラフ ( 外

8 P2P P2P (Peer-to-Peer) P2P P2P As Internet access line bandwidth has increased, peer-to-peer applications have been increasing and have great impact

IPSJ SIG Technical Report Vol.2014-MBL-70 No.20 Vol.2014-UBI-41 No /3/14 1,a) Yuko Hirabe 1,a) Mai Tsuda 1 Yutaka Arakawa 1 Keiichi Yasum

情報の構造とデータ処理

Leveraging Cloud Computing to launch Python apps

知能と情報, Vol.29, No.6, pp

PC Development of Distributed PC Grid System,,,, Junji Umemoto, Hiroyuki Ebara, Katsumi Onishi, Hiroaki Morikawa, and Bunryu U PC WAN PC PC WAN PC 1 P

Hadoop Introduction

Introduction

main.dvi

Twitter Twitter [5] ANPI NLP 5 [6] Lee [7] Lee [8] Twitter Flickr FreeWiFi FreeWiFi Flickr FreeWiFi 2. 2 Mikolov [9] [10] word2vec word2vec word2vec k

2

FINAL PROGRAM 25th Annual Workshop SWoPP / / 2012 Tottori Summer United Workshops on Parallel, Distributed, and Cooperative Processing 2012

Gnutella Peer-to-Peer(P2P) P2P Linux P2P

IPSJ SIG Technical Report Vol.2010-GN-74 No /1/ , 3 Disaster Training Supporting System Based on Electronic Triage HIROAKI KOJIMA, 1 KU

DEIM Forum 2017 H ,

ICDE2013study.ppt

[1] Excel Excel... [3]. CSV RDF. [4] LinkedData. [5] LinkedData 1 RDF. OLAP. OLAP. [6] RDBMS. Excel CSV. CSV JSON RDF. Excel RDF. RDF RDF..

([ ],), : [Name], name1 name2 name10 4, 2 SuperSQL, ([ ]!), name1 name2 : [Name]! name SuperSQL,,,,,,, < < > } =,

fiš„v8.dvi

: Name, Tel name tel (! ) name : Name! Tel tel ( % ) 3. HTML. : Name % Tel name tel 2. 2,., [ ]!, [ ]!, [ ]!,. [! [, ]! ]!,,. ( [ ], ),. : [Name], nam

Wikipedia YahooQA MAD 4)5) MAD Web 6) 3. YAMAHA 7) 8) Vocaloid PV YouTube 1 minato minato ussy 3D MAD F EDis ussy

3 2 2 (1) (2) (3) (4) 4 4 AdaBoost 2. [11] Onishi&Yoda [8] Iwashita&Stoica [5] 4 [3] 3. 3 (1) (2) (3)

PowerPoint Presentation


Hadoopの全て

,, WIX. 3. Web Index 3. 1 WIX WIX XML URL, 1., keyword, URL target., WIX, header,, WIX. 1 entry keyword 1 target 1 keyword target., entry, 1 1. WIX [2

MATLAB® における並列・分散コンピューティング ~ Parallel Computing Toolbox™ & MATLAB Distributed Computing Server™ ~

Computational Semantics 1 category specificity Warrington (1975); Warrington & Shallice (1979, 1984) 2 basic level superiority 3 super-ordinate catego

Vol.57 No (Mar. 2016) 1,a) , L3 CG VDI VDI A Migration to a Cloud-based Information Infrastructure to Support

IPSJ SIG Technical Report Vol.2009-HCI-134 No /7/17 1. RDB Wiki Wiki RDB SQL Wiki Wiki RDB Wiki RDB Wiki A Wiki System Enhanced by Visibl

HASC2012corpus HASC Challenge 2010,2011 HASC2011corpus( 116, 4898), HASC2012corpus( 136, 7668) HASC2012corpus HASC2012corpus

Web STEPS Web Web Form Cookie HTTP STEPS Web

WISS 2018 [2 4] [5,6] Query-by-Dancing Query-by- Dancing Cao [1] OpenPose 2 Ghias [7] Query by humming Chen [8] Query by rhythm Jang [9] Query-by-tapp

IPSJ SIG Technical Report Vol.2017-MUS-116 No /8/24 MachineDancing: 1,a) 1,b) 3 MachineDancing MachineDancing MachineDancing 1 MachineDan

B HNS 7)8) HNS ( ( ) 7)8) (SOA) HNS HNS 4) HNS ( ) ( ) 1 TV power, channel, volume power true( ON) false( OFF) boolean channel volume int

Microsoft PowerPoint - shudo-NoSQL-data-model ppt

2 2.1 SNS web Facebook Google+ SNS web SNS web HITS ANT(Auction Network Trust) web [4] SNS WEB PageRank HITS HITS web authorities, hubs Pagerank web S

日立評論2007年3月号 : ソフトウェア開発への

Agenda Hadoop Sahara Kilo Q&A Copyright 2015 Mirantis, Inc. All rights reserved Page 2

Core1 FabScalar VerilogHDL Cache Cache FabScalar 1 CoreConnect[2] Wishbone[3] AMBA[4] AMBA 1 AMBA ARM L2 AMBA2.0 AMBA2.0 FabScalar AHB APB AHB AMBA2.0

yamamoto_hadoop.pptx

Gray [6] cross tabulation CUBE, ROLL UP Johnson [7] pivoting SQL 3. SuperSQL SuperSQL SuperSQL SQL [1] [2] SQL SELECT GENERATE <media> <TFE> GENER- AT

3 4 SAP HANA 5 6 SAP HANA Xeon E7 v3 SAP HANA 6 8 OLTP OLAP 1 9 SAP S/4HANA SAP HANA Studio 13 14

325 In this research, we created smartphone cases attaching a dimple or a wedge shaped object in order to improve eyes-free and single-handed touch ac

IPSJ SIG Technical Report Vol.2018-SE-200 No /12/ Proposal of test description support environment for request acquisition in web appli

IPSJ SIG Technical Report Vol.2014-HCI-158 No /5/22 1,a) 2 2 3,b) Development of visualization technique expressing rainfall changing conditions

untitled

「ウイルスセキュリティZERO」ユーザーズガイド

"-./0%. "-%!"#$#% $%&'(%)*+,%.!"#+$,$% &'()*% $%&'-(.(/%+,% $%&'0%12*+,'% 1 RMX.. grade gradetype= integer grade[

JAXA-RR ICT ICT (Virtual Observatory = VO) JVO (Japanese Virtual Observatory) 1,2,3,4) 1 VO 1 Google Sky API (JVOSky) 1 VO Hadoop

Title 中國宗教文獻研究國際シンポジウム報告書 ( 大規模佛教文獻群に對する確率統計的分析の試み / 師茂樹 ) Author(s) Citation (2004) Issue Date URL Right Typ

Joint Content Development Proposal Tech Docs and Curriculum

情報処理学会研究報告 IPSJ SIG Technical Report Vol.2013-HPC-139 No /5/29 Gfarm/Pwrake NICT NICT 10TB 100TB CPU I/O HPC I/O NICT Gf

Duplicate Near Duplicate Intact Partial Copy Original Image Near Partial Copy Near Partial Copy with a background (a) (b) 2 1 [6] SIFT SIFT SIF

1. 1 DBMS Unix (USP ) ( )[3] 20 UNIX [2] KISS UNIX 1. 2 (Tukubai ) Unix OS Unix USP Tukubai Tukubai 1. 3 Unix SQL Tukubai usp Tukubai Open usp Tukubai

DRAM L2 L2 DRAM L2 DRAM L2 RAM DRAM 3 DRAM 3. 1 DRAM SRAM/DRAM 2. SRAM/DRAM DRAM LLC Last Level Cache 2 2) DRAM 1(A) (B) LLC L2 DRAM DRAM L2 SRAM DRAM

GUI(Graphical User Interface) GUI CLI(Command Line Interface) GUI

Transcription:

DEIM Forum 2014 D1-3 MapReduce 180 8585 3 9 11 E-mail: {ozawa.tsuyoshi,oikawa.kazuki,onizuka.makoto,honjo.toshimori}@lab.ntt.co.jp MapReduce 1 Google, Facebook, Yahoo! MapReduce MapReduce MapReduce MapReduce 1. MapReduce, Hadoop, MapReduce 1 Google, Facebook, Yahoo! [2], [6], [16], [18] MapReduce Map Map Reduce [6] MapReduce ETL RDBMS ( [4], [13]) MapReduce 2 1 Mapper 1 Reducer Mapper Map MapTask Reducer Reduce ReduceTask MapReduce Reducer 1 WordCount MapTask 1 Reducer Reducer 2 1 Map- Task Reducer IO MapReduce 1 Hadoop [1] gzip bzip2 LZO LZ4 Snappy 2 MapReduce Reducer [12] : 1 Map inmapper combining 2 stripes 3 Map memory-backed join Map MapTask Map MapReduce [12] 1 Hadoop [17] 2 MapReduce MapReduce

MapTask 678 9:;<<=>?>@;A> Map 関数 バッファ管理モジュール リングバッファ 45#$% *+,-./ 0123!"#$% &'() 外部記憶装置 1 MapReduce IFIle IFIle MapReduce 25% MapReduce 2 MapReduce OSS Hadoop 3 4 5 6 2. 3 2 Hadoop IFile Key1の長さ (1-5バイト) (Key1, Value1) Value1の長さ (1-5バイト) (Key2, Value2) (Key2, Value3) Key1 のバイト列 (Key3, Value4) Value1 のバイト列 Hadoop Shuffle (IFile) 2. 1 MapReduce 1 MapReduce MapReduce Map Reduce 2 Map D Key/Value (K 1, V 1 ) Map Key/Value (K 2, V 2 ) Reduce Key Shuffle Reduce Key/Value (K 3, V 3 ) D map(k 1, V 1 ) {K 2, V 2 } shuffle({k 2, V 2 }) {K 2, {V 2 }} reduce(k 2, {V 2 }) (K 3, V 3 ) MapReduce DFS MapReduce InputSplit InputSplit MapTask MapTask Mapper Map Key/Value Key shuffle ReduceTask Key ReduceTask Reducer Reduce Key/Value DFS 2. 2 Hadoop Shuffle MapReduce MapTask MapReduce Hadoop 2

Hadoop Map Key Value Key Value Key Value IFile IFile 3 IFile Key Key Value 1 Key Value Key Value MapReduce IFile Writer/Reader Key Value MapReduce [17] 2. 3 MapReduce Task MapTask ReduceTask 2 MapTask MapTask ReduceTask MapTask ReduceTask MapTask MapTask/ReduceTask Hadoop Shuffle MapTask/ReduceTask MapTask [11] Hadoop MapTask 3. 3. 1 OLAP C-Store MonetDB [14], [17] MapTask Map 関数 型 1 用のバッファ (Key, Value) Keyの一部 バッファ管理モジュール 列指向バッファ Valueの一部 Valueの一部 型 2 用のバッファ Keyの一部 型 3 用のバッファ 型 4 用のバッファ 外部記憶装置 CIFile CIFile 4 numtypes:1バイト ( 型の個数 ) CIFile Type1:1バイト ( 含まれている型 1) ヘッダ型 1 用のバッファ Type2:1バイト ( 含まれている型 2) Type3:1バイト 型 2 用のバッファ型 3 用のバッファ ( 含まれている型 3) lentype1: 4バイト ( 型 1のバッファ長 ) lentype2: 4バイト ( 型 2のバッファ長 ) lentype2: 4バイト ( 型 3のバッファ長 ) 5 (CIFile) [22] 4 MapTask CIFile(Columnar IFile) 3. 2 CIFile CIFile 5 CIFile CIFile 1 numtypes 1

numtypes 4 numtypes CIFile Key Value 4. CIFile IFile Key Value MapTask MapTask [12] 2 Key Value MapTask TextInputFromat Key Value Raw An apple is red An apple apple is is red 1 An apple is, apple is red Mapper Key Value 1 Reducer Reducer MapReduce Hadoop IFile.Writer CIFile DataOutput Amazon EC2 m2.4xlarge 4. 1 IFile CIFile bzip2 Snappy bzip2 Hadoop Bzip2Codec Snappy snappy-java [15] Brisk [5] SnappyCodec PUMA Benchmark Suite [7] Wikipedia 50GB (file87) 8355840 28 1 IFile CIFile IFile Key Value CIFile CIFile Snappy 25% 4. 2 CIFile CIFile PUMA Benchmark Suite Wikipedia 50GB (file87) 2 CIFile MapTask Raw p4delta [23] 4. 3 CIFile Map- Task IFile CIFile Snappy PUMA Benchmark Suite Wikipedia 50GB 8K IO IFile CIFile 3 CIFile IFile 21% IO 33% IO MapReduce 5. Spark [21] MapReduce (DAG) DSL Spark RDD [20] DSL Shark [19] Spark SQL DB Spark Shark RDD Spark/Shark CIF [8] [10] IO IO MapReduce CIF

1 IFile CIFile ( ) bzip2 ( ) Snappy ( ) IFile Raw 9206558 1361111 3554898 CIFile Raw 9025188(2% ) 1214726(11% ) 3283301(8% ) IFile 18806076 1819063 6095471 CIFile 17923310(5% ) 1633830(11% ) 4591160(25% ) 2 CIFile ( ) bzip2 ( ) Snappy ( ) Raw 99917 32749((67% ) 54374(45% ) Raw 8274496 915891(88% ) 2850701(65% ) 442138 173477(60% ) 296318(32% ) 13957109 1448904(89% ) 4128572(70% ) 3 IFile CIFile ( ) Snappy ( ) IFile 4307 36799837380 CIFile 2880(33% ) 29365516361(21% ) co-location RCFile [9] ORCFile [3] MapReduce MapReduce MapReduce blob 2 1:1 CIF RCFile ORCFile CIFile CIF RCFile ORCFile CIF RCFile ORCFile Shuffle CIFile MapReduce 6. MapReduce Shuffle CIFile Hadoop MapReduce Hadoop Hive [1] : Apache Hadoop, http://hadoop.apache.org/. [2] : Apache Hadoop Wiki, http://wiki.apache.org/hadoop/ PoweredBy. [3] : Create a new Optimized Row Columnar file format for Hive, https://issues.apache.org/jira/browse/ HIVE-3874 (2013). [4] : Treasure Data s Plazma: Columnar Cloud Storage http://blog.treasure-data.com/post/53534943282/ treasure-datas-plazma-columnar-cloud-storage (2013). [5] DataStax: Brisk, https://github.com/riptano/brisk. [6] Dean, J. and Ghemawat, S.: MapReduce: simplified data processing on large clusters, Proceedings of the 6th conference on Symposium on Opearting Systems Design & Implementation - Volume 6, OSDI 04, Berkeley, CA, USA, USENIX Association, pp. 10 10 (2004). [7] Faraz Ahmad, Seyong Lee, M. T.: PUMA: Purdue MapReduce Benchmarks Suite. [8] Floratou, A., Patel, J. M., Shekita, E. J. and Tata, S.: Column-oriented Storage Techniques for MapReduce, Proc. VLDB Endow., Vol. 4, No. 7, pp. 419 429 (2011). [9] He, Y., Lee, R., Huai, Y., Shao, Z., Jain, N., Zhang, X. and Xu, Z.: RCFile: A Fast and Space-efficient Data Placement Structure in MapReduce-based Warehouse Systems, Proceedings of the 2011 IEEE 27th International Conference on Data Engineering, ICDE 11, Washington, DC, USA, IEEE Computer Society, pp. 1199 1208 (2011). [10] Kaldewey, T., Shekita, E. J. and Tata, S.: Clydesdale: Structured Data Processing on MapReduce, Proceedings of the 15th International Conference on Extending Database Technology, EDBT 12, New York, NY, USA, ACM, pp. 15 25 (2012). [11] Li, B., Mazur, E., Diao, Y., McGregor, A. and Shenoy, P.: A platform for scalable one-pass analytics using MapReduce, SIGMOD 11: Proceedings of the 2011 ACM SIGMOD International Conference on Management of data, New York, NY, USA, ACM, pp. 985 996 (2011). [12] Lin, J. and Dyer, C.: Data-Intensive Text Processing with MapReduce, Morgan and Claypool Publishers (2010). [13] Ohta, K.: Hadoop meets Cloud with Multi-Tenancy, http: //www.slideshare.net/treasure-data/hadoop-meets-cloud-with-multitena (2013). [14] Peter Boncz, Marcin Zukowski, N. N.: MonetDB/X100: Hyper-Pipelining Query Execution, Conference on Innovative Data Systems Research 2005 (2005). [15] Saito, T. L.: snappy-java, https://code.google.com/p/

snappy-java/. [16] Silberstein, A. E., Sears, R., Zhou, W. and Cooper, B. F.: A batch of PNUTS: experiences connecting cloud batch and serving systems, Proceedings of the 2011 ACM SIG- MOD International Conference on Management of data, SIGMOD 11, New York, NY, USA, ACM, pp. 1101 1112 (2011). [17] Stonebraker, M., Abadi, D. J., Batkin, A., Chen, X., Cherniack, M., Ferreira, M., Lau, E., Lin, A., Madden, S., O Neil, E., O Neil, P., Rasin, A., Tran, N. and Zdonik, S.: C-store: A Column-oriented DBMS, Proceedings of the 31st International Conference on Very Large Data Bases, VLDB 05, VLDB Endowment, pp. 553 564 (2005). [18] Thusoo, A., Shao, Z., Anthony, S., Borthakur, D., Jain, N., Sen Sarma, J., Murthy, R. and Liu, H.: Data warehousing and analytics infrastructure at facebook, Proceedings of the 2010 ACM SIGMOD International Conference on Management of data, SIGMOD 10, New York, NY, USA, ACM, pp. 1013 1020 (2010). [19] Xin, R. S., Rosen, J., Zaharia, M., Franklin, M. J., Shenker, S. and Stoica, I.: Shark: SQL and Rich Analytics at Scale, Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data, SIGMOD 13, New York, NY, USA, ACM, pp. 13 24 (2013). [20] Zaharia, M., Chowdhury, M., Das, T., Dave, A., Ma, J., McCauley, M., Franklin, M. J., Shenker, S. and Stoica, I.: Resilient Distributed Datasets: A Fault-tolerant Abstraction for In-memory Cluster Computing, Proceedings of the 9th USENIX Conference on Networked Systems Design and Implementation, NSDI 12, Berkeley, CA, USA, USENIX Association, pp. 2 2 (2012). [21] Zaharia, M., Chowdhury, M., Franklin, M. J., Shenker, S. and Stoica, I.: Spark: Cluster Computing with Working Sets, Proceedings of the 2Nd USENIX Conference on Hot Topics in Cloud Computing, HotCloud 10, Berkeley, CA, USA, USENIX Association, pp. 10 10 (2010). [22] Zukowski, M.: Balancing Vectorized Query Execution with Bandwidth Optimized Storage, PhD thesis, Universiteit van Amsterdam (2009). [23] Zukowski, M., Heman, S., Nes, N. and Boncz, P.: Super- Scalar RAM-CPU Cache Compression, Proceedings of the 22Nd International Conference on Data Engineering, ICDE 06, Washington, DC, USA, IEEE Computer Society, pp. 59 (2006).