AWSSummitTokyo2018

Similar documents
PowerPoint Presentation

Dockerの商用サービスでの利用事例紹介

Joint Content Development Proposal Tech Docs and Curriculum

VMware View Persona Management

Hortonworks Kitase

DEIM Forum 2019 H2-2 SuperSQL SuperSQL SQL SuperSQL Web SuperSQL DBMS Pi

Agenda Hadoop Sahara Kilo Q&A Copyright 2015 Mirantis, Inc. All rights reserved Page 2

AWS における ベストパートナーを見つける 7 つの方法 相澤恵奏アマゾンウェブサービスジャパンアライアンス技術本部テクニカルイネーブルメント部部長パートナーソリューションアーキテクト #AWSInnovate 2019, Amazon Web Services, Inc. or its affi

サンプル株式会社 御中 システム導入のご提案

test

Startup_on_AWS_usecases_StartupDay

1

Part 1 IT CPU IT IT 1998 Windows NT Server 4.0, Terminal Server Edition 1 Windows Based Terminal WBT Windows CE 1 100Mbps 1Gbps LAN OS 1 PC 1 OS 2

30

MATLAB® における並列・分散コンピューティング ~ Parallel Computing Toolbox™ & MATLAB Distributed Computing Server™ ~

スライド 1

Hadoop Introduction


DEIM Forum 2012 C2-6 Hadoop Web Hadoop Distributed File System Hadoop I/O I/O Hadoo

Elastic stack Jun Ohtani 1

FreakOut における AWS 上での機械学習活用事例 株式会社フリークアウト 西口 次郎 CTO 小浜 翔太郎 Software Engineer FreakOut

2 JSON., 2. JSON,, JSON Jaql [9] Spark Streaming [8], Spark [7].,, 2, 3 4, JSON [3], Jaql [9], Spark [7] Spark Streaming [8] JSON JSON [

untitled

yamamoto_hadoop.pptx

16soukatsu_p1_40.ai

Microsoft Azure Azure

PowerPoint Presentation

Cloud connect the world as a Glue

Web Web Web Web i

データセンターの効率的な資源活用のためのデータ収集・照会システムの設計

最新アップデート AWS IoT Solution 〜 事例とサービスアップデート 〜

Cisco® ASA シリーズルーター向けDigiCert® 統合ガイド


Microsoft Azure Azure Microsoft Web Azure Microsoft Azure Azure IT Web (IoT) OS Docker Linux JavaScript Python.NET PHP Java Node.js Ruby ios Android W

s

ビッグデータアナリティクス - 第3回: 分散処理とApache Spark

グリーの様々なサービスを支えるクラウド運用およびデータ分析基盤

PowerPoint Presentation

How to Automate Using PowerShell-JP

FileMaker Cloud App FileMaker Pro FileMaker Go FileMaker WebDirect App FileMaker Cloud Amazon Web Services (AWS) Marketplace AWS FileMaker Server File

Microsoft Word - SYNCNELサーバ操作マニュアル(管理者編)_v3.0.0_ docx

Leveraging Cloud Computing to launch Python apps


1 2 3 ( ) ( ) SNS SNS Facebook %[g]( %[ ]) [ ] IT LNS (Life Networking Service) LNS LNS LNS SNS SNS 3. LNS (Life Networking S

1 Microsoft Windows Server 2012 Windows Server Windows Azure Hyper-V Windows Server 2012 Datacenter/Standard Hyper-V Windows Server Windo

2. Twitter Twitter 2.1 Twitter Twitter( ) Twitter Twitter ( 1 ) RT ReTweet RT ReTweet RT ( 2 ) URL Twitter Twitter 140 URL URL URL 140 URL URL

PDF


1. 52

扉 序文 目次DVD用 .indd

すぐできる冬の省エネ・節電ガイド

”Žfi¶‰s‚ÒŒh”~”ŒŠá‘WŁ\”ƒ



Microsoft Word J.^...O.|Word.i10...j.doc

2

,, create table drop table alter table


PowerPoint プレゼンテーション

Introduction Purpose This course explains how to use Mapview, a utility program for the Highperformance Embedded Workshop (HEW) development environmen

和文タイトル

ファイル アップロード

PowerPoint Presentation

Microsoft PowerPoint - shudo-NoSQL-data-model ppt

_02_3.ppt

PDF_J

fiš„v3.dvi

AWS Client VPN - ユーザーガイド

PowerPoint プレゼンテーション

X Window System X X &

KWCR3.0 instration

QConTokyo2013_DocDatabase_agile_atWare

1 SQL Server SQL Oracle SQL SQL* Plus PL/SQL 2 SQL Server SQL Server SQL Oracle SQL SQL*Plus SQL Server GUI 1-1 osql 1-1 Transact- SQL SELECTFROM 058

IIJ Technical WEEK Cloudbusting Machine(CBM)

Mobilelron® Virtual Smartphone Platform 向けDigiCert® 統合ガイド

Abstract Journal of Agricultural Science 2

SORACOM Beam-Funnel-Endorse

Oracle XML DB によるスケーラビリティおよびパフォーマンス検証 - MML v.3.0

外部SQLソース入門

Introduction Purpose This training course demonstrates the use of the High-performance Embedded Workshop (HEW), a key tool for developing software for

untitled

南山会報88入稿.indd

"CAS を利用した Single Sign On 環境の構築"

AWS 認定 DevOps エンジニア - プロフェッショナルサンプル試験問題 1) あなたは Amazon EBS ボリュームを使用する Amazon EC2 上で実行されているアプリケーションサーバ ー向けに 自動データバックアップソリューションを導入する業務を担当しています 単一障害点を回避し

AWS およびパートナーサービスを使った、データの集約および活用設計パターン

PowerPoint Presentation

FINAL FANTASY XV POCKET EDITION を支える AWS サーバレス技術 LOGO ILLUSTRATION: 2016 YOSHITAKA AMANO 2018 SQUARE ENIX CO., LTD. All Rights Reserved.

7,, i

untitled

凡友83号.indd

凡友86号.indd

WIDE 1


PDF_T

untitled

honbun.indd

CAS Yale Open Source software Authentication Authorization (nu-cas) Backend Database Authentication Authorization Powered by A

H8.6 P

2 BIG-IP 800 LTM v HF2 V LTM L L L IP GUI VLAN.

Transcription:

AWS Gunosy AWS Summit Tokyo 2018/06/01

自己紹介 - 米田 武 / Takeshi Yoneda / マスタケ - Github/Twitter: @mathetake - 2017/03/31: - MSc. in Mathematics at Osaka University - 2017/04/01~ - Machine learning engineer at Gunosy Inc. - Apply mathematics to - Recommendation System - Machine learning - Optimization problems - data engineering

-- AWS

10,000/day+ AWS

Examples: etc

Amazon s3 AWS Lambda Amazon DynamoDB Amazon Kinesis NoSQL DynamoDB Accelerator(DAX) Amazon EMR DynamoDB Hadoop

1.

Local Popularity() / => => #1.

Local Popularity() / => => #1.

* * #1.

Articles Title: Content: Title: Content: Title: Content: #1.

Articles Title: Content: Title: Content: Title: Content: #1.

Articles Title: Content: Title: Content: Title: Content: #1.

#1.

M #1.

lpush/rtrim #1.

u1 u2 u3 u6 u5 #1.

u1 u2 u3 u6 u5 #1.

u1 u2 u3 u5 u6 #1.

u1 u2 u3 u5 u6 #1.

#1.

Trigger Update Trigger Put Click logs stream [kinesis stream] Click logger [lambda] Click logs table [DynamoDB] UserVectorizer [lambda] UserVetor table [DynamoDB] BatchGet Trigger Put Crawler [EC2] ArticleVectorizer [EC2] ArticleVector table [DynamoDB] [DAX] : Gunosy, Gunosy -1- http://tech.gunosy.io/entry/realtime-vectorization-with-dynamodb #1.

Trigger Update Trigger Put Click logs stream [kinesis stream] Click logger [lambda] Click logs table [DynamoDB] UserVectorizer [lambda] UserVetor table [DynamoDB] BatchGet Trigger Put Crawler [EC2] ArticleVectorizer [EC2] ArticleVector table [DynamoDB] [DAX] : Gunosy, Gunosy -1- http://tech.gunosy.io/entry/realtime-vectorization-with-dynamodb #1.

Trigger Update Trigger Put Click logs stream [kinesis stream] Click logger [lambda] Click logs table [DynamoDB] UserVectorizer [lambda] UserVetor table [DynamoDB] BatchGet Trigger Put Crawler [EC2] ArticleVectorizer [EC2] ArticleVector table [DynamoDB] [DAX] : Gunosy, Gunosy -1- http://tech.gunosy.io/entry/realtime-vectorization-with-dynamodb #1.

Client app Log stream[kinesis stream] fluentd Server[EC2] fluentd Server Click logs stream[kinesis stream] filteramazon kinesis stream Trigger Update Log Stream [kinesis stream] Fluentd Server [EC2] Click logs stream [kinesis stream] Click logger [lambda] Click logs table [DynamoDB] #1.

Click logs stream Click logger[aws lambda] Kinesis stream Click logger Click logs table[dynamodb] updatem Trigger Update Log Stream [kinesis stream] Fluentd Server [EC2] Click logs stream [kinesis stream] Click logger [lambda] Click logs table [DynamoDB] #1.

Click logs stream Click logger[aws lambda] Kinesis stream Click logger Click logs table[dynamodb] updatem remove (ltrim) Log 1 Log 2 Log 3 Log M-1 Log M list append (rpush) #1.

Trigger Update Trigger Put Click logs stream [kinesis stream] Click logger [lambda] Click logs table [DynamoDB] UserVectorizer [lambda] UserVetor table [DynamoDB] BatchGet Trigger Put Crawler [EC2] ArticleVectorizer [EC2] ArticleVector table [DynamoDB] [DAX] : Gunosy, Gunosy -1- http://tech.gunosy.io/entry/realtime-vectorization-with-dynamodb #1.

Click logs table -> UserVectorizer[AWS lambda] DynamoDBStream PUT Trigger Put Click logs table [DynamoDB] UserVectorizer [lambda] UserVetor table [DynamoDB] BatchGet ArticleVector table [DynamoDB] [DAX] #1.

Click logs table -> UserVectorizer[AWS lambda] DynamoDBStream PUT Lambda Trigger Put Click logs table [DynamoDB] UserVectorizer [lambda] UserVetor table [DynamoDB] BatchGet ArticleVector table [DynamoDB] [DAX] #1.

UserVectorizer (DAX) > ArticleVector table[dynamodb] DynamoDBGET DAX UserVectorizer Trigger Put Click logs table [DynamoDB] UserVectorizer [lambda] UserVetor table [DynamoDB] BatchGet ArticleVector table [DynamoDB] [DAX] #1.

DAX(Amazon DynamoDB Accelerator) DynamoDB() ReadHeavy SDKDAXTCP Lambda DAX DynamoDB Trigger Put Click logs table [DynamoDB] UserVectorizer [lambda] UserVetor table [DynamoDB] BatchGet ArticleVector table [DynamoDB] [DAX] #1.

Trigger Update Trigger Put Click logs stream [kinesis stream] Click logger [lambda] Click logs table [DynamoDB] UserVectorizer [lambda] UserVetor table [DynamoDB] BatchGet Trigger Put Crawler [EC2] ArticleVectorizer [EC2] ArticleVector table [DynamoDB] [DAX] Crawler ArticleVectorizer[EC2]ArticleVector tableput #1.

Trigger Update Trigger Put Click logs stream [kinesis stream] Click logger [lambda] Click logs table [DynamoDB] UserVectorizer [lambda] UserVetor table [DynamoDB] BatchGet Trigger Put Crawler [EC2] ArticleVectorizer [EC2] ArticleVector table [DynamoDB] [DAX] () #1.

2.

Kinesis / DynamoDB / DAX / lambda / #2.

= #2.

= #2.

= #2.

(, etc.) t a - - b #2.

t a - - b #2.

t a - - b #2.

t ab a - - b #2.

t ba a - - b #2.

t ba a - - b #2.

50msec or die. https://tenshoku.mynavi.jp/it-engineer/knowhow/naoya_sushi/05

in 50 msec Background UserVector table GET user vector 0 50 (msec) #2.

in 50 msec Background UserVector table GET user vector 0 50 (msec) DynamoDB #2.

in 50 msec Background UserVector table GET user vector 0 50 (msec) DynamoDB #2.

in 50 msec Background UserVector table GET user vector 0 50 (msec) DynamoDB #2.

in 50 msec Background UserVector table GET user vector 0 50 (msec) #2.

in 50 msec Background UserVector table GET user vector 0 50 (msec) #2.

in 50 msec Background UserVector table GET user vector 0 50 (msec) * 25msec / request *Application Load Balancer `TargetResponseTime` #2.

in 50 msec Background UserVector table GET user vector 0 50 (msec) #2.

API DataLake API[EC2] Hive Metastore ~MB digdag #2.

API DataLake API[EC2] Hive Metastore ~MB digdag #2.

API s3(mb) DataLake API[EC2] Hive Metastore ~MB digdag #2.

API DataLake API[EC2] Hive Metastore ~MB digdag #2.

API s3 DataLake API[EC2] Hive Metastore ~MB digdag #2.

API s3 DataLake API[EC2] Hive Metastore ~MB digdag #2.

API DataLake API[EC2] Hive Metastore ~MB digdag #2.

API EMR(Elastic MapReduce) DataLake API[EC2] Hive Metastore ~MB digdag #2.

DataLake Hive Metastore ~MB digdag RDS digdag airflow #2.

RemoteHive Metastore DataLake Hive Metastore ~MB digdag RDS digdag airflow #2.

RemoteHive Metastore DataLake Hive Metastore ~MB digdag RDS digdag airflow #2.

(Remote) Hive Metastore HDFS* (Spark/Hive/Presto)Metastore Amazon RDS for MySQL CREATE EXTERNAL TABLE `users`( `id` bigint, `enabled` boolean, `admin_enabled` boolean, `created_at` string, `updated_at` string ) STORED AS PARQUET LOCATION 's3://hogefuga-log/hive/user.db/users' TBLPROPERTIES ( parquet.compress'='snappy'); *HDFS = Hadoop Distributed File System. s3file SystemHadoop S3 filesystem s3hdfs #2.

Spark on EMRMetaStore SparkQL DataLake Hive Metastore ~MB digdag RDS digdag airflow #2.

Hive MetastorePresto DataLake Hive Metastore ~MB digdag RDS digdag airflow #2.

Airflow DataLake Hive Metastore ~MB digdag RDS digdag airflow #2.

/s3 Metastore DataLake Hive Metastore ~MB digdag RDS digdag airflow #2.

S3 + Hive Metastore = Awesome! HQL > Drop Table SELECT column1, column2 FROM hive.db.hoge WHERE dt = 20180601 S3 < write(=s3put) EMR Hive MetastoreEMR,,, #2.

Lambda, Kinesis stream, DynamoDB, DAX DAX API DynamoDB & s3 S3 + Hive Metastore & S3&

We are hiring! 募集職種例 データ分析エンジニア サービスのKPI等の統計情報の設計 / 収集 / 分析 機械学習 自然言語処理エンジニア 上記技術を含め数理モデルを駆使したアルゴリズムの開発 https://gunosy.co.jp/recruit/requirements/engineer/ アプリ開発エンジニア ios / Android アプリの開発 サーバーサイドエンジニア 各プロダクトのサーバーサイド開発 Gunosy 採用