IPSJ SIG Technical Report Vol.2016-GI-35 No /3/9 StarCraft AI Deep Q-Network StarCraft: BroodWar Blizzard Entertainment AI Competition AI Convo
|
|
- ともなり さどひら
- 4 years ago
- Views:
Transcription
1 StarCraft AI Deep Q-Network StarCraft: BroodWar Blizzard Entertainment AI Competition AI Convolutional Neural Network(CNN) Q Deep Q-Network(DQN) CNN DQN,,, 1. StarCraft: Brood War *1 Blizzard Entertainment ( RTS ) RTS 2010 StarCraft AI Competition * AI BWAPI *3 API StarCraft () () TerranZargProtoss 3 1 HP HP 0 HP 0 StarCraft *1 *2 *3 AI AI Convolutional Neural Network(CNN) Q Deep Q-Network(DQN) AI StarCraft DQN Starcraft 3 Starcraft
2 2. StarCraft StarCraft () () TerranZargProtoss 3 1 Terran SCV Zarg Drone Zarg Terran Creep Creep Protoss Zarg HP HP Pylon HP HP 0 HP Terran Medic Heal Terran Vessel Defensive Matrix 1 HP250 1 StarCraft StarCraft AI StarCraft [1] StarCraft AI 1 AI AI 2
3 α w(z) = Ue iα z w(z) = Q log z(q > 0) w(z) = iγ log z AI [2] 3 AI HP 2 HP r m r t+1 = enemy health it enemy health it+1 i=1 (agent health t agent healt t+1 ) (1) Terran Vulture Marine 6 Vulture 1000 AI 100% RTS AI Kiting 3.4 UCB UCT(UCB applied to Tree) [4] UCT UCB(Upper Confidence Bound) i UCB 3.3 Kiting [3] Kiting one-step Q-learning Watkins s Q(λ)one-step Sarsa Sarsa(λ) UCB(i) = Q i + C ln N N i (2) Q i i C N i N i i C UCB 3
4 i j q (j) i q (j) i = ω 1 HP + ω 2 DM + ω 3 CP + ω 4 EG (3) HP DMCP EG ω n Q i q i 3.5 Deep Q-Network Deep Q-Network Atari 2600 [5] Deep Q-Network Q Q(s, a) CNN Q Experience Replay Replay Memory CNN 2 2 Atari x84 epsilon-greedy Deep Q-Network ( 1 ) Replay-Memory D N ( 2 ) ( 3 ) ( a ) s 1 = {x 1 } ϕ 1 = ϕ(s 1 ) ( b ) t = 1 T ( s T ) ( i ) ϵ a a t ( ii ) Q (ϕ(s t ), a; θ) a t ( iii ) a t r t x t+1 ( iv )s t, a t, x t+1 ϕ(t + 1) ( v ) (ϕ t, a t, r t, ϕ t+1 ) D ( vi )D 1 minibatch 4 (ϕ j, a j, r j, ϕ j+1 ) ( vii )minibatch y j Q r j () y j = b() Deep Q-Network 1000 BreakoutPongEnduro Space Invaders Deep Q-Network *4 StarCraft RTS Deep Q-Network HP DQN CNN CNN *4 distributed-deep-reinforcement-learning/ 4
5 地形 情報 CNN ユニット情報 3 DQN Q 学習 DQN 行動 ( 1 ) 32x32 1 8x8 () ( 2 ) CNN ( 3 ) HP Q 9 1 DQN 8 t i reward(i, t) cause damage(i, t) i t unit health(i, t) i t HP unit reward(i, t) =cause damage(i, t) {unit health(i, t) unit health(i, t + 1)} (4) reward(i, t) = 2 unit reward(i, t) unit reward(j, t) (5) 3 j i epsilon-greedy AI 1 DQN 2 1 DQN 4 D enemy (x, y) D enemy (x, y) D enemy (x, y) (D enemy (x, y) ) A* (D enemy (x, y) ) Marine 8 Marine Intel Corei7 6700KPalit NE5XTIX015KB- PG600F (GTX TITAN X 12GB) Windows 10 Starcraft C++ BWAPI 5
6 Python Chainer * 5 MessagePack-RPC * *5 * HP HP AI 10 6
7 StarCraft DQN DQN AI 2 [1] StarCraft AI 10 (2015). [2] Tung, N., Kien, N. and Ruck, T.: Potential flow for unit positioning during combat in StarCraft, IEEE 2nd Global Conference on Consumer Electronics (GCCE 2013), IEEE, pp (2013). [3] Wender, S. and Watson, I.: Applying reinforcement learning to small scale combat in the real-time strategy game starcraft: broodwar, IEEE Conference on Computational Inteligence and Games (CIG 2012),, IEEE, pp (2012). [4] Zhe W., Kien Quang N., Ruck T., Frank R.: MONTE- CARLO PLANNING FOR UNIT CONTROL IN STAR- CRAFT, The 1st IEEE Global Conference on Consumer Electronics 2012, pp (2012). [5] Mnih, V., Kavukcuoglu, K., Silver, D., Graves, A., Antonoglou, I., Wierstra, D. and Riedmiller, M.: Playing Atari With Deep Reinforcement Learning, NIPS Deep Learning Workshop (2013). 7
2015 3
JAIST Reposi https://dspace.j Title ターン制ストラテジーゲームにおける候補手の抽象化 によるゲーム木探索の効率化 Author(s) 村山, 公志朗 Citation Issue Date 2015-03 Type Thesis or Dissertation Text version author URL http://hdl.handle.net/10119/12652
More information1 StarCraft esportsleague WallPlayed.org 200 StarCraft Benzene StarCraft 3 Terran Zerg Protoss Terran Terran Terran 3 Terran Zerg Zerg Worker D
StarCraft AI 1,a) 1 1 StarCraft (RTS) AI StarCraft AI 2012 2014 AI AI 12 StarCraft AI AI StarCraftBWAPI Introducing ranks of troops to StarCraft AI Abstract: StarCraft is a popular series of Real Time
More informationi ii iii iv v vi vii ( ー ー ) ( ) ( ) ( ) ( ) ー ( ) ( ) ー ー ( ) ( ) ( ) ( ) ( ) 13 202 24122783 3622316 (1) (2) (3) (4) 2483 (1) (2) (3) (4) (5) (6) (7) (8) (9) (10) (11) 11 11 2483 13
More informationVol. 52 No (Dec. 2011) Ms. Pac-Man IEEE CIG Ms. Pac-Man Ms. Pac-Man AI AI Ms. Pac-Man Ms. Pac-Man Competition Ms. Pac-Man Monte
Vol. 52 No. 12 3817 3827 (Dec. 2011) Ms. Pac-Man 1 2 2007 IEEE CIG Ms. Pac-Man Ms. Pac-Man AI AI Ms. Pac-Man Ms. Pac-Man Competition Ms. Pac-Man Monte-Carlo Tree Search in Ms. Pac-Man Nozomu Ikehata 1
More informationUCT探索を用いた大貧民クライアント
UCT.. ( ) UCT 1 / 34 1 2 UEC 2012 3 4 UCT UCB1 UCB1-Tuned 5 ( ) UCT 2 / 34 1 http://uguisu.skr.jp/othello/ http://matome.naver.jp/odai/2128989764455845801 ( ) UCT 3 / 34 1 : (1997) : (1997) : (2010) :
More informationTerran( テラン ) Terran のユニットは人間に近い見た目をしている. 施設建設の際に他の種族と異なり場所の制約が無く, 指定した場所に建設出来る. 更に, 建設後に施設を浮遊させて移動することも可能である. また,Terran のユニットや施設はダメージを受けた際に修理が可能である.3
深層学習を用いた StarCraft の敵作戦予測 鎌田徹朗 1 橋本剛 1 概要 :StarCraft は RTS(Real-Time Strategy) ゲームの中でも特に人気のシリーズであり, 多くのプロが存在している. 現在までに, プロより強い AI 開発を目標として様々な研究が行われているが,AI はプロに対して 0 勝 15 敗と惨敗しており, プロのレベルには遠い. 本研究ではより効率的な
More information,.,., ( ).,., A, B. A, B,.,...,., Python Long Short Term Memory(LSTM), Unity., Asynchronous method, Deep Q-Network(DQN), LSTM, TORCS. Asynchronous met
2016 Future University Hakodate 2016 System Information Science Practice Group Report AI Project Name AI love Deep Learning TORCS Deep Learning Group Name TORCS Deep Learning /Project No. 14-B /Project
More information[1] AI [2] Pac-Man Ms. Pac-Man Ms. Pac-Man Pac-Man Ms. Pac-Man IEEE AI Ms. Pac-Man AI [3] AI 2011 UCT[4] [5] 58,990 Ms. Pac-Man AI Ms. Pac-Man 921,360
TD(λ) Ms. Pac-Man AI 1,a) 2 3 3 Ms. Pac-Man AI Ms. Pac-Man UCT (Upper Confidence Bounds applied to Trees) TD(λ) UCT UCT Progressive bias Progressive bias UCT UCT Ms. Pac-Man UCT Progressive bias TD(λ)
More informationDQN Pathak Intrinsic Curiosity Module (ICM) () [2] Pathak VizDoom Super Mario Bros Mnih A3C [3] ICM Burda ICM Atari 2600 [4] Seijen Hybrid Reward Arch
Hybrid Reward Architecture 1,a) 1 AI RPG (Rogue-like games) AI AI A3C ICM ICM Deep Reinforcement Learning of Roguelike Games Using Internal Rewards and Hybrid Reward Architecture Yukio Kano 1,a) Yoshimasa
More information2
1 2 3 4 5 6 7 8 9 10 I II III 11 IV 12 V 13 VI VII 14 VIII. 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 _ 33 _ 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 VII 51 52 53 54 55 56 57 58 59
More informationuntitled
i ii iii iv v 43 43 vi 43 vii T+1 T+2 1 viii 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 a) ( ) b) ( ) 51
More informationi
14 i ii iii iv v vi 14 13 86 13 12 28 14 16 14 15 31 (1) 13 12 28 20 (2) (3) 2 (4) (5) 14 14 50 48 3 11 11 22 14 15 10 14 20 21 20 (1) 14 (2) 14 4 (3) (4) (5) 12 12 (6) 14 15 5 6 7 8 9 10 7
More informationDL_UCT
Deep Learning for Real- Time Atari Game Play Using Offline Monte- Carlo Tree Search Planning Guo, X., Singh, S., Lee, H., Lewis, R. L., & Wang, X. (2014). InAdvances in Neural Information Processing Systems
More information1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 () - 1 - - 2 - - 3 - - 4 - - 5 - 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57
More informationJAIST Reposi https://dspace.j Title ゲームの主目的達成を意図しない人間らしい行動の分 類と模倣 Author(s) 中川, 絢太 Citation Issue Date 2017-03 Type Thesis or Dissertation Text version author URL http://hdl.handle.net/10119/14157 Rights
More informationuntitled
I...1 II...2...2 III...3...3...7 IV...15...15...20 V...23...23...24...25 VI...31...31...32...33...40...47 VII...62...62...67 VIII...70 1 2 3 4 m 3 m 3 m 3 m 3 m 3 m 3 5 6 () 17 18 7 () 17 () 17 8 9 ()
More information入門ガイド
ii iii iv NEC Corporation 1998 v P A R 1 P A R 2 P A R 3 T T T vi P A R T 4 P A R T 5 P A R T 6 P A R T 7 vii 1P A R T 1 2 2 1 3 1 4 1 1 5 2 3 6 4 1 7 1 2 3 8 1 1 2 3 9 1 2 10 1 1 2 11 3 12 1 2 1 3 4 13
More informationMastering the Game of Go without Human Knowledge ( ) AI 3 1 AI 1 rev.1 (2017/11/26) 1 6 2
6 2 6.1........................................... 3 6.2....................... 5 6.2.1........................... 5 6.2.2........................... 9 6.2.3................. 11 6.3.......................
More information<4D6963726F736F667420506F776572506F696E74202D208376838C835B83938365815B835683878393312E707074205B8CDD8AB78382815B83685D>
i i vi ii iii iv v vi vii viii ix 2 3 4 5 6 7 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
More informationSC-85X2取説
I II III IV V VI .................. VII VIII IX X 1-1 1-2 1-3 1-4 ( ) 1-5 1-6 2-1 2-2 3-1 3-2 3-3 8 3-4 3-5 3-6 3-7 ) ) - - 3-8 3-9 4-1 4-2 4-3 4-4 4-5 4-6 5-1 5-2 5-3 5-4 5-5 5-6 5-7 5-8 5-9 5-10 5-11
More information活用ガイド (ソフトウェア編)
(Windows 95 ) ii iii iv NEC Corporation 1999 v P A R T 1 vi P A R T 2 vii P A R T 3 P A R T 4 viii P A R T 5 ix x P A R T 1 2 3 1 1 2 4 1 2 3 4 5 1 1 2 3 4 6 5 6 7 7 1 1 2 8 1 9 1 1 2 3 4 5 6 1 2 3 4
More information「産業上利用することができる発明」の審査の運用指針(案)
1 1.... 2 1.1... 2 2.... 4 2.1... 4 3.... 6 4.... 6 1 1 29 1 29 1 1 1. 2 1 1.1 (1) (2) (3) 1 (4) 2 4 1 2 2 3 4 31 12 5 7 2.2 (5) ( a ) ( b ) 1 3 2 ( c ) (6) 2. 2.1 2.1 (1) 4 ( i ) ( ii ) ( iii ) ( iv)
More informationConvolutional Neural Network A Graduation Thesis of College of Engineering, Chubu University Investigation of feature extraction by Convolution
Convolutional Neural Network 2014 3 A Graduation Thesis of College of Engineering, Chubu University Investigation of feature extraction by Convolutional Neural Network Fukui Hiroshi 1940 1980 [1] 90 3
More information活用ガイド (ソフトウェア編)
(Windows 98 ) ii iii iv v NEC Corporation 1999 vi P A R T 1 P A R T 2 vii P A R T 3 viii P A R T 4 ix P A R T 5 x P A R T 1 2 3 1 1 2 4 1 2 3 4 5 1 1 2 3 4 5 6 6 7 7 1 1 2 8 1 9 1 1 2 3 4 5 6 1 2 3 10
More informationo 2o 3o 3 1. I o 3. 1o 2o 31. I 3o PDF Adobe Reader 4o 2 1o I 2o 3o 4o 5o 6o 7o 2197/ o 1o 1 1o
78 2 78... 2 22201011... 4... 9... 7... 29 1 1214 2 7 1 8 2 2 3 1 2 1o 2o 3o 3 1. I 1124 4o 3. 1o 2o 31. I 3o PDF Adobe Reader 4o 2 1o 72 1. I 2o 3o 4o 5o 6o 7o 2197/6 9. 9 8o 1o 1 1o 2o / 3o 4o 5o 6o
More information262014 3 1 1 6 3 2 198810 2/ 198810 2 1 3 4 http://www.pref.hiroshima.lg.jp/site/monjokan/ 1... 1... 1... 2... 2... 4... 5... 9... 9... 10... 10... 10... 10... 13 2... 13 3... 15... 15... 15... 16 4...
More informationこれわかWord2010_第1部_100710.indd
i 1 1 2 3 6 6 7 8 10 10 11 12 12 12 13 2 15 15 16 17 17 18 19 20 20 21 ii CONTENTS 25 26 26 28 28 29 30 30 31 32 35 35 35 36 37 40 42 44 44 45 46 49 50 50 51 iii 52 52 52 53 55 56 56 57 58 58 60 60 iv
More informationパワポカバー入稿用.indd
i 1 1 2 2 3 3 4 4 4 5 7 8 8 9 9 10 11 13 14 15 16 17 19 ii CONTENTS 2 21 21 22 25 26 32 37 38 39 39 41 41 43 43 43 44 45 46 47 47 49 52 54 56 56 iii 57 59 62 64 64 66 67 68 71 72 72 73 74 74 77 79 81 84
More informationこれでわかるAccess2010
i 1 1 1 2 2 2 3 4 4 5 6 7 7 9 10 11 12 13 14 15 17 ii CONTENTS 2 19 19 20 23 24 25 25 26 29 29 31 31 33 35 36 36 39 39 41 44 45 46 48 iii 50 50 52 54 55 57 57 59 61 63 64 66 66 67 70 70 73 74 74 77 77
More information1... 1... 1... 3 2... 4... 4... 4... 4... 4... 6... 10... 11... 15... 30
1 2420128 1 6 3 2 199103 189/1 1991031891 3 4 5 JISJIS X 0208, 1997 1 http://www.pref.hiroshima.lg.jp/site/monjokan/ 1... 1... 1... 3 2... 4... 4... 4... 4... 4... 6... 10... 11... 15... 30 1 3 5 7 6 7
More information平成18年版 男女共同参画白書
i ii iii iv v vi vii viii ix 3 4 5 6 7 8 9 Column 10 11 12 13 14 15 Column 16 17 18 19 20 21 22 23 24 25 26 Column 27 28 29 30 Column 31 32 33 34 35 36 Column 37 Column 38 39 40 Column 41 42 43 44 45
More information178 5 I 1 ( ) ( ) 10 3 13 3 1 8891 8 3023 6317 ( 10 1914 7152 ) 16 5 1 ( ) 6 13 3 13 3 8575 3896 8 1715 779 6 (1) 2 7 4 ( 2 ) 13 11 26 12 21 14 11 21
I 178 II 180 III ( ) 181 IV 183 V 185 VI 186 178 5 I 1 ( ) ( ) 10 3 13 3 1 8891 8 3023 6317 ( 10 1914 7152 ) 16 5 1 ( ) 6 13 3 13 3 8575 3896 8 1715 779 6 (1) 2 7 4 ( 2 ) 13 11 26 12 21 14 11 21 4 10 (
More information44 4 I (1) ( ) (10 15 ) ( 17 ) ( 3 1 ) (2)
(1) I 44 II 45 III 47 IV 52 44 4 I (1) ( ) 1945 8 9 (10 15 ) ( 17 ) ( 3 1 ) (2) 45 II 1 (3) 511 ( 451 1 ) ( ) 365 1 2 512 1 2 365 1 2 363 2 ( ) 3 ( ) ( 451 2 ( 314 1 ) ( 339 1 4 ) 337 2 3 ) 363 (4) 46
More informationi ii i iii iv 1 3 3 10 14 17 17 18 22 23 28 29 31 36 37 39 40 43 48 59 70 75 75 77 90 95 102 107 109 110 118 125 128 130 132 134 48 43 43 51 52 61 61 64 62 124 70 58 3 10 17 29 78 82 85 102 95 109 iii
More informationAI
JAIST Reposi https://dspace.j Title プレイヤの意図や価値観を学習し行動選択するチーム プレイ AI の構成 Author(s) 吉谷, 慧 Citation Issue Date 2013-03 Type Thesis or Dissertation Text version author URL http://hdl.handle.net/10119/11300
More informationIII
III 1 1 2 1 2 3 1 3 4 1 3 1 4 1 3 2 4 1 3 3 6 1 4 6 1 4 1 6 1 4 2 8 1 4 3 9 1 5 10 1 5 1 10 1 5 2 12 1 5 3 12 1 5 4 13 1 6 15 2 1 18 2 1 1 18 2 1 2 19 2 2 20 2 3 22 2 3 1 22 2 3 2 24 2 4 25 2 4 1 25 2
More informationiii iv v vi vii viii ix 1 1-1 1-2 1-3 2 2-1 3 3-1 3-2 3-3 3-4 4 4-1 4-2 5 5-1 5-2 5-3 5-4 5-5 5-6 5-7 6 6-1 6-2 6-3 6-4 6-5 6 6-1 6-2 6-3 6-4 6-5 7 7-1 7-2 7-3 7-4 7-5 7-6 7-7 7-8 7-9 7-10 7-11 8 8-1
More informationエクセルカバー入稿用.indd
i 1 1 2 3 5 5 6 7 7 8 9 9 10 11 11 11 12 2 13 13 14 15 15 16 17 17 ii CONTENTS 18 18 21 22 22 24 25 26 27 27 28 29 30 31 32 36 37 40 40 42 43 44 44 46 47 48 iii 48 50 51 52 54 55 59 61 62 64 65 66 67 68
More information活用ガイド (ハードウェア編)
(Windows 98) 808-877675-122-A ii iii iv NEC Corporation 1999 v vi PART 1 vii viii PART 2 PART 3 ix x xi xii P A R T 1 2 1 3 4 1 5 6 1 7 8 1 9 10 11 1 12 1 1 2 3 13 1 2 3 14 4 5 1 15 1 1 16 1 17 18 1 19
More information01_.g.r..
I II III IV V VI VII VIII IX X XI I II III IV V I I I II II II I I YS-1 I YS-2 I YS-3 I YS-4 I YS-5 I YS-6 I YS-7 II II YS-1 II YS-2 II YS-3 II YS-4 II YS-5 II YS-6 II YS-7 III III YS-1 III YS-2
More informationii iii iv CON T E N T S iii iv v Chapter1 Chapter2 Chapter 1 002 1.1 004 1.2 004 1.2.1 007 1.2.2 009 1.3 009 1.3.1 010 1.3.2 012 1.4 012 1.4.1 014 1.4.2 015 1.5 Chapter3 Chapter4 Chapter5 Chapter6 Chapter7
More informationM41 JP Manual.indd
i ii iii iv v vi vii 1 No / A-B EQ 2 MIC REC REC00001.WAV Stereo CH:01 0:00:00 1:50:00 3 4 5 6 7 8 9 10 11 12 1 1 F F A A 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 Φ 35 36 37 38
More information() 3 3 2 5 3 6 4 2 5 4 2 (; ) () 8 2 4 0 0 2 ex. 3 n n =, 2,, 20 : 3 2 : 9 3 : 27 4 : 8 5 : 243 6 : 729 7 : 287 8 : 656 9 : 9683 0 : 59049 : 7747 2 : 5344 3 : 594323 4 : 4782969 5 : 4348907 6 : 4304672
More informationAccessflÌfl—−ÇŠš1
ACCESS ACCESS i ii ACCESS iii iv ACCESS v vi ACCESS CONTENTS ACCESS CONTENTS ACCESS 1 ACCESS 1 2 ACCESS 3 1 4 ACCESS 5 1 6 ACCESS 7 1 8 9 ACCESS 10 1 ACCESS 11 1 12 ACCESS 13 1 14 ACCESS 15 1 v 16 ACCESS
More information知能科学:ニューラルネットワーク
2 3 4 (Neural Network) (Deep Learning) (Deep Learning) ( x x = ax + b x x x ? x x x w σ b = σ(wx + b) x w b w b .2.8.6 σ(x) = + e x.4.2 -.2 - -5 5 x w x2 w2 σ x3 w3 b = σ(w x + w 2 x 2 + w 3 x 3 + b) x,
More information知能科学:ニューラルネットワーク
2 3 4 (Neural Network) (Deep Learning) (Deep Learning) ( x x = ax + b x x x ? x x x w σ b = σ(wx + b) x w b w b .2.8.6 σ(x) = + e x.4.2 -.2 - -5 5 x w x2 w2 σ x3 w3 b = σ(w x + w 2 x 2 + w 3 x 3 + b) x,
More information3 5 18 3 5000 1 2 7 8 120 1 9 1954 29 18 12 30 700 4km 1.5 100 50 6 13 5 99 93 34 17 2 2002 04 14 16 6000 12 57 60 1986 55 3 3 3 500 350 4 5 250 18 19 1590 1591 250 100 500 20 800 20 55 3 3 3 18 19 1590
More information困ったときのQ&A
ii iii iv NEC Corporation 1997 v P A R T 1 vi vii P A R T 2 viii P A R T 3 ix x xi 1P A R T 2 1 3 4 1 5 6 1 7 8 1 9 1 2 3 4 10 1 11 12 1 13 14 1 1 2 15 16 1 2 1 1 2 3 4 5 17 18 1 2 3 1 19 20 1 21 22 1
More informationこれからの強化学習 サンプルページ この本の定価 判型などは, 以下の URL からご覧いただけます. このサンプルページの内容は, 初版 1 刷発行時のものです.
これからの強化学習 サンプルページ この本の定価 判型などは, 以下の URL からご覧いただけます. http://www.morikita.co.jp/books/mid/088031 このサンプルページの内容は, 初版 1 刷発行時のものです. i ii Sutton Barto 20 1 2 3 4 1 Richard S. Sutton and Andrew G. Barto. Reinforcement
More informationi
i ii iii iv v vi vii viii ix x xi ( ) 854.3 700.9 10 200 3,126.9 162.3 100.6 18.3 26.5 5.6/s ( ) ( ) 1949 8 12 () () ア イ ウ ) ) () () () () BC () () (
More information(報告書まとめ 2004/03/ )
- i - ii iii iv v vi vii viii ix x xi 1 Shock G( Invention) (Property rule) (Liability rule) Impact flow 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 (
More information86 7 I ( 13 ) II ( )
10 I 86 II 86 III 89 IV 92 V 2001 93 VI 95 86 7 I 2001 6 12 10 2001 ( 13 ) 10 66 2000 2001 4 100 1 3000 II 1988 1990 1991 ( ) 500 1994 2 87 1 1994 2 1000 1000 1000 2 1994 12 21 1000 700 5 800 ( 97 ) 1000
More informationCRS4
I... 1 II... 1 A... 1 B... 1 C... 1 D... 2 E... 3 III... 3 A... 3 B... 4 C... 5 IV... 8 A... 8 B... 8 C... 9 D... 10 V... 11 A... 11 B... 11 C... 12 VI... 12 A... 12 B... 12 C... 12 VII... 13 ii I II A
More information医系の統計入門第 2 版 サンプルページ この本の定価 判型などは, 以下の URL からご覧いただけます. このサンプルページの内容は, 第 2 版 1 刷発行時のものです.
医系の統計入門第 2 版 サンプルページ この本の定価 判型などは, 以下の URL からご覧いただけます. http://www.morikita.co.jp/books/mid/009192 このサンプルページの内容は, 第 2 版 1 刷発行時のものです. i 2 t 1. 2. 3 2 3. 6 4. 7 5. n 2 ν 6. 2 7. 2003 ii 2 2013 10 iii 1987
More information活用ガイド (ソフトウェア編)
ii iii iv NEC Corporation 1998 v vi PA RT 1 vii PA RT 2 viii PA RT 3 PA RT 4 ix P A R T 1 2 3 1 4 5 1 1 2 1 2 3 4 6 1 2 3 4 5 7 1 6 7 8 1 9 1 10 1 2 3 4 5 6 7 8 9 10 11 11 1 12 12 1 13 1 1 14 2 3 4 5 1
More informationパソコン機能ガイド
PART12 ii iii iv v 1 2 3 4 5 vi vii viii ix P A R T 1 x P A R T 2 xi P A R T 3 xii xiii P A R T 1 2 3 1 4 5 1 6 1 1 2 7 1 2 8 1 9 10 1 11 12 1 13 1 2 3 4 14 1 15 1 2 3 16 4 1 1 2 3 17 18 1 19 20 1 1
More informationパソコン機能ガイド
PART2 iii ii iv v 1 2 3 4 5 vi vii viii ix P A R T 1 x P A R T 2 xi P A R T 3 xii xiii P A R T 1 2 1 3 4 1 5 6 1 2 1 1 2 7 8 9 1 10 1 11 12 1 13 1 2 3 14 4 1 1 2 3 15 16 1 17 1 18 1 1 2 19 20 1 21 1 22
More informationResearch on decision making in multi-player games with imperfect information
Research on decision making in multi-player games with imperfect information 37-086521 22 2 9 UCT UCT 46 % 60000 9 % 1 1 1.1........................................ 1 1.2.....................................
More information1... 1 2... 1 1... 1 2... 2 3... 2 4... 4 5... 4 6... 4 7... 22 8... 22 3... 22 1... 22 2... 23 3... 23 4... 24 5... 24 6... 25 7... 31 8... 32 9... 3
3 2620149 3 6 3 2 198812 21/ 198812 21 1 3 4 5 JISJIS X 0208 : 1997 JIS 4 JIS X 0213:2004 http://www.pref.hiroshima.lg.jp/site/monjokan/ 1... 1 2... 1 1... 1 2... 2 3... 2 4... 4 5... 4 6... 4 7... 22
More information1
10 200 15 20 50 (1) (2) 45 A4 JICA 15 WS 1 [] a. b. 10 A 30 15 15 NGO PC 5 15 15 15 15 NGO 1948 1970 10 NGO 90 AB 40 40 WS 1 NGO 40 WS Q 43 63 73 15 9 8 5 5 4 63 17 9 8 6 6 4 2000 14 15 100 2000 1
More informationuntitled
23 12 10 12:55 ~ 18:45 KKR Tel0557-85-2000 FAX0557-85-6604 12:55~13:00 13:00~13:38 I 1) 13:00~13:12 2) 13:13~13:25 3) 13:26~13:38 13:39~14:17 II 4) 13:39~13:51 5) 13:52 ~ 14:04 6) 14:05 ~ 14:17 14:18 ~
More information松竹映画ファンド重要事項説明書
2004 11 30 2004 11 2005 2 1 2004 11 30 1. IV. (5) 26 10 10,000 1,350 2,625 232,025 133,006 9,500 1,320 2,520 224,005 131,001 9,000 1,290 2,415 215,985 128,996 8,500 1,260 2,310 207,965 126,991 8,000 1,230
More informationThe 23rd Game Programming Workshop ,a) 2,3,b) Deep Q-Network Atari2600 Minecraft AI Minecraft hg-dagger/q Imitation Learning and Reinforcement L
1,a) 2,3,b) Deep Q-Network Atari2600 Minecraft AI Minecraft hg-dagger/q Imitation Learning and Reinforcement Learning using Hierarchical Structure Yutaro Fujimura 1,a) Tomoyuki Kaneko 2,3,b) Abstract:
More information第1部 一般的コメント
(( 2000 11 24 2003 12 31 3122 94 2332 508 26 a () () i ii iii iv (i) (ii) (i) (ii) (iii) (iv) (a) (b)(c)(d) a) / (i) (ii) (iii) (iv) 1996 7 1996 12
More informationN cos s s cos ψ e e e e 3 3 e e 3 e 3 e
3 3 5 5 5 3 3 7 5 33 5 33 9 5 8 > e > f U f U u u > u ue u e u ue u ue u e u e u u e u u e u N cos s s cos ψ e e e e 3 3 e e 3 e 3 e 3 > A A > A E A f A A f A [ ] f A A e > > A e[ ] > f A E A < < f ; >
More information表1票4.qx4
iii iv v 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 22 23 10 11 24 25 26 27 10 56 28 11 29 30 12 13 14 15 16 17 18 19 2010 2111 22 23 2412 2513 14 31 17 32 18 33 19 34 20 35 21 36 24 37 25 38 2614
More information第1章 国民年金における無年金
1 2 3 4 ILO ILO 5 i ii 6 7 8 9 10 ( ) 3 2 ( ) 3 2 2 2 11 20 60 12 1 2 3 4 5 6 7 8 9 10 11 12 13 13 14 15 16 17 14 15 8 16 2003 1 17 18 iii 19 iv 20 21 22 23 24 25 ,,, 26 27 28 29 30 (1) (2) (3) 31 1 20
More information長崎県地域防災計画
i ii iii iv v vi vii viii ix - 1 - - 2 - - 3 - - 4 - - 5 - - 6 - - 7 - - 8 - - 9 - 玢 - 10 - - 11 - - 12 - - 13 - - 14 - - 15 - - 16 - - 17 - - 18 - - 19 - - 20 - - 21 - - 22 - - 23 - - 24 - - 25 - -
More informationONLINE_MANUAL
JPN ii iii iv v 6 vi vii viii 1 CHAPTER 1-1 1 2 1-2 1 2 3 4 5 1-3 6 7 1-4 2 CHAPTER 2-1 2-2 2-3 1 2 3 4 5 2-4 6 7 8 2-5 9 10 2-6 11 2-7 1 2 2-8 3 (A) 4 5 6 2-9 1 2-10 2 3 2-11 4 5 2-12 1 2 2-13 3 4 5
More informationONLINE_MANUAL
JPN ii iii iv v vi 6 vii viii 1 CHAPTER 1-1 1 2 1-2 1 2 3 1-3 4 5 6 7 1-4 2 CHAPTER 2-1 2-2 2-3 1 2 3 4 5 2-4 6 7 8 2-5 9 10 2-6 11 2-7 1 2 2-8 3 (A) 4 5 6 2-9 1 2-10 2 3 2-11 4 5 2-12 1 2 2-13 3 4 5
More information,255 7, ,355 4,452 3,420 3,736 8,206 4, , ,992 6, ,646 4,
30 8 IT 28 1,260 3 1 11. 1101. 1102. 1103. 1 3 1,368.3 3 1,109.8 p.5,p.7 2 9,646 4,291 14.5% 10,p.11 3 3,521 8 p.13 45-49 40-44 50-54 019 5 3 1 2,891 3 6 1 3 95 1 1101 1102 1103 1101 1102 1103 1 6,255
More information85 4
85 4 86 Copright c 005 Kumanekosha 4.1 ( ) ( t ) t, t 4.1.1 t Step! (Step 1) (, 0) (Step ) ±V t (, t) I Check! P P V t π 54 t = 0 + V (, t) π θ : = θ : π ) θ = π ± sin ± cos t = 0 (, 0) = sin π V + t +V
More informationSendai Urban Research Forum ...1...2...4...5...6 4...8...14...26...29...48...68...69...71...74...80...83...85...88...98 21...100...101...108...122...132...137 1960 3 15 24 4 13 4 2 4 4 10 4 4 15 i 2,000
More information_314I01BM浅谷2.indd
587 ネットワークの表現学習 1 1 1 1 Deep Learning [1] Google [2] Deep Learning [3] [4] 2014 Deepwalk [5] 1 2 [6] [7] [8] 1 2 1 word2vec[9] word2vec 1 http://www.ai-gakkai.or.jp/my-bookmark_vol31-no4 588 31 4 2016
More information