IPSJ SIG Technical Report Vol.2015-MUS-107 No /5/23 HARK-Binaural Raspberry Pi 2 1,a) ( ) HARK 2 HARK-Binaural A/D Raspberry Pi 2 1.

Similar documents
ロボット聴覚オープンソースソフトウェアHARKの紹介

1. HNS [1] HNS HNS HNS [2] HNS [3] [4] [5] HNS 16ch SNR [6] 1 16ch 1 3 SNR [4] [5] 2. 2 HNS API HNS CS27-HNS [1] (SOA) [7] API Web 2

676 Vol. 31 No. 7, pp , Incremental Noise Estimation in Outdoor Auditory Scene Analysis using a Quadrocopter with a Microphone A

1 Kinect for Windows M = [X Y Z] T M = [X Y Z ] T f (u,v) w 3.2 [11] [7] u = f X +u Z 0 δ u (X,Y,Z ) (5) v = f Y Z +v 0 δ v (X,Y,Z ) (6) w = Z +

IPSJ SIG Technical Report Vol.2017-MUS-116 No /8/24 MachineDancing: 1,a) 1,b) 3 MachineDancing MachineDancing MachineDancing 1 MachineDan


光学

(3.6 ) (4.6 ) 2. [3], [6], [12] [7] [2], [5], [11] [14] [9] [8] [10] (1) Voodoo 3 : 3 Voodoo[1] 3 ( 3D ) (2) : Voodoo 3D (3) : 3D (Welc

1 (n = 52, 386) DL (n = 52, 386) DL DL [4] Dynamic Time Warping(DTW ) [5] Altmetrics Gunther [

(a) 1 (b) 3. Gilbert Pernicka[2] Treibitz Schechner[3] Narasimhan [4] Kim [5] Nayar [6] [7][8][9] 2. X X X [10] [11] L L t L s L = L t + L s

fiš„v8.dvi

HTML5無料セミナ.key

2016 Institute of Statistical Research

9.プレゼン資料(小泉)R1

IPSJ SIG Technical Report 1, Instrument Separation in Reverberant Environments Using Crystal Microphone Arrays Nobutaka ITO, 1, 2 Yu KITANO, 1

IPSJ SIG Technical Report Vol.2012-MUS-96 No /8/10 MIDI Modeling Performance Indeterminacies for Polyphonic Midi Score Following and

(2-3)CyberSpace

MDD PBL ET 9) 2) ET ET 2.2 2), 1 2 5) MDD PBL PBL MDD MDD MDD 10) MDD Executable UML 11) Executable UML MDD Executable UML

1 Web [2] Web [3] [4] [5], [6] [7] [8] S.W. [9] 3. MeetingShelf Web MeetingShelf MeetingShelf (1) (2) (3) (4) (5) Web MeetingShelf

2007/8 Vol. J90 D No. 8 Stauffer [7] 2 2 I 1 I 2 2 (I 1(x),I 2(x)) 2 [13] I 2 = CI 1 (C >0) (I 1,I 2) (I 1,I 2) Field Monitoring Server

Gaze Head Eye (a) deg (b) 45 deg (c) 9 deg 1: - 1(b) - [5], [6] [7] Stahl [8], [9] Fang [1], [11] Itti [12] Itti [13] [7] Fang [1],



第1章

[1] SBS [2] SBS Random Forests[3] Random Forests ii

1(a) (b),(c) - [5], [6] Itti [12] [13] gaze eyeball head 2: [time] [7] Stahl [8], [9] Fang [1], [11] 3 -

SICE東北支部研究集会資料(2012年)

258 5) GPS 1 GPS 6) GPS DP 7) 8) 10) GPS GPS ) GPS Global Positioning System

IPSJ SIG Technical Report Vol.2010-MPS-77 No /3/5 VR SIFT Virtual View Generation in Hallway of Cybercity Buildings from Video Sequen

IPSJ SIG Technical Report Vol.2016-CE-137 No /12/ e β /α α β β / α A judgment method of difficulty of task for a learner using simple

28 Horizontal angle correction using straight line detection in an equirectangular image

& Vol.5 No (Oct. 2015) TV 1,2,a) , Augmented TV TV AR Augmented Reality 3DCG TV Estimation of TV Screen Position and Ro

1 1 CodeDrummer CodeMusician CodeDrummer Fig. 1 Overview of proposal system c

Vol. 23 No. 4 Oct Kitchen of the Future 1 Kitchen of the Future 1 1 Kitchen of the Future LCD [7], [8] (Kitchen of the Future ) WWW [7], [3

IPSJ SIG Technical Report Vol.2012-IS-119 No /3/ Web A Multi-story e-picture Book with the Degree-of-interest Extraction Function


(MIRU2008) HOG Histograms of Oriented Gradients (HOG)

sigmusdemo.dvi

B HNS 7)8) HNS ( ( ) 7)8) (SOA) HNS HNS 4) HNS ( ) ( ) 1 TV power, channel, volume power true( ON) false( OFF) boolean channel volume int

2). 3) 4) 1.2 NICTNICT DCRA Dihedral Corner Reflector micro-arraysdcra DCRA DCRA DCRA 3D DCRA PC USB PC PC ON / OFF Velleman K8055 K8055 K8055

DEIM Forum 2012 E Web Extracting Modification of Objec

,4) 1 P% P%P=2.5 5%!%! (1) = (2) l l Figure 1 A compilation flow of the proposing sampling based architecture simulation

IPSJ SIG Technical Report Vol.2014-MBL-70 No.49 Vol.2014-UBI-41 No /3/15 2,a) 2,b) 2,c) 2,d),e) WiFi WiFi WiFi 1. SNS GPS Twitter Facebook Twit


3 2 2 (1) (2) (3) (4) 4 4 AdaBoost 2. [11] Onishi&Yoda [8] Iwashita&Stoica [5] 4 [3] 3. 3 (1) (2) (3)

IPSJ-SLP

1 P2 P P3P4 P5P8 P9P10 P11 P12

IPSJ SIG Technical Report iphone iphone,,., OpenGl ES 2.0 GLSL(OpenGL Shading Language), iphone GPGPU(General-Purpose Computing on Graphics Proc

H(ω) = ( G H (ω)g(ω) ) 1 G H (ω) (6) 2 H 11 (ω) H 1N (ω) H(ω)= (2) H M1 (ω) H MN (ω) [ X(ω)= X 1 (ω) X 2 (ω) X N (ω) ] T (3)

2003/3 Vol. J86 D II No Fig. 1 An exterior view of eye scanner. CCD [7] CCD PC USB PC PC USB RS-232C PC

IPSJ SIG Technical Report Vol.2010-CVIM-170 No /1/ Visual Recognition of Wire Harnesses for Automated Wiring Masaki Yoneda, 1 Ta

スライド 1

IPSJ SIG Technical Report Vol.2013-GN-87 No /3/ Research of a surround-sound field adjustmen system based on loudspeakers arrangement Ak

(255) Vol. 19 No. 4 July (completion) tcsh bash UNIX Emacs/Mule 2 ( ) [2] [9] [11] 2 (speech completion) 3 ( ) [7] 2 ( 7.1 )

A Navigation Algorithm for Avoidance of Moving and Stationary Obstacles for Mobile Robot Masaaki TOMITA*3 and Motoji YAMAMOTO Department of Production

(i) 1 (ii) ,, 第 5 回音声ドキュメント処理ワークショップ講演論文集 (2011 年 3 月 7 日 ) 1) 1 2) Lamel 2) Roy 3) 4) w 1 w 2 w n 2 2-g

橡上野先生訂正2


VHDL-AMS Department of Electrical Engineering, Doshisha University, Tatara, Kyotanabe, Kyoto, Japan TOYOTA Motor Corporation, Susono, Shizuok

IPSJ SIG Technical Report GPS LAN GPS LAN GPS LAN Location Identification by sphere image and hybrid sensing Takayuki Katahira, 1 Yoshio Iwai 1

1 Table 1: Identification by color of voxel Voxel Mode of expression Nothing Other 1 Orange 2 Blue 3 Yellow 4 SSL Humanoid SSL-Vision 3 3 [, 21] 8 325

IPSJ SIG Technical Report Vol.2011-MUS-91 No /7/ , 3 1 Design and Implementation on a System for Learning Songs by Presenting Musical St

3.1 Thalmic Lab Myo * Bluetooth PC Myo 8 RMS RMS t RMS(t) i (i = 1, 2,, 8) 8 SVM libsvm *2 ν-svm 1 Myo 2 8 RMS 3.2 Myo (Root

, : GUI Web Java 2.1 GUI GUI GUI 2 y = x y = x y = x

Google Goggles [1] Google Goggles Android iphone web Google Goggles Lee [2] Lee iphone () [3] [4] [5] [6] [7] [8] [9] [10] :

untitled

Vol.55 No (Jan. 2014) saccess 6 saccess 7 saccess 2. [3] p.33 * B (A) (B) (C) (D) (E) (F) *1 [3], [4] Web PDF a m

[2][3][4][5] 4 ( 1 ) ( 2 ) ( 3 ) ( 4 ) 2. Shiratori [2] Shiratori [3] [4] GP [5] [6] [7] [8][9] Kinect Choi [10] 3. 1 c 2016 Information Processing So

Lyra X Y X Y ivis Designer Lyra ivisdesigner Lyra ivisdesigner 2 ( 1 ) ( 2 ) ( 3 ) ( 4 ) ( 5 ) (1) (2) (3) (4) (5) Iv Studio [8] 3 (5) (4) (1) (

main.dvi

2013 M

DPA,, ShareLog 3) 4) 2.2 Strino Strino STRain-based user Interface with tacticle of elastic Natural ObjectsStrino 1 Strino ) PC Log-Log (2007 6)

センシングコンピュータシステム特論 (2012/04/23)

Haiku Generation Based on Motif Images Using Deep Learning Koki Yoneda 1 Soichiro Yokoyama 2 Tomohisa Yamashita 2 Hidenori Kawamura Scho

P2P P2P peer peer P2P peer P2P peer P2P i

1 Fig. 1 Extraction of motion,.,,, 4,,, 3., 1, 2. 2.,. CHLAC,. 2.1,. (256 ).,., CHLAC. CHLAC, HLAC. 2.3 (HLAC ) r,.,. HLAC. N. 2 HLAC Fig. 2

PDA 8) ID ZigBee 10) 7) 12) 10) 11) ( 1) Bluetooth Bluetooth Bluetooth 9) WiFi WiFi NTP (X,Y,Z 3 ) ZigBee 10) Fig. 1 1 Overview of recording, analyzin


Fig. 3 3 Types considered when detecting pattern violations 9)12) 8)9) 2 5 methodx close C Java C Java 3 Java 1 JDT Core 7) ) S P S

untitled

IPSJ SIG Technical Report Vol.2014-EIP-63 No /2/21 1,a) Wi-Fi Probe Request MAC MAC Probe Request MAC A dynamic ads control based on tra

,,, 2 ( ), $[2, 4]$, $[21, 25]$, $V$,, 31, 2, $V$, $V$ $V$, 2, (b) $-$,,, (1) : (2) : (3) : $r$ $R$ $r/r$, (4) : 3

動画コンテンツ 動画 1 動画 2 動画 3 生成中の映像 入力音楽 選択された素片 テンポによる伸縮 音楽的構造 A B B B B B A C C : 4) 6) Web Web 2 2 c 2009 Information Processing S

IPSJ SIG Technical Report Vol.2013-SLP-98 No /10/25 1,a) 1 ( Q&A ) ( ) YJVOICE Development of speech recognition and natural language processing

IPSJ SIG Technical Report Vol.2013-CE-122 No.16 Vol.2013-CLE-11 No /12/14 Android 1,a) 1 1 GPS LAN 2 LAN Android,,, Android, HTML5 LAN 1. ICT(I

2. CABAC CABAC CABAC 1 1 CABAC Figure 1 Overview of CABAC 2 DCT 2 0/ /1 CABAC [3] 3. 2 値化部 コンテキスト計算部 2 値算術符号化部 CABAC CABAC

IPSJ SIG Technical Report Vol.2017-HCI-173 No.5 Vol.2017-EC-44 No /6/1 1,a) 1,2,b) 3,c) 1,d) 3D * 1* Graduate School of Engineerin

情報処理学会研究報告 IPSJ SIG Technical Report Vol.2013-CVIM-186 No /3/15 EMD 1,a) SIFT. SIFT Bag-of-keypoints. SIFT SIFT.. Earth Mover s Distance

main

2011 Kinect : M Kinect Kinect Kinect ON/OFF ON/OFF

IPSJ SIG Technical Report Vol.2015-MUS-106 No.10 Vol.2015-EC-35 No /3/2 BGM 1,4,a) ,4 BGM. BGM. BGM BGM. BGM. BGM. BGM. 1.,. YouTube 201

Grund.dvi

5) 2. Geminoid HI-1 6) Telenoid 7) Geminoid HI-1 Geminoid HI-1 Telenoid Robot- PHONE 8) RobotPHONE 11 InterRobot 9) InterRobot InterRobot irt( ) 10) 4

DEIM Forum 2014 B Twitter Twitter Twitter 2006 Twitter 201

6 2. AUTOSAR 2.1 AUTOSAR AUTOSAR ECU OSEK/VDX 3) OSEK/VDX OS AUTOSAR AUTOSAR ECU AUTOSAR 1 AUTOSAR BSW (Basic Software) (Runtime Environment) Applicat

report-MSPC.dvi

1: 2: 3: 4: 2. 1 Exploratory Search [4] Exploratory Search 2. 1 [7] [8] [9] [10] Exploratory Search

23 Fig. 2: hwmodulev2 3. Reconfigurable HPC 3.1 hw/sw hw/sw hw/sw FPGA PC FPGA PC FPGA HPC FPGA FPGA hw/sw hw/sw hw- Module FPGA hwmodule hw/sw FPGA h

A Responsive Processor for Parallel/Distributed Real-time Processing

特別寄稿.indd

Transcription:

HARK-Binaural Raspberry Pi 2 1,a) 1 1 1 2 3 () HARK 2 HARK-Binaural A/D Raspberry Pi 2 1. [1,2] [2 5] () HARK (Honda Research Institute Japan audition for robots with Kyoto University) *1 GUI ( 1) Python ROS [5] HARK HARK-Binaural 1, Kyoto Univ., Sakyo, Kyoto, 606 8501, Japan 2,, HRI-JP, Wako, Saitama, 351 0188, Japan 3, Waseda Univ., Shinjuku, Tokyo, 169 0072, Japan a) yoshiaki@kuis.kyoto-u.ac.jp *1 http://www.hark.jp/ [6] A/D HARK HARK-Binaural Raspberry Pi 2 Rapiro. 2. HARK rospeex Two!Ears rospeex [4] ROS 1) 2) 3) 3 Google API rospeex ROS HARK HARK 1

¾t ¾ â 1: HARK Two!Ears *2 3. HARK HARK HARK [3] HARK [1] 3.1 HARK 1) 2) 2 [3]. HARK Flowdesiner [7] batchflow wios harktool Microsoft Kinect Playstation Eye HARK 1 GUI (HARK *2 http://twoears.aipa.tu-berlin.de/ Desiner) linux apt windows 3.2 MUSIC (MUltiple SIgnal Classification) [8] MUSIC HARK GEVD-MUSIC GSVD-MUSIC [9] GHDSS-AS (Geometric High-order Decorrelation Source Separation with Adaptive Stepsize) [10] GHDSS-AS GHDSS-AS (Adaptive Stepsize) MFT (Missing Future Theory) [11] MFCC MSLS HARK- MUSICKinect OpenCV HARK- 2

SourceSeparation 1: Hark-Binaural SpeechEnhanvement BinauralMultisourceLocalization BinauralMultisourceTracker VoiceActivityDetection SpectrumVisualization SSLVisualization VADVisualization WaveVisualization Kinect, HARK-OpenCV 4. HARK-Binaural HARK-Binaural 1 HARK-Binaural 4.1 HARK-Binaural 1) 2) HARK 1) HARK MUSIC [9] A/D A/D A/D Armadillo 2) HARK-Binaural HARK HARK BinauralMultisourceTracker HARK-SSS Beamforming () 4.2 BinauralMultisourceLocalization GCC-PHAT [12] HARK MUSIC 1 1 1 dynamic K-mans [6] BinauralMultisourceTracker 2: SSLVisualization 3 BinauralMultisourceLocalization 1 VoiceActivityDetection [13] HARK-Binaural ( 1 ) 2-30, 0, 60 BinauralMultisourceLocalization SSLVisualization 5. Raspberry Pi 2 Raspberry Pi 2 Rapiro ( 3) HARK-Binaural Raspberry Pi 2 900MHz ARM Cortex-A7 4 Debian Raspbian HARK Raspbian HARK MUSIC HARK-Binaural BinauralMultisourceLocalization 60 Raspberry Pi 2 17.6 2 () http://winnie.kuis.kyoto-u.ac. jp/members/yoshiaki/demo/sigmus107/ 3

d d 3: HARK-Binaural ¾t 4: Â 5: â Ì 2: HARK AudioStreamFromMic MultiFFT SourceTracker SourceIntervalExtender HARK-Binaural BinauralMultisourceLocalization BinauralMultisourceTracker HARK-ROS HarkMsgsStreamFromRos HarkMsgsSubscriber RosHarkMsgsPuclisher HARK-SSS Beamforming 5.1 4 2-ch 3 2 USB Raspberry Pi 2 5.2 5 ROS HARK HARK-ROS ROS CCD ROS HARK-ROS 1) 2) 3) () 3 6. HARK- Binaural Raspberry Pi 2 A/D HARK-Binaural Armadillo Raspberry Pi 2 (S) No.24220006 [1] Rosenthal, D. F. et al.: Computational Auditory Scene Analysis, Lawrence Erlbaum (1998). [2] Bradski, G. et al.: Learning OpenCV: Computer vision with the OpenCV library, O Reilly Media, Inc. (2008). [3] HARK 15 pp. 1712 1716 (2014). [4] rospeex ROS (CNR2013-10) Vol. 113, pp. 7 10 (2013). 4

[5] Quigley, M. et al.: ROS: an open-source Robot Operating System, ICRA workshop on open source software, Vol. 3, No. 3.2, p. 5 (2009). [6] Kim, U.-H. et al.: Improved binaural sound localization and tracking for unknown time-varying number of speakers, Advanced Robotics, Vol. 27, No. 15, pp. 1161 1173 (2013). [7] Cote, C. et al.: Code reusability tools for programming mobile robots, Proc. of IEEE/RSJ IROS 2004, Vol. 2, pp. 1820 1825 vol.2 (2004). [8] Asano, F. et al.: Real-time sound source localization and separation system and its application to automatic speech recognition., Proc. of Interspeech 2001, pp. 1013 1016 (2001). [9] Nakamura, K. et al.: Intelligent sound source localization and its application to multimodal human tracking, IEEE/RSJ IROS 2011, pp. 143 148 (2011). [10] Nakajima, H. et al.: Blind Source Separation with Parameter-Free Adaptive Step-Size Method for Robot Audition, IEEE TASLP, Vol. 18, No. 6, pp. 1476 1485 (2010). [11] Okuno, H. G. et al.: Robot Audition: Missing Feature Theory Approach and Active Audition, Robotics Research, Springer, pp. 227 244 (2011). [12] Knapp, C. et al.: The generalized correlation method for estimation of time delay, IEEE Transactions on Acoustics, Speech and Signal Processing (ASSP), Vol. 24, No. 4, pp. 320 327 (1976). [13] Sohn, J. et al.: A statistical model-based voice activity detection, IEEE Signal Processing Letters, Vol. 6, No. 1, pp. 1 3 (1999). 5