SCOPUS 정보 검색 플랫폼

Proceedings - IEEE International Conference on Robotics and Automation

Volumn , Issue , 2014, Pages 6285-6292

Sound representation and classification benchmark for domestic robots

(4) Maxime, Janvier a Alameda Pineda, Xavier a Girin, Laurent a,b,c Horaud, Radu c

a INRIA RHÔNE ALPES (France)

b GIPSA LAB (France)

c UNIV GRENOBLE ALPES (France)

Author keywords

[No Author keywords available]

Indexed keywords

ANTHROPOMORPHIC ROBOTS; ROBOTICS;

BACKGROUND NOISE; COMPARATIVE STUDIES; COMPUTATION TIME; DOMESTIC ROBOTS; HUMANOID ROBOT NAO; MEMORY REQUIREMENTS; REALISTIC CONDITIONS; SOUND SOURCE;

CLASSIFICATION (OF INFORMATION);

EID: 84929207631 PISSN: 10504729 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICRA.2014.6907786 Document Type: Conference Paper

Times cited : (34)

References (33)

1
- 0003684441
- MIT Press
- A. S. Bregman, Auditory scene analysis: The perceptual organization of sound. MIT Press, 1994.
- (1994) Auditory Scene Analysis: The Perceptual Organization of Sound
- Bregman, A.S.¹

2
- 0033617899
- A model-based sound localization system and its application to robot navigation
- J. Huang, T. Supaongprapa, I. Terakura, F. Wang, N. Ohnishi, and N. Sugie, "A model-based sound localization system and its application to robot navigation, " Robotics and Autonomous Systems, vol. 27, no. 4, pp. 199-209, 1999.
- (1999) Robotics and Autonomous Systems , vol.27 , Issue.4 , pp. 199-209
- Huang, J.¹ Supaongprapa, T.² Terakura, I.³ Wang, F.⁴ Ohnishi, N.⁵ Sugie, N.⁶

3
- 85009286773
- Real-Time sound source localization and separation for robot audition
- K. Nakadai, H. G. Okuno, and H. Kitano, "Real-Time sound source localization and separation for robot audition, " in Int. Conf. on Spoken Language Processing, 2002, pp. 193-196.
- (2002) Int. Conf. on Spoken Language Processing , pp. 193-196
- Nakadai, K.¹ Okuno, H.G.² Kitano, H.³

4
- 34250652551
- Real-Time robot audition system that recognizes simultaneous speech in the real world
- S. Yamamoto, K. Nakadai, M. Nakano, H. Tsujino, J.-M. Valin, K. Komatani, T. Ogata, and H. G. Okuno, "Real-Time robot audition system that recognizes simultaneous speech in the real world, " in Int. Conf. on Intell. Rob. and Syst., 2006, pp. 5333-5338.
- (2006) Int. Conf. on Intell. Rob. and Syst , pp. 5333-5338
- Yamamoto, S.¹ Nakadai, K.² Nakano, M.³ Tsujino, H.⁴ Valin, J.-M.⁵ Komatani, K.⁶ Ogata, T.⁷ Okuno, H.G.⁸

5
- 63549118078
- An open source software system for robot audition hark and its evaluation
- K. Nakadai, H. G. Okuno, H. Nakajima, Y. Hasegawa, and H. Tsujino, "An open source software system for robot audition hark and its evaluation, " in Int. Conf. on Humanoid Robots, 2008, pp. 561-566.
- (2008) Int. Conf. on Humanoid Robots , pp. 561-566
- Nakadai, K.¹ Okuno, H.G.² Nakajima, H.³ Hasegawa, Y.⁴ Tsujino, H.⁵

6
- 0036452426
- The intelligent ASIMO: System overview and integration
- Y. Sakagami, R. Watanabe, C. Aoyama, S. Matsunaga, N. Higaki, and K. Fujimura, "The intelligent ASIMO: System overview and integration, " in Int. Conf. on Intell. Rob. and Syst., 2002, pp. 2478-2483.
- (2002) Int. Conf. on Intell. Rob. and Syst , pp. 2478-2483
- Sakagami, Y.¹ Watanabe, R.² Aoyama, C.³ Matsunaga, S.⁴ Higaki, N.⁵ Fujimura, K.⁶

7
- 34247578210
- Where am I? Scene recognition for mobile robots using audio features
- S. Chu, S. Narayanan, C.-C. Kuo, and M. J. Mataric, "Where am I? scene recognition for mobile robots using audio features, " in Int. Conf. on Multimedia and Expo, 2006, pp. 885-888.
- (2006) Int. Conf. on Multimedia and Expo , pp. 885-888
- Chu, S.¹ Narayanan, S.² Kuo, C.-C.³ Mataric, M.J.⁴

8
- 76249114818
- Daily sound recognition using pitch-cluster-maps for mobile robot audition
- Y. Sasaki, M. Kaneyoshi, S. Kagami, H. Mizoguchi, and T. Enomoto, "Daily sound recognition using pitch-cluster-maps for mobile robot audition, " in Int. Conf. on Intell. Rob. and Syst., 2009, pp. 2724-2729.
- (2009) Int. Conf. on Intell. Rob. and Syst , pp. 2724-2729
- Sasaki, Y.¹ Kaneyoshi, M.² Kagami, S.³ Mizoguchi, H.⁴ Enomoto, T.⁵

9
- 79960540012
- Environmental sound recognition for robot audition using matchingpursuit
- Springer
- N. Yamakawa, T. Takahashi, T. Kitahara, T. Ogata, and H. G. Okuno, "Environmental sound recognition for robot audition using matchingpursuit, " in Modern Approaches in Applied Intelligence, ser. Lecture Notes in Computer Science. Springer, 2011, pp. 1-10.
- (2011) Modern Approaches in Applied Intelligence, Ser. Lecture Notes in Computer Science , pp. 1-10
- Yamakawa, N.¹ Takahashi, T.² Kitahara, T.³ Ogata, T.⁴ Okuno, H.G.⁵

10
- 84870828922
- Audio-based human activity recognition using non-markovian ensemble voting
- J. Stork, L. Spinello, J. Silva, and K. Arras, "Audio-based human activity recognition using non-markovian ensemble voting, " in International Symp. on Robots and Human Interactive Communications, 2012, pp. 509-514.
- (2012) International Symp. on Robots and Human Interactive Communications , pp. 509-514
- Stork, J.¹ Spinello, L.² Silva, J.³ Arras, K.⁴

11
- 84891055505
- Soundevent recognition with a companion humanoid
- M. Janvier, X. Alameda-Pineda, L. Girin, and R. P. Horaud, "Soundevent recognition with a companion humanoid, " in IEEE Int. Conf. on Humanoid Robotics, 2012.
- (2012) IEEE Int. Conf. on Humanoid Robotics
- Janvier, M.¹ Alameda-Pineda, X.² Girin, L.³ Horaud, R.P.⁴

12
- 0024610919
- A tutorial on hidden markov models and selected applications in speech recognition
- L. R. Rabiner, "A tutorial on Hidden Markov Models and selected applications in speech recognition, " Proceedings of the IEEE, vol. 77, no. 2, pp. 257-286, 1989.
- (1989) Proceedings of the IEEE , vol.77 , Issue.2 , pp. 257-286
- Rabiner, L.R.¹

13
- 79957687384
- Sound event recognition with probabilistic distance SVMs
- H. D. Tran and H. Li, "Sound event recognition with probabilistic distance SVMs, " IEEE Transactions on Speech Audio Processing, vol. 19, no. 6, pp. 1556-1568, 2011.
- (2011) IEEE Transactions on Speech Audio Processing , vol.19 , Issue.6 , pp. 1556-1568
- Tran, H.D.¹ Li, H.²

14
- 9744249570
- Environmental sound recognition by multilayered Neural Networks
- Y. Toyoda, J. Huang, S. Ding, and Y. Liu, "Environmental sound recognition by multilayered Neural Networks, " in Int. Conf. on Computer and Information Technology, 2004, pp. 123-127.
- (2004) Int. Conf. on Computer and Information Technology , pp. 123-127
- Toyoda, Y.¹ Huang, J.² Ding, S.³ Liu, Y.⁴

15
- 0037279492
- Content-based audio classification and retrieval by support vector machines
- G. Guo and S. Z. Li, "Content-based audio classification and retrieval by support vector machines, " IEEE Transactions on Neural Networks, vol. 14, no. 1, pp. 209-215, 2003.
- (2003) IEEE Transactions on Neural Networks , vol.14 , Issue.1 , pp. 209-215
- Guo, G.¹ Li, S.Z.²

16
- 80051625348
- Continuous audio analytics by HMM and Viterbi decoding
- V. Ramasubramanian, R. Karthik, S. Thiyagarajan, and S. Cherla, "Continuous audio analytics by HMM and Viterbi decoding, " in Int. Conf. Acoust., Speech, Sig. Process., 2011, pp. 2396-2399.
- (2011) Int. Conf. Acoust., Speech, Sig. Process , pp. 2396-2399
- Ramasubramanian, V.¹ Karthik, R.² Thiyagarajan, S.³ Cherla, S.⁴

17
- 0042830801
- Comparison of techniques for environmental Table v memory needed to store the trained classifiers (IN KB)
- kNN QNN GMM-1 GMM-T HMM SVM TFF 370 6 39 130 520 MFCC 430 6 50 180 1100 540 MFCC+TFF 460 4 46 200 1300 680 Wavelets 350 10 58 65 430 330 MFCC+BoW 38 11.2 10.6 271 MFCC+TFF+BoW 31 16.5 52.1 206 MFCC+Interp 5300 715 2100 MFCC+TFF+Interp 6100 967 3800 SAI 4230 593 5550 TABLE VI FEATURE COMPUTATION TIME (IN MS). Feature BoW Interpolation K-means Histo. TTFF 3 MFCC 2.4 12.3 0.8 2.3 MFCC+TTFF 5.4 13.3 0.9 2.7 Wavelets 9.6 SAI 350 sound recognition
- M. Cowling and R. Sitte, "Comparison of techniques for environmental TABLE V MEMORY NEEDED TO STORE THE TRAINED CLASSIFIERS (IN KB). kNN QNN GMM-1 GMM-T HMM SVM TFF 370 6 39 130 520 MFCC 430 6 50 180 1100 540 MFCC+TFF 460 4 46 200 1300 680 Wavelets 350 10 58 65 430 330 MFCC+BoW 38 11.2 10.6 271 MFCC+TFF+BoW 31 16.5 52.1 206 MFCC+Interp 5300 715 2100 MFCC+TFF+Interp 6100 967 3800 SAI 4230 593 5550 TABLE VI FEATURE COMPUTATION TIME (IN MS). Feature BoW Interpolation K-means Histo. TTFF 3 MFCC 2.4 12.3 0.8 2.3 MFCC+TTFF 5.4 13.3 0.9 2.7 Wavelets 9.6 SAI 350 sound recognition, " Patt. Rec. Lett., vol. 24, no. 15, pp. 2895-2907, 2003.
- (2003) Patt. Rec. Lett , vol.24 , Issue.15 , pp. 2895-2907
- Cowling, M.¹ Sitte, R.²

18
- 33644626634
- G. Peeters, "A large set of audio features for sound description (similarity and classification) in the cuidado project, " 2004.
- (2004) A Large Set of Audio Features for Sound Description (Similarity and Classification) in the Cuidado Project
- Peeters, G.¹

19
- 0029765670
- Real-Time discrimination of broadcast speech/music
- J. Saunders, "Real-Time discrimination of broadcast speech/music, " in Int. Conf. Acoust., Speech, Sig. Process., 1996, pp. 993-996.
- (1996) Int. Conf. Acoust., Speech, Sig. Process , pp. 993-996
- Saunders, J.¹

20
- 0030648077
- Construction and evaluation of a robust multifeature speech/music discriminator
- E. Scheirer and M. Slaney, "Construction and evaluation of a robust multifeature speech/music discriminator, " in Int. Conf. Acoust., Speech, Sig. Process., 1997, pp. 1331-1334.
- (1997) Int. Conf. Acoust., Speech, Sig. Process , pp. 1331-1334
- Scheirer, E.¹ Slaney, M.²

21
- 85013703178
- A wavelet tour of signal processing
- S. Mallat, A wavelet tour of signal processing. Elsevier, 1999.
- (1999) Elsevier
- Mallat, S.¹

22
- 85008016199
- Audio classification and categorization based on wavelets and support vector machine
- C. Lin, S. Chen, T. Truong, and Y. Chang, "Audio classification and categorization based on wavelets and support vector machine, " IEEE Transactions on Speech and Audio Processing, vol. 13, no. 5, pp. 644-651, 2005.
- (2005) IEEE Transactions on Speech and Audio Processing , vol.13 , Issue.5 , pp. 644-651
- Lin, C.¹ Chen, S.² Truong, T.³ Chang, Y.⁴

23
- 0010051198
- Audio analysis using the discrete wavelet transform
- G. Tzanetakis, G. Essl, and P. Cook, "Audio analysis using the discrete wavelet transform, " in Conf. in Acoust. and Music Theory App., 2001.
- (2001) Conf. in Acoust. and Music Theory App
- Tzanetakis, G.¹ Essl, G.² Cook, P.³

24
- 84867596477
- Ph.D. dissertation University of Cambridge
- T. C. Walter, "Auditory-based processing of communication sounds, " Ph.D. dissertation, University of Cambridge, 2011.
- (2011) Auditory-based Processing of Communication Sounds
- Walter, T.C.¹

25
- 78149304826
- Sound retrieval and ranking using sparse auditory representations
- R. F. Lyon, M. Rehn, S. Bengio, T. C .Walters, and G. Chechik, "Sound retrieval and ranking using sparse auditory representations, " Neural Computation, vol. 22, no. 9, pp. 2390-2416, 2010.
- (2010) Neural Computation , vol.22 , Issue.9 , pp. 2390-2416
- Lyon, R.F.¹ Rehn, M.² Bengio, S.³ Walters, T.C.⁴ Chechik, G.⁵

26
- 84890450957
- T. Walters and W. van Engen. (2012) AIMC: A C++ implementation of the auditory image model. [Online]. Available: https://code.google. com/p/aimc/
- (2012) AIMC: A C++ Implementation of the Auditory Image Model
- Walters, T.¹ Van W.Engen.²

27
- 84873597271
- Yaafe, an easy to use and efficient audio feature extraction software
- B. Mathieu, S. Essid, T. Fillon, J. Prado, and G. Richard, "Yaafe, an easy to use and efficient audio feature extraction software, " in Int. Conf. for Music Information Retrieval (ISMIR), 2010.
- (2010) Int. Conf. for Music Information Retrieval (ISMIR)
- Mathieu, B.¹ Essid, S.² Fillon, T.³ Prado, J.⁴ Richard, G.⁵

28
- 33846516584
- Springer
- C. M. Bishop, Pattern recognition and machine learning. Springer, 2006.
- (2006) Pattern Recognition and Machine Learning
- Bishop, C.M.¹

29
- 32044455069
- Classification of acoustic events using SVM-based clustering schemes
- A. Temko and C. Nadeu, "Classification of acoustic events using SVM-based clustering schemes, " Pattern Recognition, vol. 39, no. 4, pp. 682-694, 2006.
- (2006) Pattern Recognition , vol.39 , Issue.4 , pp. 682-694
- Temko, A.¹ Nadeu, C.²

30
- 48849107856
- One-class SVMs challenges in audio detection and classification applications
- R. Asma, K. Hachem, L. Zied, and E. Noureddine, "One-class SVMs challenges in audio detection and classification applications, " EURASIP Journal on Advances in Signal Processing, 2008.
- (2008) EURASIP Journal on Advances in Signal Processing
- Asma, R.¹ Hachem, K.² Zied, L.³ Noureddine, E.⁴

31
- 84857466151
- MIT Press
- K. P. Murphy, Machine learning: A probabilistic perspective. MIT Press, 2012.
- (2012) Machine Learning: A Probabilistic Perspective
- Murphy, K.P.¹

32
- 79955702502
- LIBSVM: A library for support vector machines
- C.-C. Chang and C.-J. Lin, "LIBSVM: A library for support vector machines, " ACM Transactions on Intelligent Systems and Technology, vol. 2, pp. 27:1-27:27, 2011.
- (2011) ACM Transactions on Intelligent Systems and Technology , vol.2 , pp. 271-2727
- Chang, C.-C.¹ Lin, C.-J.²

33
- 33745184197
- Internal noise suppression for speech recognition by small robots
- A. Ito, T. Kanayama, M. Suzuki, and S. Makino, "Internal noise suppression for speech recognition by small robots, " in European Conf. on Speech Communication and Technology, 2005.
- (2005) European Conf. on Speech Communication and Technology
- Ito, A.¹ Kanayama, T.² Suzuki, M.³ Makino, S.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.