메뉴 건너뛰기




Volumn , Issue , 2014, Pages 6285-6292

Sound representation and classification benchmark for domestic robots

Author keywords

[No Author keywords available]

Indexed keywords

ANTHROPOMORPHIC ROBOTS; ROBOTICS;

EID: 84929207631     PISSN: 10504729     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICRA.2014.6907786     Document Type: Conference Paper
Times cited : (34)

References (33)
  • 12
    • 0024610919 scopus 로고
    • A tutorial on hidden markov models and selected applications in speech recognition
    • L. R. Rabiner, "A tutorial on Hidden Markov Models and selected applications in speech recognition, " Proceedings of the IEEE, vol. 77, no. 2, pp. 257-286, 1989.
    • (1989) Proceedings of the IEEE , vol.77 , Issue.2 , pp. 257-286
    • Rabiner, L.R.1
  • 13
    • 79957687384 scopus 로고    scopus 로고
    • Sound event recognition with probabilistic distance SVMs
    • H. D. Tran and H. Li, "Sound event recognition with probabilistic distance SVMs, " IEEE Transactions on Speech Audio Processing, vol. 19, no. 6, pp. 1556-1568, 2011.
    • (2011) IEEE Transactions on Speech Audio Processing , vol.19 , Issue.6 , pp. 1556-1568
    • Tran, H.D.1    Li, H.2
  • 15
    • 0037279492 scopus 로고    scopus 로고
    • Content-based audio classification and retrieval by support vector machines
    • G. Guo and S. Z. Li, "Content-based audio classification and retrieval by support vector machines, " IEEE Transactions on Neural Networks, vol. 14, no. 1, pp. 209-215, 2003.
    • (2003) IEEE Transactions on Neural Networks , vol.14 , Issue.1 , pp. 209-215
    • Guo, G.1    Li, S.Z.2
  • 17
    • 0042830801 scopus 로고    scopus 로고
    • Comparison of techniques for environmental Table v memory needed to store the trained classifiers (IN KB)
    • kNN QNN GMM-1 GMM-T HMM SVM TFF 370 6 39 130 520 MFCC 430 6 50 180 1100 540 MFCC+TFF 460 4 46 200 1300 680 Wavelets 350 10 58 65 430 330 MFCC+BoW 38 11.2 10.6 271 MFCC+TFF+BoW 31 16.5 52.1 206 MFCC+Interp 5300 715 2100 MFCC+TFF+Interp 6100 967 3800 SAI 4230 593 5550 TABLE VI FEATURE COMPUTATION TIME (IN MS). Feature BoW Interpolation K-means Histo. TTFF 3 MFCC 2.4 12.3 0.8 2.3 MFCC+TTFF 5.4 13.3 0.9 2.7 Wavelets 9.6 SAI 350 sound recognition
    • M. Cowling and R. Sitte, "Comparison of techniques for environmental TABLE V MEMORY NEEDED TO STORE THE TRAINED CLASSIFIERS (IN KB). kNN QNN GMM-1 GMM-T HMM SVM TFF 370 6 39 130 520 MFCC 430 6 50 180 1100 540 MFCC+TFF 460 4 46 200 1300 680 Wavelets 350 10 58 65 430 330 MFCC+BoW 38 11.2 10.6 271 MFCC+TFF+BoW 31 16.5 52.1 206 MFCC+Interp 5300 715 2100 MFCC+TFF+Interp 6100 967 3800 SAI 4230 593 5550 TABLE VI FEATURE COMPUTATION TIME (IN MS). Feature BoW Interpolation K-means Histo. TTFF 3 MFCC 2.4 12.3 0.8 2.3 MFCC+TTFF 5.4 13.3 0.9 2.7 Wavelets 9.6 SAI 350 sound recognition, " Patt. Rec. Lett., vol. 24, no. 15, pp. 2895-2907, 2003.
    • (2003) Patt. Rec. Lett , vol.24 , Issue.15 , pp. 2895-2907
    • Cowling, M.1    Sitte, R.2
  • 19
    • 0029765670 scopus 로고    scopus 로고
    • Real-Time discrimination of broadcast speech/music
    • J. Saunders, "Real-Time discrimination of broadcast speech/music, " in Int. Conf. Acoust., Speech, Sig. Process., 1996, pp. 993-996.
    • (1996) Int. Conf. Acoust., Speech, Sig. Process , pp. 993-996
    • Saunders, J.1
  • 20
    • 0030648077 scopus 로고    scopus 로고
    • Construction and evaluation of a robust multifeature speech/music discriminator
    • E. Scheirer and M. Slaney, "Construction and evaluation of a robust multifeature speech/music discriminator, " in Int. Conf. Acoust., Speech, Sig. Process., 1997, pp. 1331-1334.
    • (1997) Int. Conf. Acoust., Speech, Sig. Process , pp. 1331-1334
    • Scheirer, E.1    Slaney, M.2
  • 21
    • 85013703178 scopus 로고    scopus 로고
    • A wavelet tour of signal processing
    • S. Mallat, A wavelet tour of signal processing. Elsevier, 1999.
    • (1999) Elsevier
    • Mallat, S.1
  • 22
    • 85008016199 scopus 로고    scopus 로고
    • Audio classification and categorization based on wavelets and support vector machine
    • C. Lin, S. Chen, T. Truong, and Y. Chang, "Audio classification and categorization based on wavelets and support vector machine, " IEEE Transactions on Speech and Audio Processing, vol. 13, no. 5, pp. 644-651, 2005.
    • (2005) IEEE Transactions on Speech and Audio Processing , vol.13 , Issue.5 , pp. 644-651
    • Lin, C.1    Chen, S.2    Truong, T.3    Chang, Y.4
  • 25
    • 78149304826 scopus 로고    scopus 로고
    • Sound retrieval and ranking using sparse auditory representations
    • R. F. Lyon, M. Rehn, S. Bengio, T. C .Walters, and G. Chechik, "Sound retrieval and ranking using sparse auditory representations, " Neural Computation, vol. 22, no. 9, pp. 2390-2416, 2010.
    • (2010) Neural Computation , vol.22 , Issue.9 , pp. 2390-2416
    • Lyon, R.F.1    Rehn, M.2    Bengio, S.3    Walters, T.C.4    Chechik, G.5
  • 29
    • 32044455069 scopus 로고    scopus 로고
    • Classification of acoustic events using SVM-based clustering schemes
    • A. Temko and C. Nadeu, "Classification of acoustic events using SVM-based clustering schemes, " Pattern Recognition, vol. 39, no. 4, pp. 682-694, 2006.
    • (2006) Pattern Recognition , vol.39 , Issue.4 , pp. 682-694
    • Temko, A.1    Nadeu, C.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.