메뉴 건너뛰기




Volumn 12, Issue 1, 2006, Pages 55-67

Machine-learning based classification of speech and music

Author keywords

Audio features; Audio signal processing; Fuzzy c means clustering; Hidden Markov Models; Neural networks; Speech music classification

Indexed keywords

CLASSIFICATION (OF INFORMATION); FUZZY SETS; INFORMATION RETRIEVAL SYSTEMS; LEARNING SYSTEMS; MARKOV PROCESSES; MULTIMEDIA SYSTEMS; NEURAL NETWORKS;

EID: 33746879922     PISSN: 09424962     EISSN: None     Source Type: Journal    
DOI: 10.1007/s00530-006-0034-0     Document Type: Article
Times cited : (42)

References (37)
  • 15
    • 20444472966 scopus 로고    scopus 로고
    • A speech/music discriminator based on rms and zero-crossings
    • Panagiotakis, C., Tziritas, G.: A speech/music discriminator based on rms and zero-crossings. IEEE Trans. Multimedia (2004)
    • (2004) IEEE Trans. Multimedia
    • Panagiotakis, C.1    Tziritas, G.2
  • 19
    • 4243919627 scopus 로고    scopus 로고
    • Master's thesis, Department of Information Technology, Tampere University of Technology, Finland
    • Vesa Peltonen: Computational auditory scene recognition. Master's thesis, Department of Information Technology, Tampere University of Technology, Finland (2001)
    • (2001) Computational Auditory Scene Recognition
    • Peltonen, V.1
  • 21
    • 0036648502 scopus 로고    scopus 로고
    • Musical genre classification of audio signals
    • Tzanetakis, G., Cook, P.: Musical genre classification of audio signals. IEEE Trans. Speech Audio Proc. 10(5), 293-302 (2002)
    • (2002) IEEE Trans. Speech Audio Proc. , vol.10 , Issue.5 , pp. 293-302
    • Tzanetakis, G.1    Cook, P.2
  • 22
    • 0037708486 scopus 로고    scopus 로고
    • Content-based audio classification and segmentation by using support vector machines
    • Lu, L., Zhang, H.-J., Li, S.Z.: Content-based audio classification and segmentation by using support vector machines. ACM Mult. Sys. J. 8(6), 482-492 (2003)
    • (2003) ACM Mult. Sys. J. , vol.8 , Issue.6 , pp. 482-492
    • Lu, L.1    Zhang, H.-J.2    Li, S.Z.3
  • 23
    • 0036556701 scopus 로고    scopus 로고
    • Audio classification in speech and music: A comparison between a statistical and a neural approach
    • Bugatti, A., Flammini, A., Migliorati, P.: Audio classification in speech and music: A comparison between a statistical and a neural approach. EURASIP J. Appl. Sig. Proc. 4, 372-378 (2002)
    • (2002) EURASIP J. Appl. Sig. Proc. , vol.4 , pp. 372-378
    • Bugatti, A.1    Flammini, A.2    Migliorati, P.3
  • 25
    • 0036816475 scopus 로고    scopus 로고
    • Content analysis for audio classification and segmentation
    • Lu, L., Zhang, H.-J., Jiang, H.: Content analysis for audio classification and segmentation. IEEE Trans. Speech Audio Proc. 10(7), 504-516 (2002)
    • (2002) IEEE Trans. Speech Audio Proc. , vol.10 , Issue.7 , pp. 504-516
    • Lu, L.1    Zhang, H.-J.2    Jiang, H.3
  • 28
    • 0035308233 scopus 로고    scopus 로고
    • Classification of general audio data for content-based retrieval
    • Li, D., Sethi, I.K., Dimitrova, N., McGee, T.: Classification of general audio data for content-based retrieval. Patt. Recog. Lett. 22(5), 533-544 (2001)
    • (2001) Patt. Recog. Lett. , vol.22 , Issue.5 , pp. 533-544
    • Li, D.1    Sethi, I.K.2    Dimitrova, N.3    McGee, T.4
  • 29
    • 84889075620 scopus 로고    scopus 로고
    • A framework for audio analysis based on classification and temporal segmentation
    • IEEE
    • Tzanetakis, G., Cook, P.: A framework for audio analysis based on classification and temporal segmentation. In: EUROMICRO Workshop on Music Technology and Audio Processing, IEEE, Vol. 2, pp. 61-67 (1999)
    • (1999) EUROMICRO Workshop on Music Technology and Audio Processing , vol.2 , pp. 61-67
    • Tzanetakis, G.1    Cook, P.2
  • 35
    • 0024861871 scopus 로고
    • Approximation by superpositions of a sigmoidal function
    • Cybenko, G.: Approximation by superpositions of a sigmoidal function. Math. Con. Sig. Sys. 2(4), 303-314 (1989)
    • (1989) Math. Con. Sig. Sys. , vol.2 , Issue.4 , pp. 303-314
    • Cybenko, G.1
  • 36
    • 33746883995 scopus 로고
    • Artificial neural networks for speech and vision
    • Mammone, R.J. (ed.): Chapman & Hall, London
    • Mammone, R.J. (ed.): Artificial neural networks for speech and vision. Chapman & Hall Neural Computing, 1st edn. Chapman & Hall, London (1994)
    • (1994) Chapman & Hall Neural Computing, 1st Edn.
  • 37
    • 0022594196 scopus 로고
    • An introduction to hidden markov models
    • Rabiner, L.R., Juang, B.H.: An introduction to hidden markov models. IEEE ASSP Magazine 3(1), 4-16 (1986)
    • (1986) IEEE ASSP Magazine , vol.3 , Issue.1 , pp. 4-16
    • Rabiner, L.R.1    Juang, B.H.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.