메뉴 건너뛰기




Volumn 34, Issue 3, 2007, Pages 375-395

A general audio classifier based on human perception motivated model

Author keywords

Audio classification; Content based audio indexing; Gender identification; Highlights detection; Music genre recognition; Perceptually motivated features; Piecewise Gaussian Modelling

Indexed keywords

ACOUSTIC SIGNAL PROCESSING; CLASSIFICATION (OF INFORMATION); MATHEMATICAL MODELS; PIECEWISE LINEAR TECHNIQUES; PROBLEM SOLVING; PSYCHOPHYSIOLOGY;

EID: 34547284188     PISSN: 13807501     EISSN: 15737721     Source Type: Journal    
DOI: 10.1007/s11042-007-0108-9     Document Type: Article
Times cited : (17)

References (54)
  • 1
    • 0037401304 scopus 로고    scopus 로고
    • Speech/Music discrimination using entropy and dynamism features in a HMM classification framework
    • Ajmera J, McCowan I, Bourlard H (2003) Speech/Music discrimination using entropy and dynamism features in a HMM classification framework. Speech Commun 40(3):351-363
    • (2003) Speech Commun , vol.40 , Issue.3 , pp. 351-363
    • Ajmera, J.1    McCowan, I.2    Bourlard, H.3
  • 4
    • 0028919718 scopus 로고
    • Auditory event-related potentials dissociate early and late memory processes
    • Elsevier
    • Chao L, Nielsen-Bohlman L, Knight R (1995) Auditory event-related potentials dissociate early and late memory processes. Electroencephalogr Clin Neurophysiol 96:157-168, Elsevier
    • (1995) Electroencephalogr Clin Neurophysiol , vol.96 , pp. 157-168
    • Chao, L.1    Nielsen-Bohlman, L.2    Knight, R.3
  • 7
    • 0035308233 scopus 로고    scopus 로고
    • Classification of general audio data for content-based retrieval
    • Elsevier
    • Dongge L et al (2001) Classification of general audio data for content-based retrieval. Pattern Recogn Lett 22:533-544, Elsevier
    • (2001) Pattern Recogn Lett , vol.22 , pp. 533-544
    • Dongge, L.1
  • 10
    • 85128356454 scopus 로고    scopus 로고
    • Partitioning and transcription of broadcast news data
    • Gauvain J-L, Lamel L, Adda G (1998) Partitioning and transcription of broadcast news data. Proc. ICSLP'98 5:1335-1338
    • (1998) Proc. ICSLP'98 , vol.5 , pp. 1335-1338
    • Gauvain, J.-L.1    Lamel, L.2    Adda, G.3
  • 18
    • 0026374868 scopus 로고
    • Improved acoustic modeling with the SPHINX speech recognition system
    • ICASSP-91
    • Huang XD, Lee KF, Hon HW, Hwang MY (1991) Improved acoustic modeling with the SPHINX speech recognition system. Proceedings of the IEEE ICASSP-91, 1:345-348
    • (1991) Proceedings of the IEEE
    • Huang, X.D.1    Lee, K.F.2    Hon, H.W.3    Hwang, M.Y.4
  • 20
    • 17444377070 scopus 로고    scopus 로고
    • Jung E, Schwarzbacher A, Lawlor R (2002) Implementation of real-time AMDF pitch-detection for voice gender nonnalization. Proceedings of the 14th international conference on digital signal processing. DSP 2002 2:827-830
    • Jung E, Schwarzbacher A, Lawlor R (2002) Implementation of real-time AMDF pitch-detection for voice gender nonnalization. Proceedings of the 14th international conference on digital signal processing. DSP 2002 2:827-830
  • 22
    • 34547288647 scopus 로고    scopus 로고
    • Kiranyaz S, Aubazac M, Gabbouj M (2003) Unsupervised segmentation and classification over MP3 and AAC audio bitstreams. In the Proc. of the 4th European workshop on image analysis for multimedia interactive services WIAMIS 03, World Scientific, London UK
    • Kiranyaz S, Aubazac M, Gabbouj M (2003) Unsupervised segmentation and classification over MP3 and AAC audio bitstreams. In the Proc. of the 4th European workshop on image analysis for multimedia interactive services WIAMIS 03, World Scientific, London UK
  • 23
    • 34547336240 scopus 로고    scopus 로고
    • Konig Y, Morgan N (1992) GDNN a gender dependent neural network for continuous speech recognition. Proceedings, international joint conference on neural networks, IJCNN, 2, 7-11 2:332-337
    • Konig Y, Morgan N (1992) GDNN a gender dependent neural network for continuous speech recognition. Proceedings, international joint conference on neural networks, IJCNN, Volume: 2, 7-11 2:332-337
  • 24
    • 0034273520 scopus 로고    scopus 로고
    • Content-based classification and retrieval of audio using the nearest feature line method
    • Li S (2000) Content-based classification and retrieval of audio using the nearest feature line method. IEEE Trans Speech Audio Process 8:619-625
    • (2000) IEEE Trans Speech Audio Process , vol.8 , pp. 619-625
    • Li, S.1
  • 28
    • 20444469135 scopus 로고    scopus 로고
    • Improving accuracy in behaviour identification for content-based retrieval by using audio and video information
    • Miyamori H (2002) Improving accuracy in behaviour identification for content-based retrieval by using audio and video information. Proceedings of IEEE ICPR02, 2:826-830
    • (2002) Proceedings of IEEE ICPR02 , vol.2 , pp. 826-830
    • Miyamori, H.1
  • 30
    • 34547322429 scopus 로고    scopus 로고
    • Moore, BCJ (ed) (1995), Hearing. Academic, Toronto
    • Moore, BCJ (ed) (1995), Hearing. Academic, Toronto
  • 34
    • 0010020774 scopus 로고    scopus 로고
    • Scanning the dial: An exploration of factors in the identification of musical style
    • Society for Music Perception and Cognition
    • Perrot, D, Gjerdigen, RO Scanning the dial: an exploration of factors in the identification of musical style. Proceedings, the 1999 Society for Music Perception and Cognition
    • (1999) Proceedings, the
    • Perrot, D.1    Gjerdigen, R.O.2
  • 38
    • 0029209272 scopus 로고
    • Robust text-independent speaker identification using Gaussian mixture speaker models
    • Reynolds DA, Rose RC (1995) Robust text-independent speaker identification using Gaussian mixture speaker models. IEEE Trans Speech Audio Process 3(1):72-83
    • (1995) IEEE Trans Speech Audio Process , vol.3 , Issue.1 , pp. 72-83
    • Reynolds, D.A.1    Rose, R.C.2
  • 40
    • 0030365534 scopus 로고    scopus 로고
    • Robust gender-dependent acoustic-phonetic modelling in continuous speech recognition based on a new automatic male female classification
    • 2, Oct
    • Rivarol V, Farhal A, O'Shaughnessy D (1996) Robust gender-dependent acoustic-phonetic modelling in continuous speech recognition based on a new automatic male female classification. Proceedings, fourth international conference on spoken language, ICSLP 96, Volume: 2 3-6 2:1081-1084 (Oct)
    • (1996) Proceedings, fourth international conference on spoken language, ICSLP , vol.96
    • Rivarol, V.1    Farhal, A.2    O'Shaughnessy, D.3
  • 41
    • 0029765670 scopus 로고    scopus 로고
    • Real time discrimination of broadcast speech/music
    • Saunders J (1996) Real time discrimination of broadcast speech/music, Proc. Of ICASSP96 2: 993-996
    • (1996) Proc. Of ICASSP96 , vol.2 , pp. 993-996
    • Saunders, J.1
  • 42
    • 0030648077 scopus 로고    scopus 로고
    • Construction and evaluation of a robust multifeature speech/music discriminator
    • Munich, Germany April
    • Scheirer E, Slaney M (1997) Construction and evaluation of a robust multifeature speech/music discriminator. Proceedings of IEEE ICASSP'97, Munich, Germany (April)
    • (1997) Proceedings of IEEE ICASSP'97
    • Scheirer, E.1    Slaney, M.2
  • 43
    • 0034843119 scopus 로고    scopus 로고
    • Experiments on speech tracking in audio documents using Gaussian mixture modeling
    • Seek M, Magrin-Chagnolleau I, Bimbot F (2001) Experiments on speech tracking in audio documents using Gaussian mixture modeling. Proceedings of IEEE ICASSP01, 1:601-604
    • (2001) Proceedings of IEEE ICASSP01 , vol.1 , pp. 601-604
    • Seek, M.1    Magrin-Chagnolleau, I.2    Bimbot, F.3
  • 47
    • 0036648502 scopus 로고    scopus 로고
    • Musical genre classification of audio signals
    • Tzanetakis G, Cook P (2002) Musical genre classification of audio signals. IEEE Trans Speech Audio Process 10(5):293-302
    • (2002) IEEE Trans Speech Audio Process , vol.10 , Issue.5 , pp. 293-302
    • Tzanetakis, G.1    Cook, P.2
  • 49
    • 85032751556 scopus 로고    scopus 로고
    • Multimedia content analysis using both audio and visual cues
    • Wang Y, Liu Z, Huang J-C (2000) Multimedia content analysis using both audio and visual cues. IEEE Signal Process Mag 116:12-36
    • (2000) IEEE Signal Process Mag , vol.116 , pp. 12-36
    • Wang, Y.1    Liu, Z.2    Huang, J.-C.3
  • 50
    • 77958036231 scopus 로고    scopus 로고
    • Speech/music discrimination based on posterior probability features
    • Williams G, Ellis D (1999) Speech/music discrimination based on posterior probability features. Proceedings of Eurospeech
    • (1999) Proceedings of Eurospeech
    • Williams, G.1    Ellis, D.2
  • 51
    • 0030242072 scopus 로고    scopus 로고
    • Content-based classification search and retrieval of audio
    • Wold E, Blum T, Keislar D, Wheaton J (1996) Content-based classification search and retrieval of audio. IEEE Multimedia Magazine 3(3):27-36
    • (1996) IEEE Multimedia Magazine , vol.3 , Issue.3 , pp. 27-36
    • Wold, E.1    Blum, T.2    Keislar, D.3    Wheaton, J.4
  • 52
    • 0035815506 scopus 로고    scopus 로고
    • Organizing sound sequences in the human brain: The interplay of auditory streaming and temporal integration
    • Elsevier
    • Yabe H et al (2001) Organizing sound sequences in the human brain: the interplay of auditory streaming and temporal integration. Brain Res 897:222-227, Elsevier
    • (2001) Brain Res , vol.897 , pp. 222-227
    • Yabe, H.1
  • 53
    • 0035340677 scopus 로고    scopus 로고
    • Audio content analysis for on-line audiovisual data segmentation
    • Zhang T, Jay Kuo C-C (2001) Audio content analysis for on-line audiovisual data segmentation. IEEE Trans Speech Audio Process 9(4):441-457
    • (2001) IEEE Trans Speech Audio Process , vol.9 , Issue.4 , pp. 441-457
    • Zhang, T.1    Jay Kuo, C.-C.2
  • 54
    • 0036888031 scopus 로고    scopus 로고
    • Zhou W, Dao S, Jay Kuo C-C (2002) On line knowledge and rule-based video classification system for video indexing and dissemination. Inf Sys 27:559-586, Elsevier
    • Zhou W, Dao S, Jay Kuo C-C (2002) On line knowledge and rule-based video classification system for video indexing and dissemination. Inf Sys 27:559-586, Elsevier


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.