메뉴 건너뛰기




Volumn 31, Issue 12, 2010, Pages 1543-1551

Real-world acoustic event detection

Author keywords

Acoustic Event Detection; Artificial neural network; Feature selection; Gaussian mixture model supervector; Hidden markov model; Tandem model

Indexed keywords

ACOUSTIC EVENTS; ARTIFICIAL NEURAL NETWORK FEATURES; GAUSSIAN MIXTURE MODEL; GAUSSIAN MIXTURE MODEL SUPERVECTOR; SUPERVECTOR;

EID: 77955558847     PISSN: 01678655     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.patrec.2010.02.005     Document Type: Article
Times cited : (141)

References (39)
  • 1
    • 33845333816 scopus 로고    scopus 로고
    • Audio based event detection for multimedia surveillance
    • Atrey, P.K.; Maddage.; N.C.; Kankanhalli, M.S.; 2006. Audio based event detection for multimedia surveillance. In: ICASSP06.
    • (2006) ICASSP06
    • Atrey, P.K.1    Maddage, N.C.2    Kankanhalli, M.S.3
  • 2
    • 24644439101 scopus 로고    scopus 로고
    • Audio-based event detection for sports video
    • M. Baillie, and J. Jose Audio-based event detection for sports video Lecture Notes Comput. Sci. 2728 2003 61 65
    • (2003) Lecture Notes Comput. Sci. , vol.2728 , pp. 61-65
    • Baillie, M.1    Jose, J.2
  • 3
    • 0028531926 scopus 로고
    • Computational auditory scene analysis
    • G.J. Brown, and M. Cooke Computational auditory scene analysis Comput. Speech Lang. 8 1994 297 336
    • (1994) Comput. Speech Lang. , vol.8 , pp. 297-336
    • Brown, G.J.1    Cooke, M.2
  • 4
    • 33947696754 scopus 로고    scopus 로고
    • SVM based speaker verification using a GMM supervector kernel and nap variability compensation
    • IEEE
    • W. Campbell, D. Sturim, D. Reynolds, and A. Solomonoff SVM based speaker verification using a GMM supervector kernel and nap variability compensation ICASSP 2006 vol. 1 2006 IEEE 97 100
    • (2006) ICASSP 2006 , vol.1 , pp. 97-100
    • Campbell, W.1    Sturim, D.2    Reynolds, D.3    Solomonoff, A.4
  • 7
    • 11244272075 scopus 로고    scopus 로고
    • Highlight sound effects detection in audio stream
    • Cui, R.; Lu, L.; Zhung, H.-J.; Cai, L.-H.; 2003a. Highlight sound effects detection in audio stream. In: ICME03, pp. III: 37-40.
    • (2003) ICME03 , pp. 37-40
    • Cui, R.1    Lu, L.2    Zhung . H, -J.3    Cai, L.-H.4
  • 8
    • 11244272075 scopus 로고    scopus 로고
    • Highlight sound effects detection in audio stream
    • Cui, R.; Lu, L.; Zhung, H.-J.; Cai, L.-H.; 2003b. Highlight sound effects detection in audio stream. In: ICME03, pp. III: 37-40.
    • (2003) ICME03 , pp. 37-40
    • Cui, R.1    Lu, L.2    Zhung . H, -J.3    Cai, L.-H.4
  • 11
    • 85009135386 scopus 로고    scopus 로고
    • Investigations into tandem acoustic modeling for the aurora task
    • Ellis, D.; Gomez, M.R.; 2001. Investigations into tandem acoustic modeling for the aurora task. In: Proc. Eurospeech-01. ISCA, pp. 189-192.
    • (2001) Proc. Eurospeech-01. ISCA , pp. 189-192
    • Ellis, D.1    Gomez, M.R.2
  • 13
    • 0015346024 scopus 로고
    • Maximum-likelihood sequence estimation of digital sequences in the presence of intersymbol interference
    • G.D. Forney Maximum-likelihood sequence estimation of digital sequences in the presence of intersymbol interference IEEE Trans. Inform. Theory 18 3 1972 363 378
    • (1972) IEEE Trans. Inform. Theory , vol.18 , Issue.3 , pp. 363-378
    • Forney, G.D.1
  • 15
    • 0028419019 scopus 로고
    • Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markovchains
    • J.-L. Gauvain, and C.-H. Lee Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markovchains IEEE Trans. Speech Audio Process. 2 1994 291 298
    • (1994) IEEE Trans. Speech Audio Process. , vol.2 , pp. 291-298
    • Gauvain, J.-L.1    Lee, C.-H.2
  • 17
    • 0033709098 scopus 로고    scopus 로고
    • Tandem connectionist feature stream extraction for conventional HMM systems
    • IEEE
    • H. Hermansky, D. Ellis, and S. Sharma Tandem connectionist feature stream extraction for conventional HMM systems ICASSP 2000 vol. III 2000 IEEE 1635 1638
    • (2000) ICASSP 2000 , vol.3 , pp. 1635-1638
    • Hermansky, H.1    Ellis, D.2    Sharma, S.3
  • 18
    • 70349213420 scopus 로고    scopus 로고
    • Long-time span acoustic activity analysis from far-field sensors in smart homes
    • IEEE
    • J. Huang, X. Zhuang, V. Libal, and G. Potamianos Long-time span acoustic activity analysis from far-field sensors in smart homes ICASSP 2009 2009 IEEE
    • (2009) ICASSP 2009
    • Huang, J.1    Zhuang, X.2    Libal, V.3    Potamianos, G.4
  • 19
    • 0031078007 scopus 로고    scopus 로고
    • Feature selection: Evaluation, application, and small sample performance
    • A. Jain, and D. Zongker Feature selection: Evaluation, application, and small sample performance IEEE Trans. Pattern Anal. Machine Intell. 19 1997 153 158
    • (1997) IEEE Trans. Pattern Anal. Machine Intell. , vol.19 , pp. 153-158
    • Jain, A.1    Zongker, D.2
  • 20
    • 0026142334 scopus 로고
    • A study on speaker adaptation of the parameters of continuous density hidden markov models
    • C.-H. Lee, C.-H. Lin, and B.-H. Juang A study on speaker adaptation of the parameters of continuous density hidden markov models IEEE Trans. Signal Process. 39 4 1991 806 814
    • (1991) IEEE Trans. Signal Process. , vol.39 , Issue.4 , pp. 806-814
    • Lee, C.-H.1    Lin, C.-H.2    Juang, B.-H.3
  • 22
    • 0034296009 scopus 로고    scopus 로고
    • Finding consensus in speech recognition: Word error minimization and other applications of confusion networks
    • L. Mangu, E. Brill, and A. Stolcke Finding consensus in speech recognition: Word error minimization and other applications of confusion networks Comput. Speech Lang. 14 4 2000 373 400
    • (2000) Comput. Speech Lang. , vol.14 , Issue.4 , pp. 373-400
    • Mangu, L.1    Brill, E.2    Stolcke, A.3
  • 23
    • 84908294933 scopus 로고    scopus 로고
    • Duration dependent input output markov models for audio-visual event detection
    • Naphade, M.R.; Garg, A.; Huang, T.; 2001. Duration dependent input output markov models for audio-visual event detection. In: ICME01, p. 65.
    • (2001) ICME01 , pp. 65
    • Naphade, M.R.1    Garg, A.2    Huang, T.3
  • 27
    • 0029209272 scopus 로고
    • Robust text-independent speaker identification using Gaussian mixture speaker models
    • D. Reynolds, and R. Rose Robust text-independent speaker identification using Gaussian mixture speaker models IEEE Trans. Speech Audio Process. 3 1 1995 72 83
    • (1995) IEEE Trans. Speech Audio Process. , vol.3 , Issue.1 , pp. 72-83
    • Reynolds, D.1    Rose, R.2
  • 30
    • 34147210906 scopus 로고    scopus 로고
    • Establishing a gold standard for manual cough counting: Video versus digital audio recordings
    • J.A. Smith, J.E. Earis, and A.A. Woodcock Establishing a gold standard for manual cough counting: Video versus digital audio recordings Cough 2 6 2006 1 6
    • (2006) Cough , vol.2 , Issue.6 , pp. 1-6
    • Smith, J.A.1    Earis, J.E.2    Woodcock, A.A.3
  • 31
    • 0033220764 scopus 로고    scopus 로고
    • Adaptive floating search methods in feature selection
    • Somol, P.; Pudil, P.; Novoviová, J.; Paclik, P.; 1999. Adaptive floating search methods in feature selection. Pattern Recognition Lett. 20 (11-13), 1157-1163.
    • (1999) Pattern Recognition Lett. , vol.20 , Issue.11-13 , pp. 1157-1163
    • Somol, P.1
  • 33
    • 33646794668 scopus 로고    scopus 로고
    • Classification of meeting-room acoustic events with support vector machines and variable-feature-set clustering
    • Temko, A.; Nadeu, C.; 2005. Classification of meeting-room acoustic events with support vector machines and variable-feature-set clustering. In: Proc. Internat. Conf. on Acoustics, Speech, and Signal Processing, vol. V. pp. 505-508.
    • (2005) Proc. Internat. Conf. on Acoustics, Speech, and Signal Processing , vol.5 , pp. 505-508
    • Temko, A.1    Nadeu, C.2
  • 34
  • 36
    • 0035340677 scopus 로고    scopus 로고
    • Audio content analysis for online audiovisual data segmentation and classification
    • T. Zhang, and C.-C.J. Kuo Audio content analysis for online audiovisual data segmentation and classification IEEE Trans. Speech Audio Process. 9 4 2001 441 457
    • (2001) IEEE Trans. Speech Audio Process. , vol.9 , Issue.4 , pp. 441-457
    • Zhang, T.1    Kuo, C.-C.J.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.