SCOPUS 정보 검색 플랫폼

Volumn 31, Issue 12, 2010, Pages 1543-1551

Real-world acoustic event detection

(4) Zhuang, Xiaodan a Zhou, Xi a Hasegawa Johnson, Mark A a Huang, Thomas S a

a University of Illinois at Urbana Champaign (United States)

Author keywords

Acoustic Event Detection; Artificial neural network; Feature selection; Gaussian mixture model supervector; Hidden markov model; Tandem model

Indexed keywords

ACOUSTIC EVENTS; ARTIFICIAL NEURAL NETWORK FEATURES; GAUSSIAN MIXTURE MODEL; GAUSSIAN MIXTURE MODEL SUPERVECTOR; SUPERVECTOR;

AUDIO ACOUSTICS; CHEMICAL DETECTION; FEATURE EXTRACTION; HIDDEN MARKOV MODELS; IMAGE SEGMENTATION; INFORMATION RETRIEVAL SYSTEMS; OBJECT RECOGNITION;

NEURAL NETWORKS;

EID: 77955558847 PISSN: 01678655 EISSN: None Source Type: Journal
DOI: 10.1016/j.patrec.2010.02.005 Document Type: Article

Times cited : (141)

References (39)

1
- 33845333816
- Audio based event detection for multimedia surveillance
- Atrey, P.K.; Maddage.; N.C.; Kankanhalli, M.S.; 2006. Audio based event detection for multimedia surveillance. In: ICASSP06.
- (2006) ICASSP06
- Atrey, P.K.¹ Maddage, N.C.² Kankanhalli, M.S.³

2
- 24644439101
- Audio-based event detection for sports video
- M. Baillie, and J. Jose Audio-based event detection for sports video Lecture Notes Comput. Sci. 2728 2003 61 65
- (2003) Lecture Notes Comput. Sci. , vol.2728 , pp. 61-65
- Baillie, M.¹ Jose, J.²

3
- 0028531926
- Computational auditory scene analysis
- G.J. Brown, and M. Cooke Computational auditory scene analysis Comput. Speech Lang. 8 1994 297 336
- (1994) Comput. Speech Lang. , vol.8 , pp. 297-336
- Brown, G.J.¹ Cooke, M.²

4
- 33947696754
- SVM based speaker verification using a GMM supervector kernel and nap variability compensation
- IEEE
- W. Campbell, D. Sturim, D. Reynolds, and A. Solomonoff SVM based speaker verification using a GMM supervector kernel and nap variability compensation ICASSP 2006 vol. 1 2006 IEEE 97 100
- (2006) ICASSP 2006 , vol.1 , pp. 97-100
- Campbell, W.¹ Sturim, D.² Reynolds, D.³ Solomonoff, A.⁴

5
- 0003710380
- Chang, C.-C.; Lin, C.-J.; 2001. LIBSVM: A library for support vector machines. Software available at .
- (2001) LIBSVM: A Library for Support Vector Machines
- Chang . C, -C.¹ Lin, C.-J.²

6
- 33750548608
- Events detection for an audio-based surveillance system
- Clavel, C.; Ehrette, T.; Richard, G.; 2005. Events detection for an audio-based surveillance system. In: IEEE Internat. Conf. on Multimedia and Expo.; pp. 1306-1309.
- (2005) IEEE Internat. Conf. on Multimedia and Expo. , pp. 1306-1309
- Clavel, C.¹ Ehrette, T.² Richard, G.³

7
- 11244272075
- Highlight sound effects detection in audio stream
- Cui, R.; Lu, L.; Zhung, H.-J.; Cai, L.-H.; 2003a. Highlight sound effects detection in audio stream. In: ICME03, pp. III: 37-40.
- (2003) ICME03 , pp. 37-40
- Cui, R.¹ Lu, L.² Zhung . H, -J.³ Cai, L.-H.⁴

8
- 11244272075
- Highlight sound effects detection in audio stream
- Cui, R.; Lu, L.; Zhung, H.-J.; Cai, L.-H.; 2003b. Highlight sound effects detection in audio stream. In: ICME03, pp. III: 37-40.
- (2003) ICME03 , pp. 37-40
- Cui, R.¹ Lu, L.² Zhung . H, -J.³ Cai, L.-H.⁴

9
- 0003922190
- John Wiley & Sons New York
- R.O. Duda, P.E. Hart, and D.G. Stork Pattern Classification 2001 John Wiley & Sons New York
- (2001) Pattern Classification
- Duda, R.O.¹ Hart, P.E.² Stork, D.G.³

10
- 0003794341
- Ph.D. Thesis, MIT
- Ellis, D.; 1996. Prediction-driven computational auditory scene analysis. Ph.D. Thesis, MIT.
- (1996) Prediction-driven Computational Auditory Scene Analysis
- Ellis, D.¹

11
- 85009135386
- Investigations into tandem acoustic modeling for the aurora task
- Ellis, D.; Gomez, M.R.; 2001. Investigations into tandem acoustic modeling for the aurora task. In: Proc. Eurospeech-01. ISCA, pp. 189-192.
- (2001) Proc. Eurospeech-01. ISCA , pp. 189-192
- Ellis, D.¹ Gomez, M.R.²

12
- 0034848926
- Tandem acoustic modeling in large-vocabulary recognition
- Ellis, D.; Singh, R.; Sivadas, S.; 2001. Tandem acoustic modeling in large-vocabulary recognition. In: Proc. IEEE Internat. Conf. on Acoustics, Speech, and Signal Processing, 2001 (ICASSP '01), vol. 1. pp. 517-520.
- (2001) Proc. IEEE Internat. Conf. on Acoustics, Speech, and Signal Processing, 2001 (ICASSP '01) , vol.1 , pp. 517-520
- Ellis, D.¹ Singh, R.² Sivadas, S.³

13
- 0015346024
- Maximum-likelihood sequence estimation of digital sequences in the presence of intersymbol interference
- G.D. Forney Maximum-likelihood sequence estimation of digital sequences in the presence of intersymbol interference IEEE Trans. Inform. Theory 18 3 1972 363 378
- (1972) IEEE Trans. Inform. Theory , vol.18 , Issue.3 , pp. 363-378
- Forney, G.D.¹

14
- 0001963082
- A short introduction to boosting
- Y. Freund, and R.E. Schapire A short introduction to boosting J. Japanese Soc. Artif. Intell. 14 5 1999 771 780
- (1999) J. Japanese Soc. Artif. Intell. , vol.14 , Issue.5 , pp. 771-780
- Freund, Y.¹ Schapire, R.E.²

15
- 0028419019
- Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markovchains
- J.-L. Gauvain, and C.-H. Lee Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markovchains IEEE Trans. Speech Audio Process. 2 1994 291 298
- (1994) IEEE Trans. Speech Audio Process. , vol.2 , pp. 291-298
- Gauvain, J.-L.¹ Lee, C.-H.²

16
- 33746259957
- Mel cepstrum, deltas, double deltas, - What else is new?
- Hermansky, H.; 1999. Mel cepstrum, deltas, double deltas, - what else is new? In: Proc. Robust Methods for Speech Recognition in Adverse Condition.
- (1999) Proc. Robust Methods for Speech Recognition in Adverse Condition
- Hermansky, H.¹

17
- 0033709098
- Tandem connectionist feature stream extraction for conventional HMM systems
- IEEE
- H. Hermansky, D. Ellis, and S. Sharma Tandem connectionist feature stream extraction for conventional HMM systems ICASSP 2000 vol. III 2000 IEEE 1635 1638
- (2000) ICASSP 2000 , vol.3 , pp. 1635-1638
- Hermansky, H.¹ Ellis, D.² Sharma, S.³

18
- 70349213420
- Long-time span acoustic activity analysis from far-field sensors in smart homes
- IEEE
- J. Huang, X. Zhuang, V. Libal, and G. Potamianos Long-time span acoustic activity analysis from far-field sensors in smart homes ICASSP 2009 2009 IEEE
- (2009) ICASSP 2009
- Huang, J.¹ Zhuang, X.² Libal, V.³ Potamianos, G.⁴

19
- 0031078007
- Feature selection: Evaluation, application, and small sample performance
- A. Jain, and D. Zongker Feature selection: Evaluation, application, and small sample performance IEEE Trans. Pattern Anal. Machine Intell. 19 1997 153 158
- (1997) IEEE Trans. Pattern Anal. Machine Intell. , vol.19 , pp. 153-158
- Jain, A.¹ Zongker, D.²

20
- 0026142334
- A study on speaker adaptation of the parameters of continuous density hidden markov models
- C.-H. Lee, C.-H. Lin, and B.-H. Juang A study on speaker adaptation of the parameters of continuous density hidden markov models IEEE Trans. Signal Process. 39 4 1991 806 814
- (1991) IEEE Trans. Signal Process. , vol.39 , Issue.4 , pp. 806-814
- Lee, C.-H.¹ Lin, C.-H.² Juang, B.-H.³

21
- 0009985115
- Mel frequency Cepstral coefficients for music modeling
- Logan, B.; 2000. Mel frequency Cepstral coefficients for music modeling. In: Proc. Internat. Conf. on Music Information Retrieval.
- (2000) Proc. Internat. Conf. on Music Information Retrieval
- Logan, B.¹

22
- 0034296009
- Finding consensus in speech recognition: Word error minimization and other applications of confusion networks
- L. Mangu, E. Brill, and A. Stolcke Finding consensus in speech recognition: Word error minimization and other applications of confusion networks Comput. Speech Lang. 14 4 2000 373 400
- (2000) Comput. Speech Lang. , vol.14 , Issue.4 , pp. 373-400
- Mangu, L.¹ Brill, E.² Stolcke, A.³

23
- 84908294933
- Duration dependent input output markov models for audio-visual event detection
- Naphade, M.R.; Garg, A.; Huang, T.; 2001. Duration dependent input output markov models for audio-visual event detection. In: ICME01, p. 65.
- (2001) ICME01 , pp. 65
- Naphade, M.R.¹ Garg, A.² Huang, T.³

24
- 85009291610
- Robust speech/ music classification in audio document
- 2005-2008
- Pinquier, J.; 2002. Robust speech/ music classification in audio document. In: Proc. Internat. Conf. on Spoken Language Processing (ICSLP), pp. III: 2005-2008.
- (2002) Proc. Internat. Conf. on Spoken Language Processing (ICSLP)
- Pinquier, J.¹

25
- 85115260483
- Floating search methods for feature selection with nonmonotonic criterion functions
- Pudil, P.; Ferri, F.; Novovicova, J.; Kittler, J.; 1994. Floating search methods for feature selection with nonmonotonic criterion functions. In: Proc. 12th IAPR Internat. Conf. on Pattern Recognition, 1994. Conference B: Computer Vision and Image Processing, vol. 2. pp. 279-283.
- (1994) Proc. 12th IAPR Internat. Conf. on Pattern Recognition, 1994. Conference B: Computer Vision and Image Processing , vol.2 , pp. 279-283
- Pudil, P.¹ Ferri, F.² Novovicova, J.³ Kittler, J.⁴

26
- 0342502195
- Soft margins for AdaBoost
- G. Ratsch, T. Onoda, and K.-R. Muller Soft margins for AdaBoost IEEE Trans. Signal Process. 42 2001 287 320
- (2001) IEEE Trans. Signal Process. , vol.42 , pp. 287-320
- Ratsch, G.¹ Onoda, T.² Muller, K.-R.³

27
- 0029209272
- Robust text-independent speaker identification using Gaussian mixture speaker models
- D. Reynolds, and R. Rose Robust text-independent speaker identification using Gaussian mixture speaker models IEEE Trans. Speech Audio Process. 3 1 1995 72 83
- (1995) IEEE Trans. Speech Audio Process. , vol.3 , Issue.1 , pp. 72-83
- Reynolds, D.¹ Rose, R.²

28
- 77955560781
- Tech. Rep. 491, MIT Media Laboratory Perceptual Computing Section
- Scheirer, E.D.; 1999. Sound scene segmentation by dynamic detection of correlogram comodulation. Tech. Rep. 491, MIT Media Laboratory Perceptual Computing Section.
- (1999) Sound Scene Segmentation by Dynamic Detection of Correlogram Comodulation
- Scheirer, E.D.¹

29
- 0004094721
- MIT Press Cambridge, MA, US
- B. Schlkopf, and A. Smola Learning with Kernels 2002 MIT Press Cambridge, MA, US
- (2002) Learning with Kernels
- Schlkopf, B.¹ Smola, A.²

30
- 34147210906
- Establishing a gold standard for manual cough counting: Video versus digital audio recordings
- J.A. Smith, J.E. Earis, and A.A. Woodcock Establishing a gold standard for manual cough counting: Video versus digital audio recordings Cough 2 6 2006 1 6
- (2006) Cough , vol.2 , Issue.6 , pp. 1-6
- Smith, J.A.¹ Earis, J.E.² Woodcock, A.A.³

31
- 0033220764
- Adaptive floating search methods in feature selection
- Somol, P.; Pudil, P.; Novoviová, J.; Paclik, P.; 1999. Adaptive floating search methods in feature selection. Pattern Recognition Lett. 20 (11-13), 1157-1163.
- (1999) Pattern Recognition Lett. , vol.20 , Issue.11-13 , pp. 1157-1163
- Somol, P.¹

32
- 77955550639
- Temko, A.; 2007. CLEAR 2007 AED evaluation plan and workshop. .
- (2007) CLEAR 2007 AED Evaluation Plan and Workshop
- Temko, A.¹

33
- 33646794668
- Classification of meeting-room acoustic events with support vector machines and variable-feature-set clustering
- Temko, A.; Nadeu, C.; 2005. Classification of meeting-room acoustic events with support vector machines and variable-feature-set clustering. In: Proc. Internat. Conf. on Acoustics, Speech, and Signal Processing, vol. V. pp. 505-508.
- (2005) Proc. Internat. Conf. on Acoustics, Speech, and Signal Processing , vol.5 , pp. 505-508
- Temko, A.¹ Nadeu, C.²

34
- 47749089085
- Acoustic event detection and classification in smart-room environments: Evaluation of CHIL project systems
- November
- Temko, A.; Malkin, R.; Zieger, C.; Macho, D.; Nadeu, C.; Omologo, M.; 2006. Acoustic event detection and classification in smart-room environments: Evaluation of CHIL project systems. IV Jornadas en Tecnologia del Habla November.
- (2006) IV Jornadas en Tecnologia Del Habla
- Temko, A.¹ Malkin, R.² Zieger, C.³ MacHo, D.⁴ Nadeu, C.⁵ Omologo, M.⁶

35
- 77955558772
- Tidigits
- Tidigits, 1993. Linguistic Data Consortium Catalog No. LDC93S10.
- (1993) Linguistic Data Consortium Catalog No. LDC93S10

36
- 0035340677
- Audio content analysis for online audiovisual data segmentation and classification
- T. Zhang, and C.-C.J. Kuo Audio content analysis for online audiovisual data segmentation and classification IEEE Trans. Speech Audio Process. 9 4 2001 441 457
- (2001) IEEE Trans. Speech Audio Process. , vol.9 , Issue.4 , pp. 441-457
- Zhang, T.¹ Kuo, C.-C.J.²

37
- 47749116035
- HMM-based acoustic event detection with AdaBoost feature selection
- Zhou, X.; Zhuang, X.; Liu, M.; Tang, H.; Hasegawa-Johnson, M.; Huang, T.; 2007. HMM-based acoustic event detection with AdaBoost feature selection. In: Classification of Events, Activities and Relationships Evaluation and Workshop, pp. 345-353.
- (2007) Classification of Events, Activities and Relationships Evaluation and Workshop , pp. 345-353
- Zhou, X.¹ Zhuang, X.² Liu, M.³ Tang, H.⁴ Hasegawa-Johnson, M.⁵ Huang, T.⁶

38
- 51449090402
- Intersession variability compensation for language detection
- Zhou, X.; Navrátil, J.; Pelecanos, J.W.; Ramaswamy, G.N.; Huang, T.S.; 2008. Intersession variability compensation for language detection. In: Proc. Internat. Conf. on Acoustics, Speech, and Signal Processing.
- (2008) Proc. Internat. Conf. on Acoustics, Speech, and Signal Processing
- Zhou, X.¹ Navrátil, J.² Pelecanos . J, W.³ Ramaswamy . G, N.⁴ Huang, T.S.⁵

39
- 51449101221
- Feature analysis and selection for acoustic event detection
- (ICASSP '08)
- Zhuang, X.; Zhou, X.; Huang, T.S.; Hasegawa-Johnson, M.; 2008. Feature analysis and selection for acoustic event detection. In: Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing, 2008 (ICASSP '08), pp.17-20.
- (2008) Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing , pp. 17-20
- Zhuang, X.¹ Zhou, X.² Huang . T, S.³ Hasegawa-Johnson, M.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.