메뉴 건너뛰기




Volumn 18, Issue 3, 2010, Pages 688-707

Segmentation, indexing, and retrieval for environmental and natural sounds

Author keywords

Acoustic signal analysis; Acoustic signal detection; Bayes procedures; Clustering methods; Database query processing

Indexed keywords

ACOUSTIC SIGNAL ANALYSIS; ACOUSTIC SIGNAL DETECTION; BAYES PROCEDURE; CLUSTERING METHODS; DATABASE QUERY PROCESSING;

EID: 76949085351     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2010.2041384     Document Type: Article
Times cited : (61)

References (50)
  • 1
    • 34548068861 scopus 로고
    • Rochester VT: Destiny Books
    • R. Schafer, The Soundscape. Rochester, VT: Destiny Books, 1968.
    • (1968) The Soundscape
    • Schafer, R.1
  • 4
    • 33750550452 scopus 로고    scopus 로고
    • Automatic surveillance of the acoustic activity in our living environment
    • Amsterdam, The Netherlands Jul.
    • A. Harma, M. F. McKinney, and J. Skowronek, "Automatic surveillance of the acoustic activity in our living environment," in IEEE Int. Conf. Multimedia and Expo, Amsterdam, The Netherlands, Jul. 2005.
    • (2005) IEEE Int. Conf. Multimedia and Expo
    • Harma, A.1    McKinney, M.F.2    Skowronek, J.3
  • 7
    • 33745204826 scopus 로고    scopus 로고
    • MyLifeBits: A personal database for everything
    • J. Gemmell, G. Bell, and R. Lueder, "MyLifeBits: A personal database for everything," Commun. ACM, vol.49, no.1, pp. 88-95, 2006.
    • (2006) Commun. ACM , vol.49 , Issue.1 , pp. 88-95
    • Gemmell, J.1    Bell, G.2    Lueder, R.3
  • 9
    • 0023831656 scopus 로고
    • A new statistical approach for automatic segmentation of continuous speech signals
    • Jan.
    • R. Andre-Obrecht, "A new statistical approach for automatic segmentation of continuous speech signals," IEEE Trans. Acoust., Speech, Signal Process., vol.36, no.1, pp. 29-40, Jan. 1988.
    • (1988) IEEE Trans. Acoust., Speech, Signal Process. , vol.36 , Issue.1 , pp. 29-40
    • Andre-Obrecht, R.1
  • 10
    • 31844447985 scopus 로고    scopus 로고
    • Ph.D. dissertation, Radboud Univ. of Nijmegen, Nijmegen, The Netherlands
    • A. T. Cemgil, "Bayesian Music Transcription," Ph.D. dissertation, Radboud Univ. of Nijmegen, Nijmegen, The Netherlands, 2004.
    • (2004) Bayesian Music Transcription
    • Cemgil, A.T.1
  • 14
    • 33846227904 scopus 로고    scopus 로고
    • Automatic meeting segmentation using dynamic Bayesian networks
    • Jan.
    • A. Dielmann and S. Renals, "Automatic meeting segmentation using dynamic Bayesian networks," IEEE Trans. Multimedia, vol.9, no.1, pp. 25-36, Jan. 2007.
    • (2007) IEEE Trans. Multimedia , vol.9 , Issue.1 , pp. 25-36
    • Dielmann, A.1    Renals, S.2
  • 15
    • 33646748325 scopus 로고    scopus 로고
    • Modeling individual and group actions in meetings with layered HMMs
    • May
    • D. Zhang, D. Gatica-Perez, S. Bengio, and I. McCowan, "Modeling individual and group actions in meetings with layered HMMs," IEEE Trans. Multimedia, vol.8, no.3, pp. 509-520, May 2006.
    • (2006) IEEE Trans. Multimedia , vol.8 , Issue.3 , pp. 509-520
    • Zhang, D.1    Gatica-Perez, D.2    Bengio, S.3    McCowan, I.4
  • 20
    • 0030242072 scopus 로고    scopus 로고
    • Content-based classification, search, and retrieval of audio
    • E. Wold, T. Blum, D. Keislar, and J. Wheaton, "Content-based classification, search, and retrieval of audio," IEEE Multimedia, vol.3, no.3, pp. 27-36, 1996.
    • (1996) IEEE Multimedia , vol.3 , Issue.3 , pp. 27-36
    • Wold, E.1    Blum, T.2    Keislar, D.3    Wheaton, J.4
  • 22
    • 0036648502 scopus 로고    scopus 로고
    • Musical genre classification of audio signals
    • Jul.
    • G. Tzanetakis and P. Cook, "Musical genre classification of audio signals," IEEE Trans. Speech Audio Process., vol.10, no.5, pp. 293-302, Jul. 2002.
    • (2002) IEEE Trans. Speech Audio Process. , vol.10 , Issue.5 , pp. 293-302
    • Tzanetakis, G.1    Cook, P.2
  • 25
    • 0037418225 scopus 로고    scopus 로고
    • Optimally sparse representation in general (nonorthogonal) dictionaries via minimization
    • D. L. Donoho and M. Elad, "Optimally sparse representation in general (nonorthogonal) dictionaries via minimization," in Proc. National Academy Sci., 2003, vol. 100, no. 5, pp. 2197-2202.
    • (2003) Proc. National Academy Sci. , vol.100 , Issue.5 , pp. 2197-2202
    • Donoho, D.L.1    Elad, M.2
  • 26
    • 76949109252 scopus 로고    scopus 로고
    • Speech Processing, Transmission and Quality Aspects (STQ); Distributed Speech Recognition; Front-End Feature Extraction Algorithm; Compression Algorithms, ETSI ES 201 108 v1.1.3 (2003-2009), 2003, E.T.S.I. standard document
    • Speech Processing, Transmission and Quality Aspects (STQ); Distributed Speech Recognition; Front-End Feature Extraction Algorithm; Compression Algorithms, ETSI ES 201 108 v1.1.3 (2003-2009), 2003, E.T.S.I. standard document.
  • 27
    • 0036214787 scopus 로고    scopus 로고
    • Yin, a fundamental frequency estimator for speech and music
    • A. de Cheveigne and H. Kawahara, "Yin, a fundamental frequency estimator for speech and music," J. Acoust. Soc. Amer., vol.111, no.4, pp. 1917-1930, 2002.
    • (2002) J. Acoust. Soc. Amer. , vol.111 , Issue.4 , pp. 1917-1930
    • De Cheveigne, A.1    Kawahara, H.2
  • 28
    • 0015756315 scopus 로고
    • An optimum processor theory for the central formation of the pitch of complex tones
    • J. L. Goldstein, "An optimum processor theory for the central formation of the pitch of complex tones," J. Acoust. Soc. Amer., vol.54, no.6, pp. 1496-1516, 1973.
    • (1973) J. Acoust. Soc. Amer. , vol.54 , Issue.6 , pp. 1496-1516
    • Goldstein, J.L.1
  • 29
    • 23944465437 scopus 로고    scopus 로고
    • A new probabilistic spectral pitch estimator: Exact and MCMC-approximate strategies
    • U. K. Wiil, Ed. New York: Springer-Verlag
    • H. Thornburg and R. J. Leistikow, "A new probabilistic spectral pitch estimator: Exact and MCMC-approximate strategies," in Lecture Notes in Computer Science 3310, U. K. Wiil, Ed. New York: Springer-Verlag, 2005.
    • (2005) Lecture Notes in Computer Science 3310
    • Thornburg, H.1    Leistikow, R.J.2
  • 30
    • 47649111947 scopus 로고    scopus 로고
    • Melody extraction and musical onset detection via probabilistic models of STFT peak data
    • May
    • H. Thornburg, R. Leistikow, and J. Berger, "Melody extraction and musical onset detection via probabilistic models of STFT peak data," IEEE Trans. Audio, Speech, Lang. Process., vol.15, no.4, pp. 1257-1272, May 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.4 , pp. 1257-1272
    • Thornburg, H.1    Leistikow, R.2    Berger, J.3
  • 32
    • 46749144025 scopus 로고    scopus 로고
    • A dynamic Bayesian network approach to tracking learned switching dynamic models
    • Pittsburgh, PA
    • V. Pavlovic, J. M. Rehg, and T. Cham, "A dynamic Bayesian network approach to tracking learned switching dynamic models," in Proc. Int. Workshop Hybrid Syst., Pittsburgh, PA, 2000.
    • (2000) Proc. Int. Workshop Hybrid Syst.
    • Pavlovic, V.1    Rehg, J.M.2    Cham, T.3
  • 34
    • 33750392431 scopus 로고    scopus 로고
    • Accessing minimal-impact personal audio archives
    • Jul.
    • D. Ellis and K. Lee, "Accessing minimal-impact personal audio archives," IEEE Multimedia, vol.13, no.4, pp. 30-38, Jul. 2006.
    • (2006) IEEE Multimedia , vol.13 , Issue.4 , pp. 30-38
    • Ellis, D.1    Lee, K.2
  • 35
    • 70349471166 scopus 로고    scopus 로고
    • Multi-channel audio segmentation for continuous observation and archival of large spaces
    • Taipei, Taiwan
    • G. Wichern, H. Thornburg, and A. Spanias, "Multi-channel audio segmentation for continuous observation and archival of large spaces," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Taipei, Taiwan, 2009, pp. 237-240.
    • (2009) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. , pp. 237-240
    • Wichern, G.1    Thornburg, H.2    Spanias, A.3
  • 37
    • 0016355478 scopus 로고
    • A new look at the statistical model identification
    • Mar.
    • H. Akaike, "A new look at the statistical model identification, " IEEE Trans. Autom. Control, vol.AC-19, no.6, pp. 716-723, Mar. 1974.
    • (1974) IEEE Trans. Autom. Control , vol.AC-19 , Issue.6 , pp. 716-723
    • Akaike, H.1
  • 38
    • 0042553279 scopus 로고
    • Smoothing and differentiation of data by simplified least squares procedures
    • A. Savitzky and M. J. Golay, "Smoothing and differentiation of data by simplified least squares procedures," Anal. Chem., vol.36, no.8, pp. 1627-1639, 1964.
    • (1964) Anal. Chem. , vol.36 , Issue.8 , pp. 1627-1639
    • Savitzky, A.1    Golay, M.J.2
  • 39
    • 0024610919 scopus 로고
    • A tutorial on hidden Markov models and selected applications in speech recognition
    • Feb.
    • L. R. Rabiner, "A tutorial on hidden Markov models and selected applications in speech recognition," Proc. IEEE, vol.77, no.2, pp. 257-286, Feb. 1989.
    • (1989) Proc. IEEE , vol.77 , Issue.2 , pp. 257-286
    • Rabiner, L.R.1
  • 40
    • 84899013108 scopus 로고    scopus 로고
    • On spectral clustering analysis and an algorithm
    • Vancouver, BC, Canada
    • A. Y. Ng, M. Jordan, and Y. Weiss, "On spectral clustering analysis and an algorithm," in Adv. Neural Inf. Process. Syst., Vancouver, BC, Canada, 2002.
    • (2002) Adv. Neural Inf. Process. Syst.
    • Ng, A.Y.1    Jordan, M.2    Weiss, Y.3
  • 41
    • 0022018101 scopus 로고
    • A probabilistic distance measure for hidden Markov models
    • B. H. Huang and L. R. Rabiner, "A probabilistic distance measure for hidden Markov models," AT&T Tech. J., vol.64, no.2, pp. 1251-1270, 1985.
    • (1985) AT&T Tech. J. , vol.64 , Issue.2 , pp. 1251-1270
    • Huang, B.H.1    Rabiner, L.R.2
  • 42
  • 44
    • 0000120766 scopus 로고
    • Estimating the dimension of a model
    • G. Schwarz, "Estimating the dimension of a model," Ann. Statist., vol.6, no.2, pp. 461-464, 1978.
    • (1978) Ann. Statist. , vol.6 , Issue.2 , pp. 461-464
    • Schwarz, G.1
  • 49
    • 8644267670 scopus 로고    scopus 로고
    • Conceptnet: A practical commonsense reasoning toolkit
    • H. Liu and P. Singh, "Conceptnet: A practical commonsense reasoning toolkit," BT Technol. J., vol.22, no.4, pp. 211-226, 2004.
    • (2004) BT Technol. J. , vol.22 , Issue.4 , pp. 211-226
    • Liu, H.1    Singh, P.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.