메뉴 건너뛰기




Volumn 14, Issue 6, 2006, Pages 2252-2263

Parametric representations of bird sounds for automatic species recognition

Author keywords

Bird song; Dynamic time warping (DTW); Feature extraction; Gaussian mixture model (GMM); Hidden Markov model (HMM); Sinusoidal modeling

Indexed keywords

AUDIO CLASSIFICATIONS; AUTOMATIC RECOGNITION; BIRD SPECIES; CEPSTRUM; CLASSIFICATION AND RECOGNITION; DYNAMIC TIME WARPING (DTW); GAUSSIAN MIXTURE MODEL (GMM); HIDDEN MARKOV MODEL (HMM); PARAMETRIC REPRESENTATIONS; SIGNAL PROCESSING TECHNIQUES; SINUSOIDAL MODELING; SPECIES RECOGNITION;

EID: 34347345718     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2006.872624     Document Type: Article
Times cited : (180)

References (37)
  • 2
    • 0004272205 scopus 로고
    • Cambridge, U.K, Cambridge Univ. Press
    • W. H. Thorpe, Bird Song. Cambridge, U.K.: Cambridge Univ. Press, 1961.
    • (1961) Bird Song
    • Thorpe, W.H.1
  • 4
    • 34249651500 scopus 로고    scopus 로고
    • Song structure and microgeographic variation in a population of the Grey-cheeked Fulvetta (Alcippe morrisonia) at Shoushan nature park, southern Taiwan
    • B.-S. Shieh, "Song structure and microgeographic variation in a population of the Grey-cheeked Fulvetta (Alcippe morrisonia) at Shoushan nature park, southern Taiwan," Zool. Stud., vol. 43, no. 1, pp. 132-141, 2004.
    • (2004) Zool. Stud , vol.43 , Issue.1 , pp. 132-141
    • Shieh, B.-S.1
  • 5
    • 1242345173 scopus 로고    scopus 로고
    • Chickadee song structure is individually distinctive over long-broadcast distances
    • P. J. Christie, D. J. Mennill, and L. M. Ratcliffe, "Chickadee song structure is individually distinctive over long-broadcast distances," Behavior, vol. 141, no. 1, pp. 101-124, 2004.
    • (2004) Behavior , vol.141 , Issue.1 , pp. 101-124
    • Christie, P.J.1    Mennill, D.J.2    Ratcliffe, L.M.3
  • 6
    • 0033858382 scopus 로고    scopus 로고
    • A procedure for an automated measurement of song similarity
    • O. Tchernichovski, F. Nottebohm, C. E. Ho, B. Pesaran, and P. P. Mitra, "A procedure for an automated measurement of song similarity," Animal Beh., vol. 59, pp. 1167-1176, 2000.
    • (2000) Animal Beh , vol.59 , pp. 1167-1176
    • Tchernichovski, O.1    Nottebohm, F.2    Ho, C.E.3    Pesaran, B.4    Mitra, P.P.5
  • 7
    • 0001225311 scopus 로고
    • Individual recognition of male Tawny owls (Strix aluco) using spectrograms of their territorial calls
    • P. Galeotti and G. Pavan, "Individual recognition of male Tawny owls (Strix aluco) using spectrograms of their territorial calls," Ethology, Ecology, Evol., vol. 3, no. 2, pp. 113-126, 1991.
    • (1991) Ethology, Ecology, Evol , vol.3 , Issue.2 , pp. 113-126
    • Galeotti, P.1    Pavan, G.2
  • 8
    • 0029774171 scopus 로고    scopus 로고
    • Application of dynamic programming matching to classification of budgerigar contact calls
    • Dec
    • K. Ito, K. Mori, and S. Iwasaki, "Application of dynamic programming matching to classification of budgerigar contact calls," J. Acoust. Soc. Amer., vol. 100, no. 6, pp. 3947-3956, Dec. 1996.
    • (1996) J. Acoust. Soc. Amer , vol.100 , Issue.6 , pp. 3947-3956
    • Ito, K.1    Mori, K.2    Iwasaki, S.3
  • 10
    • 0036578790 scopus 로고    scopus 로고
    • A method for parameterization of timevarying sounds
    • May
    • A. Härmä and M. Juntunen, "A method for parameterization of timevarying sounds," IEEE Signal Process. Lett., vol. 9, no. 5, pp. 151-153, May 2002.
    • (2002) IEEE Signal Process. Lett , vol.9 , Issue.5 , pp. 151-153
    • Härmä, A.1    Juntunen, M.2
  • 11
    • 0031959193 scopus 로고    scopus 로고
    • Automated recognition of bird song elements from continuous recordings using dynamic time warping and hidden Markov models: A comparative study
    • Apr
    • J. A. Kogan and D. Margoliash, "Automated recognition of bird song elements from continuous recordings using dynamic time warping and hidden Markov models: a comparative study," J. Acoust. Soc. Amer., vol. 103, no. 4, pp. 2185-2196, Apr. 1998.
    • (1998) J. Acoust. Soc. Amer , vol.103 , Issue.4 , pp. 2185-2196
    • Kogan, J.A.1    Margoliash, D.2
  • 12
    • 0029830701 scopus 로고    scopus 로고
    • Template-based automatic recognition of birdsong syllables from continuous recordings
    • Aug
    • S. E. Anderson, A. S. Dave, and D. Margoliash, "Template-based automatic recognition of birdsong syllables from continuous recordings," J. Acoust. Soc. Amer., vol. 100, no. 2, pp. 1209-1219, Aug. 1996.
    • (1996) J. Acoust. Soc. Amer , vol.100 , Issue.2 , pp. 1209-1219
    • Anderson, S.E.1    Dave, A.S.2    Margoliash, D.3
  • 14
    • 0031268932 scopus 로고    scopus 로고
    • Birdsong recognition using backpropagation and multivariate statistics
    • Nov
    • A. L. McIlraith and H. C. Card, "Birdsong recognition using backpropagation and multivariate statistics," IEEE Trans. Signal Process., vol. 45, no. 11, pp. 2740-2748, Nov. 1997.
    • (1997) IEEE Trans. Signal Process , vol.45 , Issue.11 , pp. 2740-2748
    • McIlraith, A.L.1    Card, H.C.2
  • 15
    • 0141855203 scopus 로고    scopus 로고
    • Automatic recognition of bird species based on sinusoidal modeling of syllables
    • Hong Kong, China, Apr
    • A. Härmä, "Automatic recognition of bird species based on sinusoidal modeling of syllables," in IEEE Int. Conf. Acoust., Speech, Signal Processing (ICASSP), Hong Kong, China, Apr. 2003, pp. 545-548.
    • (2003) IEEE Int. Conf. Acoust., Speech, Signal Processing (ICASSP) , pp. 545-548
    • Härmä, A.1
  • 18
    • 0006287236 scopus 로고
    • Larynx and Trachea
    • A. S. King and J. McLelland, Eds, New York: Academic
    • A. S. King and J. McLelland, Eds., "Larynx and Trachea," in Form and Function in Birds. New York: Academic, 1989, vol. 4, pp. 69-103.
    • (1989) Form and Function in Birds , vol.4 , pp. 69-103
  • 19
    • 33750309873 scopus 로고    scopus 로고
    • Automatic Recognition of Bird Species by Their Sounds,
    • M.S. thesis, Helsinki Univ. Technol, Espoo, Finland
    • S. Fagerlund, "Automatic Recognition of Bird Species by Their Sounds," M.S. thesis, Helsinki Univ. Technol., Espoo, Finland, 2004.
    • (2004)
    • Fagerlund, S.1
  • 20
    • 0035308233 scopus 로고    scopus 로고
    • Classification of general audio data for content-based retrieval
    • D. Li, I. K. Sethi, N. Dimitrova, and T. McGee, "Classification of general audio data for content-based retrieval," Pattern Recognition Lett., vol. 22, pp. 533-544, 2001.
    • (2001) Pattern Recognition Lett , vol.22 , pp. 533-544
    • Li, D.1    Sethi, I.K.2    Dimitrova, N.3    McGee, T.4
  • 21
    • 0031232722 scopus 로고    scopus 로고
    • Speech analysis/synthesis and modification using an analysis-by-synthesis/overlap-add sinusoidal model
    • Sep
    • B. George and M. J. T. Smith, "Speech analysis/synthesis and modification using an analysis-by-synthesis/overlap-add sinusoidal model," IEEE Trans. Speech Audio Process., vol. 5, no. 5, pp. 389-406, Sep. 1997.
    • (1997) IEEE Trans. Speech Audio Process , vol.5 , Issue.5 , pp. 389-406
    • George, B.1    Smith, M.J.T.2
  • 22
    • 0002161311 scopus 로고
    • The frequency analysis of time series for echoes: Cepstrum, pseudo-autocovariance, cross-cepstrum and saphe cracking
    • Istanbul, Turkey, Jun. 5-9
    • B. P. Bogert, M. J. R. Healy, and J. W. Tukey, "The frequency analysis of time series for echoes: cepstrum, pseudo-autocovariance, cross-cepstrum and saphe cracking," in Proc. Symp. Time Series Analysis, Istanbul, Turkey, Jun. 5-9, 1963, pp. 209-243.
    • (1963) Proc. Symp. Time Series Analysis , pp. 209-243
    • Bogert, B.P.1    Healy, M.J.R.2    Tukey, J.W.3
  • 24
    • 0019053271 scopus 로고
    • Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
    • Aug
    • S. Davis and P. Mermelstein, "Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-28, no. 4, pp. 357-366, Aug. 1980.
    • (1980) IEEE Trans. Acoust., Speech, Signal Process , vol.ASSP-28 , Issue.4 , pp. 357-366
    • Davis, S.1    Mermelstein, P.2
  • 25
    • 0029345416 scopus 로고
    • A comparison of signal processing front ends for automatic word recognition
    • Jul
    • C. R. Jankowski, Jr., H.-D. H. Vo, and R. P. Lippman, "A comparison of signal processing front ends for automatic word recognition," IEEE Trans. Speech Audio Process., vol. 3, no. 4, pp. 286-292, Jul. 1995.
    • (1995) IEEE Trans. Speech Audio Process , vol.3 , Issue.4 , pp. 286-292
    • Jankowski Jr., C.R.1    Vo, H.-D.H.2    Lippman, R.P.3
  • 26
    • 0017930815 scopus 로고
    • Dynamic programming algorithm optimization for spoken word recognition
    • Feb
    • H. Sakoe and S. Chiba, "Dynamic programming algorithm optimization for spoken word recognition," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-26, no. 1, pp. 43-49, Feb. 1978.
    • (1978) IEEE Trans. Acoust., Speech, Signal Process , vol.ASSP-26 , Issue.1 , pp. 43-49
    • Sakoe, H.1    Chiba, S.2
  • 27
    • 0024610919 scopus 로고
    • A tutorial on hidden Markov models and selected applications in speech recognition
    • Feb
    • L. R. Rabiner, "A tutorial on hidden Markov models and selected applications in speech recognition," Proc. IEEE, vol. 77, no. 2, pp. 257-286, Feb. 1989.
    • (1989) Proc. IEEE , vol.77 , Issue.2 , pp. 257-286
    • Rabiner, L.R.1
  • 28
    • 84935113569 scopus 로고    scopus 로고
    • A. J. Viterbi, Error bounds for convolutional codes and an asymptotically optimal decoding algorithm, IEEE Trans. Inform. Theory, IT-13, no. 2, pp. 260-269, Apr. 1967.
    • A. J. Viterbi, "Error bounds for convolutional codes and an asymptotically optimal decoding algorithm," IEEE Trans. Inform. Theory, vol. IT-13, no. 2, pp. 260-269, Apr. 1967.
  • 29
    • 0002629270 scopus 로고
    • Maximum likelihood from incomplete data via theEMalgorithm
    • A. P. Dempster, N. M. Laird, and D. B. Rubin, "Maximum likelihood from incomplete data via theEMalgorithm," J. Roy. Statist. Soc., Series B, vol. 39, no. 1, pp. 1-38, 1977.
    • (1977) J. Roy. Statist. Soc., Series B , vol.39 , Issue.1 , pp. 1-38
    • Dempster, A.P.1    Laird, N.M.2    Rubin, D.B.3
  • 31
    • 0018918171 scopus 로고
    • An algorithm for vector quantizer design
    • Jan
    • Y. Linde, A. Buzo, and R. Gray, "An algorithm for vector quantizer design," IEEE Trans. Commun., vol. C-28, no. 1, pp. 84-95, Jan. 1980.
    • (1980) IEEE Trans. Commun , vol.C-28 , Issue.1 , pp. 84-95
    • Linde, Y.1    Buzo, A.2    Gray, R.3
  • 32
    • 0021412027 scopus 로고
    • Vector quantization, IEEE Acoust., Speech
    • Feb
    • R. Gray, "Vector quantization," IEEE Acoust., Speech, Signal Process. Mag., vol. 1, no. 2, pp. 4-29, Feb 1984.
    • (1984) Signal Process. Mag , vol.1 , Issue.2 , pp. 4-29
    • Gray, R.1
  • 34
    • 0025493667 scopus 로고
    • The segmental K-means algorithm for estimating parameters of hidden Markov models
    • Sep
    • B.-H. Juang and L. R. Rabiner, "The segmental K-means algorithm for estimating parameters of hidden Markov models," IEEE Trans. Acoust., Speech, Signal Process., vol. 38, no. 9, pp. 1639-1641, Sep. 1990.
    • (1990) IEEE Trans. Acoust., Speech, Signal Process , vol.38 , Issue.9 , pp. 1639-1641
    • Juang, B.-H.1    Rabiner, L.R.2
  • 35
    • 0033694067 scopus 로고    scopus 로고
    • Speech recognition using temporally connected kernels in mixture density hidden Markov models
    • Istanbul, Turkey, Jun. 5-9
    • P. Somervuo, "Speech recognition using temporally connected kernels in mixture density hidden Markov models," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), Istanbul, Turkey, Jun. 5-9, 2000, pp. 3434-3437.
    • (2000) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP) , pp. 3434-3437
    • Somervuo, P.1
  • 36
    • 84869277924 scopus 로고    scopus 로고
    • P. Woodland, S. Young, and G. Evermann, HTK, the Hidden Markov Model Toolkit, Version 3.0 [Online, Available: 2
    • P. Woodland, S. Young, and G. Evermann, HTK, the Hidden Markov Model Toolkit, Version 3.0 [Online]. Available: http://htk.eng. cam.ac.uk/ 2002
  • 37
    • 0013530135 scopus 로고
    • A. S. King and J. McLelland, Eds, New York: Academic
    • A. S. King and J. McLelland, Eds., Form and Function in Birds. New York: Academic, 1989, vol. 4.
    • (1989) Form and Function in Birds , vol.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.