메뉴 건너뛰기




Volumn 4629 LNAI, Issue , 2007, Pages 406-414

Speech/music discrimination using mel-cepstrum modulation energy

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTER MUSIC; ERROR ANALYSIS; FEATURE EXTRACTION; FREQUENCY BANDS;

EID: 38049129048     PISSN: 03029743     EISSN: 16113349     Source Type: Book Series    
DOI: 10.1007/978-3-540-74628-7_53     Document Type: Conference Paper
Times cited : (3)

References (18)
  • 1
    • 0030648077 scopus 로고    scopus 로고
    • Construction and evaluation of a robust multifeature speech/music discriminator
    • Scheirer, E., Slaney, M.: Construction and evaluation of a robust multifeature speech/music discriminator. In: Proc. ICASSP-97, vol. 2, pp. 1331-1334 (1997)
    • (1997) Proc. ICASSP-97 , vol.2 , pp. 1331-1334
    • Scheirer, E.1    Slaney, M.2
  • 2
    • 0034792569 scopus 로고    scopus 로고
    • A robust audio classification and segmentation method
    • Lu, L., Jiang, H., et al.: A robust audio classification and segmentation method. In: Proc. 9th ACM Multimedia, pp. 203-211 (2001)
    • (2001) Proc. 9th ACM Multimedia , pp. 203-211
    • Lu, L.1    Jiang, H.2
  • 3
    • 0029765670 scopus 로고    scopus 로고
    • Real-time discrimination of broadcast speech/music In
    • Saunders, J.: Real-time discrimination of broadcast speech/music In: Proc ICASSP-96, vol. 2, pp. 993-996 (1996)
    • (1996) Proc ICASSP-96 , vol.2 , pp. 993-996
    • Saunders, J.1
  • 4
    • 0015553712 scopus 로고
    • The modulation transfer function in room acoustics as a predictor of speech intelligibility
    • Houtgast, T., Steeneken, H.J.M.: The modulation transfer function in room acoustics as a predictor of speech intelligibility. Acoustica 28, 66-73 (1973)
    • (1973) Acoustica , vol.28 , pp. 66-73
    • Houtgast, T.1    Steeneken, H.J.M.2
  • 5
    • 0006507306 scopus 로고    scopus 로고
    • Segmentation and classification of auditory scenes in time domain
    • Asano, T., Sugiyama, M.: Segmentation and classification of auditory scenes in time domain. In: Proc. IWHIT98, pp. 13-18 (1998)
    • (1998) Proc. IWHIT98 , pp. 13-18
    • Asano, T.1    Sugiyama, M.2
  • 6
    • 0037401304 scopus 로고    scopus 로고
    • Speech/music discrimination using entropy and dynamism features in a HMM classification framework
    • Ajmera, J., McCowan, I., et al.: Speech/music discrimination using entropy and dynamism features in a HMM classification framework. Speech communication 40(3), 259-430 (2003)
    • (2003) Speech communication , vol.40 , Issue.3 , pp. 259-430
    • Ajmera, J.1    McCowan, I.2
  • 8
    • 55349105952 scopus 로고    scopus 로고
    • Pinquier, J., Rouas, J.L.: A fusion study in speech/music classification. In: Proc. ICME '03, 1, pp. 1-409-412 (2003)
    • Pinquier, J., Rouas, J.L.: A fusion study in speech/music classification. In: Proc. ICME '03, vol. 1, pp. 1-409-412 (2003)
  • 9
    • 40849091837 scopus 로고    scopus 로고
    • Discrimination between speech and music based on a low frequency modulation feature
    • 2001
    • Karnebäck, S.: Discrimination between speech and music based on a low frequency modulation feature. In: Proc Eurospeech-2001, pp. 1891-1894 (2001)
    • (2001) Proc Eurospeech , pp. 1891-1894
    • Karnebäck, S.1
  • 10
    • 0033690881 scopus 로고    scopus 로고
    • Musical instrument recognition using cepstral coefficients and temporal features
    • Eronen, A., Klapuri, A.: Musical instrument recognition using cepstral coefficients and temporal features. In: Proc. ICASSP '00, vol. 2, pp. II753-II756 (2000)
    • (2000) Proc. ICASSP '00 , vol.2
    • Eronen, A.1    Klapuri, A.2
  • 11
    • 0032136330 scopus 로고    scopus 로고
    • Robust speech recognition using the modulation spectrogram
    • Kingsbury, B.E.D., Morgan, N., et al.: Robust speech recognition using the modulation spectrogram, Speech Communication, 25(1-3) (1998)
    • (1998) Speech Communication , vol.25 , Issue.1-3
    • Kingsbury, B.E.D.1    Morgan, N.2
  • 12
    • 85009212153 scopus 로고    scopus 로고
    • On Factorizing Spectral Dynamics for Robust Speech Recognition
    • 2003
    • Tyagi, V., McCowan, I., et al.: On Factorizing Spectral Dynamics for Robust Speech Recognition. In: Proc. Eurospeech-2003, pp. 981-984 (2003)
    • (2003) Proc. Eurospeech , pp. 981-984
    • Tyagi, V.1    McCowan, I.2
  • 13
    • 84946750132 scopus 로고    scopus 로고
    • Mel-cepstrum modulation spectrum (MCMS) features for robust ASR
    • Tyagi, V., McCowan, I., et al.: Mel-cepstrum modulation spectrum (MCMS) features for robust ASR. In: Proc. ASRU '03, pp. 399-404 (2003)
    • (2003) Proc. ASRU '03 , pp. 399-404
    • Tyagi, V.1    McCowan, I.2
  • 14
    • 38049118898 scopus 로고    scopus 로고
    • SiTEC Speech Information Technology and Industry Promotion Center
    • SiTEC (Speech Information Technology and Industry Promotion Center), http://www.sitec.or.kr
  • 15
    • 38049109299 scopus 로고    scopus 로고
    • Garofolo, John, S, et al, TIMIT Acoustic-Phonetic Continuous Speech Corpus, Linguistic Data Consortium LDC, Philadelphia, USA
    • Garofolo, John, S., et al.: TIMIT Acoustic-Phonetic Continuous Speech Corpus, Linguistic Data Consortium (LDC), Philadelphia, USA
  • 16
    • 38049099699 scopus 로고    scopus 로고
    • Goto, M.: Development of the RWC Music Database. In: Puntonet, CG., Prieto, A.G. (eds.) ICA 2004. LNCS, 3195, Springer, Heidelberg (2004)
    • Goto, M.: Development of the RWC Music Database. In: Puntonet, CG., Prieto, A.G. (eds.) ICA 2004. LNCS, vol. 3195, Springer, Heidelberg (2004)
  • 17
    • 38049122527 scopus 로고    scopus 로고
    • KBS Korea Broadcasting System
    • KBS (Korea Broadcasting System), http://www.kbs.co.kr
  • 18
    • 38049154617 scopus 로고    scopus 로고
    • HTK Hidden Markov Model Toolkit
    • HTK (Hidden Markov Model Toolkit), http://htk.eng.cam.ac.uk


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.