SCOPUS 정보 검색 플랫폼

Volumn 4629 LNAI, Issue , 2007, Pages 406-414

Speech/music discrimination using mel-cepstrum modulation energy

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTER MUSIC; ERROR ANALYSIS; FEATURE EXTRACTION; FREQUENCY BANDS;

CEPSTRAL COEFFICIENTS; CEPSTRAL FLUX (CF); ERROR REDUCTION; MEL-CEPSTRUM MODULATION ENERGY (MCME); MODULATION FREQUENCY;

SPEECH PROCESSING;

EID: 38049129048 PISSN: 03029743 EISSN: 16113349 Source Type: Book Series
DOI: 10.1007/978-3-540-74628-7_53 Document Type: Conference Paper

Times cited : (3)

References (18)

1
- 0030648077
- Construction and evaluation of a robust multifeature speech/music discriminator
- Scheirer, E., Slaney, M.: Construction and evaluation of a robust multifeature speech/music discriminator. In: Proc. ICASSP-97, vol. 2, pp. 1331-1334 (1997)
- (1997) Proc. ICASSP-97 , vol.2 , pp. 1331-1334
- Scheirer, E.¹ Slaney, M.²

2
- 0034792569
- A robust audio classification and segmentation method
- Lu, L., Jiang, H., et al.: A robust audio classification and segmentation method. In: Proc. 9th ACM Multimedia, pp. 203-211 (2001)
- (2001) Proc. 9th ACM Multimedia , pp. 203-211
- Lu, L.¹ Jiang, H.²

3
- 0029765670
- Real-time discrimination of broadcast speech/music In
- Saunders, J.: Real-time discrimination of broadcast speech/music In: Proc ICASSP-96, vol. 2, pp. 993-996 (1996)
- (1996) Proc ICASSP-96 , vol.2 , pp. 993-996
- Saunders, J.¹

4
- 0015553712
- The modulation transfer function in room acoustics as a predictor of speech intelligibility
- Houtgast, T., Steeneken, H.J.M.: The modulation transfer function in room acoustics as a predictor of speech intelligibility. Acoustica 28, 66-73 (1973)
- (1973) Acoustica , vol.28 , pp. 66-73
- Houtgast, T.¹ Steeneken, H.J.M.²

5
- 0006507306
- Segmentation and classification of auditory scenes in time domain
- Asano, T., Sugiyama, M.: Segmentation and classification of auditory scenes in time domain. In: Proc. IWHIT98, pp. 13-18 (1998)
- (1998) Proc. IWHIT98 , pp. 13-18
- Asano, T.¹ Sugiyama, M.²

6
- 0037401304
- Speech/music discrimination using entropy and dynamism features in a HMM classification framework
- Ajmera, J., McCowan, I., et al.: Speech/music discrimination using entropy and dynamism features in a HMM classification framework. Speech communication 40(3), 259-430 (2003)
- (2003) Speech communication , vol.40 , Issue.3 , pp. 259-430
- Ajmera, J.¹ McCowan, I.²

7
- 33646375426
- Speech/non-speech segmentation based on phoneme recognition features
- Žibert, J., Pavešić, N., Mihelič, F.: Speech/non-speech segmentation based on phoneme recognition features. EURASIP Journal on Applied Signal Processing, Article ID 90945 2006(6), 1-13 (2006)
- (2006) EURASIP Journal on Applied Signal Processing, Article ID 90945 , Issue.6 , pp. 1-13
- Žibert, J.¹ Pavešić, N.² Mihelič, F.³

8
- 55349105952
- Pinquier, J., Rouas, J.L.: A fusion study in speech/music classification. In: Proc. ICME '03, 1, pp. 1-409-412 (2003)
- Pinquier, J., Rouas, J.L.: A fusion study in speech/music classification. In: Proc. ICME '03, vol. 1, pp. 1-409-412 (2003)

10
- 0033690881
- Musical instrument recognition using cepstral coefficients and temporal features
- Eronen, A., Klapuri, A.: Musical instrument recognition using cepstral coefficients and temporal features. In: Proc. ICASSP '00, vol. 2, pp. II753-II756 (2000)
- (2000) Proc. ICASSP '00 , vol.2
- Eronen, A.¹ Klapuri, A.²

11
- 0032136330
- Robust speech recognition using the modulation spectrogram
- Kingsbury, B.E.D., Morgan, N., et al.: Robust speech recognition using the modulation spectrogram, Speech Communication, 25(1-3) (1998)
- (1998) Speech Communication , vol.25 , Issue.1-3
- Kingsbury, B.E.D.¹ Morgan, N.²

13
- 84946750132
- Mel-cepstrum modulation spectrum (MCMS) features for robust ASR
- Tyagi, V., McCowan, I., et al.: Mel-cepstrum modulation spectrum (MCMS) features for robust ASR. In: Proc. ASRU '03, pp. 399-404 (2003)
- (2003) Proc. ASRU '03 , pp. 399-404
- Tyagi, V.¹ McCowan, I.²

14
- 38049118898
- SiTEC Speech Information Technology and Industry Promotion Center
- SiTEC (Speech Information Technology and Industry Promotion Center), http://www.sitec.or.kr

15
- 38049109299
- Garofolo, John, S, et al, TIMIT Acoustic-Phonetic Continuous Speech Corpus, Linguistic Data Consortium LDC, Philadelphia, USA
- Garofolo, John, S., et al.: TIMIT Acoustic-Phonetic Continuous Speech Corpus, Linguistic Data Consortium (LDC), Philadelphia, USA

16
- 38049099699
- Goto, M.: Development of the RWC Music Database. In: Puntonet, CG., Prieto, A.G. (eds.) ICA 2004. LNCS, 3195, Springer, Heidelberg (2004)
- Goto, M.: Development of the RWC Music Database. In: Puntonet, CG., Prieto, A.G. (eds.) ICA 2004. LNCS, vol. 3195, Springer, Heidelberg (2004)

17
- 38049122527
- KBS Korea Broadcasting System
- KBS (Korea Broadcasting System), http://www.kbs.co.kr

18
- 38049154617
- HTK Hidden Markov Model Toolkit
- HTK (Hidden Markov Model Toolkit), http://htk.eng.cam.ac.uk

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.