-
1
-
-
0030648077
-
Construction and evaluation of a robust multifeature speech/music discriminator
-
Scheirer, E., Slaney, M.: Construction and evaluation of a robust multifeature speech/music discriminator. In: Proc. ICASSP-97, vol. 2, pp. 1331-1334 (1997)
-
(1997)
Proc. ICASSP-97
, vol.2
, pp. 1331-1334
-
-
Scheirer, E.1
Slaney, M.2
-
2
-
-
0034792569
-
A robust audio classification and segmentation method
-
Lu, L., Jiang, H., et al.: A robust audio classification and segmentation method. In: Proc. 9th ACM Multimedia, pp. 203-211 (2001)
-
(2001)
Proc. 9th ACM Multimedia
, pp. 203-211
-
-
Lu, L.1
Jiang, H.2
-
3
-
-
0029765670
-
Real-time discrimination of broadcast speech/music In
-
Saunders, J.: Real-time discrimination of broadcast speech/music In: Proc ICASSP-96, vol. 2, pp. 993-996 (1996)
-
(1996)
Proc ICASSP-96
, vol.2
, pp. 993-996
-
-
Saunders, J.1
-
4
-
-
0015553712
-
The modulation transfer function in room acoustics as a predictor of speech intelligibility
-
Houtgast, T., Steeneken, H.J.M.: The modulation transfer function in room acoustics as a predictor of speech intelligibility. Acoustica 28, 66-73 (1973)
-
(1973)
Acoustica
, vol.28
, pp. 66-73
-
-
Houtgast, T.1
Steeneken, H.J.M.2
-
5
-
-
0006507306
-
Segmentation and classification of auditory scenes in time domain
-
Asano, T., Sugiyama, M.: Segmentation and classification of auditory scenes in time domain. In: Proc. IWHIT98, pp. 13-18 (1998)
-
(1998)
Proc. IWHIT98
, pp. 13-18
-
-
Asano, T.1
Sugiyama, M.2
-
6
-
-
0037401304
-
Speech/music discrimination using entropy and dynamism features in a HMM classification framework
-
Ajmera, J., McCowan, I., et al.: Speech/music discrimination using entropy and dynamism features in a HMM classification framework. Speech communication 40(3), 259-430 (2003)
-
(2003)
Speech communication
, vol.40
, Issue.3
, pp. 259-430
-
-
Ajmera, J.1
McCowan, I.2
-
7
-
-
33646375426
-
Speech/non-speech segmentation based on phoneme recognition features
-
Žibert, J., Pavešić, N., Mihelič, F.: Speech/non-speech segmentation based on phoneme recognition features. EURASIP Journal on Applied Signal Processing, Article ID 90945 2006(6), 1-13 (2006)
-
(2006)
EURASIP Journal on Applied Signal Processing, Article ID 90945
, Issue.6
, pp. 1-13
-
-
Žibert, J.1
Pavešić, N.2
Mihelič, F.3
-
8
-
-
55349105952
-
-
Pinquier, J., Rouas, J.L.: A fusion study in speech/music classification. In: Proc. ICME '03, 1, pp. 1-409-412 (2003)
-
Pinquier, J., Rouas, J.L.: A fusion study in speech/music classification. In: Proc. ICME '03, vol. 1, pp. 1-409-412 (2003)
-
-
-
-
9
-
-
40849091837
-
Discrimination between speech and music based on a low frequency modulation feature
-
2001
-
Karnebäck, S.: Discrimination between speech and music based on a low frequency modulation feature. In: Proc Eurospeech-2001, pp. 1891-1894 (2001)
-
(2001)
Proc Eurospeech
, pp. 1891-1894
-
-
Karnebäck, S.1
-
10
-
-
0033690881
-
Musical instrument recognition using cepstral coefficients and temporal features
-
Eronen, A., Klapuri, A.: Musical instrument recognition using cepstral coefficients and temporal features. In: Proc. ICASSP '00, vol. 2, pp. II753-II756 (2000)
-
(2000)
Proc. ICASSP '00
, vol.2
-
-
Eronen, A.1
Klapuri, A.2
-
11
-
-
0032136330
-
Robust speech recognition using the modulation spectrogram
-
Kingsbury, B.E.D., Morgan, N., et al.: Robust speech recognition using the modulation spectrogram, Speech Communication, 25(1-3) (1998)
-
(1998)
Speech Communication
, vol.25
, Issue.1-3
-
-
Kingsbury, B.E.D.1
Morgan, N.2
-
12
-
-
85009212153
-
On Factorizing Spectral Dynamics for Robust Speech Recognition
-
2003
-
Tyagi, V., McCowan, I., et al.: On Factorizing Spectral Dynamics for Robust Speech Recognition. In: Proc. Eurospeech-2003, pp. 981-984 (2003)
-
(2003)
Proc. Eurospeech
, pp. 981-984
-
-
Tyagi, V.1
McCowan, I.2
-
13
-
-
84946750132
-
Mel-cepstrum modulation spectrum (MCMS) features for robust ASR
-
Tyagi, V., McCowan, I., et al.: Mel-cepstrum modulation spectrum (MCMS) features for robust ASR. In: Proc. ASRU '03, pp. 399-404 (2003)
-
(2003)
Proc. ASRU '03
, pp. 399-404
-
-
Tyagi, V.1
McCowan, I.2
-
14
-
-
38049118898
-
-
SiTEC Speech Information Technology and Industry Promotion Center
-
SiTEC (Speech Information Technology and Industry Promotion Center), http://www.sitec.or.kr
-
-
-
-
15
-
-
38049109299
-
-
Garofolo, John, S, et al, TIMIT Acoustic-Phonetic Continuous Speech Corpus, Linguistic Data Consortium LDC, Philadelphia, USA
-
Garofolo, John, S., et al.: TIMIT Acoustic-Phonetic Continuous Speech Corpus, Linguistic Data Consortium (LDC), Philadelphia, USA
-
-
-
-
16
-
-
38049099699
-
-
Goto, M.: Development of the RWC Music Database. In: Puntonet, CG., Prieto, A.G. (eds.) ICA 2004. LNCS, 3195, Springer, Heidelberg (2004)
-
Goto, M.: Development of the RWC Music Database. In: Puntonet, CG., Prieto, A.G. (eds.) ICA 2004. LNCS, vol. 3195, Springer, Heidelberg (2004)
-
-
-
-
17
-
-
38049122527
-
-
KBS Korea Broadcasting System
-
KBS (Korea Broadcasting System), http://www.kbs.co.kr
-
-
-
-
18
-
-
38049154617
-
-
HTK Hidden Markov Model Toolkit
-
HTK (Hidden Markov Model Toolkit), http://htk.eng.cam.ac.uk
-
-
-
|