-
1
-
-
0030648077
-
Construction and evaluation of a robust multifeature speech/music discriminator
-
IEEE
-
Scheirer, E., Slaney, M.: Construction and evaluation of a robust multifeature speech/music discriminator. In: Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP'97, IEEE), Vol. 2, pp. 1331-1334 (1997)
-
(1997)
Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP'97)
, vol.2
, pp. 1331-1334
-
-
Scheirer, E.1
Slaney, M.2
-
2
-
-
33746879653
-
A multifeature speech/music discrimination system
-
IEEE
-
Saad, E.M., El-Adawy, M.I., Abu-El-Wata, M.E., Wahba, A.A.: A multifeature speech/music discrimination system. In: Proceedings of the 19th National Radio Science Conference (NRSC'02, IEEE), pp. 208-213 (2002)
-
(2002)
Proceedings of the 19th National Radio Science Conference (NRSC'02)
, pp. 208-213
-
-
Saad, E.M.1
El-Adawy, M.I.2
Abu-El-Wata, M.E.3
Wahba, A.A.4
-
3
-
-
0029765670
-
Real-time discrimination of broadcast speech/music
-
IEEE
-
John Saunders: Real-time discrimination of broadcast speech/music. In: Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP'96, IEEE), Vol. 2, pp. 993-996 (1996)
-
(1996)
Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP'96)
, vol.2
, pp. 993-996
-
-
Saunders, J.1
-
4
-
-
0032638667
-
A comparison of features for speech, music discrimination
-
IEEE
-
Carey, M.J., Parris, E.S., Lloyd-Thomas, H.: A comparison of features for speech, music discrimination. In: Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP'99, IEEE), Vol. 1, pp. 149-152 (1999)
-
(1999)
Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP'99)
, vol.1
, pp. 149-152
-
-
Carey, M.J.1
Parris, E.S.2
Lloyd-Thomas, H.3
-
5
-
-
70350279492
-
Feature fusion for music detection
-
Parris, E.S., Carey, M.J., Lloyd-Thomas, H.: Feature fusion for music detection. In: Proceedings of the European Conference on Speech Communication and Technology (EUROSPEECH'99), pp. 2191-2194 (1999)
-
(1999)
Proceedings of the European Conference on Speech Communication and Technology (EUROSPEECH'99)
, pp. 2191-2194
-
-
Parris, E.S.1
Carey, M.J.2
Lloyd-Thomas, H.3
-
6
-
-
0034853025
-
Robust singing detection in speech/music discriminator design
-
IEEE
-
Chou, W., Gu, L.: Robust singing detection in speech/music discriminator design. In: Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP'01, IEEE), Vol. 2, pp. 865-868 (2001)
-
(2001)
Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP'01)
, vol.2
, pp. 865-868
-
-
Chou, W.1
Gu, L.2
-
7
-
-
0036288612
-
Speech and music classification in audio documents
-
IEEE
-
Pinquier, J., Sénac, C., André-Obrecht, R.: Speech and music classification in audio documents. In: Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP'02, IEEE), Vol. 4, pp. 4164-4164 (2002)
-
(2002)
Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP'02)
, vol.4
, pp. 4164-4164
-
-
Pinquier, J.1
Sénac, C.2
André-Obrecht, R.3
-
8
-
-
85009291610
-
Robust speech/music classification in audio documents
-
Pinquier, J., Rouas, J.-L., André-Obrecht, R.: Robust speech/music classification in audio documents. In: Proceedings of the 7th International Conference on Spoken Language (ICSLP'02), Vol. 3, pp. 2005-2008 (2002)
-
(2002)
Proceedings of the 7th International Conference on Spoken Language (ICSLP'02)
, vol.3
, pp. 2005-2008
-
-
Pinquier, J.1
Rouas, J.-L.2
André-Obrecht, R.3
-
9
-
-
0141590391
-
A fusion study in speech/music classification
-
IEEE
-
Pinquier, J., Rouas, J.L., André-Obrecht, R.: A fusion study in speech/music classification. In: Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP'03, IEEE), Vol. 2, pp. II-17-II-20 (2003)
-
(2003)
Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP'03)
, vol.2
-
-
Pinquier, J.1
Rouas, J.L.2
André-Obrecht, R.3
-
10
-
-
84872720209
-
Robust speech music discrimination using spectrum's first order statistics and neural networks
-
IEEE
-
Harb, H., Chen, L.: Robust speech music discrimination using spectrum's first order statistics and neural networks. In: Proceedings of the 7th International Symposium on Signal Processing and its Applications, IEEE, Vol. 2, pp. 125-128 (2003)
-
(2003)
Proceedings of the 7th International Symposium on Signal Processing and Its Applications
, vol.2
, pp. 125-128
-
-
Harb, H.1
Chen, L.2
-
11
-
-
20444469089
-
Speech/music/silence and gender detection algorithm
-
Harb, H., Chen, L., Auloge, J.Y.: Speech/music/silence and gender detection algorithm. In: Proceedings of the 7th International Conference on Distributed Multimedia Systems (DMS'01), pp. 257-262 (2001)
-
(2001)
Proceedings of the 7th International Conference on Distributed Multimedia Systems (DMS'01)
, pp. 257-262
-
-
Harb, H.1
Chen, L.2
Auloge, J.Y.3
-
13
-
-
51449085287
-
A fast and robust speech/music discrimination approach
-
IEEE
-
Wang, W.Q., Gao, W., Ying, D.W.: A fast and robust speech/music discrimination approach. In: Proceedings of the Information, Communications & Signal Processing (ICICS-PCM'03, IEEE), Vol. 3, pp. 1325-1329 (2003)
-
(2003)
Proceedings of the Information, Communications & Signal Processing (ICICS-PCM'03)
, vol.3
, pp. 1325-1329
-
-
Wang, W.Q.1
Gao, W.2
Ying, D.W.3
-
14
-
-
0033705976
-
Speech/music discrimination for multimedia applications
-
IEEE
-
El-Maleh, K., Klein, M., Petrucci, G., Kabal, P.: Speech/music discrimination for multimedia applications. In: Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP'00, IEEE), Vol. 4, pp. 2445-2448 (2000)
-
(2000)
Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP'00)
, vol.4
, pp. 2445-2448
-
-
El-Maleh, K.1
Klein, M.2
Petrucci, G.3
Kabal, P.4
-
16
-
-
78649537792
-
Applying neural network on content-based audio classification
-
IEEE
-
Shao, X., Xu, C., Kankanhalli, M.S.: Applying neural network on content-based audio classification. In: Proceedings of the Fourth International Conference on Information, Communications and Signal Processing, IEEE, Vol. 3, pp. 1823-1825 (2003)
-
(2003)
Proceedings of the Fourth International Conference on Information, Communications and Signal Processing
, vol.3
, pp. 1823-1825
-
-
Shao, X.1
Xu, C.2
Kankanhalli, M.S.3
-
17
-
-
4544345094
-
A comparison of human and automatic musical genre classification
-
IEEE
-
Lippens, S., Martens, J.P., De Mulder, T., Tzanetakis, G.: A comparison of human and automatic musical genre classification. In: Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP'04, IEEE), Vol. 4, pp. IV-233-IV-236 (2004)
-
(2004)
Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP'04)
, vol.4
-
-
Lippens, S.1
Martens, J.P.2
De Mulder, T.3
Tzanetakis, G.4
-
18
-
-
4544304284
-
Harmonicity and dynamics-based features for audio
-
IEEE
-
Srinivasan, S.H., Kankanhalli, M.: Harmonicity and dynamics-based features for audio. In: Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP'04, IEEE), Vol. 4, pp. IV-321-IV-324 (2004)
-
(2004)
Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP'04)
, vol.4
-
-
Srinivasan, S.H.1
Kankanhalli, M.2
-
19
-
-
4243919627
-
-
Master's thesis, Department of Information Technology, Tampere University of Technology, Finland
-
Vesa Peltonen: Computational auditory scene recognition. Master's thesis, Department of Information Technology, Tampere University of Technology, Finland (2001)
-
(2001)
Computational Auditory Scene Recognition
-
-
Peltonen, V.1
-
20
-
-
0010053023
-
Automatic musical genre classification of audio signals
-
Tzanetakis, G., Essl, G., Cook, P.: Automatic musical genre classification of audio signals. In: Proceedings of the International Symposium on Music Information Retrieval (ISMIR'01), pp. 205-210 (2001)
-
(2001)
Proceedings of the International Symposium on Music Information Retrieval (ISMIR'01)
, pp. 205-210
-
-
Tzanetakis, G.1
Essl, G.2
Cook, P.3
-
21
-
-
0036648502
-
Musical genre classification of audio signals
-
Tzanetakis, G., Cook, P.: Musical genre classification of audio signals. IEEE Trans. Speech Audio Proc. 10(5), 293-302 (2002)
-
(2002)
IEEE Trans. Speech Audio Proc.
, vol.10
, Issue.5
, pp. 293-302
-
-
Tzanetakis, G.1
Cook, P.2
-
22
-
-
0037708486
-
Content-based audio classification and segmentation by using support vector machines
-
Lu, L., Zhang, H.-J., Li, S.Z.: Content-based audio classification and segmentation by using support vector machines. ACM Mult. Sys. J. 8(6), 482-492 (2003)
-
(2003)
ACM Mult. Sys. J.
, vol.8
, Issue.6
, pp. 482-492
-
-
Lu, L.1
Zhang, H.-J.2
Li, S.Z.3
-
23
-
-
0036556701
-
Audio classification in speech and music: A comparison between a statistical and a neural approach
-
Bugatti, A., Flammini, A., Migliorati, P.: Audio classification in speech and music: A comparison between a statistical and a neural approach. EURASIP J. Appl. Sig. Proc. 4, 372-378 (2002)
-
(2002)
EURASIP J. Appl. Sig. Proc.
, vol.4
, pp. 372-378
-
-
Bugatti, A.1
Flammini, A.2
Migliorati, P.3
-
24
-
-
0034792569
-
A robust audio classification and segmentation method
-
ACM
-
Lu, L., Jiang, H., Zhang, H.-J.: A robust audio classification and segmentation method. In: Proceedings of the 9th ACM International Conference on Multimedia (MM'01, ACM), pp. 203-211 (2001)
-
(2001)
Proceedings of the 9th ACM International Conference on Multimedia (MM'01)
, pp. 203-211
-
-
Lu, L.1
Jiang, H.2
Zhang, H.-J.3
-
25
-
-
0036816475
-
Content analysis for audio classification and segmentation
-
Lu, L., Zhang, H.-J., Jiang, H.: Content analysis for audio classification and segmentation. IEEE Trans. Speech Audio Proc. 10(7), 504-516 (2002)
-
(2002)
IEEE Trans. Speech Audio Proc.
, vol.10
, Issue.7
, pp. 504-516
-
-
Lu, L.1
Zhang, H.-J.2
Jiang, H.3
-
26
-
-
10044253046
-
Speech music discrimination using class-specific features
-
IEEE
-
Beierholm, T., Baggenstoss, P.M.: Speech music discrimination using class-specific features. In: Proceedings of the 17th International Conference on Pattern Recognition (ICPR'04, IEEE), Vol. 2, pp. 379-382 (2004)
-
(2004)
Proceedings of the 17th International Conference on Pattern Recognition (ICPR'04)
, vol.2
, pp. 379-382
-
-
Beierholm, T.1
Baggenstoss, P.M.2
-
27
-
-
0028727261
-
Detection of human speech in structured noise
-
IEEE
-
Hoyt, J.D., Wechsler, H.: Detection of human speech in structured noise. In: Proceedings of the International Conference on Neural Networks, IEEE, Vol. 7, pp. 4493-4496 (1994)
-
(1994)
Proceedings of the International Conference on Neural Networks
, vol.7
, pp. 4493-4496
-
-
Hoyt, J.D.1
Wechsler, H.2
-
28
-
-
0035308233
-
Classification of general audio data for content-based retrieval
-
Li, D., Sethi, I.K., Dimitrova, N., McGee, T.: Classification of general audio data for content-based retrieval. Patt. Recog. Lett. 22(5), 533-544 (2001)
-
(2001)
Patt. Recog. Lett.
, vol.22
, Issue.5
, pp. 533-544
-
-
Li, D.1
Sethi, I.K.2
Dimitrova, N.3
McGee, T.4
-
29
-
-
84889075620
-
A framework for audio analysis based on classification and temporal segmentation
-
IEEE
-
Tzanetakis, G., Cook, P.: A framework for audio analysis based on classification and temporal segmentation. In: EUROMICRO Workshop on Music Technology and Audio Processing, IEEE, Vol. 2, pp. 61-67 (1999)
-
(1999)
EUROMICRO Workshop on Music Technology and Audio Processing
, vol.2
, pp. 61-67
-
-
Tzanetakis, G.1
Cook, P.2
-
30
-
-
0031624374
-
Classification of audio signals using statistical features on time and wavelet transform domains
-
IEEE
-
Lambrou, T., Kudumakis, P., Speller, R., Sandler, M., Linney, A.: Classification of audio signals using statistical features on time and wavelet transform domains. In: Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP'98, IEEE), Vol. 6, pp. 3621-3624 (1998)
-
(1998)
Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP'98)
, vol.6
, pp. 3621-3624
-
-
Lambrou, T.1
Kudumakis, P.2
Speller, R.3
Sandler, M.4
Linney, A.5
-
31
-
-
0031619927
-
Classification of transient time-varying signals using dft and wavelet packet based methods
-
IEEE
-
Delfs, C., Jondral, R: Classification of transient time-varying signals using dft and wavelet packet based methods. In: Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP'98, IEEE), Vol. 3, pp. 1569-1572 (1998)
-
(1998)
Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP'98)
, vol.3
, pp. 1569-1572
-
-
Delfs, C.1
Jondral, R.2
-
33
-
-
0003922190
-
-
Wiley, New York
-
Duda, R.O., Stork, D.G., Hart, P.E.: Pattern classification, 2nd edn. Wiley, New York (2001)
-
(2001)
Pattern Classification, 2nd Edn.
-
-
Duda, R.O.1
Stork, D.G.2
Hart, P.E.3
-
35
-
-
0024861871
-
Approximation by superpositions of a sigmoidal function
-
Cybenko, G.: Approximation by superpositions of a sigmoidal function. Math. Con. Sig. Sys. 2(4), 303-314 (1989)
-
(1989)
Math. Con. Sig. Sys.
, vol.2
, Issue.4
, pp. 303-314
-
-
Cybenko, G.1
-
36
-
-
33746883995
-
Artificial neural networks for speech and vision
-
Mammone, R.J. (ed.): Chapman & Hall, London
-
Mammone, R.J. (ed.): Artificial neural networks for speech and vision. Chapman & Hall Neural Computing, 1st edn. Chapman & Hall, London (1994)
-
(1994)
Chapman & Hall Neural Computing, 1st Edn.
-
-
-
37
-
-
0022594196
-
An introduction to hidden markov models
-
Rabiner, L.R., Juang, B.H.: An introduction to hidden markov models. IEEE ASSP Magazine 3(1), 4-16 (1986)
-
(1986)
IEEE ASSP Magazine
, vol.3
, Issue.1
, pp. 4-16
-
-
Rabiner, L.R.1
Juang, B.H.2
|