-
1
-
-
0037401304
-
Speech/Music discrimination using entropy and dynamism features in a HMM classification framework
-
Ajmera J, McCowan I, Bourlard H (2003) Speech/Music discrimination using entropy and dynamism features in a HMM classification framework. Speech Commun 40(3):351-363
-
(2003)
Speech Commun
, vol.40
, Issue.3
, pp. 351-363
-
-
Ajmera, J.1
McCowan, I.2
Bourlard, H.3
-
3
-
-
0029716457
-
Integrated image and speech analysis for content-based video indexing
-
Chang Y-L, Zeng W, Kamel I, Alonso R (1996) Integrated image and speech analysis for content-based video indexing. Proceedings, the third IEEE international conference on multimedia computing and systems, pp306-313
-
(1996)
Proceedings, the third IEEE international conference on multimedia computing and systems
, pp. 306-313
-
-
Chang, Y.-L.1
Zeng, W.2
Kamel, I.3
Alonso, R.4
-
4
-
-
0028919718
-
Auditory event-related potentials dissociate early and late memory processes
-
Elsevier
-
Chao L, Nielsen-Bohlman L, Knight R (1995) Auditory event-related potentials dissociate early and late memory processes. Electroencephalogr Clin Neurophysiol 96:157-168, Elsevier
-
(1995)
Electroencephalogr Clin Neurophysiol
, vol.96
, pp. 157-168
-
-
Chao, L.1
Nielsen-Bohlman, L.2
Knight, R.3
-
7
-
-
0035308233
-
Classification of general audio data for content-based retrieval
-
Elsevier
-
Dongge L et al (2001) Classification of general audio data for content-based retrieval. Pattern Recogn Lett 22:533-544, Elsevier
-
(2001)
Pattern Recogn Lett
, vol.22
, pp. 533-544
-
-
Dongge, L.1
-
10
-
-
85128356454
-
Partitioning and transcription of broadcast news data
-
Gauvain J-L, Lamel L, Adda G (1998) Partitioning and transcription of broadcast news data. Proc. ICSLP'98 5:1335-1338
-
(1998)
Proc. ICSLP'98
, vol.5
, pp. 1335-1338
-
-
Gauvain, J.-L.1
Lamel, L.2
Adda, G.3
-
11
-
-
0141623871
-
RWC music database: Popular, classical, and jazz music databases
-
Goto M, Hashiguchi H, Nishimura T, Oka R (2002) RWC music database: popular, classical, and jazz music databases. Proceedings, the 3rd international conference on music information retrieval (ISMIR02), pp287-288
-
(2002)
Proceedings, the 3rd international conference on music information retrieval (ISMIR02)
, pp. 287-288
-
-
Goto, M.1
Hashiguchi, H.2
Nishimura, T.3
Oka, R.4
-
12
-
-
34547368583
-
Recognition of music types
-
ICASSP
-
Hagen S, Tanja S, Martin W (1998) Recognition of music types. Proceedings, the 1998 IEEE international conference on acoustics, speech and signal processing, ICASSP
-
(1998)
Proceedings, the 1998 IEEE international conference on acoustics, speech and signal processing
-
-
Hagen, S.1
Tanja, S.2
Martin, W.3
-
13
-
-
0002751623
-
Segment generation and clustering in the HTK broadcast news transcription system
-
Hain T, Johnson SE, Tuerk A, Woodland PC, Young SJ (1998) Segment generation and clustering in the HTK broadcast news transcription system. Proc. 1998 DARPA broadcast news transcription and understanding workshop, pp 133-137
-
(1998)
Proc. 1998 DARPA broadcast news transcription and understanding workshop
, pp. 133-137
-
-
Hain, T.1
Johnson, S.E.2
Tuerk, A.3
Woodland, P.C.4
Young, S.J.5
-
19
-
-
84908322734
-
Music type classification by spectral contrast features
-
Jiang D-N, Lu L, Zhang H-J, Cai L-H, Tao J-H (2002) Music type classification by spectral contrast features. Proceedings, IEEE international conference on multimedia and expo (ICME02)
-
(2002)
Proceedings, IEEE international conference on multimedia and expo (ICME02)
-
-
Jiang, D.-N.1
Lu, L.2
Zhang, H.-J.3
Cai, L.-H.4
Tao, J.-H.5
-
20
-
-
17444377070
-
-
Jung E, Schwarzbacher A, Lawlor R (2002) Implementation of real-time AMDF pitch-detection for voice gender nonnalization. Proceedings of the 14th international conference on digital signal processing. DSP 2002 2:827-830
-
Jung E, Schwarzbacher A, Lawlor R (2002) Implementation of real-time AMDF pitch-detection for voice gender nonnalization. Proceedings of the 14th international conference on digital signal processing. DSP 2002 2:827-830
-
-
-
-
22
-
-
34547288647
-
-
Kiranyaz S, Aubazac M, Gabbouj M (2003) Unsupervised segmentation and classification over MP3 and AAC audio bitstreams. In the Proc. of the 4th European workshop on image analysis for multimedia interactive services WIAMIS 03, World Scientific, London UK
-
Kiranyaz S, Aubazac M, Gabbouj M (2003) Unsupervised segmentation and classification over MP3 and AAC audio bitstreams. In the Proc. of the 4th European workshop on image analysis for multimedia interactive services WIAMIS 03, World Scientific, London UK
-
-
-
-
23
-
-
34547336240
-
-
Konig Y, Morgan N (1992) GDNN a gender dependent neural network for continuous speech recognition. Proceedings, international joint conference on neural networks, IJCNN, 2, 7-11 2:332-337
-
Konig Y, Morgan N (1992) GDNN a gender dependent neural network for continuous speech recognition. Proceedings, international joint conference on neural networks, IJCNN, Volume: 2, 7-11 2:332-337
-
-
-
-
24
-
-
0034273520
-
Content-based classification and retrieval of audio using the nearest feature line method
-
Li S (2000) Content-based classification and retrieval of audio using the nearest feature line method. IEEE Trans Speech Audio Process 8:619-625
-
(2000)
IEEE Trans Speech Audio Process
, vol.8
, pp. 619-625
-
-
Li, S.1
-
28
-
-
20444469135
-
Improving accuracy in behaviour identification for content-based retrieval by using audio and video information
-
Miyamori H (2002) Improving accuracy in behaviour identification for content-based retrieval by using audio and video information. Proceedings of IEEE ICPR02, 2:826-830
-
(2002)
Proceedings of IEEE ICPR02
, vol.2
, pp. 826-830
-
-
Miyamori, H.1
-
30
-
-
34547322429
-
-
Moore, BCJ (ed) (1995), Hearing. Academic, Toronto
-
Moore, BCJ (ed) (1995), Hearing. Academic, Toronto
-
-
-
-
32
-
-
0036331298
-
-
Elsevier
-
Noppeney U, Price CJ (2002) Retrieval of visual, auditory, and abstract semantics. NeuroImage 15:917-926, Elsevier
-
(2002)
Retrieval of visual, auditory, and abstract semantics. NeuroImage
, vol.15
, pp. 917-926
-
-
Noppeney, U.1
Price, C.J.2
-
34
-
-
0010020774
-
Scanning the dial: An exploration of factors in the identification of musical style
-
Society for Music Perception and Cognition
-
Perrot, D, Gjerdigen, RO Scanning the dial: an exploration of factors in the identification of musical style. Proceedings, the 1999 Society for Music Perception and Cognition
-
(1999)
Proceedings, the
-
-
Perrot, D.1
Gjerdigen, R.O.2
-
37
-
-
0033693368
-
Content-based methods for the management of digital music
-
Pye D (2000) Content-based methods for the management of digital music. Proceedings, IEEE international conference on, acoustics, speech, and signal processing, ICASSP'00.volume:4, 4:2437-2440
-
(2000)
Proceedings, IEEE international conference on, acoustics, speech, and signal processing, ICASSP'00
, vol.4
-
-
Pye, D.1
-
38
-
-
0029209272
-
Robust text-independent speaker identification using Gaussian mixture speaker models
-
Reynolds DA, Rose RC (1995) Robust text-independent speaker identification using Gaussian mixture speaker models. IEEE Trans Speech Audio Process 3(1):72-83
-
(1995)
IEEE Trans Speech Audio Process
, vol.3
, Issue.1
, pp. 72-83
-
-
Reynolds, D.A.1
Rose, R.C.2
-
40
-
-
0030365534
-
Robust gender-dependent acoustic-phonetic modelling in continuous speech recognition based on a new automatic male female classification
-
2, Oct
-
Rivarol V, Farhal A, O'Shaughnessy D (1996) Robust gender-dependent acoustic-phonetic modelling in continuous speech recognition based on a new automatic male female classification. Proceedings, fourth international conference on spoken language, ICSLP 96, Volume: 2 3-6 2:1081-1084 (Oct)
-
(1996)
Proceedings, fourth international conference on spoken language, ICSLP
, vol.96
-
-
Rivarol, V.1
Farhal, A.2
O'Shaughnessy, D.3
-
41
-
-
0029765670
-
Real time discrimination of broadcast speech/music
-
Saunders J (1996) Real time discrimination of broadcast speech/music, Proc. Of ICASSP96 2: 993-996
-
(1996)
Proc. Of ICASSP96
, vol.2
, pp. 993-996
-
-
Saunders, J.1
-
42
-
-
0030648077
-
Construction and evaluation of a robust multifeature speech/music discriminator
-
Munich, Germany April
-
Scheirer E, Slaney M (1997) Construction and evaluation of a robust multifeature speech/music discriminator. Proceedings of IEEE ICASSP'97, Munich, Germany (April)
-
(1997)
Proceedings of IEEE ICASSP'97
-
-
Scheirer, E.1
Slaney, M.2
-
43
-
-
0034843119
-
Experiments on speech tracking in audio documents using Gaussian mixture modeling
-
Seek M, Magrin-Chagnolleau I, Bimbot F (2001) Experiments on speech tracking in audio documents using Gaussian mixture modeling. Proceedings of IEEE ICASSP01, 1:601-604
-
(2001)
Proceedings of IEEE ICASSP01
, vol.1
, pp. 601-604
-
-
Seek, M.1
Magrin-Chagnolleau, I.2
Bimbot, F.3
-
47
-
-
0036648502
-
Musical genre classification of audio signals
-
Tzanetakis G, Cook P (2002) Musical genre classification of audio signals. IEEE Trans Speech Audio Process 10(5):293-302
-
(2002)
IEEE Trans Speech Audio Process
, vol.10
, Issue.5
, pp. 293-302
-
-
Tzanetakis, G.1
Cook, P.2
-
49
-
-
85032751556
-
Multimedia content analysis using both audio and visual cues
-
Wang Y, Liu Z, Huang J-C (2000) Multimedia content analysis using both audio and visual cues. IEEE Signal Process Mag 116:12-36
-
(2000)
IEEE Signal Process Mag
, vol.116
, pp. 12-36
-
-
Wang, Y.1
Liu, Z.2
Huang, J.-C.3
-
50
-
-
77958036231
-
Speech/music discrimination based on posterior probability features
-
Williams G, Ellis D (1999) Speech/music discrimination based on posterior probability features. Proceedings of Eurospeech
-
(1999)
Proceedings of Eurospeech
-
-
Williams, G.1
Ellis, D.2
-
51
-
-
0030242072
-
Content-based classification search and retrieval of audio
-
Wold E, Blum T, Keislar D, Wheaton J (1996) Content-based classification search and retrieval of audio. IEEE Multimedia Magazine 3(3):27-36
-
(1996)
IEEE Multimedia Magazine
, vol.3
, Issue.3
, pp. 27-36
-
-
Wold, E.1
Blum, T.2
Keislar, D.3
Wheaton, J.4
-
52
-
-
0035815506
-
Organizing sound sequences in the human brain: The interplay of auditory streaming and temporal integration
-
Elsevier
-
Yabe H et al (2001) Organizing sound sequences in the human brain: the interplay of auditory streaming and temporal integration. Brain Res 897:222-227, Elsevier
-
(2001)
Brain Res
, vol.897
, pp. 222-227
-
-
Yabe, H.1
-
53
-
-
0035340677
-
Audio content analysis for on-line audiovisual data segmentation
-
Zhang T, Jay Kuo C-C (2001) Audio content analysis for on-line audiovisual data segmentation. IEEE Trans Speech Audio Process 9(4):441-457
-
(2001)
IEEE Trans Speech Audio Process
, vol.9
, Issue.4
, pp. 441-457
-
-
Zhang, T.1
Jay Kuo, C.-C.2
-
54
-
-
0036888031
-
-
Zhou W, Dao S, Jay Kuo C-C (2002) On line knowledge and rule-based video classification system for video indexing and dissemination. Inf Sys 27:559-586, Elsevier
-
Zhou W, Dao S, Jay Kuo C-C (2002) On line knowledge and rule-based video classification system for video indexing and dissemination. Inf Sys 27:559-586, Elsevier
-
-
-
|