-
1
-
-
0036816475
-
Content analysis for audio classification and segmentation
-
L. Lu, H.-J. Zhang, and H. Jiang, "Content analysis for audio classification and segmentation," IEEE Trans. Speech Audio Process., vol. 10, pp. 504-516, 2002.
-
(2002)
IEEE Trans. Speech Audio Process
, vol.10
, pp. 504-516
-
-
Lu, L.1
Zhang, H.-J.2
Jiang, H.3
-
2
-
-
0035340677
-
Audio content analysis for online audio-visual data segmentation and classification
-
T. Zhang and C.-C. J. Kuo, "Audio content analysis for online audio-visual data segmentation and classification," IEEE Trans. Speech Audio Process., vol. 9, pp. 441-457, 2001.
-
(2001)
IEEE Trans. Speech Audio Process
, vol.9
, pp. 441-457
-
-
Zhang, T.1
Kuo, C.-C.J.2
-
3
-
-
85032751556
-
Multimedia content analysis using audio and visual information
-
Jun
-
Y. Wang, Z. Liu, and J. Huang, "Multimedia content analysis using audio and visual information," IEEE Signal Processing Mag., vol. 17, no. 6, pp. 12-36, Jun. 2000.
-
(2000)
IEEE Signal Processing Mag
, vol.17
, Issue.6
, pp. 12-36
-
-
Wang, Y.1
Liu, Z.2
Huang, J.3
-
4
-
-
0034273195
-
DISTBIC: A speaker-based segmentation for audio data indexing
-
P. Delacourt and C. J. Wellekens, "DISTBIC: a speaker-based segmentation for audio data indexing," Speech Commun., vol. 32, pp. 111-126, 2000.
-
(2000)
Speech Commun
, vol.32
, pp. 111-126
-
-
Delacourt, P.1
Wellekens, C.J.2
-
5
-
-
85046873967
-
The DET curve in assessment of detection task performance
-
Rhodes, Greece, Sept
-
A. Martin, G. Doddington, T. Kamm, M. Ordowski, and M. Przybocki, "The DET curve in assessment of detection task performance," in Proc. Eurospeech-97, Rhodes, Greece, Sept. 1997, pp. 1895-1898.
-
(1997)
Proc. Eurospeech-97
, pp. 1895-1898
-
-
Martin, A.1
Doddington, G.2
Kamm, T.3
Ordowski, M.4
Przybocki, M.5
-
6
-
-
85119434191
-
Fast speaker change detection for broadcast news transcription and indexing
-
D. Liu and F. Kubala, "Fast speaker change detection for broadcast news transcription and indexing," in Proc. Eurospeech'99, vol. 3, pp. 1031-1034.
-
Proc. Eurospeech'99
, vol.3
, pp. 1031-1034
-
-
Liu, D.1
Kubala, F.2
-
7
-
-
0002782496
-
Automatic segmentation, classification and clustering of broadcast news audio
-
M. A. Siegler, U. Jain, B. Raj, and R. M. Stern, "Automatic segmentation, classification and clustering of broadcast news audio," in Proc. DARPA Speech Recognition Workshop, 1997, pp. 97-99.
-
(1997)
Proc. DARPA Speech Recognition Workshop
, pp. 97-99
-
-
Siegler, M.A.1
Jain, U.2
Raj, B.3
Stern, R.M.4
-
8
-
-
0036567736
-
Large vocabulary continuous speech recognition of broadcast news-The Philips/RWTH approach
-
P. Beyerlein, X. Aubert, R. Haeb-Umbach, M. Harris, D. Klakow, A. Wendemuth, S. Molau, H. Key, M. Pitz, and A. Sixtus, "Large vocabulary continuous speech recognition of broadcast news-The Philips/RWTH approach," Speech Commun., vol. 37, pp. 109-131, 2002.
-
(2002)
Speech Commun
, vol.37
, pp. 109-131
-
-
Beyerlein, P.1
Aubert, X.2
Haeb-Umbach, R.3
Harris, M.4
Klakow, D.5
Wendemuth, A.6
Molau, S.7
Key, H.8
Pitz, M.9
Sixtus, A.10
-
9
-
-
0002595416
-
Speaker, environment and channel change detection and clustering via the Bayesian information criterion
-
Landsdowne, VA
-
S. S. Chen and P. S. Gopalakrishnan, "Speaker, environment and channel change detection and clustering via the Bayesian information criterion," in Proc. DARPA Broadcast News Transcription Understanding Workshop, Landsdowne, VA, 1998.
-
(1998)
Proc. DARPA Broadcast News Transcription Understanding Workshop
-
-
Chen, S.S.1
Gopalakrishnan, P.S.2
-
10
-
-
0141519364
-
Efficient audio segmentation algorithms based on the BIC
-
M. Cettolo and M. Vescovi, "Efficient audio segmentation algorithms based on the BIC," in Proc. ICASSP'03, 2003.
-
(2003)
Proc. ICASSP'03
-
-
Cettolo, M.1
Vescovi, M.2
-
11
-
-
0032070101
-
Optimal segmentation of random process
-
May
-
M. Lavielle, "Optimal segmentation of random process," IEEE Trans. Signal Processing, vol. 46, no. 5, pp. 1365-1373, May 1998.
-
(1998)
IEEE Trans. Signal Processing
, vol.46
, Issue.5
, pp. 1365-1373
-
-
Lavielle, M.1
-
12
-
-
0034792569
-
A robust audio classification and segmentation method
-
Z. Liu, Y. Wang, and T. Chen, "A robust audio classification and segmentation method," in Proc. 9th ACM Int. Conf. Multimedia, 2001, pp. 203-221.
-
(2001)
Proc. 9th ACM Int. Conf. Multimedia
, pp. 203-221
-
-
Liu, Z.1
Wang, Y.2
Chen, T.3
-
13
-
-
0002085217
-
Detecting "disorder" in multidimensional random process
-
L. J. Vostrikova, "Detecting "disorder" in multidimensional random process," Soviet Math. Doklady, vol. 24, pp. 55-59, 1981.
-
(1981)
Soviet Math. Doklady
, vol.24
, pp. 55-59
-
-
Vostrikova, L.J.1
-
14
-
-
4544345457
-
Model selection criteria for acoustic segmentation
-
Paris, France
-
M. Cettolo and M. Federico, "Model selection criteria for acoustic segmentation," in Proc. ISCAITRWASR '00 Automatic Speech Recognition, Paris, France, 2000, pp. 221-227.
-
(2000)
Proc. ISCAITRWASR '00 Automatic Speech Recognition
, pp. 221-227
-
-
Cettolo, M.1
Federico, M.2
-
15
-
-
0032183995
-
The minimum description length principle in coding and modeling
-
A. Barron, J. Rissanen, and B. Yu, "The minimum description length principle in coding and modeling," IEEE Trans. Inform. Theory, vol. 44, no. 6, pp. 2743-2760, 1998.
-
(1998)
IEEE Trans. Inform. Theory
, vol.44
, Issue.6
, pp. 2743-2760
-
-
Barron, A.1
Rissanen, J.2
Yu, B.3
-
17
-
-
0000208683
-
Stochastic complexity
-
_, "Stochastic complexity," J. R. Statist. Soc. B, vol. 49, pp. 223-239, 1987.
-
(1987)
J. R. Statist. Soc. B
, vol.49
, pp. 223-239
-
-
Rissanen, J.1
-
18
-
-
0036567784
-
Automatic transcription of broadcast news
-
S. S. Chen, E. Edie, M. J. F. Gales, R. A. Gopinath, D. Kanvesky, and P. Olsen, "Automatic transcription of broadcast news," Speech Commun., vol. 37, pp. 69-87, 2002.
-
(2002)
Speech Commun
, vol.37
, pp. 69-87
-
-
Chen, S.S.1
Edie, E.2
Gales, M.J.F.3
Gopinath, R.A.4
Kanvesky, D.5
Olsen, P.6
-
19
-
-
0002082022
-
The 1996 BBN Byblos Hub-4 transcription system
-
F. Kubala et al., "The 1996 BBN Byblos Hub-4 transcription system," in Proc. Speech Recognition Workshop, 1997, pp. 90-93.
-
(1997)
Proc. Speech Recognition Workshop
, pp. 90-93
-
-
Kubala, F.1
-
20
-
-
0042256392
-
The development of the 1996 HTK broadcast news transcription system
-
P. Woodland, M. Gales, D. Pye, and S. Young, "The development of the 1996 HTK broadcast news transcription system," in Proc. Speech Recognition Workshop, 1997, pp. 73-78.
-
(1997)
Proc. Speech Recognition Workshop
, pp. 73-78
-
-
Woodland, P.1
Gales, M.2
Pye, D.3
Young, S.4
-
24
-
-
79952385877
-
-
L. Wilcox, F. Chen, D. Kimber, and V. Balasubramanian, Segmentation of speech using speaker identification, in Proc. ICASSP'94, S1, 1994, pp. 161-164.
-
L. Wilcox, F. Chen, D. Kimber, and V. Balasubramanian, "Segmentation of speech using speaker identification," in Proc. ICASSP'94, vol. S1, 1994, pp. 161-164.
-
-
-
|