SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 14, Issue 2, 2006, Pages 647-657

Multiple change-point audio segmentation and classification using an MDL-based Gaussian model

(2) Wu, Chung Hsien a,b Hsieh, Chia Hsin a,b

a IEEE (Taiwan)

b NATIONAL CHENG KUNG UNIVERSITY (Taiwan)

Author keywords

Audio classification; Audio segmentation; Minimum description length; Multiple change points

Indexed keywords

AUDIO CLASSIFICATION; AUDIO SEGMENTATION; MINIMUM DESCRIPTION LENGTH; MULTIPLE CHANGE-POINTS;

AUDIO SYSTEMS; CLASSIFICATION (OF INFORMATION); HEURISTIC METHODS; MATHEMATICAL MODELS; STATISTICAL METHODS;

SPEECH PROCESSING;

EID: 33947127409 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TSA.2005.852988 Document Type: Article

Times cited : (34)

References (26)

1
- 0036816475
- Content analysis for audio classification and segmentation
- L. Lu, H.-J. Zhang, and H. Jiang, "Content analysis for audio classification and segmentation," IEEE Trans. Speech Audio Process., vol. 10, pp. 504-516, 2002.
- (2002) IEEE Trans. Speech Audio Process , vol.10 , pp. 504-516
- Lu, L.¹ Zhang, H.-J.² Jiang, H.³

2
- 0035340677
- Audio content analysis for online audio-visual data segmentation and classification
- T. Zhang and C.-C. J. Kuo, "Audio content analysis for online audio-visual data segmentation and classification," IEEE Trans. Speech Audio Process., vol. 9, pp. 441-457, 2001.
- (2001) IEEE Trans. Speech Audio Process , vol.9 , pp. 441-457
- Zhang, T.¹ Kuo, C.-C.J.²

3
- 85032751556
- Multimedia content analysis using audio and visual information
- Jun
- Y. Wang, Z. Liu, and J. Huang, "Multimedia content analysis using audio and visual information," IEEE Signal Processing Mag., vol. 17, no. 6, pp. 12-36, Jun. 2000.
- (2000) IEEE Signal Processing Mag , vol.17 , Issue.6 , pp. 12-36
- Wang, Y.¹ Liu, Z.² Huang, J.³

4
- 0034273195
- DISTBIC: A speaker-based segmentation for audio data indexing
- P. Delacourt and C. J. Wellekens, "DISTBIC: a speaker-based segmentation for audio data indexing," Speech Commun., vol. 32, pp. 111-126, 2000.
- (2000) Speech Commun , vol.32 , pp. 111-126
- Delacourt, P.¹ Wellekens, C.J.²

5
- 85046873967
- The DET curve in assessment of detection task performance
- Rhodes, Greece, Sept
- A. Martin, G. Doddington, T. Kamm, M. Ordowski, and M. Przybocki, "The DET curve in assessment of detection task performance," in Proc. Eurospeech-97, Rhodes, Greece, Sept. 1997, pp. 1895-1898.
- (1997) Proc. Eurospeech-97 , pp. 1895-1898
- Martin, A.¹ Doddington, G.² Kamm, T.³ Ordowski, M.⁴ Przybocki, M.⁵

6
- 85119434191
- Fast speaker change detection for broadcast news transcription and indexing
- D. Liu and F. Kubala, "Fast speaker change detection for broadcast news transcription and indexing," in Proc. Eurospeech'99, vol. 3, pp. 1031-1034.
- Proc. Eurospeech'99 , vol.3 , pp. 1031-1034
- Liu, D.¹ Kubala, F.²

7
- 0002782496
- Automatic segmentation, classification and clustering of broadcast news audio
- M. A. Siegler, U. Jain, B. Raj, and R. M. Stern, "Automatic segmentation, classification and clustering of broadcast news audio," in Proc. DARPA Speech Recognition Workshop, 1997, pp. 97-99.
- (1997) Proc. DARPA Speech Recognition Workshop , pp. 97-99
- Siegler, M.A.¹ Jain, U.² Raj, B.³ Stern, R.M.⁴

8
- 0036567736
- Large vocabulary continuous speech recognition of broadcast news-The Philips/RWTH approach
- P. Beyerlein, X. Aubert, R. Haeb-Umbach, M. Harris, D. Klakow, A. Wendemuth, S. Molau, H. Key, M. Pitz, and A. Sixtus, "Large vocabulary continuous speech recognition of broadcast news-The Philips/RWTH approach," Speech Commun., vol. 37, pp. 109-131, 2002.
- (2002) Speech Commun , vol.37 , pp. 109-131
- Beyerlein, P.¹ Aubert, X.² Haeb-Umbach, R.³ Harris, M.⁴ Klakow, D.⁵ Wendemuth, A.⁶ Molau, S.⁷ Key, H.⁸ Pitz, M.⁹ Sixtus, A.¹⁰

9
- 0002595416
- Speaker, environment and channel change detection and clustering via the Bayesian information criterion
- Landsdowne, VA
- S. S. Chen and P. S. Gopalakrishnan, "Speaker, environment and channel change detection and clustering via the Bayesian information criterion," in Proc. DARPA Broadcast News Transcription Understanding Workshop, Landsdowne, VA, 1998.
- (1998) Proc. DARPA Broadcast News Transcription Understanding Workshop
- Chen, S.S.¹ Gopalakrishnan, P.S.²

10
- 0141519364
- Efficient audio segmentation algorithms based on the BIC
- M. Cettolo and M. Vescovi, "Efficient audio segmentation algorithms based on the BIC," in Proc. ICASSP'03, 2003.
- (2003) Proc. ICASSP'03
- Cettolo, M.¹ Vescovi, M.²

11
- 0032070101
- Optimal segmentation of random process
- May
- M. Lavielle, "Optimal segmentation of random process," IEEE Trans. Signal Processing, vol. 46, no. 5, pp. 1365-1373, May 1998.
- (1998) IEEE Trans. Signal Processing , vol.46 , Issue.5 , pp. 1365-1373
- Lavielle, M.¹

12
- 0034792569
- A robust audio classification and segmentation method
- Z. Liu, Y. Wang, and T. Chen, "A robust audio classification and segmentation method," in Proc. 9th ACM Int. Conf. Multimedia, 2001, pp. 203-221.
- (2001) Proc. 9th ACM Int. Conf. Multimedia , pp. 203-221
- Liu, Z.¹ Wang, Y.² Chen, T.³

13
- 0002085217
- Detecting "disorder" in multidimensional random process
- L. J. Vostrikova, "Detecting "disorder" in multidimensional random process," Soviet Math. Doklady, vol. 24, pp. 55-59, 1981.
- (1981) Soviet Math. Doklady , vol.24 , pp. 55-59
- Vostrikova, L.J.¹

14
- 4544345457
- Model selection criteria for acoustic segmentation
- Paris, France
- M. Cettolo and M. Federico, "Model selection criteria for acoustic segmentation," in Proc. ISCAITRWASR '00 Automatic Speech Recognition, Paris, France, 2000, pp. 221-227.
- (2000) Proc. ISCAITRWASR '00 Automatic Speech Recognition , pp. 221-227
- Cettolo, M.¹ Federico, M.²

15
- 0032183995
- The minimum description length principle in coding and modeling
- A. Barron, J. Rissanen, and B. Yu, "The minimum description length principle in coding and modeling," IEEE Trans. Inform. Theory, vol. 44, no. 6, pp. 2743-2760, 1998.
- (1998) IEEE Trans. Inform. Theory , vol.44 , Issue.6 , pp. 2743-2760
- Barron, A.¹ Rissanen, J.² Yu, B.³

16
- 0004087635
- River Edge, NJ: World Scientific
- J. Rissanen, Stochastic Complexity in Statistical Inquiry. River Edge, NJ: World Scientific, 1989.
- (1989) Stochastic Complexity in Statistical Inquiry
- Rissanen, J.¹

17
- 0000208683
- Stochastic complexity
- _, "Stochastic complexity," J. R. Statist. Soc. B, vol. 49, pp. 223-239, 1987.
- (1987) J. R. Statist. Soc. B , vol.49 , pp. 223-239
- Rissanen, J.¹

18
- 0036567784
- Automatic transcription of broadcast news
- S. S. Chen, E. Edie, M. J. F. Gales, R. A. Gopinath, D. Kanvesky, and P. Olsen, "Automatic transcription of broadcast news," Speech Commun., vol. 37, pp. 69-87, 2002.
- (2002) Speech Commun , vol.37 , pp. 69-87
- Chen, S.S.¹ Edie, E.² Gales, M.J.F.³ Gopinath, R.A.⁴ Kanvesky, D.⁵ Olsen, P.⁶

19
- 0002082022
- The 1996 BBN Byblos Hub-4 transcription system
- F. Kubala et al., "The 1996 BBN Byblos Hub-4 transcription system," in Proc. Speech Recognition Workshop, 1997, pp. 90-93.
- (1997) Proc. Speech Recognition Workshop , pp. 90-93
- Kubala, F.¹

20
- 0042256392
- The development of the 1996 HTK broadcast news transcription system
- P. Woodland, M. Gales, D. Pye, and S. Young, "The development of the 1996 HTK broadcast news transcription system," in Proc. Speech Recognition Workshop, 1997, pp. 73-78.
- (1997) Proc. Speech Recognition Workshop , pp. 73-78
- Woodland, P.¹ Gales, M.² Pye, D.³ Young, S.⁴

21
- 0006337662
- Transcription of BN shows with the IBM LVCSR system
- R. Bakis et al., 'Transcription of BN shows with the IBM LVCSR system," in Proc. DARPA Speech Recognition Workshop, 1997.
- (1997) Proc. DARPA Speech Recognition Workshop
- Bakis, R.¹

22
- 0006281944
- Speaker, channel and environment change detection
- H. Beigi and S. Maes, "Speaker, channel and environment change detection," in Proc. World Congr. Automation, 1998.
- (1998) Proc. World Congr. Automation
- Beigi, H.¹ Maes, S.²

23
- 0028516097
- Text-independent speaker identification
- H. Gish and N. Schmidt, 'Text-independent speaker identification," IEEE Signal Processing Mag., pp. 18-21, 1994.
- (1994) IEEE Signal Processing Mag , pp. 18-21
- Gish, H.¹ Schmidt, N.²

24
- 79952385877
- L. Wilcox, F. Chen, D. Kimber, and V. Balasubramanian, Segmentation of speech using speaker identification, in Proc. ICASSP'94, S1, 1994, pp. 161-164.
- L. Wilcox, F. Chen, D. Kimber, and V. Balasubramanian, "Segmentation of speech using speaker identification," in Proc. ICASSP'94, vol. S1, 1994, pp. 161-164.

25
- 0004172718
- San Diego, CA: Academic
- S. Theodoridis and K. Koutroumbas, Pattern Recognition. San Diego, CA: Academic, 1999.
- (1999) Pattern Recognition
- Theodoridis, S.¹ Koutroumbas, K.²

26
- 0004255301
- New York: Wiley
- G. A. F. Seber, Multivariate Observations. New York: Wiley, 1984.
- (1984) Multivariate Observations
- Seber, G.A.F.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.