SCOPUS 정보 검색 플랫폼

Volumn 50, Issue 2, 2008, Pages 153-161

Speaker change detection in casual conversations using excitation source features

a INDIAN INSTITUTE OF TECHNOLOGY MADRAS (India)

b INTERNATIONAL INSTITUTE OF INFORMATION TECHNOLOGY (India)

Author keywords

Autoassociative neural network (AANN) models; Excitation source features; Linear prediction (LP) residual; Multispeaker conversation; Speaker change detection

Indexed keywords

FEATURE EXTRACTION; INFORMATION RETRIEVAL; NEURAL NETWORKS; TELEPHONE; VOCABULARY CONTROL;

MULTISPEAKER CONVERSATION; SPEECH PRODUCTION; TELEPHONE CONVERSATIONS;

SPEECH RECOGNITION;

EID: 37649019590 PISSN: 01676393 EISSN: None Source Type: Journal
DOI: 10.1016/j.specom.2007.08.003 Document Type: Article

Times cited : (10)

References (12)

1
- 0026400244
- Gish, H., Siu, M., Rohlicek, R., 1991. Segregation of speakers for speech recognition and speaker identification. In: Proc. Internat. Conf. on Acoustics Speech and Signal Processing, Vol. 2, Toronto, Canada, pp. 873-876.

2
- 0002595416
- Speaker, environment and channel change detection and clustering via the Bayesian information criterion
- Morgan Kaufmann, San Mateo, CA
- Chen S., and Gopalakrishna P. Speaker, environment and channel change detection and clustering via the Bayesian information criterion. Proc. DARPA Broadcast News Transcription and Understanding Workshop (1998), Morgan Kaufmann, San Mateo, CA 127-132
- (1998) Proc. DARPA Broadcast News Transcription and Understanding Workshop , pp. 127-132
- Chen, S.¹ Gopalakrishna, P.²

3
- 37649022154
- Johnson, S., 1997. Speaker Tracking. Master's Thesis. Cambridge University Engineering Department, UK.

4
- 0034273195
- DISTBIC: a speaker-based segmentation for audio data indexing
- Delacourt P., and Wellekens C.J. DISTBIC: a speaker-based segmentation for audio data indexing. Speech Comm. 32 (2000) 111-126
- (2000) Speech Comm. , vol.32 , pp. 111-126
- Delacourt, P.¹ Wellekens, C.J.²

5
- 33646914432
- Speech and language technologies for audio indexing and retrieval
- Makhoul J., Kubala F., Leek T., Liu D., Nguyen L., Schwartz R., and Srivastava A. Speech and language technologies for audio indexing and retrieval. Proc. IEEE 88 8 (2000) 1338-1353
- (2000) Proc. IEEE , vol.88 , Issue.8 , pp. 1338-1353
- Makhoul, J.¹ Kubala, F.² Leek, T.³ Liu, D.⁴ Nguyen, L.⁵ Schwartz, R.⁶ Srivastava, A.⁷

6
- 0037700756
- Speaker change detection and tracking in real-time news broadcasting analysis
- Juan-les-pins, France
- Lu L., and Zhang H. Speaker change detection and tracking in real-time news broadcasting analysis. In: Proc. 10th ACM Multimedia (2002), Juan-les-pins, France 602-610
- (2002) In: Proc. 10th ACM Multimedia , pp. 602-610
- Lu, L.¹ Zhang, H.²

7
- 85039150822
- Linguistic Data Consortium, Philadelphia, USA
- Graff D., Miller D., and Walker K. Switchboard-2 Phase III Audio (2002), Linguistic Data Consortium, Philadelphia, USA
- (2002) Switchboard-2 Phase III Audio
- Graff, D.¹ Miller, D.² Walker, K.³

8
- 33748443739
- Extraction of speaker-specific excitation information from linear prediction residual of speech
- Prasanna S.R.M., Gupta C.S., and Yegnanarayana B. Extraction of speaker-specific excitation information from linear prediction residual of speech. Speech Comm. 48 10 (2006) 1243-1261
- (2006) Speech Comm. , vol.48 , Issue.10 , pp. 1243-1261
- Prasanna, S.R.M.¹ Gupta, C.S.² Yegnanarayana, B.³

9
- 0034856452
- Source and system features for speaker recognition using AANN models
- Salt Lake City, Utah, USA
- Yegnanarayana B., Reddy K.S., and Kishore S.P. Source and system features for speaker recognition using AANN models. Proc. Internat. Conf. on Acoustics Speech and Signal Processing Vol. 1 (2001), Salt Lake City, Utah, USA 409-412
- (2001) Proc. Internat. Conf. on Acoustics Speech and Signal Processing , vol.1 , pp. 409-412
- Yegnanarayana, B.¹ Reddy, K.S.² Kishore, S.P.³

10
- 0029375490
- Determination of instants of significant excitation in speech using group delay function
- Smits R., and Yegnanarayana B. Determination of instants of significant excitation in speech using group delay function. IEEE Trans. Speech Audio Process. 3 5 (1995) 325-333
- (1995) IEEE Trans. Speech Audio Process. , vol.3 , Issue.5 , pp. 325-333
- Smits, R.¹ Yegnanarayana, B.²

11
- 37649003476
- Martin, A., Przybocki, M., 2002. The NIST Speaker Recognition Evaluation Plan. National Institute of Standards and Technology, USA. .

12
- 33947613670
- Chan, W.N., Lee, T., Zheng, N., Ouyang, H., 2006. Use of vocal source features in speaker segmentation. In: Proc. Internat. Conf. on Acoustics Speech and Signal Processing, Toulouse, France, pp. 657-660.

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.