메뉴 건너뛰기




Volumn 50, Issue 2, 2008, Pages 153-161

Speaker change detection in casual conversations using excitation source features

Author keywords

Autoassociative neural network (AANN) models; Excitation source features; Linear prediction (LP) residual; Multispeaker conversation; Speaker change detection

Indexed keywords

FEATURE EXTRACTION; INFORMATION RETRIEVAL; NEURAL NETWORKS; TELEPHONE; VOCABULARY CONTROL;

EID: 37649019590     PISSN: 01676393     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.specom.2007.08.003     Document Type: Article
Times cited : (10)

References (12)
  • 1
    • 0026400244 scopus 로고    scopus 로고
    • Gish, H., Siu, M., Rohlicek, R., 1991. Segregation of speakers for speech recognition and speaker identification. In: Proc. Internat. Conf. on Acoustics Speech and Signal Processing, Vol. 2, Toronto, Canada, pp. 873-876.
  • 2
    • 0002595416 scopus 로고    scopus 로고
    • Speaker, environment and channel change detection and clustering via the Bayesian information criterion
    • Morgan Kaufmann, San Mateo, CA
    • Chen S., and Gopalakrishna P. Speaker, environment and channel change detection and clustering via the Bayesian information criterion. Proc. DARPA Broadcast News Transcription and Understanding Workshop (1998), Morgan Kaufmann, San Mateo, CA 127-132
    • (1998) Proc. DARPA Broadcast News Transcription and Understanding Workshop , pp. 127-132
    • Chen, S.1    Gopalakrishna, P.2
  • 3
    • 37649022154 scopus 로고    scopus 로고
    • Johnson, S., 1997. Speaker Tracking. Master's Thesis. Cambridge University Engineering Department, UK.
  • 4
    • 0034273195 scopus 로고    scopus 로고
    • DISTBIC: a speaker-based segmentation for audio data indexing
    • Delacourt P., and Wellekens C.J. DISTBIC: a speaker-based segmentation for audio data indexing. Speech Comm. 32 (2000) 111-126
    • (2000) Speech Comm. , vol.32 , pp. 111-126
    • Delacourt, P.1    Wellekens, C.J.2
  • 6
    • 0037700756 scopus 로고    scopus 로고
    • Speaker change detection and tracking in real-time news broadcasting analysis
    • Juan-les-pins, France
    • Lu L., and Zhang H. Speaker change detection and tracking in real-time news broadcasting analysis. In: Proc. 10th ACM Multimedia (2002), Juan-les-pins, France 602-610
    • (2002) In: Proc. 10th ACM Multimedia , pp. 602-610
    • Lu, L.1    Zhang, H.2
  • 8
    • 33748443739 scopus 로고    scopus 로고
    • Extraction of speaker-specific excitation information from linear prediction residual of speech
    • Prasanna S.R.M., Gupta C.S., and Yegnanarayana B. Extraction of speaker-specific excitation information from linear prediction residual of speech. Speech Comm. 48 10 (2006) 1243-1261
    • (2006) Speech Comm. , vol.48 , Issue.10 , pp. 1243-1261
    • Prasanna, S.R.M.1    Gupta, C.S.2    Yegnanarayana, B.3
  • 10
    • 0029375490 scopus 로고
    • Determination of instants of significant excitation in speech using group delay function
    • Smits R., and Yegnanarayana B. Determination of instants of significant excitation in speech using group delay function. IEEE Trans. Speech Audio Process. 3 5 (1995) 325-333
    • (1995) IEEE Trans. Speech Audio Process. , vol.3 , Issue.5 , pp. 325-333
    • Smits, R.1    Yegnanarayana, B.2
  • 11
    • 37649003476 scopus 로고    scopus 로고
    • Martin, A., Przybocki, M., 2002. The NIST Speaker Recognition Evaluation Plan. National Institute of Standards and Technology, USA. .
  • 12
    • 33947613670 scopus 로고    scopus 로고
    • Chan, W.N., Lee, T., Zheng, N., Ouyang, H., 2006. Use of vocal source features in speaker segmentation. In: Proc. Internat. Conf. on Acoustics Speech and Signal Processing, Toulouse, France, pp. 657-660.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.