메뉴 건너뛰기




Volumn , Issue , 2010, Pages 3130-3133

Automatic turn segmentation in spoken conversations

Author keywords

Bayesian information criterion; Kullback Leibler divergence; Modulation spectrum; Spoken dialogs; Spoken turn boundary

Indexed keywords

HIDDEN MARKOV MODELS; MODULATION; SPECTRUM ANALYSIS; SPEECH; SPEECH COMMUNICATION; SPEECH TRANSMISSION; TIME DOMAIN ANALYSIS; SPEECH RECOGNITION; TRANSCRIPTION;

EID: 79959849529     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (4)

References (9)
  • 2
    • 84867193555 scopus 로고    scopus 로고
    • New advances in voice activity detection using HOS and optimization strategies
    • M. Grimm and K. Kroschel, Eds. I-Tech
    • J.M. Górriz, J. Ramírez, and C.G. Puntonet, "New Advances in Voice Activity Detection Using HOS and Optimization Strategies," in Robust Speech Recognition and Understanding, M. Grimm and K. Kroschel, Eds., p. 460. I-Tech, 2007.
    • (2007) Robust Speech Recognition and Understanding , pp. 460
    • Górriz, J.M.1    Ramírez, J.2    Puntonet, C.G.3
  • 3
    • 0442270734 scopus 로고    scopus 로고
    • A silence compression scheme for G.729 optimized for terminals conforming to recommendation v.70
    • ITU
    • ITU, "A Silence Compression Scheme for G.729 Optimized for Terminals Conforming to Recommendation v.70," ITU-T Recommendation G.729-Annex B, 1996.
    • (1996) ITU-T Recommendation G.729-annex B
  • 4
    • 26844432206 scopus 로고    scopus 로고
    • Voice activity detector (VAD) for adaptive multi-rate (AMR) speech trafc channels
    • ETSI
    • ETSI, "Voice Activity Detector (VAD) for Adaptive Multi-Rate (AMR) Speech Trafc Channels," ETSI EN 301 708 Recommendation, 1999.
    • (1999) ETSI en 301 708 Recommendation
  • 6
    • 34047272330 scopus 로고    scopus 로고
    • Discrimination of speech from nonspeech based on multiscale spectro-temporal modulations
    • DOI 10.1109/TSA.2005.858055
    • N. Mesgarani, M. Slaney, and S. A. Shamma, "Discrimination of Speech from Nonspeech Based on Multiscale Spectrotemporal Modulations," IEEE Transactions on Audio, Speech, and Language Processing, vol. 14, pp. 920-930, May 2006. (Pubitemid 46547653)
    • (2006) IEEE Transactions on Audio, Speech and Language Processing , vol.14 , Issue.3 , pp. 920-930
    • Mesgarani, N.1    Slaney, M.2    Shamma, S.A.3
  • 7
    • 38949122754 scopus 로고    scopus 로고
    • Review. Speaker segmentation and clustering
    • M. Kotti, V. Moschou, and C. Kotropoulos, "Review. Speaker Segmentation and Clustering," Signal Processing, vol. 88, pp. 1091-1124, 2008.
    • (2008) Signal Processing , vol.88 , pp. 1091-1124
    • Kotti, M.1    Moschou, V.2    Kotropoulos, C.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.