메뉴 건너뛰기




Volumn , Issue , 2009, Pages 4197-4200

Audio segmentation for speech recognition using segment features

Author keywords

Audio segmentation; Broadcast news transcription; Speech recognition

Indexed keywords

AUDIO PROCESSING; AUDIO SEGMENTATION; AUDIO STREAM; BROADCAST NEWS TRANSCRIPTION; CHANGE-POINTS; LINEAR SEGMENTS; MAXIMUM A POSTERIORI DECODING; PRE-PROCESSING STEP; SEGMENTATION METHODS; SEGMENTATION QUALITY; SEGMENTATION TECHNIQUES; SIGNIFICANT IMPACTS; SPEECH RECOGNITION PERFORMANCE;

EID: 70349220959     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2009.4960554     Document Type: Conference Paper
Times cited : (60)

References (9)
  • 2
    • 0002595416 scopus 로고    scopus 로고
    • Speaker, environment and channel change detection and clustering via the Bayesian information criterion
    • Feb
    • S.S. Chen and P.S. Gopalakrishnan, "Speaker, environment and channel change detection and clustering via the Bayesian information criterion," in DARPA Broadcast News Transcription and Understanding Workshop, Feb. 1998, pp. 127-132.
    • (1998) DARPA Broadcast News Transcription and Understanding Workshop , pp. 127-132
    • Chen, S.S.1    Gopalakrishnan, P.S.2
  • 3
    • 4544280424 scopus 로고    scopus 로고
    • Generating and evaluating segmentations for automatic speech recognition of conversational telephone speech
    • Montreal, Canada, May
    • S. E. Tranter, K. Yu, G. Evermann, and P. C.Woodland, "Generating and evaluating segmentations for automatic speech recognition of conversational telephone speech," in Proc. ICASSP, Montreal, Canada, May 2004, vol. 1, pp. 753-756.
    • (2004) Proc. ICASSP , vol.1 , pp. 753-756
    • Tranter, S.E.1    Yu, K.2    Evermann, G.3    Woodland, P.C.4
  • 4
    • 44849107456 scopus 로고    scopus 로고
    • Advances in Arabic broadcast news transcription at RWTH
    • Kyoto, Japan, Dec
    • D. Rybach, S. Hahn, C. Gollan, R. Schlüter, and H. Ney, "Advances in Arabic broadcast news transcription at RWTH," in Proc. ASRU, Kyoto, Japan, Dec. 2007, pp. 449-454.
    • (2007) Proc. ASRU , pp. 449-454
    • Rybach, D.1    Hahn, S.2    Gollan, C.3    Schlüter, R.4    Ney, H.5
  • 6
    • 0000120766 scopus 로고
    • Estimating the dimension of a model
    • G. Schwarz, "Estimating the dimension of a model," The Annals of Statistics, vol. 6, no. 2, pp. 461-464, 1978.
    • (1978) The Annals of Statistics , vol.6 , Issue.2 , pp. 461-464
    • Schwarz, G.1
  • 7
    • 10844275417 scopus 로고    scopus 로고
    • Evaluation of BIC-based algorithms for audio segmentation
    • Apr
    • M. Cettolo, M. Vescovi, and R. Rizzi, "Evaluation of BIC-based algorithms for audio segmentation," Computer Speech and Language, vol. 19, no. 2, pp. 147-170, Apr. 2005.
    • (2005) Computer Speech and Language , vol.19 , Issue.2 , pp. 147-170
    • Cettolo, M.1    Vescovi, M.2    Rizzi, R.3
  • 8
    • 0036567851 scopus 로고    scopus 로고
    • The LIMSI broadcast news transcription system
    • May
    • J.-L. Gauvain, L. Lamel, and G. Adda, "The LIMSI broadcast news transcription system," Speech Communication, vol. 37, no. 1-2, pp. 89-108, May 2002.
    • (2002) Speech Communication , vol.37 , Issue.1-2 , pp. 89-108
    • Gauvain, J.-L.1    Lamel, L.2    Adda, G.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.