메뉴 건너뛰기




Volumn , Issue , 2013, Pages 1516-1519

Technique for automatic sentence level alignment of long speech and transcripts

Author keywords

Long audio; Resource deficient; Speech corpus; Speech text alignment; Syllable

Indexed keywords

ALIGNMENT; SPEECH RECOGNITION;

EID: 84906264108     PISSN: 2308457X     EISSN: 19909772     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (9)

References (16)
  • 1
    • 84869454077 scopus 로고    scopus 로고
    • Speech recognition for resource deficient languages using frugal speech corpus
    • Hong Kong, China, Aug, (to appear)
    • I. Ahmed and S. Kopparapu, "Speech recognition for resource deficient languages using frugal speech corpus, " in ICSPCC2012, Hong Kong, China, Aug 2012, (to appear).
    • (2012) ICSPCC2012
    • Ahmed, I.1    Kopparapu, S.2
  • 2
    • 84865744412 scopus 로고    scopus 로고
    • Efficient harvesting of internet audio for resource-scarce asr
    • Florence, Italy
    • M. Davel, C. Van Heerden, N. Kleynhans, and E. Barnard, "Efficient harvesting of internet audio for resource-scarce asr, " in Interspeech 2011, Florence, Italy, 2011.
    • (2011) Interspeech 2011
    • Davel, M.1    Van Heerden, C.2    Kleynhans, N.3    Barnard, E.4
  • 4
    • 84906271474 scopus 로고    scopus 로고
    • [Online]. Available
    • AIR, "All india radio news archives." [Online]. Available: http://www.newsonair.com/.
    • All India Radio News Archives
  • 5
    • 84885726863 scopus 로고    scopus 로고
    • A recursive algorithm for the forced alignment of very long audio segments
    • P. Moreno, C. Joerg, J. Van Thong, and O. Glickman, "A recursive algorithm for the forced alignment of very long audio segments, " in ICSLP 98, 1998, pp. 2711-2714.
    • (1998) ICSLP 98 , pp. 2711-2714
    • Moreno, P.1    Joerg, C.2    Van Thong, J.3    Glickman, O.4
  • 6
    • 34547521678 scopus 로고    scopus 로고
    • Automatic alignment and error correction of human generated transcripts for long speech recordings
    • T. J. Hazen., "Automatic alignment and error correction of human generated transcripts for long speech recordings, " in ICSLP 06, 2006, pp. 1606-1609.
    • (2006) ICSLP 06 , pp. 1606-1609
    • Hazen, T.J.1
  • 9
    • 33745223233 scopus 로고    scopus 로고
    • Automatic closed caption alignment based on speech recognition transcripts
    • December
    • C. Huang, W. Hsu, and S. Chang, "Automatic closed caption alignment based on speech recognition transcripts, " Columbia University, Tech. Rep., December 2003.
    • (2003) Columbia University, Tech. Rep.
    • Huang, C.1    Hsu, W.2    Chang, S.3
  • 11
    • 77954235232 scopus 로고    scopus 로고
    • A dynamic alignment algorithm for imperfect speech and transcript
    • Y. Tao, X. Li, and B. Wu, "A dynamic alignment algorithm for imperfect speech and transcript, " Computer Science and Information Systems, vol. 7, 2010.
    • (2010) Computer Science and Information Systems , vol.7
    • Tao, Y.1    Li, X.2    Wu, B.3
  • 12
    • 79956282392 scopus 로고    scopus 로고
    • Segmentation of monologues in audio books for building synthetic voices
    • K. Prahallad and A. W. Black, "Segmentation of monologues in audio books for building synthetic voices, " IEEE Trans. Audio, Speech and Language Processing, vol. 19, pp. 1444-1449, 2011.
    • (2011) IEEE Trans. Audio, Speech and Language Processing , vol.19 , pp. 1444-1449
    • Prahallad, K.1    Black, A.W.2
  • 13
    • 66149126976 scopus 로고    scopus 로고
    • Praat script to detect syllable nuclei and measure speech rate automatically
    • N. de Jong and T. Wempe, "Praat script to detect syllable nuclei and measure speech rate automatically, " Behavior Research Methods, vol. 41, pp. 385-390, 2009.
    • (2009) Behavior Research Methods , vol.41 , pp. 385-390
    • De Jong, N.1    Wempe, T.2
  • 14
    • 0001835850 scopus 로고
    • Accurate short-term analysis of the fundamental frequency and the harmonics-to-noise ratio of a sampled sound
    • P. Boersma, "Accurate short-term analysis of the fundamental frequency and the harmonics-to-noise ratio of a sampled sound, " in Institute of Phonetic Sciences, University of Amsterdam, Proceedings 17, 1993.
    • (1993) Institute of Phonetic Sciences, University of Amsterdam, Proceedings , vol.17
    • Boersma, P.1
  • 15
    • 84906232633 scopus 로고    scopus 로고
    • NU, [Online]. Available
    • NU, "English syllable counter from northwestern university." [Online]. Available: http://morphadorner.northwestern.edu/morphadorner/ documentation/javadoc/edu/northwestern/at/utils/corpuslinguistics/ syllablecounter/EnglishSyllableCounter.html.
    • English Syllable Counter from Northwestern University
  • 16
    • 85009179208 scopus 로고    scopus 로고
    • Unit size in unit selection speech synthesis
    • Geneva, Switerzland
    • K. Prahallad and A. W. Black, "Unit size in unit selection speech synthesis, " in Eurospeech 2003, Geneva, Switerzland, 2003.
    • (2003) Eurospeech 2003
    • Prahallad, K.1    Black, A.W.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.