메뉴 건너뛰기




Volumn , Issue , 2014, Pages 1405-1409

Audio-to-text alignment for speech recognition with very limited resources

Author keywords

Asr; Low resources; Phonetic alignment; Text to speech alignment

Indexed keywords

AUDIO RECORDINGS; CHARACTER RECOGNITION; DYNAMIC PROGRAMMING; EXPERIMENTS; LINGUISTICS; SPEECH; SPEECH COMMUNICATION;

EID: 84910072484     PISSN: 2308457X     EISSN: 19909772     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (32)

References (25)
  • 2
    • 80155146584 scopus 로고    scopus 로고
    • Automatic synchronization of electronic and audio books via TTS alignment and silence filtering
    • X. Anguera, N. Perez, A. Urruela, and N. Oliver, "Automatic Synchronization of Electronic and Audio Books via TTS Alignment and Silence Filtering, " in Proc. ICME, 2011.
    • (2011) Proc. ICME
    • Anguera, X.1    Perez, N.2    Urruela, A.3    Oliver, N.4
  • 3
    • 79956282392 scopus 로고    scopus 로고
    • Segmentation of monologues in audio books for building synthetic voices
    • K. Prahallad and A. W. Black, "Segmentation of Monologues in Audio Books for Building Synthetic Voices, " Trans. Audio, Speech and Language Processing, vol. 19, no. 5, pp. 1444-1449, 2011.
    • (2011) Trans. Audio, Speech and Language Processing , vol.19 , Issue.5 , pp. 1444-1449
    • Prahallad, K.1    Black, A.W.2
  • 4
    • 34547521678 scopus 로고    scopus 로고
    • Automatic alignment and error correction of human generated transcripts for long speech recordings
    • T. J. Hazen, "Automatic Alignment and Error Correction of Human Generated Transcripts for Long Speech Recordings, " in Proc. Inter Speech, 2006, pp. 1606-1609.
    • (2006) Proc. Inter Speech , pp. 1606-1609
    • Hazen, T.J.1
  • 5
    • 0343950213 scopus 로고    scopus 로고
    • Improving acoustic models by watching television
    • Carnegie Mellon University, Tech. Rep
    • M. J.Witbrock and A. G. Hauptmann, "Improving Acoustic Models by Watching Television, " Technical Report CMU-CS-98-110, Carnegie Mellon University, Tech. Rep., 1998.
    • (1998) Technical Report CMU-CS-98-110
    • Witbrock, M.J.1    Hauptmann, A.G.2
  • 6
  • 7
    • 46449097482 scopus 로고    scopus 로고
    • Alignment of speech to highly imperfect text transcriptions
    • A. Haubold and J. R. Kender, "Alignment of Speech to Highly Imperfect Text Transcriptions, " in Proc. ICME, 2007.
    • (2007) Proc. ICME
    • Haubold, A.1    Kender, J.R.2
  • 11
    • 84910039499 scopus 로고    scopus 로고
    • Automatic generation of hyperlinks between audio and transcript
    • September
    • J. Robert-Ribes and R. Mukhtar, "Automatic Generation of Hyperlinks Between Audio and Transcript, " in Proc. Eurospeech, vol. 1997, no. September, 1997, pp. 903-906.
    • (1997) Proc. Eurospeech , vol.1997 , pp. 903-906
    • Robert-Ribes, J.1    Mukhtar, R.2
  • 12
    • 84885726863 scopus 로고    scopus 로고
    • A recursive algorithm for the forced alignment of very long audio segments
    • P. J. Moreno, C. Joerg, J.-m. Van Thong, and O. Glickman, "A Recursive Algorithm for the Forced Alignment of Very Long Audio Segments, " in Proc. ICSLP, 1998.
    • (1998) Proc. ICSLP
    • Moreno, P.J.1    Joerg, C.2    Van Thong, J.-M.3    Glickman, O.4
  • 13
    • 84906260292 scopus 로고    scopus 로고
    • Text-to-speech alignment of long recordings using universal phone models
    • August
    • S. Hoffmann and B. Pfister, "Text-to-Speech Alignment of Long Recordings Using Universal Phone Models, " Proc. Inter Speech, no. August, pp. 1520-1524, 2013.
    • (2013) Proc. Inter Speech , pp. 1520-1524
    • Hoffmann, S.1    Pfister, B.2
  • 14
    • 84906264108 scopus 로고    scopus 로고
    • Technique for automatic sentence level alignment of long speech and transcripts
    • August
    • I. Ahmed, S. K. Kopparapu, T. C. S. Innovation, L. Mumbai, Y. Park, and T. West, "Technique for Automatic Sentence Level Alignment of Long Speech and Transcripts, " in Proc. Inter Speech, no. August, 2013, pp. 1516-1519.
    • (2013) Proc. Inter Speech , pp. 1516-1519
    • Ahmed, I.1    Kopparapu, S.K.2    Innovation, T.C.S.3    Mumbai, L.4    Park, Y.5    West, T.6
  • 15
    • 84865764419 scopus 로고    scopus 로고
    • Rapid building of an ASR system for under-resourced languages based on multilingual unsupervised training
    • August
    • N. T. Vu, F. Kraus, and T. Schultz, "Rapid building of an ASR system for Under-Resourced Languages based on Multilingual Unsupervised Training, " in Proc. Inter Speech, no. August, 2011, pp. 3145-3148.
    • (2011) Proc. Inter Speech , pp. 3145-3148
    • Vu, N.T.1    Kraus, F.2    Schultz, T.3
  • 16
    • 0036460908 scopus 로고    scopus 로고
    • Lightly supervised recognition for automatic alignment of large coherent speech recordings
    • N. Braunschweiler, M. J. F. Gales, and S. Buchholz, "Lightly supervised recognition for automatic alignment of large coherent speech recordings, " Trans. Computer Speech and Language, vol. 16, no. 1, pp. 115-129, 2002.
    • (2002) Trans. Computer Speech and Language , vol.16 , Issue.1 , pp. 115-129
    • Braunschweiler, N.1    Gales, M.J.F.2    Buchholz, S.3
  • 19
    • 84906274473 scopus 로고    scopus 로고
    • An open-source state-of-the-art toolbox for broadcast news Diarization
    • [Online]
    • M. Rouvier, G. Dupuy, P. Gay, E. Khoury, T. Merlin, and S. Meignier, "An Open-source State-of-the-art Toolbox for Broadcast News Diarization, " in Proc. Inter Speech, 2013. [Online]. Available: Http://www-lium.univ-lemans.fr/diarization.
    • (2013) Proc. Inter Speech
    • Rouvier, M.1    Dupuy, G.2    Gay, P.3    Khoury, E.4    Merlin, T.5    Meignier, S.6
  • 21
    • 85009230817 scopus 로고    scopus 로고
    • Grapheme based speech recognition
    • M. Killer, S. Stüker, and T. Schultz, "Grapheme Based Speech Recognition, " in Eurospeech, 2003, pp. 3141-3144.
    • (2003) Eurospeech , pp. 3141-3144
    • Killer, M.1    Stüker, S.2    Schultz, T.3
  • 22
    • 78049527800 scopus 로고    scopus 로고
    • The Cere voice characterful speech synthesiser SDK
    • Newcastle
    • M. P. Aylett and C. J. Pidcock, "The CereVoice Characterful Speech Synthesiser SDK, " in Proc. AISB, Newcastle, 2007, pp. 174-178.
    • (2007) Proc. AISB , pp. 174-178
    • Aylett, M.P.1    Pidcock, C.J.2
  • 23
    • 84976375912 scopus 로고
    • CEUDEX: A data base oriented to context-dependent units training in Spanish for continuous speech recognition
    • September
    • C. de la Torre, L. Gernández-Gómez, and D. Tapias, "CEUDEX: A Data Base oriented to Context-Dependent Units Training in Spanish for Continuous Speech Recognition, " in Proc. Eurospeech, no. September, 1995, pp. 845-848.
    • (1995) N Proc. Eurospeech , pp. 845-848
    • De La Torre, C.1    Gernández-Gómez, L.2    Tapias, D.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.