메뉴 건너뛰기




Volumn 2015-January, Issue , 2015, Pages 3145-3149

The Cambridge university 2014 BOLT conversational telephone Mandarin Chinese lvcsr system for speech translation

Author keywords

Character LM; Conversational speech transcription; RNNLM; Speech translation; System combination

Indexed keywords

BOLTS; DECODING; RECURRENT NEURAL NETWORKS; SPEECH; SPEECH TRANSMISSION; TELEPHONE SETS; TRANSCRIPTION;

EID: 84959109976     PISSN: 2308457X     EISSN: 19909772     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (13)

References (42)
  • 2
    • 0030638031 scopus 로고    scopus 로고
    • A post-processing system to yield reduced word error rates: Recogniser output voting error reduction (ROVER)
    • Santa Barbara, CA
    • J. G. Fiscus (1997). "A post-processing system to yield reduced word error rates: recogniser output voting error reduction (ROVER), " in Proc. IEEE ASRU, Santa Barbara, CA, pp. 347-354.
    • (1997) Proc. IEEE ASRU , pp. 347-354
    • Fiscus, J.G.1
  • 3
    • 4544253834 scopus 로고    scopus 로고
    • Posterior probability decoding, confidence estimation and system combination
    • College Park, MD
    • G. Evermann and P. C. Woodland (2000), "Posterior probability decoding, confidence estimation and system combination, " in Proc. Speech Transcription Workshop, College Park, MD, 2000.
    • (2000) Proc. Speech Transcription Workshop
    • Evermann, G.1    Woodland, P.C.2
  • 5
    • 0001076101 scopus 로고    scopus 로고
    • A stocastic finite-state word-segmentation algorithm for Chinese
    • R. Sproat, C. Shih, N. Chang, and W. Gale. (1996). A stocastic finite-state word-segmentation algorithm for Chinese, in Computational Linguistics, Vol. 22, Issue, 3, 1996, pp. 377-404.
    • (1996) Computational Linguistics , vol.22 , Issue.3 , pp. 377-404
    • Sproat, R.1    Shih, C.2    Chang, N.3    Gale, W.4
  • 6
    • 84872073683 scopus 로고    scopus 로고
    • Syllable language models for Mandarin speech recognition: Exploiting character sequence models
    • January
    • X. Liu, J. L. Hieronymus, M. J. F. Gales and P. C. Woodland (2013). "Syllable language models for Mandarin speech recognition: exploiting character sequence models", Journal of the Acoustical Society of America, Volume 133, Issue 1, pp. 519-528, January 2013.
    • (2013) Journal of the Acoustical Society of America , vol.133 , Issue.1 , pp. 519-528
    • Liu, X.1    Hieronymus, J.L.2    Gales, M.J.F.3    Woodland, P.C.4
  • 7
    • 0029747183 scopus 로고    scopus 로고
    • Speaker normalization using efficient frequency warping procedures
    • Atlanta, GA
    • L. Lee, and R. C. Rose (1996) "Speaker normalization using efficient frequency warping procedures, " in Proc. IEEE ICASSP, Atlanta, GA, 1996, vol. 1, pp. 353-356.
    • (1996) Proc. IEEE ICASSP , vol.1 , pp. 353-356
    • Lee, L.1    Rose, R.C.2
  • 8
    • 84959142742 scopus 로고    scopus 로고
    • A general artificial neural network extension for HTK
    • C. Zhang, and P. C. Woodland (2015). "A general artificial neural network extension for HTK", in submission to ISCA Interspeech.
    • (2015) ISCA Interspeech
    • Zhang, C.1    Woodland, P.C.2
  • 9
    • 84055222005 scopus 로고    scopus 로고
    • Context-dependent pretrained deep neural networks for large vocabulary speech recognition
    • January
    • G. Dahl, D. Yu, L. Deng, and A. Acero, "Context-dependent pretrained deep neural networks for large vocabulary speech recognition", in IEEE Transactions on Audio, Speech, and Language Processing, vol. 20, no. 1, pp. 30-42, January 2012.
    • (2012) IEEE Transactions on Audio, Speech, and Language Processing , vol.20 , Issue.1 , pp. 30-42
    • Dahl, G.1    Yu, D.2    Deng, L.3    Acero, A.4
  • 11
    • 0033709098 scopus 로고    scopus 로고
    • Tandem connectionist feature extraction for conventional HMM systems
    • Istanbul, Turkey
    • H. Hermansky, D. Ellis and S. Sharma (2000). "Tandem connectionist feature extraction for conventional HMM systems", in Proc. IEEE ICASSP, Istanbul, Turkey, vol. 3, pp. 1635-1638.
    • (2000) Proc. IEEE ICASSP , vol.3 , pp. 1635-1638
    • Hermansky, H.1    Ellis, D.2    Sharma, S.3
  • 12
    • 84865785753 scopus 로고    scopus 로고
    • Improved bottleneck features using pretrained deep neural networks
    • Florence, Italy
    • D. Yu and M. L. Seltzer (2011). "Improved bottleneck features using pretrained deep neural networks", in Proc. ISCA Interspeech, Florence, Italy, 2011, pp. 237-240.
    • (2011) Proc. ISCA Interspeech , pp. 237-240
    • Yu, D.1    Seltzer, M.L.2
  • 13
    • 84903160476 scopus 로고    scopus 로고
    • Paraphrastic language models
    • November
    • X. Liu, M. J. F. Gales, and P. C. Woodland (2014). "Paraphrastic language models", Computer Speech and Language, vol. 28, Issue 6, pp. 1298-1316, November 2014.
    • (2014) Computer Speech and Language , vol.28 , Issue.6 , pp. 1298-1316
    • Liu, X.1    Gales, M.J.F.2    Woodland, P.C.3
  • 14
    • 84946066405 scopus 로고    scopus 로고
    • Paraphrastic recurrent neural network language models
    • Brisbane, Australia
    • X. Liu, M. J. F. Gales, and P. C. Woodland (2015), "Paraphrastic recurrent neural network language models, " in Proc. IEEE ICASSP, Brisbane, Australia, 2015.
    • (2015) Proc. IEEE ICASSP
    • Liu, X.1    Gales, M.J.F.2    Woodland, P.C.3
  • 15
    • 33947703664 scopus 로고    scopus 로고
    • The CU-HTK Mandarin broadcast news transcription system
    • Toulouse, France
    • R. Sinha, M. J. F. Gales, D. Y. Kim, X. Liu, K. C. Sim, and P. C. Woodland (2006). "The CU-HTK Mandarin broadcast news transcription system, " in Proc. IEEE ICASSP, Toulouse, France, 2006, vol. 1, pp. 1077-1080.
    • (2006) Proc. IEEE ICASSP , vol.1 , pp. 1077-1080
    • Sinha, R.1    Gales, M.J.F.2    Kim, D.Y.3    Liu, X.4    Sim, K.C.5    Woodland, P.C.6
  • 16
    • 33646821390 scopus 로고    scopus 로고
    • Development of the CUHTK 2004 Mandarin conversational telephone speech transcription system
    • Philadelphia, PA
    • M. J. F. Gales, B. Jia, X. Liu, K. C. Sim, P. C. Woodland, and K. Yu (2005). "Development of the CUHTK 2004 Mandarin conversational telephone speech transcription system, " in Proc. IEEE ICASSP, Philadelphia, PA, 2005, vol. 1, pp. 841-844.
    • (2005) Proc. IEEE ICASSP , vol.1 , pp. 841-844
    • Gales, M.J.F.1    Jia, B.2    Liu, X.3    Sim, K.C.4    Woodland, P.C.5    Yu, K.6
  • 20
    • 0141703325 scopus 로고    scopus 로고
    • Automatic complexity control for HLDA systems
    • Hong Kong, China
    • X. Liu, M. J. F. Gales, and P. C. Woodland (2003). "Automatic complexity control for HLDA systems", in Proc. IEEE ICASSP, Hong Kong, China, vol. 1, pp. 132-135.
    • (2003) Proc. IEEE ICASSP , vol.1 , pp. 132-135
    • Liu, X.1    Gales, M.J.F.2    Woodland, P.C.3
  • 22
    • 80051623316 scopus 로고    scopus 로고
    • Investigation of acoustic units for LVCSR systems
    • Prague, Czech Republic
    • X. Liu, M. J. F. Gales, J. L. Hieronymus and P. C. Woodland (2011). "Investigation of acoustic units for LVCSR systems", in Proc. IEEE ICASSP, Prague, Czech Republic, pp. 4872-4875.
    • (2011) Proc. IEEE ICASSP , pp. 4872-4875
    • Liu, X.1    Gales, M.J.F.2    Hieronymus, J.L.3    Woodland, P.C.4
  • 23
    • 0036296863 scopus 로고    scopus 로고
    • Minimum phone error and I-smoothing for improved discriminative training
    • Orlando, FL 2002
    • D. Povey and P. C. Woodland (2002). "Minimum phone error and I-smoothing for improved discriminative training", in Proc. IEEE ICASSP, Orlando, FL, 2002, vol. 1 105-108.
    • (2002) Proc. IEEE ICASSP , vol.1 , pp. 105-108
    • Povey, D.1    Woodland, P.C.2
  • 24
    • 0032050110 scopus 로고    scopus 로고
    • Maximum likelihood linear transformations for HMM-based speech recognition
    • M. J. F. Gales (1998). "Maximum likelihood linear transformations for HMM-based speech recognition, " Computer Speech and Language, 12 (2): 75-98, 1998.
    • (1998) Computer Speech and Language , vol.12 , Issue.2 , pp. 75-98
    • Gales, M.J.F.1
  • 25
    • 0029288633 scopus 로고
    • Maximum likelihood linear regression for speaker adaptation of continuous density HMMs
    • C. J. Leggetter and P. C. Woodland (1995). "Maximum likelihood linear regression for speaker adaptation of continuous density HMMs", Computer Speech and Language, 9 (2): 171-185, 1995.
    • (1995) Computer Speech and Language , vol.9 , Issue.2 , pp. 171-185
    • Leggetter, C.J.1    Woodland, P.C.2
  • 27
    • 34547548235 scopus 로고    scopus 로고
    • Probabilistic and bottle-neck features for LVCSR of meetings
    • Honolulu, HI
    • F. Grezl, M. Karafiat, S. Kontar and J. Cernocky, "Probabilistic and bottle-neck features for LVCSR of meetings", in Proc. IEEE ICASSP, Honolulu, HI, 2007, vol. 4, pp. 757-760.
    • (2007) Proc. IEEE ICASSP , vol.4 , pp. 757-760
    • Grezl, F.1    Karafiat, M.2    Kontar, S.3    Cernocky, J.4
  • 28
    • 84890492591 scopus 로고    scopus 로고
    • Revisiting hybrid and GMM-HMM system combination techniques
    • Vancouver, Canada
    • P. Swietojanski, A. Ghoshal, and S. Renals (2013). "Revisiting hybrid and GMM-HMM system combination techniques, " in IEEE ICASSP, Vancouver, Canada, 2013, pp. 6744-6748.
    • (2013) IEEE ICASSP , pp. 6744-6748
    • Swietojanski, P.1    Ghoshal, A.2    Renals, S.3
  • 29
    • 84905265980 scopus 로고    scopus 로고
    • Joint training of convolutional and non-convolutional neural networks
    • Florence, Italy
    • H. Soltau, G. Saon, and T. N. Sainath (2014). "Joint training of convolutional and non-convolutional neural networks, " in IEEE ICASSP, Florence, Italy, 2014, pp. 5572-5576.
    • (2014) IEEE ICASSP , pp. 5572-5576
    • Soltau, H.1    Saon, G.2    Sainath, T.N.3
  • 30
    • 0032638856 scopus 로고    scopus 로고
    • Semi-tied covariance matrices for hidden markov models
    • M. J. F. Gales (1999). "Semi-tied Covariance Matrices for Hidden Markov Models", IEEE Transactions on Speech and Audio Processing, pp. 272-281, vol. 7, 1999.
    • (1999) IEEE Transactions on Speech and Audio Processing , vol.7 , pp. 272-281
    • Gales, M.J.F.1
  • 32
    • 84910067710 scopus 로고    scopus 로고
    • Efficient GPU-based training of recurrent neural network language models using spliced sentence bunch
    • Singapore
    • X. Chen, Y. Wang, X. Liu, M. J. F. Gales and P. C. Woodland (2014). "Efficient GPU-based training of recurrent neural network language models using spliced sentence bunch", in Proc. ISCA Interspeech, Singapore, 2014, pp. 641-645.
    • (2014) Proc. ISCA Interspeech , pp. 641-645
    • Chen, X.1    Wang, Y.2    Liu, X.3    Gales, M.J.F.4    Woodland, P.C.5
  • 33
    • 84906240855 scopus 로고    scopus 로고
    • Prefix tree based n-best list re-scoring for recurrent neural network language model used in speech recognition system
    • Lyon, France
    • Y. Si, Q. Zhang, T. Li, J. Pan, and Y. Yan (2013), "Prefix tree based n-best list re-scoring for recurrent neural network language model used in speech recognition system, " in Proc. ISCA Interspeech, Lyon, France, 2013, pp. 3419-3423.
    • (2013) Proc. ISCA Interspeech , pp. 3419-3423
    • Si, Y.1    Zhang, Q.2    Li, T.3    Pan, J.4    Yan, Y.5
  • 34
    • 0034296009 scopus 로고    scopus 로고
    • Finding consensus in speech recognition: Word error minimization and other applications of confusion networks
    • L. Mangu, E. Brill, and A. Stolcke (2000). "Finding consensus in speech recognition: word error minimization and other applications of confusion networks, " Computer Speech and Language, 14 (4): 373-400, 2000.
    • (2000) Computer Speech and Language , vol.14 , Issue.4 , pp. 373-400
    • Mangu, L.1    Brill, E.2    Stolcke, A.3
  • 35
    • 84905240726 scopus 로고    scopus 로고
    • Efficient lattice rescoring using recurrent neural network language models
    • Florence, Italy
    • X. Liu, Y. Wang, X. Chen, M. J. F. Gales, and P. C. Woodland (2014), "Efficient lattice rescoring using recurrent neural network language models, " in Proc. IEEE ICASSP, Florence, Italy, 2014, pp. 4941-4945.
    • (2014) Proc. IEEE ICASSP , pp. 4941-4945
    • Liu, X.1    Wang, Y.2    Chen, X.3    Gales, M.J.F.4    Woodland, P.C.5
  • 36
    • 84867332205 scopus 로고    scopus 로고
    • Use of contexts in language model interpolation and adaptation
    • January 2013
    • X. Liu, M. J. F. Gales, and P. C. Woodland (2013), "Use of contexts in language model interpolation and adaptation, " Computer Speech and Language, vol. 27, no. 1, pp. 301-321, January 2013.
    • (2013) Computer Speech and Language , vol.27 , Issue.1 , pp. 301-321
    • Liu, X.1    Gales, M.J.F.2    Woodland, P.C.3
  • 37
    • 84875943582 scopus 로고    scopus 로고
    • Language model cross adaptation for LVCSR system combination
    • June 2013
    • X. Liu, M. J. F. Gales & P. C. Woodland (2013). "Language model cross adaptation for LVCSR system combination", Computer Speech and Language, vol. 27, no. 4, pp. 928-942, June 2013.
    • (2013) Computer Speech and Language , vol.27 , Issue.4 , pp. 928-942
    • Liu, X.1    Gales, M.J.F.2    Woodland, P.C.3
  • 39
    • 4544354321 scopus 로고    scopus 로고
    • Speech recognition in multiple languages and domains: The 2003 BBN/LIMSI EARS system
    • Montreal, Canada
    • R. Schwartz et al. (2004). Speech Recognition in Multiple Languages and Domains: The 2003 BBN/LIMSI EARS System, in Proc. IEEE ICASSP, Montreal, Canada, 2004, vol. 3, pp. 753-756.
    • (2004) Proc. IEEE ICASSP , vol.3 , pp. 753-756
    • Schwartz, R.1
  • 40
    • 78049384511 scopus 로고    scopus 로고
    • The 2009 IBM gale Mandarin broadcast transcription system
    • Dallas, TX 2010
    • S. M. Chu et al. (2010). "The 2009 IBM GALE Mandarin Broadcast Transcription System, " in Proc. IEEE ICASSP, Dallas, TX, 2010, pp. 4374-4377.
    • (2010) Proc. IEEE ICASSP , pp. 4374-4377
    • Chu, S.M.1
  • 42
    • 33745208455 scopus 로고    scopus 로고
    • The 2004 bbn/limsi 20xrt english conversational telephone speech recognition system
    • Lisboa, Portugal 2005
    • R. Prasad et al. (2005). "The 2004 BBN/LIMSI 20xRT English conversational telephone speech recognition system, " in Proc. ISCA Interspeech, Lisboa, Portugal, 2005, pp. 1645-1648.
    • (2005) Proc. ISCA Interspeech , pp. 1645-1648
    • Prasad, R.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.