메뉴 건너뛰기




Volumn 133, Issue 1, 2013, Pages 519-528

Syllable language models for Mandarin speech recognition: Exploiting character language models

Author keywords

[No Author keywords available]

Indexed keywords

AUDIO-RECOGNITION; CHARACTER ERROR RATES; CHARACTER LEVEL; CHINESE LANGUAGE; LANGUAGE MODEL; LINGUISTIC INFORMATION; MANDARIN CHINESE; MANDARIN SPEECH RECOGNITION; MODEL-BASED OPC; SPEECH RECOGNITION PERFORMANCE; SPOKEN LANGUAGES; TEXT SOURCES; WORD LEVEL;

EID: 84872073683     PISSN: 00014966     EISSN: None     Source Type: Journal    
DOI: 10.1121/1.4768800     Document Type: Article
Times cited : (22)

References (44)
  • 2
    • 0001076101 scopus 로고    scopus 로고
    • A stochastic finite-state word-segmentation algorithm for Chinese
    • R. Sproat, C. Shih, N. Chang, and W. Gale, " A stochastic finite-state word-segmentation algorithm for Chinese.," Comput. Linguist. 22 (3), 377-404 (1996).
    • (1996) Comput. Linguist. , vol.22 , Issue.3 , pp. 377-404
    • Sproat, R.1    Shih, C.2    Chang, N.3    Gale, W.4
  • 6
    • 85178580924 scopus 로고
    • The syllable
    • in, edited by H. van der Hulst and Norval Smieth (Fortus, Dordrecht), Vol.
    • E. O. Selkirk, " The syllable.," in The Structure of Phonological Representations, edited by, H. van der Hulst, and, Norval Smieth, (Fortus, Dordrecht, 1982), Vol. 2, pp. 337-385.
    • (1982) The Structure of Phonological Representations , vol.2 , pp. 337-385
    • Selkirk, E.O.1
  • 7
    • 0026240875 scopus 로고
    • Markov modeling of Mandarin Chinese for decoding the phonetic sequence into Chinese characters
    • 10.1016/0885-2308(91)90004-A
    • H. Gu, C. Tseng, and L. Lee, " Markov modeling of Mandarin Chinese for decoding the phonetic sequence into Chinese characters.," Comput. Speech Lang. 5, 363-371 (1991). 10.1016/0885-2308(91)90004-A
    • (1991) Comput. Speech Lang. , vol.5 , pp. 363-371
    • Gu, H.1    Tseng, C.2    Lee, L.3
  • 9
  • 11
    • 70349210890 scopus 로고    scopus 로고
    • Modeling characters versus words for Mandarin speech recognition
    • in.
    • J. Luo, L. Lamel, and J-L. Gauvain, " Modeling characters versus words for Mandarin speech recognition.," in Proceedings of ICASSP2009 (2009).
    • (2009) Proceedings of ICASSP2009
    • Luo, J.1    Lamel, L.2    Gauvain, J.-L.3
  • 12
    • 0031103274 scopus 로고    scopus 로고
    • Complete recognition of continuous Mandarin speech for Chinese language with very large vocabulary using limited training data
    • 10.1109/89.554782
    • H. M. Wang, T. H. Ho, R. C. Yang, J. L. Shen, B. R. Bai, J. C. Hong, W. P. Chen, T. L. Yu, and L. S. Lee, " Complete recognition of continuous Mandarin speech for Chinese language with very large vocabulary using limited training data.," IEEE Trans. Speech Audio Process. 5 (2), 195-200 (1997). 10.1109/89.554782
    • (1997) IEEE Trans. Speech Audio Process. , vol.5 , Issue.2 , pp. 195-200
    • Wang, H.M.1    Ho, T.H.2    Yang, R.C.3    Shen, J.L.4    Bai, B.R.5    Hong, J.C.6    Chen, W.P.7    Yu, T.L.8    Lee, L.S.9
  • 13
    • 0023312404 scopus 로고
    • Estimation of probabilities from sparse data for the language model component of a speech recognizer
    • 10.1109/TASSP.1987.1165125
    • S. M. Katz, " Estimation of probabilities from sparse data for the language model component of a speech recognizer.," IEEE Trans. Acoust., Speech, Signal Process. 35 (3), 400-401 (1987). 10.1109/TASSP.1987.1165125
    • (1987) IEEE Trans. Acoust., Speech, Signal Process. , vol.35 , Issue.3 , pp. 400-401
    • Katz, S.M.1
  • 14
    • 0030715425 scopus 로고    scopus 로고
    • Language model adaptation using mixtures and an exponentially decaying cache
    • in, Munich
    • P. Clarkson and A. Robinson, " Language model adaptation using mixtures and an exponentially decaying cache.," in Proceedings of ICASSP1997, Munich (1997), pp. 799-802.
    • (1997) Proceedings of ICASSP1997 , pp. 799-802
    • Clarkson, P.1    Robinson, A.2
  • 15
    • 0019114666 scopus 로고
    • Interpolated estimation of Markov source parameters from sparse data
    • in, edited by E. S. Gelsema and L. N. Kanal (Norh-Holland, Amsterdam)
    • F. Jelinek and R. Mercer, " Interpolated estimation of Markov source parameters from sparse data.," in Pattern Recognition in Practice, edited by, E. S. Gelsema, and, L. N. Kanal, (Norh-Holland, Amsterdam, 1980), pp. 381-402.
    • (1980) Pattern Recognition in Practice , pp. 381-402
    • Jelinek, F.1    Mercer, R.2
  • 16
    • 0030181951 scopus 로고    scopus 로고
    • A maximum entropy approach to adaptive statistical language modeling
    • 10.1006/csla.1996.0011
    • R. Rosenfeld, " A maximum entropy approach to adaptive statistical language modeling.," Comput. Speech Lang. 10, 187-228 (1996). 10.1006/csla.1996.0011
    • (1996) Comput. Speech Lang. , vol.10 , pp. 187-228
    • Rosenfeld, R.1
  • 17
    • 44949090835 scopus 로고    scopus 로고
    • Getting more mileage from web text sources for conversational speech language modeling using class-dependent mixtures
    • in, Edmonton.
    • I. Bulyko, M. Ostendorf, and A. Stolcke " Getting more mileage from web text sources for conversational speech language modeling using class-dependent mixtures.," in Proceedings of HLT '03, Edmonton (2003).
    • (2003) Proceedings of HLT '03
    • Bulyko, I.1    Ostendorf, M.2    Stolcke, A.3
  • 19
    • 0009643324 scopus 로고    scopus 로고
    • Efficient language model adaptation through MDI estimation
    • in, Budapest.
    • M. Federico, " Efficient language model adaptation through MDI estimation.," in Proceedings of EuroSpeech '99, Budapest (1999).
    • (1999) Proceedings of EuroSpeech '99
    • Federico, M.1
  • 21
    • 70450161305 scopus 로고    scopus 로고
    • Use of contexts in language model interpolation and adaptation
    • in, Brighton.
    • X. Liu, M. J. F. Gales, and P. C. Woodland, " Use of contexts in language model interpolation and adaptation.," in Proceedings of Interspeech '09, Brighton (2009).
    • (2009) Proceedings of Interspeech '09
    • Liu, X.1    Gales, M.J.F.2    Woodland, P.C.3
  • 22
    • 0030638031 scopus 로고    scopus 로고
    • A post-processing system to yield reduced word error rates: Recogniser output voting error reduction (ROVER)
    • in.
    • J. G. Fiscus, " A post-processing system to yield reduced word error rates: Recogniser output voting error reduction (ROVER).," in Proceedings of IEEE ASRU '97 (1997).
    • (1997) Proceedings of IEEE ASRU '97
    • Fiscus, J.G.1
  • 24
    • 0013344078 scopus 로고    scopus 로고
    • Training products of experts by minimizing contrastive divergence
    • 10.1162/089976602760128018
    • G. Hinton, " Training products of experts by minimizing contrastive divergence.," Neural Comput. 14, 1771-1800 (2002). 10.1162/ 089976602760128018
    • (2002) Neural Comput. , vol.14 , pp. 1771-1800
    • Hinton, G.1
  • 25
    • 84891308106 scopus 로고    scopus 로고
    • SRILM - An extensible language modeling toolkit
    • in, Denver.
    • A. Stolcke, " SRILM-An extensible language modeling toolkit.," in Proceedings of ICSLP '02, Denver (2002).
    • (2002) Proceedings of ICSLP '02
    • Stolcke, A.1
  • 27
    • 0032661656 scopus 로고    scopus 로고
    • Network optimizations for large vocabulary speech recognition
    • 10.1016/S0167-6393(98)00026-0
    • M. Mohri and M. Riley, " Network optimizations for large vocabulary speech recognition.," Speech Commun. 25 (3), 1-12 (1998). 10.1016/S0167-6393(98)00026-0
    • (1998) Speech Commun. , vol.25 , Issue.3 , pp. 1-12
    • Mohri, M.1    Riley, M.2
  • 28
    • 0012306376 scopus 로고    scopus 로고
    • The design principles of a weighted finite-state transducer library
    • 10.1016/S0304-3975(99)00014-6
    • M. Mohri, F. C. N. Pereira, and M. Riley, " The design principles of a weighted finite-state transducer library.," Theor. Comput. Sci. 231, 17-32 (2000). 10.1016/S0304-3975(99)00014-6
    • (2000) Theor. Comput. Sci. , vol.231 , pp. 17-32
    • Mohri, M.1    Pereira, F.C.N.2    Riley, M.3
  • 29
    • 0036460907 scopus 로고    scopus 로고
    • Weighted finite-state transducers in speech recognition
    • 10.1006/csla.2001.0184
    • M. Mohri, F. C. N. Pereira, and M. Riley, " Weighted finite-state transducers in speech recognition.," Comput. Speech Lang. 16 (1), 69-88 (2002). 10.1006/csla.2001.0184
    • (2002) Comput. Speech Lang. , vol.16 , Issue.1 , pp. 69-88
    • Mohri, M.1    Pereira, F.C.N.2    Riley, M.3
  • 30
    • 70350376504 scopus 로고    scopus 로고
    • Weighted automata algorithms
    • in, edited by Manfred Droste, Werner Kuich, and Heiko Vogler (Springer, Berlin)
    • M. Mohri, " Weighted automata algorithms.," in Handbook of Weighted Automata. Monographs in Theoretical Computer Science, edited by, Manfred Droste, Werner Kuich, and, Heiko Vogler, (Springer, Berlin, 2009), pp. 213-254.
    • (2009) Handbook of Weighted Automata. Monographs in Theoretical Computer Science , pp. 213-254
    • Mohri, M.1
  • 31
    • 0001573124 scopus 로고
    • Generalized iterative scaling for log-linear models
    • 10.1214/aoms/1177692379
    • J. Darroch and D. Ratcliff, " Generalized iterative scaling for log-linear models.," Ann. Math. Stat. 43 (5), 1470-1480 (1972). 10.1214/aoms/1177692379
    • (1972) Ann. Math. Stat. , vol.43 , Issue.5 , pp. 1470-1480
    • Darroch, J.1    Ratcliff, D.2
  • 32
    • 0035059194 scopus 로고    scopus 로고
    • Whole-sentence exponential language models: A vehicle for linguistic-statistical integration
    • 10.1006/csla.2000.0159
    • R. Rosenfeld, S. F. Chen, and X. Zhu, " Whole-sentence exponential language models: A vehicle for linguistic-statistical integration.," Comput. Speech Lang. 15 (1), 55-73 (2001). 10.1006/csla.2000.0159
    • (2001) Comput. Speech Lang. , vol.15 , Issue.1 , pp. 55-73
    • Rosenfeld, R.1    Chen, S.F.2    Zhu, X.3
  • 35
    • 34047273021 scopus 로고    scopus 로고
    • A specialized on-The-fly algorithm for lexicon and language model composition
    • 10.1109/TSA.2005.860838
    • D. A. Caseiro and I. Trancoso, " A specialized on-the-fly algorithm for lexicon and language model composition.," IEEE Trans. Audio, Speech, Lang. Process. 14 (4), 1281-1291 (2006). 10.1109/TSA.2005.860838
    • (2006) IEEE Trans. Audio, Speech, Lang. Process. , vol.14 , Issue.4 , pp. 1281-1291
    • Caseiro, D.A.1    Trancoso, I.2
  • 37
    • 0036296863 scopus 로고    scopus 로고
    • Minimum phone error and I-smoothing for improved discriminative training
    • in, Orlando.
    • D. Povey and P. C. Woodland, " Minimum phone error and I-smoothing for improved discriminative training.," in Proceedings of IEEE ICASSP2002, Orlando (2002).
    • (2002) Proceedings of IEEE ICASSP2002
    • Povey, D.1    Woodland, P.C.2
  • 39
    • 0029288633 scopus 로고
    • Maximum likelihood linear regression for speaker adaptation of continuous density HMMs
    • 10.1006/csla.1995.0010
    • C. J. Leggetter and P. C. Woodland, " Maximum likelihood linear regression for speaker adaptation of continuous density HMMs.," Comput. Speech Lang. 9, 171-186 (1995). 10.1006/csla.1995.0010
    • (1995) Comput. Speech Lang. , vol.9 , pp. 171-186
    • Leggetter, C.J.1    Woodland, P.C.2
  • 41
    • 0141703325 scopus 로고    scopus 로고
    • Automatic complexity control for HLDA systems
    • in, Hong Kong , Vol.
    • X. Liu, M. J. F. Gales, and P. C. Woodland, " Automatic complexity control for HLDA systems.," in Proceedings of IEEE ICASSP2003, Hong Kong (2003), Vol. 1, pp. 132-135.
    • (2003) Proceedings of IEEE ICASSP2003 , vol.1 , pp. 132-135
    • Liu, X.1    Gales, M.J.F.2    Woodland, P.C.3
  • 43
    • 0033329799 scopus 로고    scopus 로고
    • An empirical study of smoothing techniques for language modeling
    • 10.1006/csla.1999.0128
    • S. F. Chen and J. T. Goodman, " An empirical study of smoothing techniques for language modeling.," Comput. Speech Lang. 13 (4), pp. 359-394 (1999). 10.1006/csla.1999.0128
    • (1999) Comput. Speech Lang. , vol.13 , Issue.4 , pp. 359-394
    • Chen, S.F.1    Goodman, J.T.2
  • 44
    • 33847610331 scopus 로고    scopus 로고
    • Continuous language models
    • 10.1016/j.csl.2006.09.003
    • H. Schwenk, " Continuous language models.," Comput. Speech Lang. 21 (3), 492-518 (2007). 10.1016/j.csl.2006.09.003
    • (2007) Comput. Speech Lang. , vol.21 , Issue.3 , pp. 492-518
    • Schwenk, H.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.