메뉴 건너뛰기




Volumn 4, Issue 6, 2010, Pages 994-1006

Speech recognition with flat direct models

Author keywords

Direct model; features; log linear model; maximum mutual information (MMI); speech recognition

Indexed keywords

ACOUSTIC DETECTION; AUDIO SIGNAL; DIRECT MODEL; DIRECT MODELING; FEATURES; INHERENT STRUCTURES; KEY PROBLEMS; LINEAR MODELING; LOGLINEAR MODEL; MARKOV ASSUMPTIONS; MARKOV MODEL; MAXIMUM MUTUAL INFORMATION; MUTUAL INFORMATIONS; SENTENCE ERRORS; TEMPLATE-BASED;

EID: 78649280264     PISSN: 19324553     EISSN: None     Source Type: Journal    
DOI: 10.1109/JSTSP.2010.2080812     Document Type: Article
Times cited : (15)

References (25)
  • 4
    • 4544293504 scopus 로고    scopus 로고
    • Moving beyond the 'beads-on-a-string' model of speech
    • M. Ostendorf, "Moving beyond the 'beads-on-a-string' model of speech," in Proc. IEEE ASRU Workshop, 1999, pp. 79-84.
    • (1999) Proc. IEEE ASRU Workshop , pp. 79-84
    • Ostendorf, M.1
  • 5
    • 85009110188 scopus 로고    scopus 로고
    • Learning long-term temporal features in LVCSR using neural networks
    • B. Y. Chen, Q. Zhu, and N. Morgan, "Learning long-term temporal features in LVCSR using neural networks," in Proc. ICSLP, 2004.
    • (2004) Proc. ICSLP
    • Chen, B.Y.1    Zhu, Q.2    Morgan, N.3
  • 6
    • 0032658253 scopus 로고    scopus 로고
    • Temporal patterns (TRAPS) in ASR of noisy speech
    • H. Hermansky and S. Sharma, "Temporal patterns (TRAPS) in ASR of noisy speech," in Proc. ICASSP, 1999, pp. 289-292.
    • (1999) Proc. ICASSP , pp. 289-292
    • Hermansky, H.1    Sharma, S.2
  • 10
    • 78649245809 scopus 로고    scopus 로고
    • [Online] Available
    • [Online]. Available: http://www.tellme.com/you
  • 11
    • 78649250686 scopus 로고    scopus 로고
    • [Online] Available
    • [Online]. Available: http://vlingo.com
  • 12
    • 78649297785 scopus 로고    scopus 로고
    • [Online] Available
    • [Online]. Available: http://www.google.com/mobile/apple/app.html
  • 13
    • 78649256443 scopus 로고    scopus 로고
    • [Online] Available
    • [Online]. Available: http://mobile.yahoo.com/onesearch
  • 14
    • 84946710255 scopus 로고    scopus 로고
    • Maximum entropy direct models for speech recognition
    • H.-K. J. Kuo and Y. Gao, "Maximum entropy direct models for speech recognition," in Proc. ASRU, 2003.
    • (2003) Proc. ASRU
    • Kuo, H.-K.J.1    Gao, Y.2
  • 15
    • 70349208656 scopus 로고    scopus 로고
    • A flat direct model for speech recognition
    • G. Heigold, G. Zweig, X. Li, and P. Nguyen, "A flat direct model for speech recognition," in Proc. ICASSP, 2009, pp. 3861-3864.
    • (2009) Proc. ICASSP , pp. 3861-3864
    • Heigold, G.1    Zweig, G.2    Li, X.3    Nguyen, P.4
  • 16
    • 70450201983 scopus 로고    scopus 로고
    • Maximum mutual information multiphone units in direct modeling
    • G. Zweig and P. Nguyen, "Maximum mutual information multiphone units in direct modeling," in Proc. Interspeech, 2009.
    • (2009) Proc. Interspeech
    • Zweig, G.1    Nguyen, P.2
  • 18
    • 0033887568 scopus 로고    scopus 로고
    • A survey of smoothing techniques for ME models
    • Jan.
    • S. Chen and R. Rosenfeld, "A survey of smoothing techniques for ME models," IEEE Trans. Speech Audio Process., vol. 8, no. 1, pp. 37-50, Jan. 2000.
    • (2000) IEEE Trans. Speech Audio Process. , vol.8 , Issue.1 , pp. 37-50
    • Chen, S.1    Rosenfeld, R.2
  • 19
    • 0004109478 scopus 로고    scopus 로고
    • Rprop\Description and implementation details Univ. of Karlsruhe Jan. 1994
    • M. Reidmiller, Rprop\Description and implementation details Univ. of Karlsruhe, Jan. 1994, Tech. Rep.
    • Tech. Rep
    • Reidmiller, M.1
  • 20
    • 85149106909 scopus 로고    scopus 로고
    • Discriminative language modeling with conditional Random fields and the perceptron algorithm
    • B. Roark, M. Saraclar, M. Collins, and M. Johnson, "Discriminative language modeling with conditional random fields and the perceptron algorithm," in Proc. ACL, 2004.
    • (2004) Proc. ACL
    • Roark, B.1    Saraclar, M.2    Collins, M.3    Johnson, M.4
  • 21
    • 56149117265 scopus 로고    scopus 로고
    • An investigation into a simulation of episodic memory for automatic speech recognition
    • Sep.
    • V. Maier and R. Moore, "An investigation into a simulation of episodic memory for automatic speech recognition," in Proc. Interspeech, Sep. 2005.
    • (2005) Proc. Interspeech
    • Maier, V.1    Moore, R.2
  • 22
    • 0032165145 scopus 로고    scopus 로고
    • A multispan language modeling framework for large vocabulary speech recognition
    • J. R. Bellegarda, "A multispan language modeling framework for large vocabulary speech recognition," IEEE Trans. Speech Audio Process., vol. 6, no. 5, pp. 456-467, 1998.
    • (1998) IEEE Trans. Speech Audio Process. , vol.6 , Issue.5 , pp. 456-467
    • Bellegarda, J.R.1
  • 24
    • 0029725372 scopus 로고    scopus 로고
    • Design of a speech recognition system based on acoustically derived segmental units
    • M. Bacchiani, M. Ostendorf, Y. Sagisaka, and K. Paliwal, "Design of a speech recognition system based on acoustically derived segmental units," in Proc. ICASSP, 1996, pp. 443-446.
    • (1996) Proc. ICASSP , pp. 443-446
    • Bacchiani, M.1    Ostendorf, M.2    Sagisaka, Y.3    Paliwal, K.4
  • 25
    • 0036476255 scopus 로고    scopus 로고
    • Automatic generation of subword units for speech recognition systems
    • Feb.
    • R. Singh, B. Raj, and R. Stern, "Automatic generation of subword units for speech recognition systems," IEEE Trans. Speech and Audio Processing, vol. 10, no. 2, pp. 89-99, Feb. 2002
    • (2002) IEEE Trans. Speech and Audio Processing , vol.10 , Issue.2 , pp. 89-99
    • Singh, R.1    Raj, B.2    Stern, R.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.