메뉴 건너뛰기




Volumn 101, Issue 5, 2013, Pages 1116-1135

Speech-centric information processing: An optimization-oriented approach

Author keywords

Joint optimization; speech recognition; speech centric information processing (SCIP); spoken language translation (SLT); spoken language understanding (SLU); voice search

Indexed keywords

INFORMATION RETRIEVAL; SEARCH ENGINES; SPEECH; SYSTEMS ANALYSIS; TRANSLATION (LANGUAGES);

EID: 84876669905     PISSN: 00189219     EISSN: None     Source Type: Journal    
DOI: 10.1109/JPROC.2012.2236631     Document Type: Article
Times cited : (40)

References (106)
  • 5
    • 0001024110 scopus 로고
    • First-and second-order methods for learning: Between steepest descent and Newton's method
    • R. Battiti, "First-and second-order methods for learning: Between steepest descent and Newton's method," Neural Comput., vol. 4, pp. 141-166, 1992.
    • (1992) Neural Comput. , vol.4 , pp. 141-166
    • Battiti, R.1
  • 6
    • 84972571328 scopus 로고
    • Growth transformations for functions on manifolds
    • L. Baum and G. Sell, "Growth transformations for functions on manifolds," Pacific J. Math., vol. 27, no. 2, pp. 211-227, 1968.
    • (1968) Pacific J. Math. , vol.27 , Issue.2 , pp. 211-227
    • Baum, L.1    Sell, G.2
  • 7
    • 84965063004 scopus 로고
    • An inequality with applications to statistical prediction for functions of Markov processes and to a model of ecology
    • L. Baum and J. Eagon, "An inequality with applications to statistical prediction for functions of Markov processes and to a model of ecology," Bull. Amer. Math. Soc., vol. 73, pp. 360-363, 1967.
    • (1967) Bull. Amer. Math. Soc. , vol.73 , pp. 360-363
    • Baum, L.1    Eagon, J.2
  • 8
    • 70350568323 scopus 로고    scopus 로고
    • Efficient speech translation through confusion network decoding
    • Nov.
    • N. Bertoldi, R. Zens, M. Federico, and W. Shen, "Efficient speech translation through confusion network decoding," IEEE Trans. Audio Speech Lang. Process., vol. 16, no. 8, pp. 1696-1705, Nov. 2008.
    • (2008) IEEE Trans. Audio Speech Lang. Process. , vol.16 , Issue.8 , pp. 1696-1705
    • Bertoldi, N.1    Zens, R.2    Federico, M.3    Shen, W.4
  • 13
    • 85044611587 scopus 로고
    • The mathematics of statistical machine translation: Parameter estimation
    • P. Brown, S. Pietra, V. Pietra, and R. Mercer, "The mathematics of statistical machine translation: Parameter estimation," Comput. Linguist., vol. 19, no. 2, pp. 263-311, 1993.
    • (1993) Comput. Linguist. , vol.19 , Issue.2 , pp. 263-311
    • Brown, P.1    Pietra, S.2    Pietra, V.3    Mercer, R.4
  • 15
    • 85032751967 scopus 로고    scopus 로고
    • Retrieval and browsing of spoken content
    • May
    • C. Chelba, T. Hazen, and M. Saraclar, "Retrieval and browsing of spoken content," IEEE Signal Process. Mag., vol. 25, no. 3, pp. 39-49, May 2008.
    • (2008) IEEE Signal Process. Mag. , vol.25 , Issue.3 , pp. 39-49
    • Chelba, C.1    Hazen, T.2    Saraclar, M.3
  • 16
    • 34249656385 scopus 로고    scopus 로고
    • Discriminative estimation of subspace constrained Gaussian mixture models for speech recognition
    • Jan.
    • S. Axelrod, V. Goel, R. Gopinath, P. Olsen, and K. Visweswariah, "Discriminative estimation of subspace constrained Gaussian mixture models for speech recognition," IEEE Trans. Audio Speech Lang. Process., vol. 15, no. 1, pp. 172-189, Jan. 2007.
    • (2007) IEEE Trans. Audio Speech Lang. Process. , vol.15 , Issue.1 , pp. 172-189
    • Axelrod, S.1    Goel, V.2    Gopinath, R.3    Olsen, P.4    Visweswariah, K.5
  • 17
    • 84055222005 scopus 로고    scopus 로고
    • Context-dependent pre-trained deep neural networks for large vocabulary speech recognition
    • Jan.
    • G. Dahl, D. Yu, L. Deng, and A. Acero, "Context-dependent pre-trained deep neural networks for large vocabulary speech recognition," IEEE Trans. Audio Speech Lang. Process., vol. 20, no. 1, pp. 30-42, Jan. 2012.
    • (2012) IEEE Trans. Audio Speech Lang. Process. , vol.20 , Issue.1 , pp. 30-42
    • Dahl, G.1    Yu, D.2    Deng, L.3    Acero, A.4
  • 20
    • 4243109553 scopus 로고    scopus 로고
    • Challenges in adopting speech recognition
    • Jan.
    • L. Deng and X. Huang, "Challenges in adopting speech recognition," Commun. ACM, vol. 47, no. 1, pp. 11-13, Jan. 2004.
    • (2004) Commun. ACM , vol.47 , Issue.1 , pp. 11-13
    • Deng, L.1    Huang, X.2
  • 22
    • 84867617677 scopus 로고    scopus 로고
    • Front-end, back-end, and hybrid techniques to noise-robust speech recognition
    • D. Kolossa and R. Haeb-Umbach, Eds. New York: Springer-Verlag
    • L. Deng, "Front-end, back-end, and hybrid techniques to noise-robust speech recognition," in Robust Speech Recognition of Uncertain Data, D. Kolossa and R. Haeb-Umbach, Eds. New York: Springer-Verlag, 2011, pp. 67-99.
    • (2011) Robust Speech Recognition of Uncertain Data , pp. 67-99
    • Deng, L.1
  • 23
    • 0003315567 scopus 로고    scopus 로고
    • Numerical methods for unconstrained optimization and nonlinear equations
    • Philadelphia, PA: SIAM
    • J. E. Dennis and R. B. Schnabel, "Numerical methods for unconstrained optimization and nonlinear equations SIAM's Classics in Applied Mathematics. Philadelphia, PA: SIAM, 1996.
    • (1996) SIAM's Classics in Applied Mathematics
    • Dennis, J.E.1    Schnabel, R.B.2
  • 24
    • 33947702149 scopus 로고    scopus 로고
    • Joint discriminative front end and back end training for improved speech recognition accuracy
    • DOI: 10.1109/ICASSP.2006.1660012
    • J. Droppo and A. Acero, "Joint discriminative front end and back end training for improved speech recognition accuracy," in Proc. Int. Conf. Acoust. Speech Signal Process., 2006, DOI: 10.1109/ICASSP.2006.1660012.
    • (2006) Proc. Int. Conf. Acoust. Speech Signal Process.
    • Droppo, J.1    Acero, A.2
  • 25
    • 85009188309 scopus 로고    scopus 로고
    • Conceptual decoding for spoken dialog systems
    • Geneva, Switzerland, Sep. 1-4
    • Y. Est̀eve, C. Raymond, F. Bechet, and R. DeMori, "Conceptual decoding for spoken dialog systems," in Proc. Eurospeech Conf., Geneva, Switzerland, Sep. 1-4, 2003.
    • (2003) Proc. Eurospeech Conf
    • Est̀eve, Y.1    Raymond, C.2    Bechet, F.3    Demori, R.4
  • 27
    • 70450194285 scopus 로고    scopus 로고
    • Role of natural language understanding in voice local search
    • Brighton, U.K., Sep. 6-10
    • J. Feng, S. Bangalore, and M. Gilbert, "Role of natural language understanding in voice local search," in Proc. Interspeech 2009, Brighton, U.K., Sep. 6-10, 2009.
    • (2009) Proc. Interspeech 2009
    • Feng, J.1    Bangalore, S.2    Gilbert, M.3
  • 28
    • 77954590919 scopus 로고    scopus 로고
    • Query parsing in mobile voice search
    • Raleigh, NC, USA, Apr. 26-30
    • J. Feng, "Query parsing in mobile voice search," in Proc. World Wide Web 2010, Raleigh, NC, USA, Apr. 26-30, 2010.
    • (2010) Proc. World Wide Web 2010
    • Feng, J.1
  • 29
    • 85032751148 scopus 로고    scopus 로고
    • Speech and multimodal interaction in mobile search
    • Jul.
    • J. Feng, M. Johnston, and S. Bangalore, "Speech and multimodal interaction in mobile search," IEEE Signal Process. Mag., vol. 28, no. 4, pp. 40-49, Jul. 2011.
    • (2011) IEEE Signal Process. Mag. , vol.28 , Issue.4 , pp. 40-49
    • Feng, J.1    Johnston, M.2    Bangalore, S.3
  • 31
    • 2442562479 scopus 로고    scopus 로고
    • Segmental minimum Bayes-risk decoding for automatic speech recognition
    • May
    • V. Goel, S. Kumar, and W. Byrne, "Segmental minimum Bayes-risk decoding for automatic speech recognition," IEEE Trans. Speech Audio Process., vol. 12, no. 3, pp. 234-249, May 2004.
    • (2004) IEEE Trans. Speech Audio Process. , vol.12 , Issue.3 , pp. 234-249
    • Goel, V.1    Kumar, S.2    Byrne, W.3
  • 32
    • 0025952278 scopus 로고
    • An inequality for rational functions with applications to some statistical estimation problems
    • Jan.
    • P. Gopalakrishnan, D. Kanevsky, A. Nadas, and D. Nahamoo, "An inequality for rational functions with applications to some statistical estimation problems," IEEE Trans. Inf. Theory, vol. 37, no. 1, pp. 107-113, Jan. 1991.
    • (1991) IEEE Trans. Inf. Theory , vol.37 , Issue.1 , pp. 107-113
    • Gopalakrishnan, P.1    Kanevsky, D.2    Nadas, A.3    Nahamoo, D.4
  • 33
    • 85009119467 scopus 로고    scopus 로고
    • Discriminative speaker adaptation with conditional maximum likelihood linear regression
    • Aalborg, Denmark, Sep. 3-7
    • A. Gunawardana and W. Byrne, "Discriminative speaker adaptation with conditional maximum likelihood linear regression," in Proc. Eurospeech 2001, Aalborg, Denmark, Sep. 3-7, 2001.
    • (2001) Proc. Eurospeech 2001
    • Gunawardana, A.1    Byrne, W.2
  • 36
    • 84861092214 scopus 로고    scopus 로고
    • Impacts of machine translation and speech synthesis on speech-to-speech translation
    • Sep.
    • K. Hashimoto, J. Yamagishi, W. Byrne, S. King, and K. Tokuda, "Impacts of machine translation and speech synthesis on speech-to-speech translation," Speech Commun., vol. 54, no. 7, pp. 857-866, Sep. 2012.
    • (2012) Speech Commun. , vol.54 , Issue.7 , pp. 857-866
    • Hashimoto, K.1    Yamagishi, J.2    Byrne, W.3    King, S.4    Tokuda, K.5
  • 37
    • 85032751114 scopus 로고    scopus 로고
    • Speech recognition, machine translation, and speech translationVA unified discriminative learning paradigm
    • Sep.
    • X. He and L. Deng, "Speech recognition, machine translation, and speech translationVA unified discriminative learning paradigm," IEEE Signal Process. Mag., vol. 28, no. 5, pp. 126-133, Sep. 2011.
    • (2011) IEEE Signal Process. Mag. , vol.28 , Issue.5 , pp. 126-133
    • He, X.1    Deng, L.2
  • 38
    • 84876693434 scopus 로고    scopus 로고
    • Maximum expected BLEU training of phrase and lexicon translation models
    • Jul.
    • X. He and L. Deng, "Maximum expected BLEU training of phrase and lexicon translation models," in Proc. Annu. Meeting Assoc. Comput. Linguist., Jul. 2012, vol. 1, pp. 292-301.
    • (2012) Proc. Annu. Meeting Assoc. Comput. Linguist , vol.1 , pp. 292-301
    • He, X.1    Deng, L.2
  • 39
    • 80051663140 scopus 로고    scopus 로고
    • Why word error rate is not a good metric for speech recognizer training for the speech translation task?"
    • X. He, L. Deng, and A. Acero, "Why word error rate is not a good metric for speech recognizer training for the speech translation task?" in Proc. Int. Conf. Acoust. Speech Signal Process., 2011, pp. 5632-5635.
    • (2011) Proc. Int. Conf. Acoust. Speech Signal Process. , pp. 5632-5635
    • He, X.1    Deng, L.2    Acero, A.3
  • 41
    • 85032750905 scopus 로고    scopus 로고
    • Discriminative learning in sequential pattern recognition
    • Sep.
    • X. He, L. Deng, and W. Chou, "Discriminative learning in sequential pattern recognition," IEEE Signal Process. Mag., vol. 25, no. 5, pp. 14-36, Sep. 2008.
    • (2008) IEEE Signal Process. Mag. , vol.25 , Issue.5 , pp. 14-36
    • He, X.1    Deng, L.2    Chou, W.3
  • 45
    • 84865747510 scopus 로고    scopus 로고
    • Generalized Baum-Welch algorithm and its implication to a new extended Baum-Welch algorithm
    • Florence, Italy, Aug. 27-31
    • R. Hsiao and T. Schultz, "Generalized Baum-Welch algorithm and its implication to a new extended Baum-Welch algorithm," in Proc. Interspeech 2011, Florence, Italy, Aug. 27-31, 2011.
    • (2011) Proc. Interspeech 2011
    • Hsiao, R.1    Schultz, T.2
  • 46
    • 85019175281 scopus 로고    scopus 로고
    • An overview of modern speech recognition
    • 2nd ed. London, U.K.: Chapman & Hall/CRC Press
    • X. Huang and L. Deng, "An overview of modern speech recognition," in Handbook of Natural Language Processing, 2nd ed. London, U.K.: Chapman & Hall/CRC Press, 2010, pp. 339-366.
    • (2010) Handbook of Natural Language Processing , pp. 339-366
    • Huang, X.1    Deng, L.2
  • 48
    • 70450204768 scopus 로고    scopus 로고
    • A voice search approach to replying to SMS messages in automobiles
    • Brighton, U.K., Sep. 6-10
    • Y. Ju and T. Paek, "A voice search approach to replying to SMS messages in automobiles," in Proc. Interspeech 2009, Brighton, U.K., Sep. 6-10, 2009.
    • (2009) Proc. Interspeech 2009
    • Ju, Y.1    Paek, T.2
  • 55
    • 34250709904 scopus 로고    scopus 로고
    • Optimization for discriminative training
    • Lisbon, Portugal, Sep. 4-8
    • J. Le Roux and E. McDermott, "Optimization for discriminative training," in Proc. Interspeech 2005, Lisbon, Portugal, Sep. 4-8, 2005.
    • (2005) Proc. Interspeech 2005
    • Le Roux, J.1    McDermott, E.2
  • 56
    • 78049355806 scopus 로고    scopus 로고
    • Discriminatively estimated joint acoustic, duration and language model for speech recognition
    • M. Lehr and I. Shafran, "Discriminatively estimated joint acoustic, duration and language model for speech recognition," in Proc. Int. Conf. Acoust. Speech Signal Process., 2010, pp. 5542-5545.
    • (2010) Proc. Int. Conf. Acoust. Speech Signal Process. , pp. 5542-5545
    • Lehr, M.1    Shafran, I.2
  • 57
    • 79959859604 scopus 로고    scopus 로고
    • Cross-lingual spoken language understanding from unaligned data using discriminative classification models and machine translation
    • Makuhari, Japan, Sep. 26-30
    • F. Lef̀evre, F. Mairesse, and S. Young, "Cross-lingual spoken language understanding from unaligned data using discriminative classification models and machine translation," in Proc. Interspeech 2010, Makuhari, Japan, Sep. 26-30, 2010.
    • (2010) Proc. Interspeech 2010
    • Lef̀evre, F.1    Mairesse, F.2    Young, S.3
  • 58
    • 85032751176 scopus 로고    scopus 로고
    • Spoken document understanding and organization
    • Sep.
    • L.-S. Lee and B. Chen, "Spoken document understanding and organization," IEEE Signal Process. Mag., vol. 22, no. 5, pp. 42-60, Sep. 2005.
    • (2005) IEEE Signal Process. Mag. , vol.22 , Issue.5 , pp. 42-60
    • Lee, L.-S.1    Chen, B.2
  • 60
    • 84865779292 scopus 로고    scopus 로고
    • Multi-task learning for spoken language understanding with shared slots
    • Florence, Italy, Aug. 27-31
    • X. Li, Y. Wang, and G. Tur, "Multi-task learning for spoken language understanding with shared slots," in Proc. Interspeech 2011, Florence, Italy, Aug. 27-31, 2011.
    • (2011) Proc. Interspeech 2011
    • Li, X.1    Wang, Y.2    Tur, G.3
  • 61
    • 44049107487 scopus 로고    scopus 로고
    • How to access audio files of large databases using in-car speech dialogue systems
    • Antwerp, Belgium
    • S. Mann, A. Berton, and U. Ehrlich, "How to access audio files of large databases using in-car speech dialogue systems," in Proc. Interspeech Conf., Antwerp, Belgium, 2007, pp. 138-141.
    • (2007) Proc. Interspeech Conf , pp. 138-141
    • Mann, S.1    Berton, A.2    Ehrlich, U.3
  • 62
    • 33947615216 scopus 로고    scopus 로고
    • Integrating speech recognition and machine translation: Where do we stand?"
    • DOI: 10.1109/ICASSP.2006.1661501
    • E. Matusov, S. Kanthak, and H. Ney, "Integrating speech recognition and machine translation: Where do we stand?" in Proc. Int. Conf. Acoust. Speech Signal Process., 2006, DOI: 10.1109/ICASSP.2006.1661501.
    • (2006) Proc. Int. Conf. Acoust. Speech Signal Process.
    • Matusov, E.1    Kanthak, S.2    Ney, H.3
  • 63
    • 4544287474 scopus 로고    scopus 로고
    • Minimum classification error training of landmark models for real-time continuous speech recognition
    • E. McDermott and T. Hazen, "Minimum classification error training of landmark models for real-time continuous speech recognition," in Proc. Int. Conf. Acoust. Speech Signal Process., 2006, vol. 1, pp. 937-940.
    • (2006) Proc. Int. Conf. Acoust. Speech Signal Process. , vol.1 , pp. 937-940
    • McDermott, E.1    Hazen, T.2
  • 64
    • 34547522070 scopus 로고    scopus 로고
    • Discriminative training for large vocabulary speech recognition using minimum classification error
    • Jan.
    • E. McDermott, T. Hazen, J. Le Roux, A. Nakamura, and S. Katagiri, "Discriminative training for large vocabulary speech recognition using minimum classification error," IEEE Trans. Speech Audio Process., vol. 15, no. 1, pp. 203-223, Jan. 2007.
    • (2007) IEEE Trans. Speech Audio Process. , vol.15 , Issue.1 , pp. 203-223
    • McDermott, E.1    Hazen, T.2    Le Roux, J.3    Nakamura, A.4    Katagiri, S.5
  • 66
    • 0032654483 scopus 로고    scopus 로고
    • Speech translation: Coupling of recognition and translation
    • H. Ney, "Speech translation: Coupling of recognition and translation," in Proc. Int. Conf. Acoust. Speech Signal Process., 1999, vol. 1, pp. 517-520.
    • (1999) Proc. Int. Conf. Acoust. Speech Signal Process. , vol.1 , pp. 517-520
    • Ney, H.1
  • 67
    • 84944098666 scopus 로고    scopus 로고
    • Minimum error rate training in statistical machine translation
    • F. Och, "Minimum error rate training in statistical machine translation," in Proc. Annu. Meeting Assoc. Comput. Linguist., 2003, pp. 160-167.
    • (2003) Proc. Annu. Meeting Assoc. Comput. Linguist. , pp. 160-167
    • Och, F.1
  • 71
    • 80052206377 scopus 로고    scopus 로고
    • Overview of the IWSLT 2010 evaluation campaign
    • Paris, France, Dec. 2-3
    • M. Paul, M. Federico, and S. Stücker, "Overview of the IWSLT 2010 evaluation campaign," in Proc. IWSLT, Paris, France, Dec. 2-3, 2010.
    • (2010) Proc. IWSLT
    • Paul, M.1    Federico, M.2    Stücker, S.3
  • 73
    • 77956541453 scopus 로고    scopus 로고
    • Integration of statistical models for dictation of document translations in a machine-aided human translation task
    • Nov.
    • A. Reddy and R. Rose, "Integration of statistical models for dictation of document translations in a machine-aided human translation task," IEEE Trans. Audio Speech Lang. Process., vol. 18, no. 8, pp. 2015-2027, Nov. 2010.
    • (2010) IEEE Trans. Audio Speech Lang. Process. , vol.18 , Issue.8 , pp. 2015-2027
    • Reddy, A.1    Rose, R.2
  • 74
    • 84969232669 scopus 로고    scopus 로고
    • Stochastic language models for speech recognition and understanding
    • Sydney, Australia, Nov. 30-Dec. 4
    • G. Riccardi and A. L. Gorin, "Stochastic language models for speech recognition and understanding," in Proc. ICSLP, Sydney, Australia, Nov. 30-Dec. 4, 1998.
    • (1998) Proc. ICSLP
    • Riccardi, G.1    Gorin, A.L.2
  • 75
    • 84943274699 scopus 로고
    • A direct adaptive method for faster back propagation learning: The RPROP algorithm
    • San Francisco, CA
    • M. Riedmiller and H. Braun, "A direct adaptive method for faster back propagation learning: The RPROP algorithm," in Proc. IEEE Int. Conf. Neural Netw., San Francisco, CA, 1993, pp. 586-591.
    • (1993) Proc. IEEE Int. Conf. Neural Netw , pp. 586-591
    • Riedmiller, M.1    Braun, H.2
  • 78
    • 84945900998 scopus 로고    scopus 로고
    • Best practice for convolutional neural networks applied to visual document analysis
    • P. Simard, Y. Steinkraus, and J. Platt, "Best practice for convolutional neural networks applied to visual document analysis," in Proc. Int. Conf. Document Anal. Recognit., 2003, pp. 958-962.
    • (2003) Proc. Int. Conf. Document Anal. Recognit. , pp. 958-962
    • Simard, P.1    Steinkraus, Y.2    Platt, J.3
  • 81
    • 84867605416 scopus 로고    scopus 로고
    • Towards deeper understanding: Deep convex networks for semantic utterance classification
    • Kyoto, Japan, Mar.
    • G. Tur, L. Deng, D. Hakkani-Tür, and X. He, "Towards deeper understanding: Deep convex networks for semantic utterance classification," in Proc. Int. Conf. Acoust. Speech Signal Process., Kyoto, Japan, Mar. 2012, pp. 5045-5048.
    • (2012) Proc. Int. Conf. Acoust. Speech Signal Process , pp. 5045-5048
    • Tur, G.1    Deng, L.2    Hakkani-Tür, D.3    He, X.4
  • 82
    • 0030706648 scopus 로고    scopus 로고
    • Finite-state speech-to-speech translation
    • Munich, Germany
    • E. Vidal, "Finite-state speech-to-speech translation," in Proc. Int. Conf. Acoust. Speech Signal Process., Munich, Germany, 1997, pp. 111-114.
    • (1997) Proc. Int. Conf. Acoust. Speech Signal Process , pp. 111-114
    • Vidal, E.1
  • 83
    • 85032751718 scopus 로고    scopus 로고
    • Spoken language translation
    • May
    • A. Waibel and C. Fugen, "Spoken language translation," IEEE Signal Process. Mag., vol. 25, no. 3, pp. 70-79, May 2008.
    • (2008) IEEE Signal Process. Mag. , vol.25 , Issue.3 , pp. 70-79
    • Waibel, A.1    Fugen, C.2
  • 84
    • 85032751364 scopus 로고    scopus 로고
    • An introduction to voice search
    • May
    • Y. Wang, D. Yu, Y. Ju, and A. Acero, "An introduction to voice search," IEEE Signal Process. Mag., vol. 25, no. 3, pp. 28-38, May 2008.
    • (2008) IEEE Signal Process. Mag. , vol.25 , Issue.3 , pp. 28-38
    • Wang, Y.1    Yu, D.2    Ju, Y.3    Acero, A.4
  • 87
    • 85032753932 scopus 로고    scopus 로고
    • Spoken language understanding
    • Sep.
    • Y. Wang, L. Deng, and A. Acero, "Spoken language understanding," IEEE Signal Process. Mag., vol. 22, no. 5, pp. 16-31, Sep. 2005.
    • (2005) IEEE Signal Process. Mag. , vol.22 , Issue.5 , pp. 16-31
    • Wang, Y.1    Deng, L.2    Acero, A.3
  • 88
    • 84867625339 scopus 로고    scopus 로고
    • Phrase-level transduction model with reordering for spoken to written language transformation
    • P. Xu, P. Fung, and R. Chan, "Phrase-level transduction model with reordering for spoken to written language transformation," in Proc. Int. Conf. Acoust. Speech Signal Process., 2012, pp. 4965-4968.
    • (2012) Proc. Int. Conf. Acoust. Speech Signal Process. , pp. 4965-4968
    • Xu, P.1    Fung, P.2    Chan, R.3
  • 89
    • 66149085249 scopus 로고    scopus 로고
    • An integrative and discriminative technique for spoken utterance classification
    • Aug.
    • S. Yaman, L. Deng, D. Yu, Y. Wang, and A. Acero, "An integrative and discriminative technique for spoken utterance classification," IEEE Trans. Audio Speech Lang. Process., vol. 16, no. 6, pp. 1207-1214, Aug. 2008.
    • (2008) IEEE Trans. Audio Speech Lang. Process. , vol.16 , Issue.6 , pp. 1207-1214
    • Yaman, S.1    Deng, L.2    Yu, D.3    Wang, Y.4    Acero, A.5
  • 90
    • 85032752358 scopus 로고    scopus 로고
    • Cognitive user interfaces
    • May
    • S. Young, "Cognitive user interfaces," IEEE Signal Process. Mag., vol. 27, no. 3, pp. 128-140, May 2010.
    • (2010) IEEE Signal Process. Mag. , vol.27 , Issue.3 , pp. 128-140
    • Young, S.1
  • 91
    • 84876682878 scopus 로고    scopus 로고
    • POMDP-based statistical spoken dialog systems: A review
    • DOI: 10.1109/JPROC.2012.2225812
    • S. Young, M. Gasic, B. Thomson, and J. Williams, "POMDP-based statistical spoken dialog systems: A review," Proc. IEEE, 2013, DOI: 10.1109/JPROC.2012.2225812.
    • (2013) Proc. IEEE
    • Young, S.1    Gasic, M.2    Thomson, B.3    Williams, J.4
  • 92
    • 85032782045 scopus 로고    scopus 로고
    • Deep learning and its applications to signal and information processing
    • Jan.
    • D. Yu and L. Deng, "Deep learning and its applications to signal and information processing," IEEE Signal Process. Mag., vol. 28, no. 1, pp. 145-154, Jan. 2011.
    • (2011) IEEE Signal Process. Mag. , vol.28 , Issue.1 , pp. 145-154
    • Yu, D.1    Deng, L.2
  • 93
    • 44049108531 scopus 로고    scopus 로고
    • Automated directory assistance system: From theory to practice
    • Antwerp, Belgium
    • D. Yu, Y.-C. Ju, Y.-Y. Wang, G. Zweig, and A. Acero, "Automated directory assistance system: From theory to practice," in Proc. Interspeech Conf., Antwerp, Belgium, 2007, pp. 2709-2712.
    • (2007) Proc. Interspeech Conf , pp. 2709-2712
    • Yu, D.1    Ju, Y.-C.2    Wang, Y.-Y.3    Zweig, G.4    Acero, A.5
  • 94
    • 80051607493 scopus 로고    scopus 로고
    • A novel decision function and the associated decision-feedback learning for speech translation
    • Y. Zhang, L. Deng, X. He, and A. Acero, "A novel decision function and the associated decision-feedback learning for speech translation," in Proc. Int. Conf. Acoust. Speech Signal Process., 2011, pp. 5608-5611.
    • (2011) Proc. Int. Conf. Acoust. Speech Signal Process. , pp. 5608-5611
    • Zhang, Y.1    Deng, L.2    He, X.3    Acero, A.4
  • 95
    • 85080849330 scopus 로고    scopus 로고
    • Statistical translation for speech: A perspective on structures and learning
    • B. Zhou, "Statistical translation for speech: A perspective on structures and learning," Proc. IEEE, 2013.
    • (2013) Proc. IEEE
    • Zhou, B.1
  • 101
    • 84872203606 scopus 로고    scopus 로고
    • Comparison and combination of lightly supervised approaches for language portability of a spoken language understanding system
    • B. Jabaian, L. Besacier, and F. Lefevre, "Comparison and combination of lightly supervised approaches for language portability of a spoken language understanding system," IEEE Trans. Audio Speech Lang. Process., vol. 21, no. 3, 2013.
    • IEEE Trans. Audio Speech Lang. Process. , vol.21 , Issue.3 , pp. 2013
    • Jabaian, B.1    Besacier, L.2    Lefevre, F.3
  • 102
  • 103
    • 84874256530 scopus 로고    scopus 로고
    • Use of kernel deep convex networks and end-to-end learning for spoken language understanding
    • Dec.
    • L. Deng, G. Tur, X. He, and D. Hakkani-Tur, "Use of kernel deep convex networks and end-to-end learning for spoken language understanding," Proc. IEEE Workshop Spoken Lang. Technol., Dec. 2012.
    • (2012) Proc. IEEE Workshop Spoken Lang. Technol
    • Deng, L.1    Tur, G.2    He, X.3    Hakkani-Tur, D.4
  • 104
    • 79959812741 scopus 로고    scopus 로고
    • Investigating multiple approaches for SLU portability to a new language
    • B. Jabaian, L. Besacier, and F. Lefevre, "Investigating multiple approaches for SLU portability to a new language," in Proc. Interspeech, 2010.
    • (2010) Proc. Interspeech
    • Jabaian, B.1    Besacier, L.2    Lefevre, F.3
  • 105
    • 80051636817 scopus 로고    scopus 로고
    • Combination of stochastic understanding and machine translation systems for language portability of dialogue systems
    • B. Jabaian, L. Besacier, and F. Lefevre, "Combination of stochastic understanding and machine translation systems for language portability of dialogue systems," in Proc. ICASSP, 2011.
    • (2011) Proc. ICASSP
    • Jabaian, B.1    Besacier, L.2    Lefevre, F.3
  • 106
    • 78049394664 scopus 로고    scopus 로고
    • On the use of machine translation for spoken language understanding portability
    • N. Camelin, C. Raymond, F. Bechet, and R. De Mori, "On the use of machine translation for spoken language understanding portability," in Proc. ICASSP, 2010.
    • (2010) Proc. ICASSP
    • Camelin, N.1    Raymond, C.2    Bechet, F.3    De Mori, R.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.