-
1
-
-
51449093561
-
Deploying GOOG-411: Early lessons in data, measurement, and testing
-
M. Bacchiani, F. Beaufays, J. Schalkwyk, M. Schuster, and B. Strope, "Deploying GOOG-411: Early lessons in data, measurement, and testing," in Proc. Int. Conf. Acoust. Speech Signal Process., 2008, pp. 5260-5263.
-
(2008)
Proc. Int. Conf. Acoust. Speech Signal Process.
, pp. 5260-5263
-
-
Bacchiani, M.1
Beaufays, F.2
Schalkwyk, J.3
Schuster, M.4
Strope, B.5
-
3
-
-
85032751593
-
Research developments and directions in speech recognition and understandingVPart 1
-
May
-
J. Baker, L. Deng, J. Glass, S. Khudanpur, C.-H. Lee, N. Morgan, and D. O'Shaughnessy, "Research developments and directions in speech recognition and understandingVPart 1," IEEE Signal Process. Mag., vol. 26, no. 3, pp. 75-80, May 2009.
-
(2009)
IEEE Signal Process. Mag.
, vol.26
, Issue.3
, pp. 75-80
-
-
Baker, J.1
Deng, L.2
Glass, J.3
Khudanpur, S.4
Lee, C.-H.5
Morgan, N.6
O'Shaughnessy, D.7
-
4
-
-
85032759066
-
Updated MINDS report on speech recognition and understandingVPart 2
-
Jul.
-
J. Baker, L. Deng, S. Khudanpur, C. Lee, J. Glass, N. Morgan, and D. O'Shaughnessy, "Updated MINDS report on speech recognition and understandingVPart 2," IEEE Signal Process. Mag., vol. 26, no. 4, pp. 78-85, Jul. 2009.
-
(2009)
IEEE Signal Process. Mag.
, vol.26
, Issue.4
, pp. 78-85
-
-
Baker, J.1
Deng, L.2
Khudanpur, S.3
Lee, C.4
Glass, J.5
Morgan, N.6
O'Shaughnessy, D.7
-
5
-
-
0001024110
-
First-and second-order methods for learning: Between steepest descent and Newton's method
-
R. Battiti, "First-and second-order methods for learning: Between steepest descent and Newton's method," Neural Comput., vol. 4, pp. 141-166, 1992.
-
(1992)
Neural Comput.
, vol.4
, pp. 141-166
-
-
Battiti, R.1
-
6
-
-
84972571328
-
Growth transformations for functions on manifolds
-
L. Baum and G. Sell, "Growth transformations for functions on manifolds," Pacific J. Math., vol. 27, no. 2, pp. 211-227, 1968.
-
(1968)
Pacific J. Math.
, vol.27
, Issue.2
, pp. 211-227
-
-
Baum, L.1
Sell, G.2
-
7
-
-
84965063004
-
An inequality with applications to statistical prediction for functions of Markov processes and to a model of ecology
-
L. Baum and J. Eagon, "An inequality with applications to statistical prediction for functions of Markov processes and to a model of ecology," Bull. Amer. Math. Soc., vol. 73, pp. 360-363, 1967.
-
(1967)
Bull. Amer. Math. Soc.
, vol.73
, pp. 360-363
-
-
Baum, L.1
Eagon, J.2
-
8
-
-
70350568323
-
Efficient speech translation through confusion network decoding
-
Nov.
-
N. Bertoldi, R. Zens, M. Federico, and W. Shen, "Efficient speech translation through confusion network decoding," IEEE Trans. Audio Speech Lang. Process., vol. 16, no. 8, pp. 1696-1705, Nov. 2008.
-
(2008)
IEEE Trans. Audio Speech Lang. Process.
, vol.16
, Issue.8
, pp. 1696-1705
-
-
Bertoldi, N.1
Zens, R.2
Federico, M.3
Shen, W.4
-
10
-
-
77955936729
-
ConQuest: An open-source dialog system for conferences
-
Human Lang. Technol.
-
D. Bohus, S. G. Puerto, D. Huggins-Daines, V. Keri, G. Krishna, R. Kumar, A. Raux, and S. Tomkoohus, "ConQuest: An open-source dialog system for conferences," in Proc. Conf. North Amer. Chapter Assoc. Comput. Linguist., Human Lang. Technol., 2007, pp. 9-12.
-
(2007)
Proc. Conf. North Amer. Chapter Assoc. Comput. Linguist
, pp. 9-12
-
-
Bohus, D.1
Puerto, S.G.2
Huggins-Daines, D.3
Keri, V.4
Krishna, G.5
Kumar, R.6
Raux, A.7
Tomkoohus, S.8
-
11
-
-
84876690716
-
Multi-scale personalization for voice search applications
-
Human Lang. Technol.
-
D. Bolanos, G. Zweig, and P. Nguyen, "Multi-scale personalization for voice search applications," in Proc. Conf. North Amer. Chapter Assoc. Comput. Linguist., Human Lang. Technol., 2009, pp. 101-104.
-
(2009)
Proc. Conf. North Amer. Chapter Assoc. Comput. Linguist
, pp. 101-104
-
-
Bolanos, D.1
Zweig, G.2
Nguyen, P.3
-
13
-
-
85044611587
-
The mathematics of statistical machine translation: Parameter estimation
-
P. Brown, S. Pietra, V. Pietra, and R. Mercer, "The mathematics of statistical machine translation: Parameter estimation," Comput. Linguist., vol. 19, no. 2, pp. 263-311, 1993.
-
(1993)
Comput. Linguist.
, vol.19
, Issue.2
, pp. 263-311
-
-
Brown, P.1
Pietra, S.2
Pietra, V.3
Mercer, R.4
-
14
-
-
85032751432
-
Recent efforts in spoken language translation
-
May
-
F. Casacuberta, M. Federico, H. Ney, and E. Vidal, "Recent efforts in spoken language translation," IEEE Signal Process. Mag., vol. 25, no. 3, pp. 80-88, May 2008.
-
(2008)
IEEE Signal Process. Mag.
, vol.25
, Issue.3
, pp. 80-88
-
-
Casacuberta, F.1
Federico, M.2
Ney, H.3
Vidal, E.4
-
15
-
-
85032751967
-
Retrieval and browsing of spoken content
-
May
-
C. Chelba, T. Hazen, and M. Saraclar, "Retrieval and browsing of spoken content," IEEE Signal Process. Mag., vol. 25, no. 3, pp. 39-49, May 2008.
-
(2008)
IEEE Signal Process. Mag.
, vol.25
, Issue.3
, pp. 39-49
-
-
Chelba, C.1
Hazen, T.2
Saraclar, M.3
-
16
-
-
34249656385
-
Discriminative estimation of subspace constrained Gaussian mixture models for speech recognition
-
Jan.
-
S. Axelrod, V. Goel, R. Gopinath, P. Olsen, and K. Visweswariah, "Discriminative estimation of subspace constrained Gaussian mixture models for speech recognition," IEEE Trans. Audio Speech Lang. Process., vol. 15, no. 1, pp. 172-189, Jan. 2007.
-
(2007)
IEEE Trans. Audio Speech Lang. Process.
, vol.15
, Issue.1
, pp. 172-189
-
-
Axelrod, S.1
Goel, V.2
Gopinath, R.3
Olsen, P.4
Visweswariah, K.5
-
17
-
-
84055222005
-
Context-dependent pre-trained deep neural networks for large vocabulary speech recognition
-
Jan.
-
G. Dahl, D. Yu, L. Deng, and A. Acero, "Context-dependent pre-trained deep neural networks for large vocabulary speech recognition," IEEE Trans. Audio Speech Lang. Process., vol. 20, no. 1, pp. 30-42, Jan. 2012.
-
(2012)
IEEE Trans. Audio Speech Lang. Process.
, vol.20
, Issue.1
, pp. 30-42
-
-
Dahl, G.1
Yu, D.2
Deng, L.3
Acero, A.4
-
18
-
-
85032750869
-
Spoken language understanding
-
May
-
R. De Mori, F. Bechet, D. Hakkani-Tur, M. McTear, G. Riccardi, and G. Tur, "Spoken language understanding," IEEE Signal Process. Mag., vol. 25, no. 3, pp. 50-58, May 2008.
-
(2008)
IEEE Signal Process. Mag.
, vol.25
, Issue.3
, pp. 50-58
-
-
De Mori, R.1
Bechet, F.2
Hakkani-Tur, D.3
McTear, M.4
Riccardi, G.5
Tur, G.6
-
20
-
-
4243109553
-
Challenges in adopting speech recognition
-
Jan.
-
L. Deng and X. Huang, "Challenges in adopting speech recognition," Commun. ACM, vol. 47, no. 1, pp. 11-13, Jan. 2004.
-
(2004)
Commun. ACM
, vol.47
, Issue.1
, pp. 11-13
-
-
Deng, L.1
Huang, X.2
-
21
-
-
0036880074
-
Distributed speech processing in MiPad's multimodal user interface
-
Nov.
-
L. Deng, K. Wang, A. Acero, H. Hon, J. Droppo, C. Boulis, Y. Wang, D. Jacoby, M. Mahajan, C. Chelba, and X. D. Huang, "Distributed speech processing in MiPad's multimodal user interface," IEEE Trans. Audio Speech Process., vol. 10, no. 8, pp. 605-619, Nov. 2002.
-
(2002)
IEEE Trans. Audio Speech Process.
, vol.10
, Issue.8
, pp. 605-619
-
-
Deng, L.1
Wang, K.2
Acero, A.3
Hon, H.4
Droppo, J.5
Boulis, C.6
Wang, Y.7
Jacoby, D.8
Mahajan, M.9
Chelba, C.10
Huang, X.D.11
-
22
-
-
84867617677
-
Front-end, back-end, and hybrid techniques to noise-robust speech recognition
-
D. Kolossa and R. Haeb-Umbach, Eds. New York: Springer-Verlag
-
L. Deng, "Front-end, back-end, and hybrid techniques to noise-robust speech recognition," in Robust Speech Recognition of Uncertain Data, D. Kolossa and R. Haeb-Umbach, Eds. New York: Springer-Verlag, 2011, pp. 67-99.
-
(2011)
Robust Speech Recognition of Uncertain Data
, pp. 67-99
-
-
Deng, L.1
-
23
-
-
0003315567
-
Numerical methods for unconstrained optimization and nonlinear equations
-
Philadelphia, PA: SIAM
-
J. E. Dennis and R. B. Schnabel, "Numerical methods for unconstrained optimization and nonlinear equations SIAM's Classics in Applied Mathematics. Philadelphia, PA: SIAM, 1996.
-
(1996)
SIAM's Classics in Applied Mathematics
-
-
Dennis, J.E.1
Schnabel, R.B.2
-
24
-
-
33947702149
-
Joint discriminative front end and back end training for improved speech recognition accuracy
-
DOI: 10.1109/ICASSP.2006.1660012
-
J. Droppo and A. Acero, "Joint discriminative front end and back end training for improved speech recognition accuracy," in Proc. Int. Conf. Acoust. Speech Signal Process., 2006, DOI: 10.1109/ICASSP.2006.1660012.
-
(2006)
Proc. Int. Conf. Acoust. Speech Signal Process.
-
-
Droppo, J.1
Acero, A.2
-
25
-
-
85009188309
-
Conceptual decoding for spoken dialog systems
-
Geneva, Switzerland, Sep. 1-4
-
Y. Est̀eve, C. Raymond, F. Bechet, and R. DeMori, "Conceptual decoding for spoken dialog systems," in Proc. Eurospeech Conf., Geneva, Switzerland, Sep. 1-4, 2003.
-
(2003)
Proc. Eurospeech Conf
-
-
Est̀eve, Y.1
Raymond, C.2
Bechet, F.3
Demori, R.4
-
26
-
-
0003578240
-
-
Canergie Mellon Univ., Pittsburgh, PA, Tech. Rep.
-
S. E. Fahlman, "An empirical study of learning speed in back-propagation networks," Canergie Mellon Univ., Pittsburgh, PA, Tech. Rep., 1988.
-
(1988)
An Empirical Study of Learning Speed in Back-propagation Networks
-
-
Fahlman, S.E.1
-
27
-
-
70450194285
-
Role of natural language understanding in voice local search
-
Brighton, U.K., Sep. 6-10
-
J. Feng, S. Bangalore, and M. Gilbert, "Role of natural language understanding in voice local search," in Proc. Interspeech 2009, Brighton, U.K., Sep. 6-10, 2009.
-
(2009)
Proc. Interspeech 2009
-
-
Feng, J.1
Bangalore, S.2
Gilbert, M.3
-
28
-
-
77954590919
-
Query parsing in mobile voice search
-
Raleigh, NC, USA, Apr. 26-30
-
J. Feng, "Query parsing in mobile voice search," in Proc. World Wide Web 2010, Raleigh, NC, USA, Apr. 26-30, 2010.
-
(2010)
Proc. World Wide Web 2010
-
-
Feng, J.1
-
29
-
-
85032751148
-
Speech and multimodal interaction in mobile search
-
Jul.
-
J. Feng, M. Johnston, and S. Bangalore, "Speech and multimodal interaction in mobile search," IEEE Signal Process. Mag., vol. 28, no. 4, pp. 40-49, Jul. 2011.
-
(2011)
IEEE Signal Process. Mag.
, vol.28
, Issue.4
, pp. 40-49
-
-
Feng, J.1
Johnston, M.2
Bangalore, S.3
-
31
-
-
2442562479
-
Segmental minimum Bayes-risk decoding for automatic speech recognition
-
May
-
V. Goel, S. Kumar, and W. Byrne, "Segmental minimum Bayes-risk decoding for automatic speech recognition," IEEE Trans. Speech Audio Process., vol. 12, no. 3, pp. 234-249, May 2004.
-
(2004)
IEEE Trans. Speech Audio Process.
, vol.12
, Issue.3
, pp. 234-249
-
-
Goel, V.1
Kumar, S.2
Byrne, W.3
-
32
-
-
0025952278
-
An inequality for rational functions with applications to some statistical estimation problems
-
Jan.
-
P. Gopalakrishnan, D. Kanevsky, A. Nadas, and D. Nahamoo, "An inequality for rational functions with applications to some statistical estimation problems," IEEE Trans. Inf. Theory, vol. 37, no. 1, pp. 107-113, Jan. 1991.
-
(1991)
IEEE Trans. Inf. Theory
, vol.37
, Issue.1
, pp. 107-113
-
-
Gopalakrishnan, P.1
Kanevsky, D.2
Nadas, A.3
Nahamoo, D.4
-
33
-
-
85009119467
-
Discriminative speaker adaptation with conditional maximum likelihood linear regression
-
Aalborg, Denmark, Sep. 3-7
-
A. Gunawardana and W. Byrne, "Discriminative speaker adaptation with conditional maximum likelihood linear regression," in Proc. Eurospeech 2001, Aalborg, Denmark, Sep. 3-7, 2001.
-
(2001)
Proc. Eurospeech 2001
-
-
Gunawardana, A.1
Byrne, W.2
-
34
-
-
79957695367
-
Comparing stochastic approaches to spoken language understanding in multiple languages
-
Aug.
-
S. Hahn, M. Dinarelli, C. Raymond, F. Lef̀evre, P. Lehnen, R. De Mori, H. Ney, and G. Riccardi, "Comparing stochastic approaches to spoken language understanding in multiple languages," IEEE Trans. Audio Speech Lang. Process., vol. 19, no. 6, pp. 1569-1583, Aug. 2011.
-
(2011)
IEEE Trans. Audio Speech Lang. Process.
, vol.19
, Issue.6
, pp. 1569-1583
-
-
Hahn, S.1
Dinarelli, M.2
Raymond, C.3
Lef̀evre, F.4
Lehnen, P.5
De Mori, R.6
Ney, H.7
Riccardi, G.8
-
36
-
-
84861092214
-
Impacts of machine translation and speech synthesis on speech-to-speech translation
-
Sep.
-
K. Hashimoto, J. Yamagishi, W. Byrne, S. King, and K. Tokuda, "Impacts of machine translation and speech synthesis on speech-to-speech translation," Speech Commun., vol. 54, no. 7, pp. 857-866, Sep. 2012.
-
(2012)
Speech Commun.
, vol.54
, Issue.7
, pp. 857-866
-
-
Hashimoto, K.1
Yamagishi, J.2
Byrne, W.3
King, S.4
Tokuda, K.5
-
37
-
-
85032751114
-
Speech recognition, machine translation, and speech translationVA unified discriminative learning paradigm
-
Sep.
-
X. He and L. Deng, "Speech recognition, machine translation, and speech translationVA unified discriminative learning paradigm," IEEE Signal Process. Mag., vol. 28, no. 5, pp. 126-133, Sep. 2011.
-
(2011)
IEEE Signal Process. Mag.
, vol.28
, Issue.5
, pp. 126-133
-
-
He, X.1
Deng, L.2
-
38
-
-
84876693434
-
Maximum expected BLEU training of phrase and lexicon translation models
-
Jul.
-
X. He and L. Deng, "Maximum expected BLEU training of phrase and lexicon translation models," in Proc. Annu. Meeting Assoc. Comput. Linguist., Jul. 2012, vol. 1, pp. 292-301.
-
(2012)
Proc. Annu. Meeting Assoc. Comput. Linguist
, vol.1
, pp. 292-301
-
-
He, X.1
Deng, L.2
-
39
-
-
80051663140
-
Why word error rate is not a good metric for speech recognizer training for the speech translation task?"
-
X. He, L. Deng, and A. Acero, "Why word error rate is not a good metric for speech recognizer training for the speech translation task?" in Proc. Int. Conf. Acoust. Speech Signal Process., 2011, pp. 5632-5635.
-
(2011)
Proc. Int. Conf. Acoust. Speech Signal Process.
, pp. 5632-5635
-
-
He, X.1
Deng, L.2
Acero, A.3
-
40
-
-
84876671702
-
The MSR system for IWSLT 2011 evaluation
-
San Francisco, CA, USA, Dec. 8-9
-
X. He, A. Axelrod, L. Deng, A. Acero, M. Hwang, A. Nguyen, A. Wang, and X. Huang, "The MSR system for IWSLT 2011 evaluation," in Proc. IWSLT, San Francisco, CA, USA, Dec. 8-9, 2011.
-
(2011)
Proc. IWSLT
-
-
He, X.1
Axelrod, A.2
Deng, L.3
Acero, A.4
Hwang, M.5
Nguyen, A.6
Wang, A.7
Huang, X.8
-
41
-
-
85032750905
-
Discriminative learning in sequential pattern recognition
-
Sep.
-
X. He, L. Deng, and W. Chou, "Discriminative learning in sequential pattern recognition," IEEE Signal Process. Mag., vol. 25, no. 5, pp. 14-36, Sep. 2008.
-
(2008)
IEEE Signal Process. Mag.
, vol.25
, Issue.5
, pp. 14-36
-
-
He, X.1
Deng, L.2
Chou, W.3
-
42
-
-
78649262962
-
Margin-based discriminative training for string recognition
-
Dec.
-
G. Heigold, P. Dreuw, S. Hahn, R. Schlüter, and H. Ney, "Margin-based discriminative training for string recognition," IEEE J. Sel. Top. Signal Process., vol. 4, no. 6, pp. 917-925, Dec. 2010.
-
(2010)
IEEE J. Sel. Top. Signal Process.
, vol.4
, Issue.6
, pp. 917-925
-
-
Heigold, G.1
Dreuw, P.2
Hahn, S.3
Schlüter, R.4
Ney, H.5
-
43
-
-
85008035419
-
Equivalence of generative and log-linear models
-
Jul.
-
G. Heigold, H. Ney, P. Lehnen, T. Gass, and R. Schluter, "Equivalence of generative and log-linear models," IEEE Trans. Audio Speech Lang. Process., vol. 19, no. 5, pp. 1138-1148, Jul. 2011.
-
(2011)
IEEE Trans. Audio Speech Lang. Process.
, vol.19
, Issue.5
, pp. 1138-1148
-
-
Heigold, G.1
Ney, H.2
Lehnen, P.3
Gass, T.4
Schluter, R.5
-
44
-
-
85032751458
-
Deep neural networks for acoustic modeling in speech recognition
-
Nov.
-
G. Hinton, L. Deng, D. Yu, G. Dahl, A. Mohamed, N. Jaitly, A. Senior, V. Vanhoucke, P. Nguyen, T. Sainath, and B. Kingsbury, "Deep neural networks for acoustic modeling in speech recognition," IEEE Signal Process. Mag., vol. 29, no. 6, pp. 82-97, Nov. 2012.
-
(2012)
IEEE Signal Process. Mag.
, vol.29
, Issue.6
, pp. 82-97
-
-
Hinton, G.1
Deng, L.2
Yu, D.3
Dahl, G.4
Mohamed, A.5
Jaitly, N.6
Senior, A.7
Vanhoucke, V.8
Nguyen, P.9
Sainath, T.10
Kingsbury, B.11
-
45
-
-
84865747510
-
Generalized Baum-Welch algorithm and its implication to a new extended Baum-Welch algorithm
-
Florence, Italy, Aug. 27-31
-
R. Hsiao and T. Schultz, "Generalized Baum-Welch algorithm and its implication to a new extended Baum-Welch algorithm," in Proc. Interspeech 2011, Florence, Italy, Aug. 27-31, 2011.
-
(2011)
Proc. Interspeech 2011
-
-
Hsiao, R.1
Schultz, T.2
-
46
-
-
85019175281
-
An overview of modern speech recognition
-
2nd ed. London, U.K.: Chapman & Hall/CRC Press
-
X. Huang and L. Deng, "An overview of modern speech recognition," in Handbook of Natural Language Processing, 2nd ed. London, U.K.: Chapman & Hall/CRC Press, 2010, pp. 339-366.
-
(2010)
Handbook of Natural Language Processing
, pp. 339-366
-
-
Huang, X.1
Deng, L.2
-
48
-
-
70450204768
-
A voice search approach to replying to SMS messages in automobiles
-
Brighton, U.K., Sep. 6-10
-
Y. Ju and T. Paek, "A voice search approach to replying to SMS messages in automobiles," in Proc. Interspeech 2009, Brighton, U.K., Sep. 6-10, 2009.
-
(2009)
Proc. Interspeech 2009
-
-
Ju, Y.1
Paek, T.2
-
50
-
-
80051622448
-
A-functions: A generalization of extended Baum-Welch transformations to convex optimization
-
D. Kanevsky, D. Nahamoo, T. Sainath, B. Ramabhadran, and P. Olsen, "A-functions: A generalization of extended Baum-Welch transformations to convex optimization," in Proc. Int. Conf. Acoust. Speech Signal Process., 2011, pp. 5164-5167.
-
(2011)
Proc. Int. Conf. Acoust. Speech Signal Process.
, pp. 5164-5167
-
-
Kanevsky, D.1
Nahamoo, D.2
Sainath, T.3
Ramabhadran, B.4
Olsen, P.5
-
51
-
-
84865788268
-
Convergence of line search A-function methods
-
Florence, Italy, Aug. 27-31
-
D. Kanevsky, D. Nahamoo, T. Sainath, and B. Ramabhadran, "Convergence of line search A-function methods," in Proc. Interspeech 2011, Florence, Italy, Aug. 27-31, 2011.
-
(2011)
Proc. Interspeech 2011
-
-
Kanevsky, D.1
Nahamoo, D.2
Sainath, T.3
Ramabhadran, B.4
-
52
-
-
85118138826
-
Statistical phrase-based translation
-
P. Koehn, F. Och, and D. Marcu, "Statistical phrase-based translation," in Proc. Conf. North Amer. Chapter Assoc. Comput. Linguist., Human Lang. Technol., 2003, vol. 1, pp. 48-54.
-
(2003)
Proc. Conf. North Amer. Chapter Assoc. Comput. Linguist., Human Lang. Technol.
, vol.1
, pp. 48-54
-
-
Koehn, P.1
Och, F.2
Marcu, D.3
-
54
-
-
43849085316
-
Spoken language translation
-
Madrid, Spain
-
S. Krauwer, D. Arnold, W. Kasper, M. Rayner, and H. Somers, "Spoken language translation," in Proc. ACL Spoken Lang. Transl. Workshop, Madrid, Spain, 1997, pp. 1-5.
-
(1997)
Proc. ACL Spoken Lang. Transl. Workshop
, pp. 1-5
-
-
Krauwer, S.1
Arnold, D.2
Kasper, W.3
Rayner, M.4
Somers, H.5
-
55
-
-
34250709904
-
Optimization for discriminative training
-
Lisbon, Portugal, Sep. 4-8
-
J. Le Roux and E. McDermott, "Optimization for discriminative training," in Proc. Interspeech 2005, Lisbon, Portugal, Sep. 4-8, 2005.
-
(2005)
Proc. Interspeech 2005
-
-
Le Roux, J.1
McDermott, E.2
-
56
-
-
78049355806
-
Discriminatively estimated joint acoustic, duration and language model for speech recognition
-
M. Lehr and I. Shafran, "Discriminatively estimated joint acoustic, duration and language model for speech recognition," in Proc. Int. Conf. Acoust. Speech Signal Process., 2010, pp. 5542-5545.
-
(2010)
Proc. Int. Conf. Acoust. Speech Signal Process.
, pp. 5542-5545
-
-
Lehr, M.1
Shafran, I.2
-
57
-
-
79959859604
-
Cross-lingual spoken language understanding from unaligned data using discriminative classification models and machine translation
-
Makuhari, Japan, Sep. 26-30
-
F. Lef̀evre, F. Mairesse, and S. Young, "Cross-lingual spoken language understanding from unaligned data using discriminative classification models and machine translation," in Proc. Interspeech 2010, Makuhari, Japan, Sep. 26-30, 2010.
-
(2010)
Proc. Interspeech 2010
-
-
Lef̀evre, F.1
Mairesse, F.2
Young, S.3
-
58
-
-
85032751176
-
Spoken document understanding and organization
-
Sep.
-
L.-S. Lee and B. Chen, "Spoken document understanding and organization," IEEE Signal Process. Mag., vol. 22, no. 5, pp. 42-60, Sep. 2005.
-
(2005)
IEEE Signal Process. Mag.
, vol.22
, Issue.5
, pp. 42-60
-
-
Lee, L.-S.1
Chen, B.2
-
59
-
-
84867608088
-
End-to-end speech recognition accuracy metric for voice search tasks
-
M. Levit, S. Chang, B. Buntschuh, and N. Kibre, "End-to-end speech recognition accuracy metric for voice search tasks," in Proc. Int. Conf. Acoust. Speech Signal Process., 2012, pp. 5141-5144.
-
(2012)
Proc. Int. Conf. Acoust. Speech Signal Process.
, pp. 5141-5144
-
-
Levit, M.1
Chang, S.2
Buntschuh, B.3
Kibre, N.4
-
60
-
-
84865779292
-
Multi-task learning for spoken language understanding with shared slots
-
Florence, Italy, Aug. 27-31
-
X. Li, Y. Wang, and G. Tur, "Multi-task learning for spoken language understanding with shared slots," in Proc. Interspeech 2011, Florence, Italy, Aug. 27-31, 2011.
-
(2011)
Proc. Interspeech 2011
-
-
Li, X.1
Wang, Y.2
Tur, G.3
-
61
-
-
44049107487
-
How to access audio files of large databases using in-car speech dialogue systems
-
Antwerp, Belgium
-
S. Mann, A. Berton, and U. Ehrlich, "How to access audio files of large databases using in-car speech dialogue systems," in Proc. Interspeech Conf., Antwerp, Belgium, 2007, pp. 138-141.
-
(2007)
Proc. Interspeech Conf
, pp. 138-141
-
-
Mann, S.1
Berton, A.2
Ehrlich, U.3
-
62
-
-
33947615216
-
Integrating speech recognition and machine translation: Where do we stand?"
-
DOI: 10.1109/ICASSP.2006.1661501
-
E. Matusov, S. Kanthak, and H. Ney, "Integrating speech recognition and machine translation: Where do we stand?" in Proc. Int. Conf. Acoust. Speech Signal Process., 2006, DOI: 10.1109/ICASSP.2006.1661501.
-
(2006)
Proc. Int. Conf. Acoust. Speech Signal Process.
-
-
Matusov, E.1
Kanthak, S.2
Ney, H.3
-
63
-
-
4544287474
-
Minimum classification error training of landmark models for real-time continuous speech recognition
-
E. McDermott and T. Hazen, "Minimum classification error training of landmark models for real-time continuous speech recognition," in Proc. Int. Conf. Acoust. Speech Signal Process., 2006, vol. 1, pp. 937-940.
-
(2006)
Proc. Int. Conf. Acoust. Speech Signal Process.
, vol.1
, pp. 937-940
-
-
McDermott, E.1
Hazen, T.2
-
64
-
-
34547522070
-
Discriminative training for large vocabulary speech recognition using minimum classification error
-
Jan.
-
E. McDermott, T. Hazen, J. Le Roux, A. Nakamura, and S. Katagiri, "Discriminative training for large vocabulary speech recognition using minimum classification error," IEEE Trans. Speech Audio Process., vol. 15, no. 1, pp. 203-223, Jan. 2007.
-
(2007)
IEEE Trans. Speech Audio Process.
, vol.15
, Issue.1
, pp. 203-223
-
-
McDermott, E.1
Hazen, T.2
Le Roux, J.3
Nakamura, A.4
Katagiri, S.5
-
65
-
-
33751057590
-
The ATR multilingual speech-to-speech translation system
-
Mar.
-
S. Nakamura, K. Markov, H. Nakaiwa, G. Kikui, H. Kawai, T. Jitsuhiro, J. Zhang, H. Yamamoto, E. Sumita, and S. Yamamoto, "The ATR multilingual speech-to-speech translation system," IEEE Trans. Audio Speech Lang. Process., vol. 14, no. 2, pp. 365-376, Mar. 2006.
-
(2006)
IEEE Trans. Audio Speech Lang. Process.
, vol.14
, Issue.2
, pp. 365-376
-
-
Nakamura, S.1
Markov, K.2
Nakaiwa, H.3
Kikui, G.4
Kawai, H.5
Jitsuhiro, T.6
Zhang, J.7
Yamamoto, H.8
Sumita, E.9
Yamamoto, S.10
-
66
-
-
0032654483
-
Speech translation: Coupling of recognition and translation
-
H. Ney, "Speech translation: Coupling of recognition and translation," in Proc. Int. Conf. Acoust. Speech Signal Process., 1999, vol. 1, pp. 517-520.
-
(1999)
Proc. Int. Conf. Acoust. Speech Signal Process.
, vol.1
, pp. 517-520
-
-
Ney, H.1
-
67
-
-
84944098666
-
Minimum error rate training in statistical machine translation
-
F. Och, "Minimum error rate training in statistical machine translation," in Proc. Annu. Meeting Assoc. Comput. Linguist., 2003, pp. 160-167.
-
(2003)
Proc. Annu. Meeting Assoc. Comput. Linguist.
, pp. 160-167
-
-
Och, F.1
-
69
-
-
85032751513
-
Speech segmentation and spoken document processing
-
May
-
M. Ostendorf, B. Favre, R. Grishman, D. Hakkani-Tur, M. Harper, D. Hillard, J. Hirschberg, J. Heng, J. Kahn, L. Yang, S. Maskey, E. Matusov, H. Ney, A. Rosenberg, E. Shriberg, W. Wen, and C. Woofers, "Speech segmentation and spoken document processing," IEEE Signal Process. Mag., vol. 25, no. 3, pp. 59-69, May 2008.
-
(2008)
IEEE Signal Process. Mag.
, vol.25
, Issue.3
, pp. 59-69
-
-
Ostendorf, M.1
Favre, B.2
Grishman, R.3
Hakkani-Tur, D.4
Harper, M.5
Hillard, D.6
Hirschberg, J.7
Heng, J.8
Kahn, J.9
Yang, L.10
Maskey, S.11
Matusov, E.12
Ney, H.13
Rosenberg, A.14
Shriberg, E.15
Wen, W.16
Woofers, C.17
-
70
-
-
85133336275
-
BLEU: A method for automatic evaluation of machine translation
-
K. Papineni, S. Roukos, T. Ward, and W. Zhu, "BLEU: A method for automatic evaluation of machine translation," in Proc. Annu. Meeting Assoc. Comput. Linguist., 2002, pp. 311-318.
-
(2002)
Proc. Annu. Meeting Assoc. Comput. Linguist.
, pp. 311-318
-
-
Papineni, K.1
Roukos, S.2
Ward, T.3
Zhu, W.4
-
71
-
-
80052206377
-
Overview of the IWSLT 2010 evaluation campaign
-
Paris, France, Dec. 2-3
-
M. Paul, M. Federico, and S. Stücker, "Overview of the IWSLT 2010 evaluation campaign," in Proc. IWSLT, Paris, France, Dec. 2-3, 2010.
-
(2010)
Proc. IWSLT
-
-
Paul, M.1
Federico, M.2
Stücker, S.3
-
72
-
-
4544265717
-
-
Ph.D. dissertation, Cambridge Univ. Eng. Dept., Cambridge Univ., Cambridge, U.K.
-
D. Povey, "Discriminative training for large vocabulary speech recognition," Ph.D. dissertation, Cambridge Univ. Eng. Dept., Cambridge Univ., Cambridge, U.K., 2004.
-
(2004)
Discriminative Training for Large Vocabulary Speech Recognition
-
-
Povey, D.1
-
73
-
-
77956541453
-
Integration of statistical models for dictation of document translations in a machine-aided human translation task
-
Nov.
-
A. Reddy and R. Rose, "Integration of statistical models for dictation of document translations in a machine-aided human translation task," IEEE Trans. Audio Speech Lang. Process., vol. 18, no. 8, pp. 2015-2027, Nov. 2010.
-
(2010)
IEEE Trans. Audio Speech Lang. Process.
, vol.18
, Issue.8
, pp. 2015-2027
-
-
Reddy, A.1
Rose, R.2
-
74
-
-
84969232669
-
Stochastic language models for speech recognition and understanding
-
Sydney, Australia, Nov. 30-Dec. 4
-
G. Riccardi and A. L. Gorin, "Stochastic language models for speech recognition and understanding," in Proc. ICSLP, Sydney, Australia, Nov. 30-Dec. 4, 1998.
-
(1998)
Proc. ICSLP
-
-
Riccardi, G.1
Gorin, A.L.2
-
75
-
-
84943274699
-
A direct adaptive method for faster back propagation learning: The RPROP algorithm
-
San Francisco, CA
-
M. Riedmiller and H. Braun, "A direct adaptive method for faster back propagation learning: The RPROP algorithm," in Proc. IEEE Int. Conf. Neural Netw., San Francisco, CA, 1993, pp. 586-591.
-
(1993)
Proc. IEEE Int. Conf. Neural Netw
, pp. 586-591
-
-
Riedmiller, M.1
Braun, H.2
-
76
-
-
0003459132
-
-
Ph.D. dissertation, Electr. Comput. Eng. Dept., McGill Univ., Montreal, QC, Canada
-
Y. Normandin, "Hidden Markov models, maximum mutual information estimation, and the speech recognition problem," Ph.D. dissertation, Electr. Comput. Eng. Dept., McGill Univ., Montreal, QC, Canada, 1991.
-
(1991)
Hidden Markov Models, Maximum Mutual Information Estimation, and the Speech Recognition Problem
-
-
Normandin, Y.1
-
77
-
-
85032751602
-
In-car media search
-
Jul.
-
M. Seltzer, Y. Ju, I. Tashev, Y. Wang, and D. Yu, "In-car media search," IEEE Signal Process. Mag., vol. 28, no. 4, pp. 50-60, Jul. 2011.
-
(2011)
IEEE Signal Process. Mag.
, vol.28
, Issue.4
, pp. 50-60
-
-
Seltzer, M.1
Ju, Y.2
Tashev, I.3
Wang, Y.4
Yu, D.5
-
78
-
-
84945900998
-
Best practice for convolutional neural networks applied to visual document analysis
-
P. Simard, Y. Steinkraus, and J. Platt, "Best practice for convolutional neural networks applied to visual document analysis," in Proc. Int. Conf. Document Anal. Recognit., 2003, pp. 958-962.
-
(2003)
Proc. Int. Conf. Document Anal. Recognit.
, pp. 958-962
-
-
Simard, P.1
Steinkraus, Y.2
Platt, J.3
-
79
-
-
70349200804
-
Voice search of structured media data
-
Y. Song, Y.-Y. Wang, Y. Ju, M. Seltzer, I. Tashev, and A. Acero, "Voice search of structured media data," in Proc. Int. Conf. Acoust. Speech Signal Process., 2009, pp. 3941-3944.
-
(2009)
Proc. Int. Conf. Acoust. Speech Signal Process.
, pp. 3941-3944
-
-
Song, Y.1
Wang, Y.-Y.2
Ju, Y.3
Seltzer, M.4
Tashev, I.5
Acero, A.6
-
80
-
-
84886667968
-
Intent determination and spoken utterance classification
-
Tur and De Mori, Eds. New York: Wiley
-
G. Tur and L. Deng, "Intent determination and spoken utterance classification," in Spoken Language Understanding: Systems for Extracting Semantic Information from Speech, Tur and De Mori, Eds. New York: Wiley, 2011, pp. 81-104.
-
(2011)
Spoken Language Understanding: Systems for Extracting Semantic Information from Speech
, pp. 81-104
-
-
Tur, G.1
Deng, L.2
-
81
-
-
84867605416
-
Towards deeper understanding: Deep convex networks for semantic utterance classification
-
Kyoto, Japan, Mar.
-
G. Tur, L. Deng, D. Hakkani-Tür, and X. He, "Towards deeper understanding: Deep convex networks for semantic utterance classification," in Proc. Int. Conf. Acoust. Speech Signal Process., Kyoto, Japan, Mar. 2012, pp. 5045-5048.
-
(2012)
Proc. Int. Conf. Acoust. Speech Signal Process
, pp. 5045-5048
-
-
Tur, G.1
Deng, L.2
Hakkani-Tür, D.3
He, X.4
-
82
-
-
0030706648
-
Finite-state speech-to-speech translation
-
Munich, Germany
-
E. Vidal, "Finite-state speech-to-speech translation," in Proc. Int. Conf. Acoust. Speech Signal Process., Munich, Germany, 1997, pp. 111-114.
-
(1997)
Proc. Int. Conf. Acoust. Speech Signal Process
, pp. 111-114
-
-
Vidal, E.1
-
83
-
-
85032751718
-
Spoken language translation
-
May
-
A. Waibel and C. Fugen, "Spoken language translation," IEEE Signal Process. Mag., vol. 25, no. 3, pp. 70-79, May 2008.
-
(2008)
IEEE Signal Process. Mag.
, vol.25
, Issue.3
, pp. 70-79
-
-
Waibel, A.1
Fugen, C.2
-
84
-
-
85032751364
-
An introduction to voice search
-
May
-
Y. Wang, D. Yu, Y. Ju, and A. Acero, "An introduction to voice search," IEEE Signal Process. Mag., vol. 25, no. 3, pp. 28-38, May 2008.
-
(2008)
IEEE Signal Process. Mag.
, vol.25
, Issue.3
, pp. 28-38
-
-
Wang, Y.1
Yu, D.2
Ju, Y.3
Acero, A.4
-
85
-
-
84886703610
-
Semantic frame based spoken language understanding
-
Tur and De Mori, Eds. New York: Wiley
-
Y. Wang, L. Deng, and A. Acero, "Semantic frame based spoken language understanding," in Spoken Language Understanding: Systems for Extracting Semantic Information from Speech, Tur and De Mori, Eds. New York: Wiley, 2011, pp. 35-80.
-
(2011)
Spoken Language Understanding: Systems for Extracting Semantic Information from Speech
, pp. 35-80
-
-
Wang, Y.1
Deng, L.2
Acero, A.3
-
86
-
-
84946714447
-
Is word error rate a good indicator for spoken language understanding accuracy
-
Y. Wang, A. Acero, and C. Chelba, "Is word error rate a good indicator for spoken language understanding accuracy," in Proc. IEEE Workshop Autom. Speech Recognit. Understand., 2003, pp. 577-582.
-
(2003)
Proc. IEEE Workshop Autom. Speech Recognit. Understand.
, pp. 577-582
-
-
Wang, Y.1
Acero, A.2
Chelba, C.3
-
87
-
-
85032753932
-
Spoken language understanding
-
Sep.
-
Y. Wang, L. Deng, and A. Acero, "Spoken language understanding," IEEE Signal Process. Mag., vol. 22, no. 5, pp. 16-31, Sep. 2005.
-
(2005)
IEEE Signal Process. Mag.
, vol.22
, Issue.5
, pp. 16-31
-
-
Wang, Y.1
Deng, L.2
Acero, A.3
-
88
-
-
84867625339
-
Phrase-level transduction model with reordering for spoken to written language transformation
-
P. Xu, P. Fung, and R. Chan, "Phrase-level transduction model with reordering for spoken to written language transformation," in Proc. Int. Conf. Acoust. Speech Signal Process., 2012, pp. 4965-4968.
-
(2012)
Proc. Int. Conf. Acoust. Speech Signal Process.
, pp. 4965-4968
-
-
Xu, P.1
Fung, P.2
Chan, R.3
-
89
-
-
66149085249
-
An integrative and discriminative technique for spoken utterance classification
-
Aug.
-
S. Yaman, L. Deng, D. Yu, Y. Wang, and A. Acero, "An integrative and discriminative technique for spoken utterance classification," IEEE Trans. Audio Speech Lang. Process., vol. 16, no. 6, pp. 1207-1214, Aug. 2008.
-
(2008)
IEEE Trans. Audio Speech Lang. Process.
, vol.16
, Issue.6
, pp. 1207-1214
-
-
Yaman, S.1
Deng, L.2
Yu, D.3
Wang, Y.4
Acero, A.5
-
90
-
-
85032752358
-
Cognitive user interfaces
-
May
-
S. Young, "Cognitive user interfaces," IEEE Signal Process. Mag., vol. 27, no. 3, pp. 128-140, May 2010.
-
(2010)
IEEE Signal Process. Mag.
, vol.27
, Issue.3
, pp. 128-140
-
-
Young, S.1
-
91
-
-
84876682878
-
POMDP-based statistical spoken dialog systems: A review
-
DOI: 10.1109/JPROC.2012.2225812
-
S. Young, M. Gasic, B. Thomson, and J. Williams, "POMDP-based statistical spoken dialog systems: A review," Proc. IEEE, 2013, DOI: 10.1109/JPROC.2012.2225812.
-
(2013)
Proc. IEEE
-
-
Young, S.1
Gasic, M.2
Thomson, B.3
Williams, J.4
-
92
-
-
85032782045
-
Deep learning and its applications to signal and information processing
-
Jan.
-
D. Yu and L. Deng, "Deep learning and its applications to signal and information processing," IEEE Signal Process. Mag., vol. 28, no. 1, pp. 145-154, Jan. 2011.
-
(2011)
IEEE Signal Process. Mag.
, vol.28
, Issue.1
, pp. 145-154
-
-
Yu, D.1
Deng, L.2
-
93
-
-
44049108531
-
Automated directory assistance system: From theory to practice
-
Antwerp, Belgium
-
D. Yu, Y.-C. Ju, Y.-Y. Wang, G. Zweig, and A. Acero, "Automated directory assistance system: From theory to practice," in Proc. Interspeech Conf., Antwerp, Belgium, 2007, pp. 2709-2712.
-
(2007)
Proc. Interspeech Conf
, pp. 2709-2712
-
-
Yu, D.1
Ju, Y.-C.2
Wang, Y.-Y.3
Zweig, G.4
Acero, A.5
-
94
-
-
80051607493
-
A novel decision function and the associated decision-feedback learning for speech translation
-
Y. Zhang, L. Deng, X. He, and A. Acero, "A novel decision function and the associated decision-feedback learning for speech translation," in Proc. Int. Conf. Acoust. Speech Signal Process., 2011, pp. 5608-5611.
-
(2011)
Proc. Int. Conf. Acoust. Speech Signal Process.
, pp. 5608-5611
-
-
Zhang, Y.1
Deng, L.2
He, X.3
Acero, A.4
-
95
-
-
85080849330
-
Statistical translation for speech: A perspective on structures and learning
-
B. Zhou, "Statistical translation for speech: A perspective on structures and learning," Proc. IEEE, 2013.
-
(2013)
Proc. IEEE
-
-
Zhou, B.1
-
96
-
-
34547554364
-
On efficient coupling of ASR and SMT for speech translation
-
B. Zhou, L. Besacier, and Y. Gao, "On efficient coupling of ASR and SMT for speech translation," in Proc. Int. Conf. Acoust. Speech Signal Process., 2007, vol. IV, pp. 101-104.
-
(2007)
Proc. Int. Conf. Acoust. Speech Signal Process.
, vol.4
, pp. 101-104
-
-
Zhou, B.1
Besacier, L.2
Gao, Y.3
-
97
-
-
40549085885
-
The voice rate dialog system for consumer ratings
-
G. Zweig, P. Nguyen, Y.-C. Ju, Y.-Y. Wang, D. Yu, and A. Acero, "The voice rate dialog system for consumer ratings," in Proc. Interspeech Conf., 2007, pp. 2713-2716.
-
(2007)
Proc. Interspeech Conf.
, pp. 2713-2716
-
-
Zweig, G.1
Nguyen, P.2
Ju, Y.-C.3
Wang, Y.-Y.4
Yu, D.5
Acero, A.6
-
99
-
-
84863365416
-
11, 001 new features for statistical machine translation
-
D. Chiang, K. Knight, and W. Wang, "11, 001 new features for statistical machine translation," in Proc. of Conf. North Amer. Chapter Assoc. Comput. Linguist., Human Lang. Technol., 2009, pp. 218-226.
-
(2009)
Proc. of Conf. North Amer. Chapter Assoc. Comput. Linguist., Human Lang. Technol.
, pp. 218-226
-
-
Chiang, D.1
Knight, K.2
Wang, W.3
-
100
-
-
80053255482
-
An end-to-end discriminative approach to machine translation
-
P. Liang, A. Bouchard-Cote, D. Klein, and B. Taskar, "An end-to-end discriminative approach to machine translation," in Proc. Conf. Comput. Linguist./Assoc. Comput. Linguist., 2006, pp. 761-768.
-
(2006)
Proc. Conf. Comput. Linguist./Assoc. Comput. Linguist.
, pp. 761-768
-
-
Liang, P.1
Bouchard-Cote, A.2
Klein, D.3
Taskar, B.4
-
101
-
-
84872203606
-
Comparison and combination of lightly supervised approaches for language portability of a spoken language understanding system
-
B. Jabaian, L. Besacier, and F. Lefevre, "Comparison and combination of lightly supervised approaches for language portability of a spoken language understanding system," IEEE Trans. Audio Speech Lang. Process., vol. 21, no. 3, 2013.
-
IEEE Trans. Audio Speech Lang. Process.
, vol.21
, Issue.3
, pp. 2013
-
-
Jabaian, B.1
Besacier, L.2
Lefevre, F.3
-
102
-
-
85009070292
-
Large-vocabulary speech recognition under adverse acoustic environments
-
L. Deng, A. Acero, M. Plumpe, and X. D. Huang, "Large-vocabulary speech recognition under adverse acoustic environments," Proc. Int. Conf. Spoken Lang., pp. 806-809.
-
Proc. Int. Conf. Spoken Lang.
, pp. 806-809
-
-
Deng, L.1
Acero, A.2
Plumpe, M.3
Huang, X.D.4
-
103
-
-
84874256530
-
Use of kernel deep convex networks and end-to-end learning for spoken language understanding
-
Dec.
-
L. Deng, G. Tur, X. He, and D. Hakkani-Tur, "Use of kernel deep convex networks and end-to-end learning for spoken language understanding," Proc. IEEE Workshop Spoken Lang. Technol., Dec. 2012.
-
(2012)
Proc. IEEE Workshop Spoken Lang. Technol
-
-
Deng, L.1
Tur, G.2
He, X.3
Hakkani-Tur, D.4
-
104
-
-
79959812741
-
Investigating multiple approaches for SLU portability to a new language
-
B. Jabaian, L. Besacier, and F. Lefevre, "Investigating multiple approaches for SLU portability to a new language," in Proc. Interspeech, 2010.
-
(2010)
Proc. Interspeech
-
-
Jabaian, B.1
Besacier, L.2
Lefevre, F.3
-
105
-
-
80051636817
-
Combination of stochastic understanding and machine translation systems for language portability of dialogue systems
-
B. Jabaian, L. Besacier, and F. Lefevre, "Combination of stochastic understanding and machine translation systems for language portability of dialogue systems," in Proc. ICASSP, 2011.
-
(2011)
Proc. ICASSP
-
-
Jabaian, B.1
Besacier, L.2
Lefevre, F.3
-
106
-
-
78049394664
-
On the use of machine translation for spoken language understanding portability
-
N. Camelin, C. Raymond, F. Bechet, and R. De Mori, "On the use of machine translation for spoken language understanding portability," in Proc. ICASSP, 2010.
-
(2010)
Proc. ICASSP
-
-
Camelin, N.1
Raymond, C.2
Bechet, F.3
De Mori, R.4
|