SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 21, Issue 11, 2013, Pages 2231-2243

Optimization algorithms and applications for speech and language processing

(6) Wright, Stephen J a Kanevsky, Dimitri b Deng, Li c He, Xiaodong c Heigold, Georg d Li, Haizhou e

a University of Wisconsin (United States)

b IBM RESEARCH (United States)

c MICROSOFT RESEARCH (United States)

d GOOGLE INC (United States)

e INSTITUTE FOR INFOCOMM RESEARCH (Singapore)

Author keywords

Natural language processing; Optimization methods; Speech processing

Indexed keywords

AUTOMATIC SPEECH RECOGNITION; COMPUTATIONAL PROBLEM; NATURAL LANGUAGE PROCESSING; OPTIMIZATION ALGORITHMS; OPTIMIZATION FORMULATIONS; OPTIMIZATION METHOD; OPTIMIZATION PROCEDURES; OPTIMIZATION TECHNIQUES;

NATURAL LANGUAGE PROCESSING SYSTEMS; OPTIMIZATION; SPEECH PROCESSING;

ALGORITHMS;

EID: 84887037596 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TASL.2013.2283777 Document Type: Article

Times cited : (24)

References (97)

1
- 0000342467
- Statistical inference for probabilistic functions of finite state Markov chains
- L. E. Baum and T. Petrie, "Statistical inference for probabilistic functions of finite state Markov chains," Ann. Math. Statist., vol. 37, no. 6, pp. 1554-1563, 1966.
- (1966) Ann. Math. Statist. , vol.37 , Issue.6 , pp. 1554-1563
- Baum, L.E.¹ Petrie, T.²

2
- 0029288633
- Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
- C. Leggetter and P.Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models," Comput., Speech, Lang., vol. 9, pp. 171-185, 1995.
- (1995) Comput., Speech, Lang. , vol.9 , pp. 171-185
- Leggetter, C.¹ Woodland, P.²

3
- 67149108139
- NY, USA: Wiley
- R. O. Duda, P. E. Hart, and D. G. Stork, Pattern Classification. New York, NY, USA: Wiley, 2001.
- (2001) Pattern Classification. New York
- Duda, R.O.¹ Hart, P.E.² Stork, D.G.³

4
- 0036461035
- Large scale discriminative training of hidden markov models for speech recognition
- P. C. Woodland and D. Povey, "Large scale discriminative training of hidden markov models for speech recognition," Comput. Speech, Lang., pp. 25-47, 2002.
- (2002) Comput. Speech, Lang. , pp. 25-47
- Woodland, P.C.¹ Povey, D.²

5
- 4544265717
- Ph.D. dissertation, University of Cambridge, Cambridge,U.K.
- D. Povey, "Discriminative training for large vocabulary speech recognition," Ph.D. dissertation, University of Cambridge, Cambridge,U.K., 2003.
- (2003) Discriminative Training for Large Vocabulary Speech Recognition
- Povey, D.¹

6
- 4243117872
- New York, NY, USA: Marcel Dekker
- L. Deng and D. O'Shaughnessy, Speech Processing-A Dynamic and Optimization-Oriented Approach. New York, NY, USA: Marcel Dekker, 2003.
- (2003) Speech Processing-A Dynamic and Optimization-Oriented Approach
- Deng, L.¹ O'shaughnessy, D.²

7
- 85009216465
- A comparative study on maximum entropy and discriminative training for acoustic modeling in automatic speech recognition
- W.Macherey and H. Ney, "A comparative study on maximum entropy and discriminative training for acoustic modeling in automatic speech recognition," in Proc. Eurospeech, 2003, pp. 493-496.
- (2003) Proc. Eurospeech , pp. 493-496
- Macherey, W.¹ Ney, H.²

8
- 33947618431
- Hidden conditional random fields for phone classification
- Sep.
- A. Gunawardana, M. Mahajan, A. Acero, and J. C. Platt, "Hidden conditional random fields for phone classification," in Interspeech, Sep. 2005.
- (2005) Interspeech
- Gunawardana, A.¹ Mahajan, M.² Acero, A.³ Platt, J.C.⁴

9
- 34249656385
- Discriminative estimation of subspace constrained gaussian mixture models for speech recognition
- Jan.
- S.Axelrod, V. Goel, R.A.Gopinath, P. A. Olsen, andK.Visweswariah, "Discriminative estimation of subspace constrained gaussian mixture models for speech recognition," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 1, pp. 172-189, Jan. 2007.
- (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.1 , pp. 172-189
- Axelrod, S.¹ Goel, V.² Gopinath, R.A.³ Olsen, P.A.⁴ Visweswariah, K.⁵

10
- 85032750905
- Discriminative learning in sequential pattern recognition-A unifying review for optimization-oriented speech recognition
- Sep.
- X. He, L. Deng, and W. Chou, "Discriminative learning in sequential pattern recognition-a unifying review for optimization-oriented speech recognition," IEEE Signal Process. Mag., vol. 25, no. 5, pp. 14-36, Sep. 2008.
- (2008) IEEE Signal Process. Mag. , vol.25 , Issue.5 , pp. 14-36
- He, X.¹ Deng, L.² Chou, W.³

11
- 81555228871
- Exemplar-based sparse representation features for speech recognition
- T. Sainath, B. Ramabhadran, D. Nahamoo, D. Kanevsky, and A. Sethy, "Exemplar-based sparse representation features for speech recognition," in Proc. Interspeech, 2010.
- (2010) Proc. Interspeech
- Sainath, T.¹ Ramabhadran, B.² Nahamoo, D.³ Kanevsky, D.⁴ Sethy, A.⁵

12
- 84886829539
- Optimization techniques to improve training speech of deep belief networks for large speech tasks
- Nov.
- T. Sainath, B. Kingsbury, H. Soltau, and B. Ramabhadran, "Optimization techniques to improve training speech of deep belief networks for large speech tasks," IEEE Trans. Audio, Speech, Lang. Process., Spec. Iss. Large-Scale Optimization for Audio, Speech, Lang. Process., vol. 21, no. 11, Nov. 2013.
- (2013) IEEE Trans. Audio, Speech, Lang. Process., Spec. Iss. Large-Scale Optimization for Audio, Speech, Lang. Process. , vol.21 , Issue.11
- Sainath, T.¹ Kingsbury, B.² Soltau, H.³ Ramabhadran, B.⁴

13
- 80051618443
- EM-style optimization of hidden conditional random fields for grapheme-to-phoneme conversion
- G. Heigold, S. Hahn, P. Lehnen, and H. Ney, "EM-style optimization of hidden conditional random fields for grapheme-to-phoneme conversion," in Proc. ICASSP, 2011, pp. 4920-4923.
- (2011) Proc. ICASSP , pp. 4920-4923
- Heigold, G.¹ Hahn, S.² Lehnen, P.³ Ney, H.⁴

14
- 84877743396
- Optimizing the performance of spoken language recognition with discriminative training
- Aug.
- V. Hautamäki, K. A. Lee, T. Kinnunen, B.Ma, and H. Li, "Optimizing the performance of spoken language recognition with discriminative training," IEEE Trans. Audio, Speech, Lang. Process., vol. 21, no. 8, pp. 1622-1631, Aug. 2013.
- (2013) IEEE Trans. Audio, Speech, Lang. Process. , vol.21 , Issue.8 , pp. 1622-1631
- Hautamäki, V.¹ Lee, K.A.² Kinnunen, T.³ Ma, B.⁴ Li, H.⁵

15
- 0012352869
- Carnegie Mellon Univ., Tech. Rep. 738
- T. P. Minka, "Algorithms for maximum-likelihood logistic regression," Carnegie Mellon Univ., Tech. Rep. 738, 2001.
- (2001) Algorithms for Maximum-likelihood Logistic Regression
- Minka, T.P.¹

16
- 29044444825
- Support vector machines for speaker and language recognition
- DOI 10.1016/j.csl.2005.06.003, PII S0885230805000318, Odyssey 2004: The Speaker and Language Recognition Workshop Odyssey-04
- W. Campbell, J. Campbell, D. Reynolds, E. Singer, and P. Torres-Carrasquillo, "Support vector machines for speaker and language recognition," Computer Speech Lang., pp. 210-229, Apr. 2006. (Pubitemid 41787537)
- (2006) Computer Speech and Language , vol.20 , Issue.2-3 SPEC. ISSUE , pp. 210-229
- Campbell, W.M.¹ Campbell, J.P.² Reynolds, D.A.³ Singer, E.⁴ Torres-Carrasquillo, P.A.⁵

17
- 84055217796
- Bayesian sensing hidden Markov models
- Jan.
- G. Saon and J.-T. Chien, "Bayesian sensing hidden Markov models," IEEE Trans. Audio, Speech Lang. Process., vol. 20, no. 1, pp. 43-54, Jan. 2012.
- (2012) IEEE Trans. Audio, Speech Lang. Process. , vol.20 , Issue.1 , pp. 43-54
- Saon, G.¹ Chien, J.-T.²

18
- 0000159105
- On adaptive decision rules and decision parameter adaptation for automatic speech recognition
- Aug.
- C. H. Lee and Q. Huo, "On adaptive decision rules and decision parameter adaptation for automatic speech recognition," Proc. IEEE, vol. 88, no. 8, pp. 1241-1269, Aug. 2000.
- (2000) Proc. IEEE , vol.88 , Issue.8 , pp. 1241-1269
- Lee, C.H.¹ Huo, Q.²

19
- 84887091716
- Bayesian approaches to acousticmodeling: A Review
- S.Watanabe and A. Nakamura, "Bayesian approaches to acousticmodeling: A Review," APSIPA Trans. Signal Inf. Process., vol. 1, 2012.
- (2012) APSIPA Trans. Signal Inf. Process. , vol.1
- Watanabe, S.¹ Nakamura, A.²

20
- 85032751865
- A geometric perspective of large-margin training of Gaussian models
- Nov.
- L. Xiao and L.Deng, "A geometric perspective of large-margin training of Gaussian models," IEEE Signal Process. Mag., vol. 27, no. 6, pp. 118-123, Nov. 2010.
- (2010) IEEE Signal Process. Mag. , vol.27 , Issue.6 , pp. 118-123
- Xiao, L.¹ Deng, L.²

21
- 51449090596
- A convex optimization method for joint mean and variance parameter estimation of large-margin CDHMM
- T.-H. Chang, Z.-Q. Luo, L. Deng, and C.-Y. Chi, "A convex optimization method for joint mean and variance parameter estimation of large-margin CDHMM," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 2008, pp. 4053-4056.
- (2008) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. , pp. 4053-4056
- Chang, T.-H.¹ Luo, Z.-Q.² Deng, L.³ Chi, C.-Y.⁴

22
- 84876669905
- Speech-centric information processing: An optimization-oriented approach
- May
- X. He and L. Deng, "Speech-centric information processing: An optimization-oriented approach," Proc. IEEE, vol. 101, no. 5, pp. 1116-1135, May 2013.
- (2013) Proc. IEEE , vol.101 , Issue.5 , pp. 1116-1135
- He, X.¹ Deng, L.²

23
- 0003982971
- 2nd ed. ed. New York, NY, USA: Springer
- J. Nocedal and S. J. Wright, Numerical Optimization, 2nd ed. ed. New York, NY, USA: Springer, 2006.
- (2006) Numerical Optimization
- Nocedal, J.¹ Wright, S.J.²

24
- 43949084890
- U.K.: Cambridge Univ. Press
- S. P. Boyd and L. Vandenberghe, Convex optimization. Cambridge, U.K.: Cambridge Univ. Press, 2004.
- (2004) Convex Optimization. Cambridge
- Boyd, S.P.¹ Vandenberghe, L.²

25
- 85049776636
- Ann Arbor, MI, USA: Optimization Software
- B. T. Polyak, Introduction to Optimization. Ann Arbor, MI, USA: Optimization Software, 1987.
- (1987) Introduction to Optimization
- Polyak, B.T.¹

26
- 80053182852
- Trust region-based optimization for maximum mutual information estimation of hmms in speech recognition
- Nov.
- C. Liu, Y. Hu, L.-R. Dai, and H. Jiang, "Trust region-based optimization for maximum mutual information estimation of hmms in speech recognition," IEEE Trans. Audio, Speech, Lang. Process., vol. 19, no. 8, pp. 2474-2485, Nov. 2011.
- (2011) IEEE Trans. Audio, Speech, Lang. Process. , vol.19 , Issue.8 , pp. 2474-2485
- Liu, C.¹ Hu, Y.² Dai, L.-R.³ Jiang, H.⁴

27
- 77955783938
- Error approximation and minimum phone error acoustic model estimation
- Aug.
- M. Gibson and T. Hain, "Error approximation and minimum phone error acoustic model estimation," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 6, pp. 1269-1279, Aug. 2010.
- (2010) IEEE Trans. Audio, Speech, Lang. Process. , vol.18 , Issue.6 , pp. 1269-1279
- Gibson, M.¹ Hain, T.²

28
- 0004014502
- CarnegieMellonUniv., Tech. Rep. CMUCS-99-108
- S. Chen and R. Rosenfeld, A Gaussian prior for smoothing maximum entropymodels Comput. Sci.Dept., CarnegieMellonUniv., Tech. Rep. CMUCS-99-108, 1999.
- (1999) A Gaussian Prior for Smoothing Maximum Entropymodels Comput. Sci.Dept.
- Chen, S.¹ Rosenfeld, R.²

29
- 51449120120
- BoostedMMI for model and feature-space discriminative training
- Las Vegas, NV, USA, Apr.
- D. Povey, D. Kanevsky, B. Kingsbury, B. Ramabhadran, G. Saon, and K. Visweswariah, "BoostedMMI for model and feature-space discriminative training," in Proc. IEEE Int. Conf. Acoustic, Speech, Signal Process., Las Vegas, NV, USA, Apr. 2008, pp. 4057-4060.
- (2008) Proc. IEEE Int. Conf. Acoustic, Speech, Signal Process. , pp. 4057-4060
- Povey, D.¹ Kanevsky, D.² Kingsbury, B.³ Ramabhadran, B.⁴ Saon, G.⁵ Visweswariah, K.⁶

30
- 80051640064
- Ph.D. dissertation, RWTH Aachen Univ., Aachen, Germany
- G. Heigold, "A log-linear discriminative modeling framework for speech recognition," Ph.D. dissertation, RWTH Aachen Univ., Aachen, Germany, 2010.
- (2010) A Log-linear Discriminative Modeling Framework for Speech Recognition
- Heigold, G.¹

31
- 0003845417
- The present status of automatic translation of languages
- Y. Bar-Hillel, "The present status of automatic translation of languages," Adv. Comput., pp. 158-163, 1960.
- (1960) Adv. Comput. , pp. 158-163
- Bar-Hillel, Y.¹

32
- 85044611587
- The mathematics of statistical machine translation: Parameter estimation
- P. Brown, S. Pietra, V. Pietra, and R.Mercer, "The mathematics of statistical machine translation: Parameter estimation," Comput. Linguist., vol. 19, no. 2, pp. 263-311, 1993.
- (1993) Comput. Linguist. , vol.19 , Issue.2 , pp. 263-311
- Brown, P.¹ Pietra, S.² Pietra, V.³ Mercer, R.⁴

33
- 85118138826
- Statistical phrase-based translation
- P. Koehn, F. Och, and D.Marcu, "Statistical phrase-based translation," in Proc. HLT-NAACL, 2003.
- (2003) Proc. HLT-NAACL
- Koehn, P.¹ Och, F.² Marcu, D.³

34
- 84944098666
- Minimum error rate training in statistical machine translation
- F. Och, "Minimum error rate training in statistical machine translation," in Proc. ACL, 2003.
- (2003) Proc. ACL
- Och, F.¹

35
- 85032751114
- Speech recognition, machine translation, and speech translation-A unified discriminative learning paradigm
- Sep.
- X. He, L. Deng, and W. Chou, "Speech recognition, machine translation, and speech translation-a unified discriminative learning paradigm," IEEE Signal Process. Mag., vol. 28, no. 5, pp. 126-133, Sep. 2011.
- (2011) IEEE Signal Process. Mag. , vol.28 , Issue.5 , pp. 126-133
- He, X.¹ Deng, L.² Chou, W.³

36
- 80053214216
- A maximum-entropy segmentation model for statistical machine translation
- Nov.
- D. Xiong, M. Zhang, and H. Li, "A maximum-entropy segmentation model for statistical machine translation," IEEE Trans. Audio, Speech, Lang. Process., vol. 19, no. 8, pp. 2494-2505, Nov. 2011.
- (2011) IEEE Trans. Audio, Speech, Lang. Process. , vol.19 , Issue.8 , pp. 2494-2505
- Xiong, D.¹ Zhang, M.² Li, H.³

37
- 84988221520
- Exploiting morphology and local word reordering in english-to-turkish phrase-based statistical machine translation
- Aug.
- I. D. El-Kahlout and K. Oflazer, "Exploiting morphology and local word reordering in english-to-turkish phrase-based statistical machine translation," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 6, pp. 1313-1322, Aug. 2010.
- (2010) IEEE Trans. Audio, Speech, Lang. Process. , vol.18 , Issue.6 , pp. 1313-1322
- El-Kahlout, I.D.¹ Oflazer, K.²

38
- 84876693434
- Maximum expected bleu training of phrase and lexicon translation models
- X. He and L. Deng, "Maximum expected bleu training of phrase and lexicon translation models," in Proc. ACL, Assoc. Comput. Linguist., 2012.
- (2012) Proc. ACL, Assoc. Comput. Linguist.
- He, X.¹ Deng, L.²

39
- 85133336275
- Bleu: A method for evaluation of machine translation
- K. Papineni, S. Roukos, T. Ward, and W. Zhu, "Bleu: a method for evaluation of machine translation," inProc. 40th Annu. Meeting Assoc. Comput. Linguist., 2002, pp. 311-318.
- (2002) InProc. 40th Annu. Meeting Assoc. Comput. Linguist. , pp. 311-318
- Papineni, K.¹ Roukos, S.² Ward, T.³ Zhu, W.⁴

40
- 70350125882
- An overview of text-independent speaker recognition: From features to supervectors
- T. Kinnunen and H. Li, "An overview of text-independent speaker recognition: from features to supervectors," Speech Commun., vol. 52, no. 1, pp. 12-40, 2010.
- (2010) Speech Commun. , vol.52 , Issue.1 , pp. 12-40
- Kinnunen, T.¹ Li, H.²

41
- 0033884858
- Speaker verification using adapted Gaussian mixture models
- DOI 10.1006/dspr.1999.0361
- D. Reynolds, T. Quatieri, and R. Dunn, "Speaker verification using adapted gaussian mixture models," Digital Signal Process., vol. 10, no. 1, pp. 19-41, Jan. 2000. (Pubitemid 30592166)
- (2000) Digital Signal Processing: A Review Journal , vol.10 , Issue.1 , pp. 19-41
- Reynolds, D.A.¹ Quatieri, T.F.² Dunn, R.B.³

42
- 58349106697
- A study of inter-speaker variability in speaker verification
- Jul.
- P. Kenny, P. Ouellet, N. Dehak, V. Gupta, and P. Dumouchel, "A study of inter-speaker variability in speaker verification," IEEE Trans. Audio, Speech, Lang. Process., vol. 16, no. 5, pp. 980-988, Jul. 2008.
- (2008) IEEE Trans. Audio, Speech, Lang. Process. , vol.16 , Issue.5 , pp. 980-988
- Kenny, P.¹ Ouellet, P.² Dehak, N.³ Gupta, V.⁴ Dumouchel, P.⁵

43
- 79953277529
- Using discrete probabilities with bhattacharyya measure for svm-based speaker verification
- May
- K. A. Lee, C. H. You, H. Li, T. Kinnunen, and K. C. Sim, "Using discrete probabilities with bhattacharyya measure for svm-based speaker verification," IEEE Trans. Audio, Speech, Lang. Process., vol. 19, no. 4, pp. 861-870, May 2011.
- (2011) IEEE Trans. Audio, Speech, Lang. Process. , vol.19 , Issue.4 , pp. 861-870
- Lee, K.A.¹ You, C.H.² Li, H.³ Kinnunen, T.⁴ Sim, K.C.⁵

44
- 79951609039
- Front-end factor analysis for speaker verification
- May
- N. Dehak, P. Kenny, R. Dehak, P. Dumouchel, and P. Ouellet, "Front-end factor analysis for speaker verification," IEEE Trans. Audio, Speech, Lang. Process., vol. 19, no. 4, pp. 788-798, May 2011.
- (2011) IEEE Trans. Audio, Speech, Lang. Process. , vol.19 , Issue.4 , pp. 788-798
- Dehak, N.¹ Kenny, P.² Dehak, R.³ Dumouchel, P.⁴ Ouellet, P.⁵

45
- 84876676725
- Spoken language recognition: From fundamentals to practice
- May
- H. Li, B. Ma, and K. A. Lee, "Spoken language recognition: From fundamentals to practice," Proc. IEEE, vol. 101, no. 5, pp. 1136-1159, May 2013.
- (2013) Proc. IEEE , vol.101 , Issue.5 , pp. 1136-1159
- Li, H.¹ Ma, B.² Lee, K.A.³

46
- 84887109920
- Vector-based spoken language classification
- J. Benesty, M. Sondhi, and A. Huang, Eds. New York, NY, USA: Springer
- H. Li, B. Ma, and C.-H. Lee, "Vector-based spoken language classification," in Springer Handbook of Speech Processing, J. Benesty, M. Sondhi, and A. Huang, Eds. New York, NY, USA: Springer, 2007.
- (2007) Springer Handbook of Speech Processing
- Li, H.¹ Ma, B.² Lee, C.-H.³

47
- 34547502608
- A vector space modeling approach to spoken language identification
- Jan.
- H. Li, B. Ma, and C.-H. Lee, "A vector space modeling approach to spoken language identification," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 1, pp. 271-284, Jan. 2007.
- (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.1 , pp. 271-284
- Li, H.¹ Ma, B.² Lee, C.-H.³

48
- 29044433376
- Application-independent evaluation of speaker detection
- DOI 10.1016/j.csl.2005.08.001, PII S0885230805000483, Odyssey 2004: The Speaker and Language Recognition Workshop Odyssey-04
- N. Brümmer and J. Preez, "Application-independent evaluation of speaker detection," Comput. Speech Lang., vol. 20, no. 2, pp. 230-275, 2006. (Pubitemid 41787538)
- (2006) Computer Speech and Language , vol.20 , Issue.2-3 SPEC. ISSUE , pp. 230-275
- Brummer, N.¹ Du Preez, J.²

49
- 36248952139
- An introduction to application independent evaluation of speaker recognition systems
- R. Müller, Ed. New York, NY, USA: Springer
- D. A. van Leeuwen and N. Brümmer, "An introduction to application independent evaluation of speaker recognition systems," in Speaker Classification, Lecture Notes in Computer Science/Artificial Intelligence, R. Müller, Ed. New York, NY, USA: Springer, 2007, vol. 4343.
- (2007) Speaker Classification, Lecture Notes in Computer Science/Artificial Intelligence , vol.4343
- Van Leeuwen, D.A.¹ Brümmer, N.²

50
- 85032751399
- TechWare: Speaker and spoken language recognition resources
- Nov.
- H. Li and B.Ma, "TechWare: Speaker and spoken language recognition resources," IEEE Signal Process. Mag., vol. 27, no. 6, pp. 139-142, Nov. 2010.
- (2010) IEEE Signal Process. Mag. , vol.27 , Issue.6 , pp. 139-142
- Li, H.¹ Ma, B.²

51
- 85084012091
- NIST 2007 language recognition evaluation
- A. F. Martin and A. N. Le, "NIST 2007 language recognition evaluation," in Proc. Odyssey: Speaker Lang. Recogn. Workshop, 2008, p. 016.
- (2008) Proc. Odyssey: Speaker Lang. Recogn. Workshop , pp. 016
- Martin, A.F.¹ Le, A.N.²

52
- 37649031157
- The current state of language recognition: NIST 2005 evaluation results
- A. F. Martin and A. N. Le, "The current state of language recognition: NIST 2005 evaluation results," in Proc. Odyssey: Speaker Lang. Recogn. Workshop, 2006, pp. 1-6.
- (2006) Proc. Odyssey: Speaker Lang. Recogn. Workshop , pp. 1-6
- Martin, A.F.¹ Le, A.N.²

53
- 84969216997
- NIST speech processing evaluations: Lvcsr, speaker recognition, language recognition
- A. F. Martin and J. S. Garofolo, "NIST speech processing evaluations: Lvcsr, speaker recognition, language recognition," in Proc. IEEE Workshop Signal Process. Applicat. Public Security Forensics, 2007, pp. 1-7.
- (2007) Proc. IEEE Workshop Signal Process. Applicat. Public Security Forensics , pp. 1-7
- Martin, A.F.¹ Garofolo, J.S.²

54
- 85073106909
- NIST 2009 language recognition evaluation
- A. F. Martin and C. Greenberg, "NIST 2009 language recognition evaluation," in Proc. Odyssey: Speaker Lang. Recogn.Workshop, 2010, pp. 165-171.
- (2010) Proc. Odyssey: Speaker Lang. Recogn.Workshop , pp. 165-171
- Martin, A.F.¹ Greenberg, C.²

55
- 70350444555
- Optimizing the performance of spoken language recognition with discriminative training
- Nov.
- D. Zhu, H. Li, B. Ma, and C. H. Lee, "Optimizing the performance of spoken language recognition with discriminative training," IEEE Trans. Audio, Speech, Lang. Process., vol. 16, no. 8, pp. 1642-1653, Nov. 2008.
- (2008) IEEE Trans. Audio, Speech, Lang. Process. , vol.16 , Issue.8 , pp. 1642-1653
- Zhu, D.¹ Li, H.² Ma, B.³ Lee, C.H.⁴

56
- 0031139839
- Minimum classification error rate methods for speech recognition
- PII S1063667697035937
- B.-H. Juang, W. Chou, and C.-H. Lee, "Minimum classification error rate methods for speech recognition," IEEE Trans. Speech Audio Process., vol. 5, no. 3, pp. 257-265, May 1997. (Pubitemid 127745998)
- (1997) IEEE Transactions on Speech and Audio Processing , vol.5 , Issue.3 , pp. 257-265
- Juang, B.-H.¹ Chou, W.² Lee, C.-H.³

57
- 84887071028
- N. Brümmer, "Focal bilinear: Tools for detector fusion and calibration, with use of side-information," [Online]. Available: https://sites.google. com/site/nikobrummer/focalbilinear.
- Focal Bilinear: Tools for Detector Fusion and Calibration, with Use of Side-information
- Brümmer, N.¹

58
- 80052047297
- Ph.D. dissertation, Stellenbosch Univ., Stellenbosch, South Africa
- N. Brümmer, "Measuring, refining and calibrating speaker and language information extracted from speech," Ph.D. dissertation, Stellenbosch Univ., Stellenbosch, South Africa, 2010.
- (2010) Measuring, Refining and Calibrating Speaker and Language Information Extracted from Speech
- Brümmer, N.¹

59
- 42749108057
- On calibration of language recognition scores
- N. Brümmer and D. Leeuwen, "On calibration of language recognition scores," in Proc. IEEE Odyssey: Speaker Lang. Recogn. Workshop, 2006, pp. 1-8.
- (2006) Proc. IEEE Odyssey: Speaker Lang. Recogn. Workshop , pp. 1-8
- Brümmer, N.¹ Leeuwen, D.²

60
- 0025952278
- An inequality for rational functions with applications to some statistical estimation problems
- Jan.
- P. S. Gopalakrishnan, D. Kanevsky, D. Nahamoo, and A. Nadas, "An inequality for rational functions with applications to some statistical estimation problems," IEEE Trans. Inf. Theory, vol. 37, no. 1, pp. 107-113, Jan. 1991.
- (1991) IEEE Trans. Inf. Theory , vol.37 , Issue.1 , pp. 107-113
- Gopalakrishnan, P.S.¹ Kanevsky, D.² Nahamoo, D.³ Nadas, A.⁴

61
- 0026372945
- An improvedMMIE training algorithmfor speaker-independent, small vocabulary, continuous speech recognition
- Y. Normandin, "An improvedMMIE training algorithmfor speaker-independent, small vocabulary, continuous speech recognition," in Proc. ICASSP, 1991, pp. 537-540.
- (1991) Proc. ICASSP , pp. 537-540
- Normandin, Y.¹

62
- 4544265717
- Ph.D. dissertation, Univ. of Cambridge, Cambridge, U.K.
- D. Povey, "Discriminative training for large vocabulary speech recognition," Ph.D. dissertation, Univ. of Cambridge, Cambridge, U.K., 2003.
- (2003) Discriminative Training for Large Vocabulary Speech Recognition
- Povey, D.¹

63
- 44849142532
- Extended Baum transformations for general functions, II
- D. Kanevsky, "Extended Baum transformations for general functions, II," Human Language Technol., IBM, Tech. Rep. RC23645(W0506-120), 2005.
- (2005) Human Language Technol., IBM, Tech. Rep. RC23645(W0506-120)
- Kanevsky, D.¹

64
- 2142684272
- On reversing Jensen's inequality
- T. Jebara, "On reversing Jensen's inequality," in Proc. NIPS, 2002.
- (2002) Proc. NIPS
- Jebara, T.¹

65
- 34249656385
- Discriminative estimation of subspace constrained Gaussian mixture models for speech recognition
- Jan.
- S. Axelrod, V. Goel, P. Gopinath, R. Olsen, and K. Visweswariah, "Discriminative estimation of subspace constrained Gaussian mixture models for speech recognition," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 1, pp. 172-189, Jan. 2007.
- (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.1 , pp. 172-189
- Axelrod, S.¹ Goel, V.² Gopinath, P.³ Olsen, R.⁴ Visweswariah, K.⁵

66
- 70349995255
- Generalization of extended Baum-Welch parameter estimation for discriminative training and decoding
- D. Kanevsky, T. Sainath, B. Ramabhadran, and D. Nahamoo, "Generalization of extended Baum-Welch parameter estimation for discriminative training and decoding," in Proc. Interspeech, 2008.
- (2008) Proc. Interspeech
- Kanevsky, D.¹ Sainath, T.² Ramabhadran, B.³ Nahamoo, D.⁴

67
- 80051622448
- A-Functions: A generalization of extended Baum-Welch transformations to convex optimization
- D. Kanevsky, D. Nahamoo, T. N. Sainath, B. Ramabhadran, and P. A. Olsen, "A-Functions: A generalization of extended Baum-Welch transformations to convex optimization," in Proc. ICASSP, 2011, pp. 5164-5167.
- (2011) Proc. ICASSP , pp. 5164-5167
- Kanevsky, D.¹ Nahamoo, D.² Sainath, T.N.³ Ramabhadran, B.⁴ Olsen, P.A.⁵

68
- 0035342391
- Comparison of discriminative training criteria and optimization methods for speech recognition
- DOI 10.1016/S0167-6393(00)00035-2, PII S0167639300000352
- R. Schlüter, W. Macherey, B. Müller, and H. Ney, "Comparison of discriminative training criteria and optimization methods for speech recognition," Speech Commun., pp. 287-310, 2001. (Pubitemid 32284868)
- (2001) Speech Communication , vol.34 , Issue.3 , pp. 287-310
- Schluter, R.¹ Macherey, W.² Muller, B.³ Ney, H.⁴

69
- 34547530690
- Constrained line search optimization for discriminative training in speech recognition
- C. Liu, P. Liu, H. Jiang, F. Soong, and R. Wang, "Constrained Line Search Optimization for Discriminative Training in Speech Recognition," in Proc. ICASSP, 2007, pp. 329-332.
- (2007) Proc. ICASSP , pp. 329-332
- Liu, C.¹ Liu, P.² Jiang, H.³ Soong, F.⁴ Wang, R.⁵

70
- 84865747510
- Generalized Baum-Welch algorithm and its application to new extended Baum-Welch algorithm
- DR. Hsiao and T. Schultz, "Generalized Baum-Welch algorithm and its application to new extended Baum-Welch algorithm," in Proc. Interspeech, 2011.
- (2011) Proc. Interspeech
- Hsiao, D.R.¹ Schultz, T.²

71
- 48849083725
- Extended Baum-Welch reestimation of Gaussian mixture models based on reverse Jensen inequality
- Lisbon, Portugal, Sep.
- M. Afify, "Extended Baum-Welch reestimation of Gaussian mixture models based on reverse Jensen inequality," in Proc. Interspeech, Lisbon, Portugal, Sep. 2005.
- (2005) Proc. Interspeech
- Afify, M.¹

72
- 0012708049
- On reversing Jensen's inequality
- Dec.
- T. Jebara and A. Pentland, "On reversing Jensen's inequality," Adv. Neural Inf. Process. Syst., Dec. 2000.
- (2000) Adv. Neural Inf. Process. Syst.
- Jebara, T.¹ Pentland, A.²

73
- 33846516584
- NewYork, NY, USA: Springer
- C. Bishop, Pattern Recognition and Machine Learning. NewYork, NY, USA: Springer, 2006.
- (2006) Pattern Recognition and Machine Learning
- Bishop, C.¹

74
- 0002210265
- On the convergence properties of the em algorithm
- C. F. J. Wu, "On the convergence properties of the EM algorithm," Ann. Statist., vol. 11, no. 1, pp. 95-103, 1983.
- (1983) Ann. Statist. , vol.11 , Issue.1 , pp. 95-103
- Wu, C.F.J.¹

75
- 0024610919
- Tutorial on hidden Markov models and selected applications in speech recognition
- Feb.
- L. Rabiner, "Tutorial on hidden Markov models and selected applications in speech recognition," Proc. IEEE, vol. 77, no. 2, pp. 257-286, Feb. 1989.
- (1989) Proc. IEEE , vol.77 , Issue.2 , pp. 257-286
- Rabiner, L.¹

76
- 0028412908
- High-performance connected digit recognition using maximum mutual information estimation
- Apr.
- Y. Normandin, R. Cardin, and R. Demori, "High-performance connected digit recognition using maximum mutual information estimation," IEEE Trans. Speech Audio Process., vol. 2, no. 2, pp. 299-311, Apr. 1994.
- (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.2 , pp. 299-311
- Normandin, Y.¹ Cardin, R.² Demori, R.³

77
- 0036296863
- Minimum phone error and i-smoothing for improved discriminative training
- D. Povey and P. C.Woodland, "Minimum phone error and i-smoothing for improved discriminative training," in Proc. IEEE Int. Conf. Acoustic, Speech, Signal Process., 2002, pp. 105-108.
- (2002) Proc. IEEE Int. Conf. Acoustic, Speech, Signal Process. , pp. 105-108
- Povey, D.¹ Woodland, P.C.²

78
- 34547522070
- Discriminative training for large vocabulary speech recognition usingminimumclassification error
- Jan.
- E. McDermott, T. J. Hazen, J. Le Roux, A. Nakamura, and S. Katagiri, "Discriminative training for large vocabulary speech recognition usingminimumclassification error," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 1, pp. 203-223, Jan. 2007.
- (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.1 , pp. 203-223
- McDermott, E.¹ Hazen, T.J.² Le Roux, J.³ Nakamura, A.⁴ Katagiri, S.⁵

79
- 85008035419
- Equivalence of generative and log-linearmodels
- Jul.
- G. Heigold, H. Ney, P. Lehnen, T. Gass, and R. Schlüter, "Equivalence of generative and log-linearmodels," IEEE Trans. Audio, Speech, Lang. Process., vol. 19, no. 5, pp. 1138-1148, Jul. 2011.
- (2011) IEEE Trans. Audio, Speech, Lang. Process. , vol.19 , Issue.5 , pp. 1138-1148
- Heigold, G.¹ Ney, H.² Lehnen, P.³ Gass, T.⁴ Schlüter, R.⁵

80
- 0002629270
- Maximum-likelihood from incomplete data via the em algorithm
- A. P. Dempster, N. M. Laird, and D. B. Rubin, "Maximum-likelihood from incomplete data via the EM algorithm," J. R. Statist. Soc. Ser. B., vol. 39, 1977.
- (1977) J. R. Statist. Soc. Ser. B. , vol.39
- Dempster, A.P.¹ Laird, N.M.² Rubin, D.B.³

81
- 0001573124
- Generalized iterative scaling for log-linear models
- J. Darroch and D. Ratcliff, "Generalized iterative scaling for log-linear models," Ann. Math. Statist., vol. 43, pp. 1470-1480, 1972.
- (1972) Ann. Math. Statist. , vol.43 , pp. 1470-1480
- Darroch, J.¹ Ratcliff, D.²

82
- 0031120321
- Inducing features of random fields
- S. A. Della Pietra, V. J. Della Pietra, and J. Lafferty, "Inducing features of random fields," IEEE Trans. Pattern Anal. Mach. Intell., vol. 19, no. 4, pp. 380-393, Apr.. 1997. (Pubitemid 127762893)
- (1997) IEEE Transactions on Pattern Analysis and Machine Intelligence , vol.19 , Issue.4 , pp. 380-393
- Pietra, S.D.¹ Pietra, V.D.² Lafferty, J.³

83
- 51449099268
- GIS-like estimation of log-linear models with hidden variables
- G. Heigold, T. Deselaers, R. Schlüter, andH. Ney, "GIS-like estimation of log-linear models with hidden variables," in Proc. ICASSP, 2008, pp. 4045-4048.
- (2008) Proc. ICASSP , pp. 4045-4048
- Heigold, G.¹ Deselaers, T.² Schlüter, R.³ Ney, H.⁴

84
- 84943274699
- A direct adaptivemethod for faster backpropagation learning: The Rprop algorithm
- M. Riedmiller and H. Braun, "A direct adaptivemethod for faster backpropagation learning: The Rprop algorithm," in IEEE International Conference on Neural Networks (ICNN), 1993.
- (1993) IEEE International Conference on Neural Networks (ICNN)
- Riedmiller, M.¹ Braun, H.²

85
- 84876672166
- Machine learning paradigms for speech recognition: An overview
- May
- L. Deng and X. Li, "Machine learning paradigms for speech recognition: An overview," IEEE Trans. Audio, Speech, Lang. Process., vol. 21, no. 5, pp. 1060-1089, May 2013.
- (2013) IEEE Trans. Audio, Speech, Lang. Process. , vol.21 , Issue.5 , pp. 1060-1089
- Deng, L.¹ Li, X.²

86
- 84878379108
- Scalable minimum Bayes risk training of deep neural network acoustic models using distributed hessian-free optimization
- B. Kingsbury, T. Sainath, and H. Soltau, "Scalable minimum Bayes risk training of deep neural network acoustic models using distributed hessian-free optimization," in Proc. Interspeech, 2012.
- (2012) Proc. Interspeech
- Kingsbury, B.¹ Sainath, T.² Soltau, H.³

87
- 84877760312
- Large scale distributed deep networks
- J. Dean, G. Corrado, R. Monga, K. Chen, M. Devin, Q. Le, M. W. Mao, M.-A. Ranzato, A.-W. Senior, P. A. Tucker, K. Yang, and A. Y. Ng, "Large scale distributed deep networks," NIPS, 2012.
- (2012) NIPS
- Dean, J.¹ Corrado, G.² Monga, R.³ Chen, K.⁴ Devin, M.⁵ Le, Q.⁶ Mao, M.W.⁷ Ranzato, M.-A.⁸ Senior, A.-W.⁹ Tucker, P.A.¹⁰ Yang, K.¹¹ Ng, A.Y.¹²

88
- 85032751458
- Deep neural networks for acoustic modeling in speech recognition
- Nov.
- G. Hinton, L. Deng, D. Yu, G. Dahl, A.Mohamed, N. Jaitly, A. Senior, V. Vanhoucke, P. Nguyen, T. Sainath, and B. Kingsbury, "Deep neural networks for acoustic modeling in speech recognition," IEEE Signal Process. Mag., vol. 29, no. 6, pp. 82-97, Nov. 2012.
- (2012) IEEE Signal Process. Mag. , vol.29 , Issue.6 , pp. 82-97
- Hinton, G.¹ Deng, L.² Yu, D.³ Dahl, G.⁴ Mohamed, A.⁵ Jaitly, N.⁶ Senior, A.⁷ Vanhoucke, V.⁸ Nguyen, P.⁹ Sainath, T.¹⁰ Kingsbury, B.¹¹

89
- 84055222005
- Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition
- Jan.
- G. Dahl,D.Yu, L.Deng, and A. Acero, "Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition," IEEE Trans. Audio, Speech, Lang. Process., vol. 20, no. 1, pp. 30-42, Jan. 2012.
- (2012) IEEE Trans. Audio, Speech, Lang. Process. , vol.20 , Issue.1 , pp. 30-42
- Dahl, G.¹ Yu, D.² Deng, L.³ Acero, A.⁴

90
- 84055211743
- Acoustic modeling using deep belief networks
- Jan.
- A. Mohamed, G. Dahl, and G. Hinton, "Acoustic modeling using deep belief networks," IEEE Trans. Audio, Speech, Lang. Process., vol. 20, no. 1, pp. 14-22, Jan. 2012.
- (2012) IEEE Trans. Audio, Speech, Lang. Process. , vol.20 , Issue.1 , pp. 14-22
- Mohamed, A.¹ Dahl, G.² Hinton, G.³

91
- 84255177123
- Deep and wide: Multiple layers in automatic speech recognition
- Jan.
- N. Morgan, "Deep and wide: Multiple layers in automatic speech recognition," IEEE Trans. Audio, Speech, Lang. Process., vol. 20, no. 1, pp. 7-13, Jan. 2012.
- (2012) IEEE Trans. Audio, Speech, Lang. Process. , vol.20 , Issue.1 , pp. 7-13
- Morgan, N.¹

92
- 84875405186
- Exploiting deep neural networks for detection-based speech recognition
- M. Siniscalchi, L. Deng, D. Yu, and C.-H. Lee, "Exploiting deep neural networks for detection-based speech recognition," Neurocomputing, pp. 148-157, 2013.
- (2013) Neurocomputing , pp. 148-157
- Siniscalchi, M.¹ Deng, L.² Yu, D.³ Lee, C.-H.⁴

93
- 84865768819
- Deep convex network: A scalable architecture for speech pattern classification
- L. Deng and D. Yu, "Deep convex network: A scalable architecture for speech pattern classification," in Proc. Interspeech, 2011.
- (2011) Proc. Interspeech
- Deng, L.¹ Yu, D.²

94
- 84867614591
- Scalable stacking and learning for building deep architectures
- L. Deng, D. Yu, and J. Platt, "Scalable stacking and learning for building deep architectures," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 2012, pp. 2133-2136.
- (2012) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. , pp. 2133-2136
- Deng, L.¹ Yu, D.² Platt, J.³

95
- 84879301618
- Tensor deep stacking networks
- Aug.
- B. Hutchinson, L. Deng, and D. Yu, "Tensor deep stacking networks," IEEE Trans. Pattern Anal.Mach. Intell., vol. 35, no. 8, pp. 1944-1957, Aug. 2013.
- (2013) IEEE Trans. Pattern Anal.Mach. Intell. , vol.35 , Issue.8 , pp. 1944-1957
- Hutchinson, B.¹ Deng, L.² Yu, D.³

96
- 80053459857
- Generating text with recurrent neural networks
- I. Suskever, J. Martens, and G. E. Hinton, "Generating text with recurrent neural networks," in Proc. 28th Int. Conf. Mach. Learn., 2011.
- (2011) Proc. 28th Int. Conf. Mach. Learn.
- Suskever, I.¹ Martens, J.² Hinton, G.E.³

97
- 84890526837
- New types of deep neural network learning for speech recognition and related applications: An overview
- L. Deng, G. E. Hinton, and B. Kingsbury, "New types of deep neural network learning for speech recognition and related applications: An overview," in Proc. ICASSP, 2013.
- (2013) Proc. ICASSP
- Deng, L.¹ Hinton, G.E.² Kingsbury, B.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.