-
1
-
-
0000342467
-
Statistical inference for probabilistic functions of finite state Markov chains
-
L. E. Baum and T. Petrie, "Statistical inference for probabilistic functions of finite state Markov chains," Ann. Math. Statist., vol. 37, no. 6, pp. 1554-1563, 1966.
-
(1966)
Ann. Math. Statist.
, vol.37
, Issue.6
, pp. 1554-1563
-
-
Baum, L.E.1
Petrie, T.2
-
2
-
-
0029288633
-
Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
-
C. Leggetter and P.Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models," Comput., Speech, Lang., vol. 9, pp. 171-185, 1995.
-
(1995)
Comput., Speech, Lang.
, vol.9
, pp. 171-185
-
-
Leggetter, C.1
Woodland, P.2
-
3
-
-
67149108139
-
-
NY, USA: Wiley
-
R. O. Duda, P. E. Hart, and D. G. Stork, Pattern Classification. New York, NY, USA: Wiley, 2001.
-
(2001)
Pattern Classification. New York
-
-
Duda, R.O.1
Hart, P.E.2
Stork, D.G.3
-
4
-
-
0036461035
-
Large scale discriminative training of hidden markov models for speech recognition
-
P. C. Woodland and D. Povey, "Large scale discriminative training of hidden markov models for speech recognition," Comput. Speech, Lang., pp. 25-47, 2002.
-
(2002)
Comput. Speech, Lang.
, pp. 25-47
-
-
Woodland, P.C.1
Povey, D.2
-
5
-
-
4544265717
-
-
Ph.D. dissertation, University of Cambridge, Cambridge,U.K.
-
D. Povey, "Discriminative training for large vocabulary speech recognition," Ph.D. dissertation, University of Cambridge, Cambridge,U.K., 2003.
-
(2003)
Discriminative Training for Large Vocabulary Speech Recognition
-
-
Povey, D.1
-
7
-
-
85009216465
-
A comparative study on maximum entropy and discriminative training for acoustic modeling in automatic speech recognition
-
W.Macherey and H. Ney, "A comparative study on maximum entropy and discriminative training for acoustic modeling in automatic speech recognition," in Proc. Eurospeech, 2003, pp. 493-496.
-
(2003)
Proc. Eurospeech
, pp. 493-496
-
-
Macherey, W.1
Ney, H.2
-
8
-
-
33947618431
-
Hidden conditional random fields for phone classification
-
Sep.
-
A. Gunawardana, M. Mahajan, A. Acero, and J. C. Platt, "Hidden conditional random fields for phone classification," in Interspeech, Sep. 2005.
-
(2005)
Interspeech
-
-
Gunawardana, A.1
Mahajan, M.2
Acero, A.3
Platt, J.C.4
-
9
-
-
34249656385
-
Discriminative estimation of subspace constrained gaussian mixture models for speech recognition
-
Jan.
-
S.Axelrod, V. Goel, R.A.Gopinath, P. A. Olsen, andK.Visweswariah, "Discriminative estimation of subspace constrained gaussian mixture models for speech recognition," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 1, pp. 172-189, Jan. 2007.
-
(2007)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.15
, Issue.1
, pp. 172-189
-
-
Axelrod, S.1
Goel, V.2
Gopinath, R.A.3
Olsen, P.A.4
Visweswariah, K.5
-
10
-
-
85032750905
-
Discriminative learning in sequential pattern recognition-A unifying review for optimization-oriented speech recognition
-
Sep.
-
X. He, L. Deng, and W. Chou, "Discriminative learning in sequential pattern recognition-a unifying review for optimization-oriented speech recognition," IEEE Signal Process. Mag., vol. 25, no. 5, pp. 14-36, Sep. 2008.
-
(2008)
IEEE Signal Process. Mag.
, vol.25
, Issue.5
, pp. 14-36
-
-
He, X.1
Deng, L.2
Chou, W.3
-
11
-
-
81555228871
-
Exemplar-based sparse representation features for speech recognition
-
T. Sainath, B. Ramabhadran, D. Nahamoo, D. Kanevsky, and A. Sethy, "Exemplar-based sparse representation features for speech recognition," in Proc. Interspeech, 2010.
-
(2010)
Proc. Interspeech
-
-
Sainath, T.1
Ramabhadran, B.2
Nahamoo, D.3
Kanevsky, D.4
Sethy, A.5
-
12
-
-
84886829539
-
Optimization techniques to improve training speech of deep belief networks for large speech tasks
-
Nov.
-
T. Sainath, B. Kingsbury, H. Soltau, and B. Ramabhadran, "Optimization techniques to improve training speech of deep belief networks for large speech tasks," IEEE Trans. Audio, Speech, Lang. Process., Spec. Iss. Large-Scale Optimization for Audio, Speech, Lang. Process., vol. 21, no. 11, Nov. 2013.
-
(2013)
IEEE Trans. Audio, Speech, Lang. Process., Spec. Iss. Large-Scale Optimization for Audio, Speech, Lang. Process.
, vol.21
, Issue.11
-
-
Sainath, T.1
Kingsbury, B.2
Soltau, H.3
Ramabhadran, B.4
-
13
-
-
80051618443
-
EM-style optimization of hidden conditional random fields for grapheme-to-phoneme conversion
-
G. Heigold, S. Hahn, P. Lehnen, and H. Ney, "EM-style optimization of hidden conditional random fields for grapheme-to-phoneme conversion," in Proc. ICASSP, 2011, pp. 4920-4923.
-
(2011)
Proc. ICASSP
, pp. 4920-4923
-
-
Heigold, G.1
Hahn, S.2
Lehnen, P.3
Ney, H.4
-
14
-
-
84877743396
-
Optimizing the performance of spoken language recognition with discriminative training
-
Aug.
-
V. Hautamäki, K. A. Lee, T. Kinnunen, B.Ma, and H. Li, "Optimizing the performance of spoken language recognition with discriminative training," IEEE Trans. Audio, Speech, Lang. Process., vol. 21, no. 8, pp. 1622-1631, Aug. 2013.
-
(2013)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.21
, Issue.8
, pp. 1622-1631
-
-
Hautamäki, V.1
Lee, K.A.2
Kinnunen, T.3
Ma, B.4
Li, H.5
-
16
-
-
29044444825
-
Support vector machines for speaker and language recognition
-
DOI 10.1016/j.csl.2005.06.003, PII S0885230805000318, Odyssey 2004: The Speaker and Language Recognition Workshop Odyssey-04
-
W. Campbell, J. Campbell, D. Reynolds, E. Singer, and P. Torres-Carrasquillo, "Support vector machines for speaker and language recognition," Computer Speech Lang., pp. 210-229, Apr. 2006. (Pubitemid 41787537)
-
(2006)
Computer Speech and Language
, vol.20
, Issue.2-3 SPEC. ISSUE
, pp. 210-229
-
-
Campbell, W.M.1
Campbell, J.P.2
Reynolds, D.A.3
Singer, E.4
Torres-Carrasquillo, P.A.5
-
17
-
-
84055217796
-
Bayesian sensing hidden Markov models
-
Jan.
-
G. Saon and J.-T. Chien, "Bayesian sensing hidden Markov models," IEEE Trans. Audio, Speech Lang. Process., vol. 20, no. 1, pp. 43-54, Jan. 2012.
-
(2012)
IEEE Trans. Audio, Speech Lang. Process.
, vol.20
, Issue.1
, pp. 43-54
-
-
Saon, G.1
Chien, J.-T.2
-
18
-
-
0000159105
-
On adaptive decision rules and decision parameter adaptation for automatic speech recognition
-
Aug.
-
C. H. Lee and Q. Huo, "On adaptive decision rules and decision parameter adaptation for automatic speech recognition," Proc. IEEE, vol. 88, no. 8, pp. 1241-1269, Aug. 2000.
-
(2000)
Proc. IEEE
, vol.88
, Issue.8
, pp. 1241-1269
-
-
Lee, C.H.1
Huo, Q.2
-
20
-
-
85032751865
-
A geometric perspective of large-margin training of Gaussian models
-
Nov.
-
L. Xiao and L.Deng, "A geometric perspective of large-margin training of Gaussian models," IEEE Signal Process. Mag., vol. 27, no. 6, pp. 118-123, Nov. 2010.
-
(2010)
IEEE Signal Process. Mag.
, vol.27
, Issue.6
, pp. 118-123
-
-
Xiao, L.1
Deng, L.2
-
21
-
-
51449090596
-
A convex optimization method for joint mean and variance parameter estimation of large-margin CDHMM
-
T.-H. Chang, Z.-Q. Luo, L. Deng, and C.-Y. Chi, "A convex optimization method for joint mean and variance parameter estimation of large-margin CDHMM," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 2008, pp. 4053-4056.
-
(2008)
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process.
, pp. 4053-4056
-
-
Chang, T.-H.1
Luo, Z.-Q.2
Deng, L.3
Chi, C.-Y.4
-
22
-
-
84876669905
-
Speech-centric information processing: An optimization-oriented approach
-
May
-
X. He and L. Deng, "Speech-centric information processing: An optimization-oriented approach," Proc. IEEE, vol. 101, no. 5, pp. 1116-1135, May 2013.
-
(2013)
Proc. IEEE
, vol.101
, Issue.5
, pp. 1116-1135
-
-
He, X.1
Deng, L.2
-
26
-
-
80053182852
-
Trust region-based optimization for maximum mutual information estimation of hmms in speech recognition
-
Nov.
-
C. Liu, Y. Hu, L.-R. Dai, and H. Jiang, "Trust region-based optimization for maximum mutual information estimation of hmms in speech recognition," IEEE Trans. Audio, Speech, Lang. Process., vol. 19, no. 8, pp. 2474-2485, Nov. 2011.
-
(2011)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.19
, Issue.8
, pp. 2474-2485
-
-
Liu, C.1
Hu, Y.2
Dai, L.-R.3
Jiang, H.4
-
27
-
-
77955783938
-
Error approximation and minimum phone error acoustic model estimation
-
Aug.
-
M. Gibson and T. Hain, "Error approximation and minimum phone error acoustic model estimation," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 6, pp. 1269-1279, Aug. 2010.
-
(2010)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.18
, Issue.6
, pp. 1269-1279
-
-
Gibson, M.1
Hain, T.2
-
29
-
-
51449120120
-
BoostedMMI for model and feature-space discriminative training
-
Las Vegas, NV, USA, Apr.
-
D. Povey, D. Kanevsky, B. Kingsbury, B. Ramabhadran, G. Saon, and K. Visweswariah, "BoostedMMI for model and feature-space discriminative training," in Proc. IEEE Int. Conf. Acoustic, Speech, Signal Process., Las Vegas, NV, USA, Apr. 2008, pp. 4057-4060.
-
(2008)
Proc. IEEE Int. Conf. Acoustic, Speech, Signal Process.
, pp. 4057-4060
-
-
Povey, D.1
Kanevsky, D.2
Kingsbury, B.3
Ramabhadran, B.4
Saon, G.5
Visweswariah, K.6
-
30
-
-
80051640064
-
-
Ph.D. dissertation, RWTH Aachen Univ., Aachen, Germany
-
G. Heigold, "A log-linear discriminative modeling framework for speech recognition," Ph.D. dissertation, RWTH Aachen Univ., Aachen, Germany, 2010.
-
(2010)
A Log-linear Discriminative Modeling Framework for Speech Recognition
-
-
Heigold, G.1
-
31
-
-
0003845417
-
The present status of automatic translation of languages
-
Y. Bar-Hillel, "The present status of automatic translation of languages," Adv. Comput., pp. 158-163, 1960.
-
(1960)
Adv. Comput.
, pp. 158-163
-
-
Bar-Hillel, Y.1
-
32
-
-
85044611587
-
The mathematics of statistical machine translation: Parameter estimation
-
P. Brown, S. Pietra, V. Pietra, and R.Mercer, "The mathematics of statistical machine translation: Parameter estimation," Comput. Linguist., vol. 19, no. 2, pp. 263-311, 1993.
-
(1993)
Comput. Linguist.
, vol.19
, Issue.2
, pp. 263-311
-
-
Brown, P.1
Pietra, S.2
Pietra, V.3
Mercer, R.4
-
34
-
-
84944098666
-
Minimum error rate training in statistical machine translation
-
F. Och, "Minimum error rate training in statistical machine translation," in Proc. ACL, 2003.
-
(2003)
Proc. ACL
-
-
Och, F.1
-
35
-
-
85032751114
-
Speech recognition, machine translation, and speech translation-A unified discriminative learning paradigm
-
Sep.
-
X. He, L. Deng, and W. Chou, "Speech recognition, machine translation, and speech translation-a unified discriminative learning paradigm," IEEE Signal Process. Mag., vol. 28, no. 5, pp. 126-133, Sep. 2011.
-
(2011)
IEEE Signal Process. Mag.
, vol.28
, Issue.5
, pp. 126-133
-
-
He, X.1
Deng, L.2
Chou, W.3
-
36
-
-
80053214216
-
A maximum-entropy segmentation model for statistical machine translation
-
Nov.
-
D. Xiong, M. Zhang, and H. Li, "A maximum-entropy segmentation model for statistical machine translation," IEEE Trans. Audio, Speech, Lang. Process., vol. 19, no. 8, pp. 2494-2505, Nov. 2011.
-
(2011)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.19
, Issue.8
, pp. 2494-2505
-
-
Xiong, D.1
Zhang, M.2
Li, H.3
-
37
-
-
84988221520
-
Exploiting morphology and local word reordering in english-to-turkish phrase-based statistical machine translation
-
Aug.
-
I. D. El-Kahlout and K. Oflazer, "Exploiting morphology and local word reordering in english-to-turkish phrase-based statistical machine translation," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 6, pp. 1313-1322, Aug. 2010.
-
(2010)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.18
, Issue.6
, pp. 1313-1322
-
-
El-Kahlout, I.D.1
Oflazer, K.2
-
38
-
-
84876693434
-
Maximum expected bleu training of phrase and lexicon translation models
-
X. He and L. Deng, "Maximum expected bleu training of phrase and lexicon translation models," in Proc. ACL, Assoc. Comput. Linguist., 2012.
-
(2012)
Proc. ACL, Assoc. Comput. Linguist.
-
-
He, X.1
Deng, L.2
-
39
-
-
85133336275
-
Bleu: A method for evaluation of machine translation
-
K. Papineni, S. Roukos, T. Ward, and W. Zhu, "Bleu: a method for evaluation of machine translation," inProc. 40th Annu. Meeting Assoc. Comput. Linguist., 2002, pp. 311-318.
-
(2002)
InProc. 40th Annu. Meeting Assoc. Comput. Linguist.
, pp. 311-318
-
-
Papineni, K.1
Roukos, S.2
Ward, T.3
Zhu, W.4
-
40
-
-
70350125882
-
An overview of text-independent speaker recognition: From features to supervectors
-
T. Kinnunen and H. Li, "An overview of text-independent speaker recognition: from features to supervectors," Speech Commun., vol. 52, no. 1, pp. 12-40, 2010.
-
(2010)
Speech Commun.
, vol.52
, Issue.1
, pp. 12-40
-
-
Kinnunen, T.1
Li, H.2
-
41
-
-
0033884858
-
Speaker verification using adapted Gaussian mixture models
-
DOI 10.1006/dspr.1999.0361
-
D. Reynolds, T. Quatieri, and R. Dunn, "Speaker verification using adapted gaussian mixture models," Digital Signal Process., vol. 10, no. 1, pp. 19-41, Jan. 2000. (Pubitemid 30592166)
-
(2000)
Digital Signal Processing: A Review Journal
, vol.10
, Issue.1
, pp. 19-41
-
-
Reynolds, D.A.1
Quatieri, T.F.2
Dunn, R.B.3
-
42
-
-
58349106697
-
A study of inter-speaker variability in speaker verification
-
Jul.
-
P. Kenny, P. Ouellet, N. Dehak, V. Gupta, and P. Dumouchel, "A study of inter-speaker variability in speaker verification," IEEE Trans. Audio, Speech, Lang. Process., vol. 16, no. 5, pp. 980-988, Jul. 2008.
-
(2008)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.16
, Issue.5
, pp. 980-988
-
-
Kenny, P.1
Ouellet, P.2
Dehak, N.3
Gupta, V.4
Dumouchel, P.5
-
43
-
-
79953277529
-
Using discrete probabilities with bhattacharyya measure for svm-based speaker verification
-
May
-
K. A. Lee, C. H. You, H. Li, T. Kinnunen, and K. C. Sim, "Using discrete probabilities with bhattacharyya measure for svm-based speaker verification," IEEE Trans. Audio, Speech, Lang. Process., vol. 19, no. 4, pp. 861-870, May 2011.
-
(2011)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.19
, Issue.4
, pp. 861-870
-
-
Lee, K.A.1
You, C.H.2
Li, H.3
Kinnunen, T.4
Sim, K.C.5
-
44
-
-
79951609039
-
Front-end factor analysis for speaker verification
-
May
-
N. Dehak, P. Kenny, R. Dehak, P. Dumouchel, and P. Ouellet, "Front-end factor analysis for speaker verification," IEEE Trans. Audio, Speech, Lang. Process., vol. 19, no. 4, pp. 788-798, May 2011.
-
(2011)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.19
, Issue.4
, pp. 788-798
-
-
Dehak, N.1
Kenny, P.2
Dehak, R.3
Dumouchel, P.4
Ouellet, P.5
-
45
-
-
84876676725
-
Spoken language recognition: From fundamentals to practice
-
May
-
H. Li, B. Ma, and K. A. Lee, "Spoken language recognition: From fundamentals to practice," Proc. IEEE, vol. 101, no. 5, pp. 1136-1159, May 2013.
-
(2013)
Proc. IEEE
, vol.101
, Issue.5
, pp. 1136-1159
-
-
Li, H.1
Ma, B.2
Lee, K.A.3
-
46
-
-
84887109920
-
Vector-based spoken language classification
-
J. Benesty, M. Sondhi, and A. Huang, Eds. New York, NY, USA: Springer
-
H. Li, B. Ma, and C.-H. Lee, "Vector-based spoken language classification," in Springer Handbook of Speech Processing, J. Benesty, M. Sondhi, and A. Huang, Eds. New York, NY, USA: Springer, 2007.
-
(2007)
Springer Handbook of Speech Processing
-
-
Li, H.1
Ma, B.2
Lee, C.-H.3
-
47
-
-
34547502608
-
A vector space modeling approach to spoken language identification
-
Jan.
-
H. Li, B. Ma, and C.-H. Lee, "A vector space modeling approach to spoken language identification," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 1, pp. 271-284, Jan. 2007.
-
(2007)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.15
, Issue.1
, pp. 271-284
-
-
Li, H.1
Ma, B.2
Lee, C.-H.3
-
48
-
-
29044433376
-
Application-independent evaluation of speaker detection
-
DOI 10.1016/j.csl.2005.08.001, PII S0885230805000483, Odyssey 2004: The Speaker and Language Recognition Workshop Odyssey-04
-
N. Brümmer and J. Preez, "Application-independent evaluation of speaker detection," Comput. Speech Lang., vol. 20, no. 2, pp. 230-275, 2006. (Pubitemid 41787538)
-
(2006)
Computer Speech and Language
, vol.20
, Issue.2-3 SPEC. ISSUE
, pp. 230-275
-
-
Brummer, N.1
Du Preez, J.2
-
49
-
-
36248952139
-
An introduction to application independent evaluation of speaker recognition systems
-
R. Müller, Ed. New York, NY, USA: Springer
-
D. A. van Leeuwen and N. Brümmer, "An introduction to application independent evaluation of speaker recognition systems," in Speaker Classification, Lecture Notes in Computer Science/Artificial Intelligence, R. Müller, Ed. New York, NY, USA: Springer, 2007, vol. 4343.
-
(2007)
Speaker Classification, Lecture Notes in Computer Science/Artificial Intelligence
, vol.4343
-
-
Van Leeuwen, D.A.1
Brümmer, N.2
-
50
-
-
85032751399
-
TechWare: Speaker and spoken language recognition resources
-
Nov.
-
H. Li and B.Ma, "TechWare: Speaker and spoken language recognition resources," IEEE Signal Process. Mag., vol. 27, no. 6, pp. 139-142, Nov. 2010.
-
(2010)
IEEE Signal Process. Mag.
, vol.27
, Issue.6
, pp. 139-142
-
-
Li, H.1
Ma, B.2
-
52
-
-
37649031157
-
The current state of language recognition: NIST 2005 evaluation results
-
A. F. Martin and A. N. Le, "The current state of language recognition: NIST 2005 evaluation results," in Proc. Odyssey: Speaker Lang. Recogn. Workshop, 2006, pp. 1-6.
-
(2006)
Proc. Odyssey: Speaker Lang. Recogn. Workshop
, pp. 1-6
-
-
Martin, A.F.1
Le, A.N.2
-
53
-
-
84969216997
-
NIST speech processing evaluations: Lvcsr, speaker recognition, language recognition
-
A. F. Martin and J. S. Garofolo, "NIST speech processing evaluations: Lvcsr, speaker recognition, language recognition," in Proc. IEEE Workshop Signal Process. Applicat. Public Security Forensics, 2007, pp. 1-7.
-
(2007)
Proc. IEEE Workshop Signal Process. Applicat. Public Security Forensics
, pp. 1-7
-
-
Martin, A.F.1
Garofolo, J.S.2
-
55
-
-
70350444555
-
Optimizing the performance of spoken language recognition with discriminative training
-
Nov.
-
D. Zhu, H. Li, B. Ma, and C. H. Lee, "Optimizing the performance of spoken language recognition with discriminative training," IEEE Trans. Audio, Speech, Lang. Process., vol. 16, no. 8, pp. 1642-1653, Nov. 2008.
-
(2008)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.16
, Issue.8
, pp. 1642-1653
-
-
Zhu, D.1
Li, H.2
Ma, B.3
Lee, C.H.4
-
56
-
-
0031139839
-
Minimum classification error rate methods for speech recognition
-
PII S1063667697035937
-
B.-H. Juang, W. Chou, and C.-H. Lee, "Minimum classification error rate methods for speech recognition," IEEE Trans. Speech Audio Process., vol. 5, no. 3, pp. 257-265, May 1997. (Pubitemid 127745998)
-
(1997)
IEEE Transactions on Speech and Audio Processing
, vol.5
, Issue.3
, pp. 257-265
-
-
Juang, B.-H.1
Chou, W.2
Lee, C.-H.3
-
58
-
-
80052047297
-
-
Ph.D. dissertation, Stellenbosch Univ., Stellenbosch, South Africa
-
N. Brümmer, "Measuring, refining and calibrating speaker and language information extracted from speech," Ph.D. dissertation, Stellenbosch Univ., Stellenbosch, South Africa, 2010.
-
(2010)
Measuring, Refining and Calibrating Speaker and Language Information Extracted from Speech
-
-
Brümmer, N.1
-
60
-
-
0025952278
-
An inequality for rational functions with applications to some statistical estimation problems
-
Jan.
-
P. S. Gopalakrishnan, D. Kanevsky, D. Nahamoo, and A. Nadas, "An inequality for rational functions with applications to some statistical estimation problems," IEEE Trans. Inf. Theory, vol. 37, no. 1, pp. 107-113, Jan. 1991.
-
(1991)
IEEE Trans. Inf. Theory
, vol.37
, Issue.1
, pp. 107-113
-
-
Gopalakrishnan, P.S.1
Kanevsky, D.2
Nahamoo, D.3
Nadas, A.4
-
61
-
-
0026372945
-
An improvedMMIE training algorithmfor speaker-independent, small vocabulary, continuous speech recognition
-
Y. Normandin, "An improvedMMIE training algorithmfor speaker-independent, small vocabulary, continuous speech recognition," in Proc. ICASSP, 1991, pp. 537-540.
-
(1991)
Proc. ICASSP
, pp. 537-540
-
-
Normandin, Y.1
-
62
-
-
4544265717
-
-
Ph.D. dissertation, Univ. of Cambridge, Cambridge, U.K.
-
D. Povey, "Discriminative training for large vocabulary speech recognition," Ph.D. dissertation, Univ. of Cambridge, Cambridge, U.K., 2003.
-
(2003)
Discriminative Training for Large Vocabulary Speech Recognition
-
-
Povey, D.1
-
63
-
-
44849142532
-
Extended Baum transformations for general functions, II
-
D. Kanevsky, "Extended Baum transformations for general functions, II," Human Language Technol., IBM, Tech. Rep. RC23645(W0506-120), 2005.
-
(2005)
Human Language Technol., IBM, Tech. Rep. RC23645(W0506-120)
-
-
Kanevsky, D.1
-
64
-
-
2142684272
-
On reversing Jensen's inequality
-
T. Jebara, "On reversing Jensen's inequality," in Proc. NIPS, 2002.
-
(2002)
Proc. NIPS
-
-
Jebara, T.1
-
65
-
-
34249656385
-
Discriminative estimation of subspace constrained Gaussian mixture models for speech recognition
-
Jan.
-
S. Axelrod, V. Goel, P. Gopinath, R. Olsen, and K. Visweswariah, "Discriminative estimation of subspace constrained Gaussian mixture models for speech recognition," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 1, pp. 172-189, Jan. 2007.
-
(2007)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.15
, Issue.1
, pp. 172-189
-
-
Axelrod, S.1
Goel, V.2
Gopinath, P.3
Olsen, R.4
Visweswariah, K.5
-
66
-
-
70349995255
-
Generalization of extended Baum-Welch parameter estimation for discriminative training and decoding
-
D. Kanevsky, T. Sainath, B. Ramabhadran, and D. Nahamoo, "Generalization of extended Baum-Welch parameter estimation for discriminative training and decoding," in Proc. Interspeech, 2008.
-
(2008)
Proc. Interspeech
-
-
Kanevsky, D.1
Sainath, T.2
Ramabhadran, B.3
Nahamoo, D.4
-
67
-
-
80051622448
-
A-Functions: A generalization of extended Baum-Welch transformations to convex optimization
-
D. Kanevsky, D. Nahamoo, T. N. Sainath, B. Ramabhadran, and P. A. Olsen, "A-Functions: A generalization of extended Baum-Welch transformations to convex optimization," in Proc. ICASSP, 2011, pp. 5164-5167.
-
(2011)
Proc. ICASSP
, pp. 5164-5167
-
-
Kanevsky, D.1
Nahamoo, D.2
Sainath, T.N.3
Ramabhadran, B.4
Olsen, P.A.5
-
68
-
-
0035342391
-
Comparison of discriminative training criteria and optimization methods for speech recognition
-
DOI 10.1016/S0167-6393(00)00035-2, PII S0167639300000352
-
R. Schlüter, W. Macherey, B. Müller, and H. Ney, "Comparison of discriminative training criteria and optimization methods for speech recognition," Speech Commun., pp. 287-310, 2001. (Pubitemid 32284868)
-
(2001)
Speech Communication
, vol.34
, Issue.3
, pp. 287-310
-
-
Schluter, R.1
Macherey, W.2
Muller, B.3
Ney, H.4
-
69
-
-
34547530690
-
Constrained line search optimization for discriminative training in speech recognition
-
C. Liu, P. Liu, H. Jiang, F. Soong, and R. Wang, "Constrained Line Search Optimization for Discriminative Training in Speech Recognition," in Proc. ICASSP, 2007, pp. 329-332.
-
(2007)
Proc. ICASSP
, pp. 329-332
-
-
Liu, C.1
Liu, P.2
Jiang, H.3
Soong, F.4
Wang, R.5
-
70
-
-
84865747510
-
Generalized Baum-Welch algorithm and its application to new extended Baum-Welch algorithm
-
DR. Hsiao and T. Schultz, "Generalized Baum-Welch algorithm and its application to new extended Baum-Welch algorithm," in Proc. Interspeech, 2011.
-
(2011)
Proc. Interspeech
-
-
Hsiao, D.R.1
Schultz, T.2
-
71
-
-
48849083725
-
Extended Baum-Welch reestimation of Gaussian mixture models based on reverse Jensen inequality
-
Lisbon, Portugal, Sep.
-
M. Afify, "Extended Baum-Welch reestimation of Gaussian mixture models based on reverse Jensen inequality," in Proc. Interspeech, Lisbon, Portugal, Sep. 2005.
-
(2005)
Proc. Interspeech
-
-
Afify, M.1
-
74
-
-
0002210265
-
On the convergence properties of the em algorithm
-
C. F. J. Wu, "On the convergence properties of the EM algorithm," Ann. Statist., vol. 11, no. 1, pp. 95-103, 1983.
-
(1983)
Ann. Statist.
, vol.11
, Issue.1
, pp. 95-103
-
-
Wu, C.F.J.1
-
75
-
-
0024610919
-
Tutorial on hidden Markov models and selected applications in speech recognition
-
Feb.
-
L. Rabiner, "Tutorial on hidden Markov models and selected applications in speech recognition," Proc. IEEE, vol. 77, no. 2, pp. 257-286, Feb. 1989.
-
(1989)
Proc. IEEE
, vol.77
, Issue.2
, pp. 257-286
-
-
Rabiner, L.1
-
76
-
-
0028412908
-
High-performance connected digit recognition using maximum mutual information estimation
-
Apr.
-
Y. Normandin, R. Cardin, and R. Demori, "High-performance connected digit recognition using maximum mutual information estimation," IEEE Trans. Speech Audio Process., vol. 2, no. 2, pp. 299-311, Apr. 1994.
-
(1994)
IEEE Trans. Speech Audio Process.
, vol.2
, Issue.2
, pp. 299-311
-
-
Normandin, Y.1
Cardin, R.2
Demori, R.3
-
77
-
-
0036296863
-
Minimum phone error and i-smoothing for improved discriminative training
-
D. Povey and P. C.Woodland, "Minimum phone error and i-smoothing for improved discriminative training," in Proc. IEEE Int. Conf. Acoustic, Speech, Signal Process., 2002, pp. 105-108.
-
(2002)
Proc. IEEE Int. Conf. Acoustic, Speech, Signal Process.
, pp. 105-108
-
-
Povey, D.1
Woodland, P.C.2
-
78
-
-
34547522070
-
Discriminative training for large vocabulary speech recognition usingminimumclassification error
-
Jan.
-
E. McDermott, T. J. Hazen, J. Le Roux, A. Nakamura, and S. Katagiri, "Discriminative training for large vocabulary speech recognition usingminimumclassification error," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 1, pp. 203-223, Jan. 2007.
-
(2007)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.15
, Issue.1
, pp. 203-223
-
-
McDermott, E.1
Hazen, T.J.2
Le Roux, J.3
Nakamura, A.4
Katagiri, S.5
-
79
-
-
85008035419
-
Equivalence of generative and log-linearmodels
-
Jul.
-
G. Heigold, H. Ney, P. Lehnen, T. Gass, and R. Schlüter, "Equivalence of generative and log-linearmodels," IEEE Trans. Audio, Speech, Lang. Process., vol. 19, no. 5, pp. 1138-1148, Jul. 2011.
-
(2011)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.19
, Issue.5
, pp. 1138-1148
-
-
Heigold, G.1
Ney, H.2
Lehnen, P.3
Gass, T.4
Schlüter, R.5
-
80
-
-
0002629270
-
Maximum-likelihood from incomplete data via the em algorithm
-
A. P. Dempster, N. M. Laird, and D. B. Rubin, "Maximum-likelihood from incomplete data via the EM algorithm," J. R. Statist. Soc. Ser. B., vol. 39, 1977.
-
(1977)
J. R. Statist. Soc. Ser. B.
, vol.39
-
-
Dempster, A.P.1
Laird, N.M.2
Rubin, D.B.3
-
81
-
-
0001573124
-
Generalized iterative scaling for log-linear models
-
J. Darroch and D. Ratcliff, "Generalized iterative scaling for log-linear models," Ann. Math. Statist., vol. 43, pp. 1470-1480, 1972.
-
(1972)
Ann. Math. Statist.
, vol.43
, pp. 1470-1480
-
-
Darroch, J.1
Ratcliff, D.2
-
82
-
-
0031120321
-
Inducing features of random fields
-
S. A. Della Pietra, V. J. Della Pietra, and J. Lafferty, "Inducing features of random fields," IEEE Trans. Pattern Anal. Mach. Intell., vol. 19, no. 4, pp. 380-393, Apr.. 1997. (Pubitemid 127762893)
-
(1997)
IEEE Transactions on Pattern Analysis and Machine Intelligence
, vol.19
, Issue.4
, pp. 380-393
-
-
Pietra, S.D.1
Pietra, V.D.2
Lafferty, J.3
-
83
-
-
51449099268
-
GIS-like estimation of log-linear models with hidden variables
-
G. Heigold, T. Deselaers, R. Schlüter, andH. Ney, "GIS-like estimation of log-linear models with hidden variables," in Proc. ICASSP, 2008, pp. 4045-4048.
-
(2008)
Proc. ICASSP
, pp. 4045-4048
-
-
Heigold, G.1
Deselaers, T.2
Schlüter, R.3
Ney, H.4
-
85
-
-
84876672166
-
Machine learning paradigms for speech recognition: An overview
-
May
-
L. Deng and X. Li, "Machine learning paradigms for speech recognition: An overview," IEEE Trans. Audio, Speech, Lang. Process., vol. 21, no. 5, pp. 1060-1089, May 2013.
-
(2013)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.21
, Issue.5
, pp. 1060-1089
-
-
Deng, L.1
Li, X.2
-
86
-
-
84878379108
-
Scalable minimum Bayes risk training of deep neural network acoustic models using distributed hessian-free optimization
-
B. Kingsbury, T. Sainath, and H. Soltau, "Scalable minimum Bayes risk training of deep neural network acoustic models using distributed hessian-free optimization," in Proc. Interspeech, 2012.
-
(2012)
Proc. Interspeech
-
-
Kingsbury, B.1
Sainath, T.2
Soltau, H.3
-
87
-
-
84877760312
-
Large scale distributed deep networks
-
J. Dean, G. Corrado, R. Monga, K. Chen, M. Devin, Q. Le, M. W. Mao, M.-A. Ranzato, A.-W. Senior, P. A. Tucker, K. Yang, and A. Y. Ng, "Large scale distributed deep networks," NIPS, 2012.
-
(2012)
NIPS
-
-
Dean, J.1
Corrado, G.2
Monga, R.3
Chen, K.4
Devin, M.5
Le, Q.6
Mao, M.W.7
Ranzato, M.-A.8
Senior, A.-W.9
Tucker, P.A.10
Yang, K.11
Ng, A.Y.12
-
88
-
-
85032751458
-
Deep neural networks for acoustic modeling in speech recognition
-
Nov.
-
G. Hinton, L. Deng, D. Yu, G. Dahl, A.Mohamed, N. Jaitly, A. Senior, V. Vanhoucke, P. Nguyen, T. Sainath, and B. Kingsbury, "Deep neural networks for acoustic modeling in speech recognition," IEEE Signal Process. Mag., vol. 29, no. 6, pp. 82-97, Nov. 2012.
-
(2012)
IEEE Signal Process. Mag.
, vol.29
, Issue.6
, pp. 82-97
-
-
Hinton, G.1
Deng, L.2
Yu, D.3
Dahl, G.4
Mohamed, A.5
Jaitly, N.6
Senior, A.7
Vanhoucke, V.8
Nguyen, P.9
Sainath, T.10
Kingsbury, B.11
-
89
-
-
84055222005
-
Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition
-
Jan.
-
G. Dahl,D.Yu, L.Deng, and A. Acero, "Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition," IEEE Trans. Audio, Speech, Lang. Process., vol. 20, no. 1, pp. 30-42, Jan. 2012.
-
(2012)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.20
, Issue.1
, pp. 30-42
-
-
Dahl, G.1
Yu, D.2
Deng, L.3
Acero, A.4
-
90
-
-
84055211743
-
Acoustic modeling using deep belief networks
-
Jan.
-
A. Mohamed, G. Dahl, and G. Hinton, "Acoustic modeling using deep belief networks," IEEE Trans. Audio, Speech, Lang. Process., vol. 20, no. 1, pp. 14-22, Jan. 2012.
-
(2012)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.20
, Issue.1
, pp. 14-22
-
-
Mohamed, A.1
Dahl, G.2
Hinton, G.3
-
91
-
-
84255177123
-
Deep and wide: Multiple layers in automatic speech recognition
-
Jan.
-
N. Morgan, "Deep and wide: Multiple layers in automatic speech recognition," IEEE Trans. Audio, Speech, Lang. Process., vol. 20, no. 1, pp. 7-13, Jan. 2012.
-
(2012)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.20
, Issue.1
, pp. 7-13
-
-
Morgan, N.1
-
92
-
-
84875405186
-
Exploiting deep neural networks for detection-based speech recognition
-
M. Siniscalchi, L. Deng, D. Yu, and C.-H. Lee, "Exploiting deep neural networks for detection-based speech recognition," Neurocomputing, pp. 148-157, 2013.
-
(2013)
Neurocomputing
, pp. 148-157
-
-
Siniscalchi, M.1
Deng, L.2
Yu, D.3
Lee, C.-H.4
-
93
-
-
84865768819
-
Deep convex network: A scalable architecture for speech pattern classification
-
L. Deng and D. Yu, "Deep convex network: A scalable architecture for speech pattern classification," in Proc. Interspeech, 2011.
-
(2011)
Proc. Interspeech
-
-
Deng, L.1
Yu, D.2
-
94
-
-
84867614591
-
Scalable stacking and learning for building deep architectures
-
L. Deng, D. Yu, and J. Platt, "Scalable stacking and learning for building deep architectures," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 2012, pp. 2133-2136.
-
(2012)
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process.
, pp. 2133-2136
-
-
Deng, L.1
Yu, D.2
Platt, J.3
-
95
-
-
84879301618
-
Tensor deep stacking networks
-
Aug.
-
B. Hutchinson, L. Deng, and D. Yu, "Tensor deep stacking networks," IEEE Trans. Pattern Anal.Mach. Intell., vol. 35, no. 8, pp. 1944-1957, Aug. 2013.
-
(2013)
IEEE Trans. Pattern Anal.Mach. Intell.
, vol.35
, Issue.8
, pp. 1944-1957
-
-
Hutchinson, B.1
Deng, L.2
Yu, D.3
-
97
-
-
84890526837
-
New types of deep neural network learning for speech recognition and related applications: An overview
-
L. Deng, G. E. Hinton, and B. Kingsbury, "New types of deep neural network learning for speech recognition and related applications: An overview," in Proc. ICASSP, 2013.
-
(2013)
Proc. ICASSP
-
-
Deng, L.1
Hinton, G.E.2
Kingsbury, B.3
|