-
1
-
-
33947619591
-
Cross-domain and cross-language portability of acoustic features estimated by multilayer perceptrons
-
A. Stolcke, F. Grezl, M.-Y. Hwang, X. Lei, N. Morgan, and D. Vergyri, "Cross-domain and cross-language portability of acoustic features estimated by multilayer perceptrons, " in Proc. of IEEE Int. Conf. on Acoustics, Speech and Signal Processing, 2006, pp. 321-324.
-
(2006)
Proc. of IEEE Int. Conf. on Acoustics, Speech and Signal Processing
, pp. 321-324
-
-
Stolcke, A.1
Grezl, F.2
Hwang, M.-Y.3
Lei, X.4
Morgan, N.5
Vergyri, D.6
-
2
-
-
44849132075
-
Monolingual and cross lingual comparison of tandem features derived from articulatory and phone MLPs
-
O. Cetin, M. Magimai-Doss, K. Livescu, A. Kantor, S. King, C. Bartels, and J. Frankel, "Monolingual and cross lingual comparison of tandem features derived from articulatory and phone MLPs, " in Proc. of IEEE Automatic Speech Recognition and Understanding Workshop, 2007, pp. 36-41.
-
(2007)
Proc. of IEEE Automatic Speech Recognition and Understanding Workshop
, pp. 36-41
-
-
Cetin, O.1
Magimai-Doss, M.2
Livescu, K.3
Kantor, A.4
King, S.5
Bartels, C.6
Frankel, J.7
-
3
-
-
84858976609
-
Cross-lingual portability of Chinese and English neural network features for French and German LVCSR
-
C. Plahl, R. Schluter, and H. Ney, "Cross-lingual Portability of Chinese and English Neural Network Features for French and German LVCSR, " in Proc. of IEEE Automatic Speech Recognition and Understanding Workshop, 2011, pp. 371-376.
-
(2011)
Proc. of IEEE Automatic Speech Recognition and Understanding Workshop
, pp. 371-376
-
-
Plahl, C.1
Schluter, R.2
Ney, H.3
-
4
-
-
84858985238
-
Cross-lingual portability of MLP-based tandem features-a case study for English and Hungarian
-
L. Toth, J. Frankel, G. Gosztolya, and S. King, "Cross-lingual Portability of MLP-Based Tandem Features-A Case Study for English and Hungarian, " in Proc. of Interspeech, 2008, pp. 2695- 2698.
-
(2008)
Proc. of Interspeech
, pp. 2695-2698
-
-
Toth, L.1
Frankel, J.2
Gosztolya, G.3
King, S.4
-
5
-
-
84858955616
-
Study of probabilistic and bottle-neck features in multilingual environment
-
F. Grezl, M. Karafiat, and M. Janda, "Study of probabilistic and bottle-neck features in multilingual environment, " in Proc. of IEEE Automatic Speech Recognition and Understanding Workshop, 2011, pp. 359-364.
-
(2011)
Proc. of IEEE Automatic Speech Recognition and Understanding Workshop
, pp. 359-364
-
-
Grezl, F.1
Karafiat, M.2
Janda, M.3
-
6
-
-
84890474441
-
Investigation on cross- And multilingual MLP features under matched and mismatched acoustical conditions
-
accepted for publication
-
Z. Tuske, J. Pinto, D. Willett, and R. Schluter, "Investigation on cross- And multilingual MLP features under matched and mismatched acoustical conditions, " in Proc. of IEEE Int. Conf. on Acoustics, Speech and Signal Processing, 2013, accepted for publication.
-
(2013)
Proc. of IEEE Int. Conf. on Acoustics, Speech and Signal Processing
-
-
Tuske, Z.1
Pinto, J.2
Willett, D.3
Schluter, R.4
-
7
-
-
84878559540
-
An investigation on initialization schemes for multilayer perceptron training using multilingual data and their effect on ASR performance
-
N. T. Vu, W. Breiter, F. Metze, and T. Schultz, "An Investigation on Initialization Schemes for Multilayer Perceptron Training Using Multilingual Data and Their Effect on ASR Performance, " in Proc. of Interspeech, 2012.
-
(2012)
Proc. of Interspeech
-
-
Vu, N.T.1
Breiter, W.2
Metze, F.3
Schultz, T.4
-
8
-
-
84867606552
-
Multilingual MLP features for low-resource LVCSR systems
-
S. Thomas, S. Ganapathy, and H. Hermansky, "Multilingual MLP features for low-resource LVCSR systems, " in Proc. of IEEE Int. Conf. on Acoustics, Speech and Signal Processing, 2012, pp. 4269-4272.
-
(2012)
Proc. of IEEE Int. Conf. on Acoustics, Speech and Signal Processing
, pp. 4269-4272
-
-
Thomas, S.1
Ganapathy, S.2
Hermansky, H.3
-
9
-
-
0033709098
-
Tandem connectionist feature extraction for conventional HMM systems
-
H. Hermansky, D. P. Ellis, and S. Sharma, "Tandem connectionist feature extraction for conventional HMM systems, " in Proc. of IEEE Int. Conf. on Acoustics, Speech and Signal Processing, vol. 3, 2000, pp. 1635-1638.
-
(2000)
Proc. of IEEE Int. Conf. on Acoustics, Speech and Signal Processing
, vol.3
, pp. 1635-1638
-
-
Hermansky, H.1
Ellis, D.P.2
Sharma, S.3
-
10
-
-
34547548235
-
Probabilistic and bottle-neck features for LVCSR of meetings
-
F. Grezl, M. Karafiat, S. Kontar, and J. Cernocky, "Probabilistic and bottle-neck features for LVCSR of meetings, " in Proc. of IEEE Int. Conf. on Acoustics, Speech and Signal Processing, 2007, pp. 757-760.
-
(2007)
Proc. of IEEE Int. Conf. on Acoustics, Speech and Signal Processing
, pp. 757-760
-
-
Grezl, F.1
Karafiat, M.2
Kontar, S.3
Cernocky, J.4
-
11
-
-
85135166225
-
Fast bootstrapping of LVCSR systems with multilingual phoneme sets
-
T. Schultz and A. Waibel, "Fast Bootstrapping Of LVCSR Systems With Multilingual Phoneme Sets, " in Proc of Eurospeech, 1997.
-
(1997)
Proc of Eurospeech
-
-
Schultz, T.1
Waibel, A.2
-
12
-
-
70349220094
-
A study on multilingual acoustic modeling for large vocabulary ASR
-
H. Lin, L. Deng, D. Yu, Y. Gong, A. Acero, and C.-H. Lee, "A study on multilingual acoustic modeling for large vocabulary ASR, " in Proc. of IEEE Int. Conf. on Acoustics, Speech and Signal Processing, 2009, pp. 4333-4336.
-
(2009)
Proc. of IEEE Int. Conf. on Acoustics, Speech and Signal Processing
, pp. 4333-4336
-
-
Lin, H.1
Deng, L.2
Yu, D.3
Gong, Y.4
Acero, A.5
Lee, C.-H.6
-
13
-
-
79959816770
-
Towards mixed language speech recognition systems
-
D. Imseng, H. Bourlard, and M. Magimai-Doss, "Towards mixed language speech recognition systems, " in Proc. of Interspeech, 2010, pp. 278-281.
-
(2010)
Proc. of Interspeech
, pp. 278-281
-
-
Imseng, D.1
Bourlard, H.2
Magimai-Doss, M.3
-
15
-
-
0033690885
-
Towards language independent acoustic modeling
-
W. Byrne, P. Beyerlein, J. M. Huerta, S. Khudanpur, B. Marthi, J. Morgan, N. Peterek, J. Picone, D. Vergyri, and W. Wang, "Towards language independent acoustic modeling, " in Proc. of IEEE International Conference on Acoustics, Speech and Signal Processing, vol. 2, 2000, pp. 1029-1032.
-
(2000)
Proc. of IEEE International Conference on Acoustics, Speech and Signal Processing
, vol.2
, pp. 1029-1032
-
-
Byrne, W.1
Beyerlein, P.2
Huerta, J.M.3
Khudanpur, S.4
Marthi, B.5
Morgan, J.6
Peterek, N.7
Picone, J.8
Vergyri, D.9
Wang, W.10
-
16
-
-
79959819891
-
Cross-lingual and multi stream posterior features for low resource LVCSR systems
-
S. Thomas, S. Ganapathy, and H. Hermansky, "Cross-lingual and multi stream posterior features for low resource LVCSR systems, " in Proc. of Interspeech, 2010, pp. 877-880.
-
(2010)
Proc. of Interspeech
, pp. 877-880
-
-
Thomas, S.1
Ganapathy, S.2
Hermansky, H.3
-
17
-
-
84867224965
-
On the use of a multilingual neural network front-end
-
S. Scanzio, P. Laface, L. Fissore, R. Gemello, and F. Mana, "On the Use of a Multilingual Neural Network Front-End, " in Proc. of Interspeech, 2008, pp. 2711-2714.
-
(2008)
Proc. of Interspeech
, pp. 2711-2714
-
-
Scanzio, S.1
Laface, P.2
Fissore, L.3
Gemello, R.4
Mana, F.5
-
18
-
-
78049394188
-
Multilingual acoustic modeling for speech recognition based on subspace gaussian mixture models
-
L. Burget, P. Schwarz, M. Agarwal, P. Akayazi, K. Feng, A. Ghoshal, O. Glembek, N. Goel, M. Karafiat, D. Povey, A. Rastrow, R. C. Rose, and S. Thomas, "Multilingual acoustic modeling for speech recognition based on subspace Gaussian mixture models, " in Proc. of IEEE Int. Conf. on Acoustics, Speech and Signal Processing, 2013, 2010, pp. 4334-4337.
-
(2010)
Proc. of IEEE Int. Conf. on Acoustics, Speech and Signal Processing, 2013
, pp. 4334-4337
-
-
Burget, L.1
Schwarz, P.2
Agarwal, M.3
Akayazi, P.4
Feng, K.5
Ghoshal, A.6
Glembek, O.7
Goel, N.8
Karafiat, M.9
Povey, D.10
Rastrow, A.11
Rose, R.C.12
Thomas, S.13
-
19
-
-
84874226274
-
The language-independent bottleneck features
-
K. Vesely, M. Karafiat, F. Grezl, M. Janda, and E. Egorova, "The language-independent bottleneck features, " in Proc. of IEEE Workshop on Spoken Language Technology, 2012, pp. 336-341.
-
(2012)
Proc. of IEEE Workshop on Spoken Language Technology
, pp. 336-341
-
-
Vesely, K.1
Karafiat, M.2
Grezl, F.3
Janda, M.4
Egorova, E.5
-
20
-
-
33745213373
-
Multi-resolution RASTA filtering for TANDEM-based ASR
-
H. Hermansky and P. Fousek, "Multi-resolution RASTA filtering for TANDEM-based ASR, " in Proc. of Interspeech, 2005, pp. 361-364.
-
(2005)
Proc. of Interspeech
, pp. 361-364
-
-
Hermansky, H.1
Fousek, P.2
-
21
-
-
79959844505
-
Hierarchical bottle neck features for LVCSR
-
C. Plahl, R. Schluter, and H. Ney, "Hierarchical Bottle Neck Features for LVCSR, " in Proc. of Interspeech, 2010, pp. 1197-1200.
-
(2010)
Proc. of Interspeech
, pp. 1197-1200
-
-
Plahl, C.1
Schluter, R.2
Ney, H.3
-
22
-
-
84865801985
-
Conversational speech transcription using context-dependent deep neural networks
-
F. Seide, G. Li, and D. Yu, "Conversational Speech Transcription Using Context-Dependent Deep Neural Networks, " in Proc of Interspeech, 2011, pp. 437-440.
-
(2011)
Proc of Interspeech
, pp. 437-440
-
-
Seide, F.1
Li, G.2
Yu, D.3
-
23
-
-
84867593213
-
Auto-encoder bottleneck features using deep belief networks
-
T. N. Sainath, B. Kingsbury, and B. Ramabhadran, "Auto-encoder bottleneck features using deep belief networks, " in Proc. of IEEE Int. Conf. on Acoustics, Speech and Signal Processing, 2012, pp. 4153-4156.
-
(2012)
Proc. of IEEE Int. Conf. on Acoustics, Speech and Signal Processing
, pp. 4153-4156
-
-
Sainath, T.N.1
Kingsbury, B.2
Ramabhadran, B.3
-
24
-
-
84890543571
-
Deep hierarchical bottleneck MRASTA features for LVCSR
-
accepted for publication
-
Z. Tuske, R. Schluter, and H. Ney, "Deep hierarchical bottleneck MRASTA features for LVCSR, " in Proc. of IEEE Int. Conf. on Acoustics, Speech and Signal Processing, 2013, accepted for publication.
-
(2013)
Proc. of IEEE Int. Conf. on Acoustics, Speech and Signal Processing
-
-
Tuske, Z.1
Schluter, R.2
Ney, H.3
-
25
-
-
79959839253
-
The RWTH 2009 QUAERO ASR evaluation system for English and German
-
M. Nußbaum-Thom, S. Wiesler, M. Sundermeyer, C. Plahl, S. Hahn, R. Schluter, and H. Ney, "The RWTH 2009 QUAERO ASR evaluation system for English and German, " in Proc. of Interspeech, 2010, pp. 1517-1520.
-
(2010)
Proc. of Interspeech
, pp. 1517-1520
-
-
Nußbaum-Thom, M.1
Wiesler, S.2
Sundermeyer, M.3
Plahl, C.4
Hahn, S.5
Schluter, R.6
Ney, H.7
-
26
-
-
80051609102
-
The RWTH 2010 QUAERO ASR evaluation system for English, French, and German
-
M. Sundermeyer, M. Nußbaum-Thom, S.Wiesler, C. Plahl, A.-D. Mousa, S. Hahn, D. Nolden, R. Schluter, and H. Ney, "The RWTH 2010 QUAERO ASR Evaluation System for English, French, and German, " in Proc. of IEEE Int. Conf. on Acoustics, Speech and Signal Processing, 2011, pp. 2212-2215.
-
(2011)
Proc. of IEEE Int. Conf. on Acoustics, Speech and Signal Processing
, pp. 2212-2215
-
-
Sundermeyer, M.1
Nußbaum-Thom, M.2
Wiesler, S.3
Plahl, C.4
Mousa, A.-D.5
Hahn, S.6
Nolden, D.7
Schluter, R.8
Ney, H.9
-
28
-
-
84878410921
-
RASR - The RWTH Aachen university open source speech recognition toolkit
-
D. Rybach, S. Hahn, P. Lehnen, D. Nolden, M. Sundermeyer, Z. Tuske, S. Wiesler, R. Schluter, and H. Ney, "RASR - The RWTH Aachen University Open Source Speech Recognition Toolkit, " in Proc. of IEEE Automatic Speech Recognition and Understanding Workshop, 2011.
-
(2011)
Proc. of IEEE Automatic Speech Recognition and Understanding Workshop
-
-
Rybach, D.1
Hahn, S.2
Lehnen, P.3
Nolden, D.4
Sundermeyer, M.5
Tuske, Z.6
Wiesler, S.7
Schluter, R.8
Ney, H.9
-
29
-
-
0032050110
-
Maximum likelihood linear transformations for HMM-based speech recognition
-
M. J. F. Gales, "Maximum likelihood linear transformations for HMM-based speech recognition, " Computer Speech and Language, vol. 12, pp. 75-98, 1998.
-
(1998)
Computer Speech and Language
, vol.12
, pp. 75-98
-
-
Gales, M.J.F.1
-
30
-
-
33646759965
-
Adaptive training using simple target models
-
G. Stemmer, F. Brugnara, and D. Giuliani, "Adaptive training using simple target models, " in Proc. of IEEE Int. Conf. on Acoustics, Speech and Signal Processing, vol. 1, 2005, pp. 997-1000.
-
(2005)
Proc. of IEEE Int. Conf. on Acoustics, Speech and Signal Processing
, vol.1
, pp. 997-1000
-
-
Stemmer, G.1
Brugnara, F.2
Giuliani, D.3
-
31
-
-
84865756657
-
Hybrid language models using mixed types of sub-lexical units for open vocabulary german LVCSR
-
M. A. B. Shaik, A. E.-D. Mousa, R. Schluter, and H. Ney, "Hybrid language models using mixed types of sub-lexical units for open vocabulary German LVCSR, " in Proc. of Interspeech, 2011, pp. 1441-1444.
-
(2011)
Proc. of Interspeech
, pp. 1441-1444
-
-
Shaik, M.A.B.1
Mousa, A.E.-D.2
Schluter, R.3
Ney, H.4
-
32
-
-
80051609913
-
Using morpheme and syllable based sub-words for polish LVCSR
-
M. A. B. Shaik, A. E.-D. Mousa, R. Schluter, and H. Ney, "Using morpheme and syllable based sub-words for Polish LVCSR, " in Proc. of IEEE Int. Conf. on Acoustics, Speech and Signal Processing, 2011, pp. 4680-4683.
-
(2011)
Proc. of IEEE Int. Conf. on Acoustics, Speech and Signal Processing
, pp. 4680-4683
-
-
Shaik, M.A.B.1
Mousa, A.E.-D.2
Schluter, R.3
Ney, H.4
|