-
1
-
-
84865767134
-
Rapid evaluation of speech representations for spoken term discovery
-
M. A. Carlin, S. Thomas, A. Jansen, and H. Hermansky, "Rapid evaluation of speech representations for spoken term discovery, " in Proceedings of Interspeech, 2011.
-
(2011)
Proceedings of Interspeech
-
-
Carlin, M.A.1
Thomas, S.2
Jansen, A.3
Hermansky, H.4
-
2
-
-
84055212007
-
Sparse multilayer perceptron for phoneme recognition
-
G. Sivaram and H. Hermansky, "Sparse multilayer perceptron for phoneme recognition, " IEEE Transactions on Audio, Speech, and Language Processing, vol. 20, no. 1, pp. 23-29, 2012.
-
(2012)
IEEE Transactions on Audio, Speech, and Language Processing
, vol.20
, Issue.1
, pp. 23-29
-
-
Sivaram, G.1
Hermansky, H.2
-
3
-
-
84855980817
-
New nonsense syllables database analyses and preliminary ASR experiments
-
P. Fousek, P. Svojanovsky, F. Grezl, and H. Hermansky, "New nonsense syllables database analyses and preliminary asr experiments, " in Proceedings of the International Conference on Spoken Language Processing (ICSLP), 2004, pp. 2004-29.
-
(2004)
Proceedings of the International Conference on Spoken Language Processing (ICSLP)
, pp. 2004-2029
-
-
Fousek, P.1
Svojanovsky, P.2
Grezl, F.3
Hermansky, H.4
-
5
-
-
0025041264
-
Perceptual linear predictive (PLP) analysis of speech
-
H. Hermansky, "Perceptual linear predictive (PLP) analysis of speech, " The Journal of the Acoustical Society of America, vol. 87, pp. 1738-1752, 1990.
-
(1990)
The Journal of the Acoustical Society of America
, vol.87
, pp. 1738-1752
-
-
Hermansky, H.1
-
6
-
-
0038133939
-
Distance measures for speech recognition, psychological and instrumental
-
P. Mermelstein, "Distance measures for speech recognition, psychological and instrumental, " Pattern recognition and artificial intelligence, vol. 116, pp. 91-103, 1976.
-
(1976)
Pattern Recognition and Artificial Intelligence
, vol.116
, pp. 91-103
-
-
Mermelstein, P.1
-
7
-
-
79955978656
-
ASR systems in noisy environment: Analysis and solutions for increasing noise robustness
-
J. Rajnoha and P. Pollak, "ASR systems in noisy environment: Analysis and solutions for increasing noise robustness, " Radioengineering, vol. 20, no. 1, pp. 74-84, 2011.
-
(2011)
Radioengineering
, vol.20
, Issue.1
, pp. 74-84
-
-
Rajnoha, J.1
Pollak, P.2
-
8
-
-
0028517164
-
RASTA processing of speech
-
H. Hermansky and N. Morgan, "RASTA processing of speech, " IEEE Transactions on Speech and Audio Processing, vol. 2, no. 4, pp. 578-589, 1994.
-
(1994)
IEEE Transactions on Speech and Audio Processing
, vol.2
, Issue.4
, pp. 578-589
-
-
Hermansky, H.1
Morgan, N.2
-
9
-
-
84867214871
-
-
(and MFCC, and inversion) in Matlab, [Online]. Available
-
D. P. W. Ellis, "PLP and RASTA (and MFCC, and inversion) in Matlab, " 2005. [Online]. Available: http://www.ee.columbia.edu/dpwe/resources/ matlab/rastamat/.
-
(2005)
PLP and RASTA
-
-
Ellis, D.P.W.1
-
10
-
-
84890488932
-
A summary of the 2012 JH CLSP workshop on zero resource speech technologies and models of early language acquisition
-
A. Jansen, E. Dupoux, S. Goldwater, M. Johnson, S. Khudanpur, K. Church, N. Feldman, H. Hermansky, F. Metze, R. Rose, M. Seltzer, P. Clark, I. McGraw, B. Varadarajan, E. Bennett, B. Borschinger, J. Chiu, E. Dunbar, A. Fourtassi, D. Harwath, C.-y. Lee, K. Levin, A. Norouzian, V. Peddinti, R. Richardson, T. Schatz, and S. Thomas, "A summary of the 2012 JH CLSP workshop on zero resource speech technologies and models of early language acquisition, " in Proceedings of ICASSP 2013, 2013.
-
(2013)
Proceedings of ICASSP 2013
-
-
Jansen, A.1
Dupoux, E.2
Goldwater, S.3
Johnson, M.4
Khudanpur, S.5
Church, K.6
Feldman, N.7
Hermansky, H.8
Metze, F.9
Rose, R.10
Seltzer, M.11
Clark, P.12
McGraw, I.13
Varadarajan, B.14
Bennett, E.15
Borschinger, B.16
Chiu, J.17
Dunbar, E.18
Fourtassi, A.19
Harwath, D.20
Lee, C.-Y.21
Levin, K.22
Norouzian, A.23
Peddinti, V.24
Richardson, R.25
Schatz, T.26
Thomas, S.27
more..
-
12
-
-
77950593007
-
Unsupervised learning of acoustic sub-word units
-
B. Varadarajan, S. Khudanpur, and E. Dupoux, "Unsupervised learning of acoustic sub-word units, " in Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies: Short Papers, 2008, pp. 165- 168.
-
(2008)
Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies: Short Papers
, pp. 165-168
-
-
Varadarajan, B.1
Khudanpur, S.2
Dupoux, E.3
-
13
-
-
84878421054
-
Intrinsic spectral analysis for zero and high resource speech recognition
-
A. Jansen, S. Thomas, and H. Hermansky, "Intrinsic spectral analysis for zero and high resource speech recognition, " in Proceedings of Interspeech, 2012.
-
(2012)
Proceedings of Interspeech
-
-
Jansen, A.1
Thomas, S.2
Hermansky, H.3
-
14
-
-
84867809023
-
A nonparametric Bayesian approach to acoustic model discovery
-
C.-y. Lee and J. Glass, "A nonparametric bayesian approach to acoustic model discovery, " in Proceedings of ACL, 2012.
-
(2012)
Proceedings of ACL
-
-
Lee, C.-Y.1
Glass, J.2
-
15
-
-
25444478852
-
A functional model of neural activity patterns and auditory images
-
R. D. Patterson and J. Holdsworth, "A functional model of neural activity patterns and auditory images, " Advances in speech, hearing and language processing, vol. 3, pp. 547-563, 1996.
-
(1996)
Advances in Speech, Hearing and Language Processing
, vol.3
, pp. 547-563
-
-
Patterson, R.D.1
Holdsworth, J.2
-
16
-
-
79251542316
-
A computational model of filtering, detection, and compression in the cochlea
-
R. Lyon, "A computational model of filtering, detection, and compression in the cochlea, " in IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP'82., vol. 7, 1982, pp. 1282-1285.
-
(1982)
IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP'82
, vol.7
, pp. 1282-1285
-
-
Lyon, R.1
-
17
-
-
30244474384
-
A joint synchrony/mean-rate model of auditory speech processing
-
S. Seneff, "A joint synchrony/mean-rate model of auditory speech processing, " in Readings in speech recognition, 1990, pp. 101- 111.
-
(1990)
Readings in Speech Recognition
, pp. 101-111
-
-
Seneff, S.1
-
18
-
-
23744508888
-
Multiresolution spectrotemporal analysis of complex sounds
-
T. Chi, P. Ru, and S. A. Shamma, "Multiresolution spectrotemporal analysis of complex sounds, " The Journal of the Acoustical Society of America, vol. 118, pp. 887-906, 2005.
-
(2005)
The Journal of the Acoustical Society of America
, vol.118
, pp. 887-906
-
-
Chi, T.1
Ru, P.2
Shamma, S.A.3
-
19
-
-
26244461684
-
Clustering with Bregman divergences
-
A. Banerjee, S. Merugu, I. S. Dhillon, and J. Ghosh, "Clustering with bregman divergences, " The Journal of Machine Learning Research, vol. 6, pp. 1705-1749, 2005.
-
(2005)
The Journal of Machine Learning Research
, vol.6
, pp. 1705-1749
-
-
Banerjee, A.1
Merugu, S.2
Dhillon, I.S.3
Ghosh, J.4
|