-
1
-
-
0004244302
-
-
Prentice Hall, Englewood Cliffs, NJ, USA
-
L. R. Rabiner and B. H. Juang, Fundamentals of Speech Recognition, Prentice Hall, Englewood Cliffs, NJ, USA, 1993.
-
(1993)
Fundamentals of Speech Recognition
-
-
Rabiner, L.R.1
Juang, B.H.2
-
2
-
-
0027167185
-
A dynamic cepstrum incorporating time-frequency masking and its application to continuous speech recognition
-
Minneapolis, Minn, USA, April
-
K. Aikawa, H. Singer, H. Kawahara, and Y. Tohkura, "A dynamic cepstrum incorporating time-frequency masking and its application to continuous speech recognition," in Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '93), vol. 2, pp. 668-671, Minneapolis, Minn, USA, April 1993.
-
(1993)
Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '93)
, vol.2
, pp. 668-671
-
-
Aikawa, K.1
Singer, H.2
Kawahara, H.3
Tohkura, Y.4
-
3
-
-
0002735918
-
Optimization of time-frequency masking filters using the minimum classification error criterion
-
Adelaide, SA, Australia, April
-
M. Bacchiani and K. Aikawa, "Optimization of time-frequency masking filters using the minimum classification error criterion," in IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '94), vol. 2, pp. 197-200, Adelaide, SA, Australia, April 1994.
-
(1994)
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '94)
, vol.2
, pp. 197-200
-
-
Bacchiani, M.1
Aikawa, K.2
-
4
-
-
0025041264
-
Perceptual linear predictive (PLP) analysis of speech
-
H. Hermansky, "Perceptual linear predictive (PLP) analysis of speech," Journal of the Acoustical Society of America, vol. 87, no. 4, pp. 1738-1752, 1990.
-
(1990)
Journal of the Acoustical Society of America
, vol.87
, Issue.4
, pp. 1738-1752
-
-
Hermansky, H.1
-
5
-
-
0028312802
-
Auditory models and human performance in tasks related to speech coding and speech recognition
-
O. Ghitza, "Auditory models and human performance in tasks related to speech coding and speech recognition," IEEE Trans-actions on Speech and Audio Processing, vol. 2, no. 1, part 2, pp. 115-132, 1994.
-
(1994)
IEEE Trans-actions on Speech and Audio Processing
, vol.2
, Issue.1 PART 2
, pp. 115-132
-
-
Ghitza, O.1
-
6
-
-
0023841401
-
Vowel processing by a model of the auditory periphery: A comparison to eighth-nerve responses
-
K. L. Payton, "Vowel processing by a model of the auditory periphery: a comparison to eighth-nerve responses," The Journal of the Acoustical Society of America, vol. 83, no. 1, pp. 145-162, 1988.
-
(1988)
The Journal of the Acoustical Society of America
, vol.83
, Issue.1
, pp. 145-162
-
-
Payton, K.L.1
-
7
-
-
79251542316
-
A computational model of filtering, detection, and compression in the cochlea
-
Paris, France, May
-
R. Lyon, "A computational model of filtering, detection, and compression in the cochlea," in Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '82), vol. 7, pp. 1282-1285, Paris, France, May 1982.
-
(1982)
Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '82)
, vol.7
, pp. 1282-1285
-
-
Lyon, R.1
-
8
-
-
84928837806
-
A joint synchrony/mean-rate model of auditory speech processing
-
S. Seneff, "A joint synchrony/mean-rate model of auditory speech processing," Journal of Phonetics, vol. 16, no. 1, pp. 55-76, 1988.
-
(1988)
Journal of Phonetics
, vol.16
, Issue.1
, pp. 55-76
-
-
Seneff, S.1
-
9
-
-
0024392496
-
Application of an auditory model to speech recognition
-
J. R. Cohen, "Application of an auditory model to speech recognition," The Journal of the Acoustical Society of America, vol. 85, no. 6, pp. 2623-2629, 1989.
-
(1989)
The Journal of the Acoustical Society of America
, vol.85
, Issue.6
, pp. 2623-2629
-
-
Cohen, J.R.1
-
10
-
-
0026400245
-
An investigation of PLP and IMELDA acoustic representations and of their potential for combination
-
Toronto, Ont, Canada, May
-
M. J. Hunt, S. M. Richardson, D. C. Bateman, and A. Piau, "An investigation of PLP and IMELDA acoustic representations and of their potential for combination," in Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '91), vol. 2, pp. 881-884, Toronto, Ont, Canada, May 1991.
-
(1991)
Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '91)
, vol.2
, pp. 881-884
-
-
Hunt, M.J.1
Richardson, S.M.2
Bateman, D.C.3
Piau, A.4
-
11
-
-
0019053271
-
Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
-
S. B. Davis and P. Mermelstein, "Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences," IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 28, no. 4, pp. 357-366, 1980.
-
(1980)
IEEE Transactions on Acoustics, Speech, and Signal Processing
, vol.28
, Issue.4
, pp. 357-366
-
-
Davis, S.B.1
Mermelstein, P.2
-
12
-
-
13544259544
-
On the usefulness of STFT phase spectrum in human listening tests
-
K. K. Paliwal and L. D. Alsteris, "On the usefulness of STFT phase spectrum in human listening tests," Speech Communication, vol. 45, no. 2, pp. 153-170, 2005.
-
(2005)
Speech Communication
, vol.45
, Issue.2
, pp. 153-170
-
-
Paliwal, K.K.1
Alsteris, L.D.2
-
13
-
-
33745196479
-
Some experiments on iterative reconstruction of speech from STFT phase and magnitude spectra
-
Lisbon, Portugal, September
-
L. D. Alsteris and K. K. Paliwal, "Some experiments on iterative reconstruction of speech from STFT phase and magnitude spectra," in Proceedings of 9th European Conference on Speech Communication and Technology (EUROSPEECH '05), pp. 337-340, Lisbon, Portugal, September 2005.
-
(2005)
Proceedings of 9th European Conference on Speech Communication and Technology (EUROSPEECH '05)
, pp. 337-340
-
-
Alsteris, L.D.1
Paliwal, K.K.2
-
14
-
-
0141480080
-
The modified group delay function and its application to phoneme recognition
-
Hong Kong, April
-
H. A. Murthy and V. R. R. Gadde, "The modified group delay function and its application to phoneme recognition," in Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '03), vol. 1, pp. 68-71, Hong Kong, April 2003.
-
(2003)
Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '03)
, vol.1
, pp. 68-71
-
-
Murthy, H.A.1
Gadde, V.R.R.2
-
15
-
-
4544293687
-
Application of the modified group delay function to speaker identification and discrimination,
-
Montreal, Quebec, Canada
-
R. M. Hegde, H. A. Murthy, and V. R. R. Gadde, "Application of the modified group delay function to speaker identification and discrimination, " in Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '04), vol. 1, pp. 517-520, Montreal, Quebec, Canada, 2004.
-
(2004)
Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '04)
, vol.1
, pp. 517-520
-
-
Hegde, R.M.1
Murthy, H.A.2
Gadde, V.R.R.3
-
16
-
-
85009067731
-
Continuous speech recognition using joint features derived from the modified group delay function and MFCC
-
Jeju Island, Korea, October
-
R. M. Hegde, H. A. Murthy, and V. R. R. Gadde, "Continuous speech recognition using joint features derived from the modified group delay function and MFCC," in Proceedings of 8th International Conference on Spoken Language Processing (INTERSPEECH '04), vol. 2, pp. 905-908, Jeju Island, Korea, October 2004.
-
(2004)
Proceedings of 8th International Conference on Spoken Language Processing (INTERSPEECH '04)
, vol.2
, pp. 905-908
-
-
Hegde, R.M.1
Murthy, H.A.2
Gadde, V.R.R.3
-
17
-
-
85009070582
-
The modified group delay feature: A new spectral representation of speech
-
Jeju Island, Korea, October
-
R. M. Hegde, H. A. Murthy, and V. R. R. Gadde, "The modified group delay feature: a new spectral representation of speech," in Proceedings of 8th International Conference on Spoken Language Processing (INTERSPEECH '04), vol. 2, pp. 913-916, Jeju Island, Korea, October 2004.
-
(2004)
Proceedings of 8th International Conference on Spoken Language Processing (INTERSPEECH '04)
, vol.2
, pp. 913-916
-
-
Hegde, R.M.1
Murthy, H.A.2
Gadde, V.R.R.3
-
19
-
-
33646756506
-
Speech processing using joint features derived from the modified group delay function
-
Philadelphia, Pa, USA, March
-
R. M. Hegde, H. A. Murthy, and V. R. R. Gadde, "Speech processing using joint features derived from the modified group delay function," in Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '05), vol. 1, pp. 541-544, Philadelphia, Pa, USA, March 2005.
-
(2005)
Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '05)
, vol.1
, pp. 541-544
-
-
Hegde, R.M.1
Murthy, H.A.2
Gadde, V.R.R.3
-
20
-
-
84892189317
-
Multi-band speech recognition in noisy environments
-
Seattle, Wash, USA, May
-
S. Okawa, E. Bocchieri, and A. Potamianos, "Multi-band speech recognition in noisy environments," in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '98), vol. 2, pp. 641-644, Seattle, Wash, USA, May 1998.
-
(1998)
Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '98)
, vol.2
, pp. 641-644
-
-
Okawa, S.1
Bocchieri, E.2
Potamianos, A.3
-
21
-
-
74549174907
-
Feature stream combination before and/or after the acoustic model
-
Tech. Rep. TR-00-007, International Computer Science Institute, Berkeley, Calif, USA
-
D. Ellis, "Feature stream combination before and/or after the acoustic model," Tech. Rep. TR-00-007, International Computer Science Institute, Berkeley, Calif, USA, 2000.
-
(2000)
-
-
Ellis, D.1
-
22
-
-
33845961024
-
Speech recognition using heterogenous information extraction in multi-stream based systems,
-
Ph.D. dissertation, Aalborg University, Aalborg, Denmark
-
H. Christensen, "Speech recognition using heterogenous information extraction in multi-stream based systems," Ph.D. dissertation, Aalborg University, Aalborg, Denmark, 2002.
-
(2002)
-
-
Christensen, H.1
-
23
-
-
0030682292
-
Recognizing reverberant speech with RASTA-PLP
-
Munich, Germany, April
-
B. E. D. Kingsbury and N. Morgan, "Recognizing reverberant speech with RASTA-PLP," in Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '97), vol. 2, pp. 1259-1262, Munich, Germany, April 1997.
-
(1997)
Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '97)
, vol.2
, pp. 1259-1262
-
-
Kingsbury, B.E.D.1
Morgan, N.2
-
24
-
-
84892186467
-
Incorporating information from syllable-length time scales intoautomatic speech recognition
-
Seattle, Wash, USA, May
-
S.-L. Wu, B. E. D. Kingsbury, N. Morgan, and S. Greenberg, "Incorporating information from syllable-length time scales intoautomatic speech recognition," in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '98), vol. 2, pp. 721-724, Seattle, Wash, USA, May 1998.
-
(1998)
Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '98)
, vol.2
, pp. 721-724
-
-
Wu, S.-L.1
Kingsbury, B.E.D.2
Morgan, N.3
Greenberg, S.4
-
25
-
-
84994350739
-
Multi-stream speech recognition: Ready for prime time?
-
Budapest, Hungary, September
-
A. Janin, D. Ellis, and N. Morgan, "Multi-stream speech recognition: ready for prime time?" in Proceedings of 6th European Conference on Speech Communication and Technology (EUROSPEECH '99), pp. 591-594, Budapest, Hungary, September 1999.
-
(1999)
Proceedings of 6th European Conference on Speech Communication and Technology (EUROSPEECH '99)
, pp. 591-594
-
-
Janin, A.1
Ellis, D.2
Morgan, N.3
-
26
-
-
0032627223
-
-
K. Kirchhoff and J. A. Bilmes, Dynamic classifier combination in hybrid speech recognition systems using utterancelevel confidence values, in Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '99), 2, pp. 693-696, Phoenix, Ariz, USA, March 1999.
-
K. Kirchhoff and J. A. Bilmes, "Dynamic classifier combination in hybrid speech recognition systems using utterancelevel confidence values," in Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '99), vol. 2, pp. 693-696, Phoenix, Ariz, USA, March 1999.
-
-
-
-
27
-
-
28244462378
-
-
Speech and Vision Lab, IIT Madras, Chennai, India
-
Database for Indian Languages, Speech and Vision Lab, IIT Madras, Chennai, India, 2001.
-
(2001)
Database for Indian Languages
-
-
-
29
-
-
0025680225
-
-
C. Jankowski, A. Kalyanswamy, S. Basson, and J. Spitz, NTIMIT: a phonetically balanced, continuous speech, telephone bandwidth speech database, in Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP'90),1, pp. 109-112, Albuquerque, NM, USA, April 1990.
-
C. Jankowski, A. Kalyanswamy, S. Basson, and J. Spitz, "NTIMIT: a phonetically balanced, continuous speech, telephone bandwidth speech database," in Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP'90),vol 1, pp. 109-112, Albuquerque, NM, USA, April 1990.
-
-
-
-
30
-
-
33845936606
-
Time and frequency pruning for speaker identification
-
Brisbane, Qld, Australia, August
-
L. Besacier and J. F. Bonastre, "Time and frequency pruning for speaker identification," in Proceedings of the 14th International Conference on Pattern Recognition (ICPR '98), vol. 2, pp. 1619-1621, Brisbane, Qld., Australia, August 1998.
-
(1998)
Proceedings of the 14th International Conference on Pattern Recognition (ICPR '98)
, vol.2
, pp. 1619-1621
-
-
Besacier, L.1
Bonastre, J.F.2
-
31
-
-
0028996867
-
CTIMIT: A speech corpus for the cellular environment with applications to automatic speech recognition
-
Detroit, Mich, USA, May
-
K. L. Brown and E. B. George, "CTIMIT: a speech corpus for the cellular environment with applications to automatic speech recognition," in Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '95), vol. 1, pp. 105-108, Detroit, Mich, USA, May 1995.
-
(1995)
Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '95)
, vol.1
, pp. 105-108
-
-
Brown, K.L.1
George, E.B.2
-
32
-
-
85081009009
-
The OGI multi-language telephone speech corpus
-
Banff, Alberta, Canada, October
-
Y. K. Muthusamy, R. A. Cole, and B. T. Oshika, "The OGI multi-language telephone speech corpus," in Proceedings of the 2nd International Conference on Spoken Language Processing (ICSLP '92), pp. 895-898, Banff, Alberta, Canada, October 1992.
-
(1992)
Proceedings of the 2nd International Conference on Spoken Language Processing (ICSLP '92)
, pp. 895-898
-
-
Muthusamy, Y.K.1
Cole, R.A.2
Oshika, B.T.3
-
33
-
-
33751561891
-
Linear and order statistics combiners for reliable pattern classification,
-
Ph.D. dissertation, University of Texas at Austin, Austin, Tex, USA, May
-
K. Turner, "Linear and order statistics combiners for reliable pattern classification," Ph.D. dissertation, University of Texas at Austin, Austin, Tex, USA, May 1996.
-
(1996)
-
-
Turner, K.1
-
34
-
-
0000926506
-
When networks disagree: Ensemble methods for hybrid neural networks
-
Chapman-Hall, London, UK
-
M. P. Perrone and L. N. Cooper, "When networks disagree: ensemble methods for hybrid neural networks," in Neural Networks for Speech and Image Processing, pp. 126-142, Chapman-Hall, London, UK, 1993.
-
(1993)
Neural Networks for Speech and Image Processing
, pp. 126-142
-
-
Perrone, M.P.1
Cooper, L.N.2
-
35
-
-
85009124169
-
Analysis of the root-cepstrum for acoustic modeling and fast decoding in speech recognition
-
Aalborg, Denmark, September
-
R. Sarikaya and J. H. L. Hansen, "Analysis of the root-cepstrum for acoustic modeling and fast decoding in speech recognition," in Proceedings of the 7th European Conference on Speech Communication and Technology (EUROSPEECH '01), pp. 687-690, Aalborg, Denmark, September 2001.
-
(2001)
Proceedings of the 7th European Conference on Speech Communication and Technology (EUROSPEECH '01)
, pp. 687-690
-
-
Sarikaya, R.1
Hansen, J.H.L.2
-
36
-
-
85054435084
-
Neural network ensembles, cross validation, and active learning
-
MIT Press, Cambridge, Mass, USA
-
A. Krogh and J. Vedelsby, "Neural network ensembles, cross validation, and active learning," in Advances in Neural Information Processing Systems, vol. 7, pp. 231-238, MIT Press, Cambridge, Mass, USA, 1995.
-
(1995)
Advances in Neural Information Processing Systems
, vol.7
, pp. 231-238
-
-
Krogh, A.1
Vedelsby, J.2
-
37
-
-
0026204672
-
Formant extraction from group delay function
-
H. A. Murthy and B. Yegnanarayana, "Formant extraction from group delay function," Speech Communication, vol. 10, no. 3, pp. 209-221, 1991.
-
(1991)
Speech Communication
, vol.10
, Issue.3
, pp. 209-221
-
-
Murthy, H.A.1
Yegnanarayana, B.2
-
38
-
-
0021439544
-
Significance of group delay functions in signal reconstruction from spectral magnitude or phase
-
B. Yegnanarayana, D. K. Saikia, and T. R. Krishnan, "Significance of group delay functions in signal reconstruction from spectral magnitude or phase," IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 32, no. 3, pp. 610-623, 1984.
-
(1984)
IEEE Transactions on Acoustics, Speech, and Signal Processing
, vol.32
, Issue.3
, pp. 610-623
-
-
Yegnanarayana, B.1
Saikia, D.K.2
Krishnan, T.R.3
-
39
-
-
1842475640
-
Automatic segmentation of continuous speech using minimum phase group delay functions
-
V. K. Prasad, T. Nagarajan, and H. A. Murthy, "Automatic segmentation of continuous speech using minimum phase group delay functions," Speech Communication, vol. 42, no. 3-4, pp. 429-446, 2004.
-
(2004)
Speech Communication
, vol.42
, Issue.3-4
, pp. 429-446
-
-
Prasad, V.K.1
Nagarajan, T.2
Murthy, H.A.3
-
40
-
-
0026923568
-
Significance of group delay functions in spectrum estimation
-
B. Yegnanarayana and H. A. Murthy, "Significance of group delay functions in spectrum estimation," IEEE Transactions on Signal Processing, vol. 40, no. 9, pp. 2281-2289, 1992.
-
(1992)
IEEE Transactions on Signal Processing
, vol.40
, Issue.9
, pp. 2281-2289
-
-
Yegnanarayana, B.1
Murthy, H.A.2
-
41
-
-
0003685404
-
-
Academic Press, San Diego, Calif, USA
-
P. Yip and K. R. Rao, Discrete Cosine Transform: Algorithms, Advantages, and Applications, Academic Press, San Diego, Calif, USA, 1997.
-
(1997)
Discrete Cosine Transform: Algorithms, Advantages, and Applications
-
-
Yip, P.1
Rao, K.R.2
-
42
-
-
0004319975
-
Acoustical and environmental robustness in automatic speech recognition,
-
Ph.D. dissertation, Carnegie Mellon University, Pittsburgh, Pa, USA
-
A. Acero, "Acoustical and environmental robustness in automatic speech recognition," Ph.D. dissertation, Carnegie Mellon University, Pittsburgh, Pa, USA, 1990.
-
(1990)
-
-
Acero, A.1
-
43
-
-
0032595177
-
Robust text-independent speaker identification over telephone channels
-
H. A. Murthy, R Beaufays, L. P. Heck, and M. Weintraub, "Robust text-independent speaker identification over telephone channels," IEEE Transactions on Speech and Audio Processing, vol. 7, no. 5, pp. 554-568, 1999.
-
(1999)
IEEE Transactions on Speech and Audio Processing
, vol.7
, Issue.5
, pp. 554-568
-
-
Murthy, H.A.1
Beaufays, R.2
Heck, L.P.3
Weintraub, M.4
-
44
-
-
0027622158
-
Root cepstral analysis: A unified view. Application to speech processing in car noise environments
-
P. Alexandre and P. Lockwood, "Root cepstral analysis: a unified view. Application to speech processing in car noise environments," Speech Communication, vol. 12, no. 3, pp. 277-288, 1993.
-
(1993)
Speech Communication
, vol.12
, Issue.3
, pp. 277-288
-
-
Alexandre, P.1
Lockwood, P.2
-
45
-
-
0141589377
-
-
SRI: Menlo Park, Calif, USA
-
V. R. R. Gadde, A. Stolcke, J. Z. D. Vergyri, K. Sonmez, and A. Venkatraman, "The SRI SPINE 2001 Evaluation System," SRI: Menlo Park, Calif, USA, 2001.
-
(2001)
The SRI SPINE 2001 Evaluation System
-
-
Gadde, V.R.R.1
Stolcke, A.2
Vergyri, J.Z.D.3
Sonmez, K.4
Venkatraman, A.5
|