-
1
-
-
0028378020
-
Applications of voice processing, to telecommunications
-
L. Rabiner, "Applications of voice processing, to telecommunications," in Proc. IEEE, vol. 82, no. 2, pp. 199-228, Feb. 1994.
-
(1994)
Proc. IEEE
, vol.82
, Issue.2
, pp. 199-228
-
-
Rabiner, L.1
-
2
-
-
0003572996
-
-
Ph.D. thesis, Carnegie Mellon Univ., Pittsburgh, PA, Apr.
-
P. Brown, "The acoustic-modeling problem in automatic speech recognition,"Ph.D. thesis, Carnegie Mellon Univ., Pittsburgh, PA, Apr. 1987.
-
(1987)
The acoustic-modeling problem in automatic speech recognition
-
-
Brown, P.1
-
3
-
-
0027579316
-
Discriminative training of dynamic programming based speech recognizers
-
Apr.
-
P. Chang and B. Juang, "Discriminative training of dynamic programming based speech recognizers," IEEE Trans. Speech Audio Processing, vol. 1, no. 2, pp. 135-143, Apr. 1993.
-
(1993)
IEEE Trans. Speech Audio Processing
, vol.1
, Issue.2
, pp. 135-143
-
-
Chang, P.1
Juang, B.2
-
4
-
-
0025588058
-
A probabilistic acoustic MAP based discriminative HMM training
-
E. Huang and F. Soong, "A probabilistic acoustic MAP based discriminative HMM training," in Proc. Int. Conf. Acoustics, Speech, Signal Processing, 1990, pp. 693-696.
-
(1990)
Proc. Int. Conf. Acoustics, Speech, Signal Processing
, pp. 693-696
-
-
Huang, E.1
Soong, F.2
-
5
-
-
0028195650
-
Speech recognition using weighted HMM and subspace projection approaches
-
Jan.
-
K. Su and C. Lee, "Speech recognition using weighted HMM and subspace projection approaches," IEEE Trans. Speech Audio Processing, vol. 2, no. 1, pp. 69-79, Jan. 1994.
-
(1994)
IEEE Trans. Speech Audio Processing
, vol.2
, Issue.1
, pp. 69-79
-
-
Su, K.1
Lee, C.2
-
6
-
-
0001462521
-
A cross-language study of voicing in initial stops: Acoustical measurements
-
L. Lisker and S. Abramson, "A cross-language study of voicing in initial stops: Acoustical measurements," Word, vol. 20, pp. 384-422, 1964.
-
(1964)
Word
, vol.20
, pp. 384-422
-
-
Lisker, L.1
Abramson, S.2
-
7
-
-
0003467241
-
-
Ph.D. thesis, Mass. Inst. of Technol., Cambridge, MA, May
-
V. Zue, "Acoustic characteristics of stop consonants: A controlled study,"Ph.D. thesis, Mass. Inst. of Technol., Cambridge, MA, May 1976.
-
(1976)
Acoustic characteristics of stop consonants: A controlled study
-
-
Zue, V.1
-
10
-
-
0001559782
-
Analysis of nasal consonants
-
Dec.
-
O. Fujimura, "Analysis of nasal consonants," J. Acoust. Soc. Amer., vol. 34, no. 12, pp. 1865-1875, Dec. 1962.
-
(1962)
J. Acoust. Soc. Amer.
, vol.34
, Issue.12
, pp. 1865-1875
-
-
Fujimura, O.1
-
11
-
-
0008457913
-
"Speech coding and recognition: A review,"
-
Feb.
-
A. Spanias and F. Wu, "Speech coding and recognition: A review," IEICE Trans. Fundamentals, vol. E75-A, no. 2, pp. 132-148, Feb. 1992.
-
(1992)
IEICE Trans. Fundamentals
, vol.E75-A
, Issue.2
, pp. 132-148
-
-
Spanias, A.1
Wu, F.2
-
12
-
-
0016467604
-
"Minimum prediction residual applied to speech recognition
-
F. Itakura, "Minimum prediction residual applied to speech recognition," IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-23, no. 1, pp. 67-72, Feb. 1975.
-
(1975)
IEEE Trans. Acoust., Speech, Signal Processing
, vol.ASSP-23
, Issue.1
, pp. 67-72
-
-
Itakura, F.1
-
13
-
-
0018656519
-
"Speaker independent recognition of isolated words using clustering techniques,"
-
L. Rabiner, S. Levinson, A. Rosenberg, and J. Wilpon, "Speaker independent recognition of isolated words using clustering techniques," IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-27, pp. 336-349, Aug. 1979.
-
(1979)
IEEE Trans. Acoust., Speech, Signal Processing
, vol.ASSP-27
, pp. 336-349
-
-
Rabiner, L.1
Levinson, S.2
Rosenberg, A.3
Wilpon, J.4
-
14
-
-
0019680113
-
Isolated word recognition using a two-pass pattern recognition approach
-
L. Rabiner and J. Wilpon, "Isolated word recognition using a two-pass pattern recognition approach," in Proc. Int. Conf. Acoustics, Speech, Signal Processing, vol. 2, 1981, pp. 724-727.
-
(1981)
Proc. Int. Conf. Acoustics, Speech, Signal Processing
, vol.2
, pp. 724-727
-
-
Rabiner, L.1
Wilpon, J.2
-
15
-
-
85061400113
-
Comparison of learning techniques in speech recognition
-
G. Bradshaw, R. Cole, and L. Zi, "Comparison of learning techniques in speech recognition," in Proc. Int. Conf. Acoustics, Speech, Signal Processing, 1982, pp. 554-557.
-
(1982)
Proc. Int. Conf. Acoustics, Speech, Signal Processing
, pp. 554-557
-
-
Bradshaw, G.1
Cole, R.2
Zi, L.3
-
16
-
-
33646908717
-
"Performance improvement in a dynamic-programming based isolated word recognition system for the alpha-digit task
-
L. Lamel and V. Zue, "Performance improvement in a dynamic-programming based isolated word recognition system for the alpha-digit task," in Proc. Int. Conf. Acoust., Speech, Signal Processing, 1982, pp. 558-561.
-
(1982)
Proc. Int. Conf. Acoust., Speech, Signal Processing
, pp. 558-561
-
-
Lamel, L.1
Zue, V.2
-
17
-
-
0001887625
-
"Performing fine phonetic distinctions: Templates vs. features
-
J. Perkell and D. Klatt, Eds. New York: Lawrence Erlbaum
-
R. Cole, R. Stern, and M. Lasry, "Performing fine phonetic distinctions: Templates vs. features," in Invariance and Variability of Speech Processes, J. Perkell and D. Klatt, Eds. New York: Lawrence Erlbaum, 1986, pp. 325-341.
-
(1986)
Invariance and Variability of Speech Processes
, pp. 325-341
-
-
Cole, R.1
Stern, R.2
Lasry, M.3
-
18
-
-
0039627352
-
"Speech as patterns on paper
-
R. Cole, Ed. New York: Lawrence Erlbaum
-
R. Cole, A. Rudnicky, V. Zue, and R. Reddy, "Speech as patterns on paper," in Perception and Production of Fluent Speech, R. Cole, Ed. New York: Lawrence Erlbaum, 1978.
-
(1978)
Perception and Production of Fluent Speech
-
-
Cole, R.1
Rudnicky, A.2
Zue, V.3
Reddy, R.4
-
19
-
-
0004989362
-
"Some performance benchmarks for isolated word speech recognition systems
-
L. Rabiner and J. Wilpon, "Some performance benchmarks for isolated word speech recognition systems," Comput. Speech Language, vol. 2, pp. 343-357, 1987.
-
(1987)
Comput. Speech Language
, vol.2
, pp. 343-357
-
-
Rabiner, L.1
Wilpon, J.2
-
20
-
-
0025659601
-
Statistical segmentation and word modeling techniques in isolated word recognition
-
S. Euler, B. Juang, C. Lee, and F. Soong, Statistical segmentation and word modeling techniques in isolated word recognition," in Proc. Int. Conf. Acoust., Speech, Signal Processing, 1990, pp. 745-748.
-
(1990)
Proc. Int. Conf. Acoust., Speech, Signal Processing
, pp. 745-748
-
-
Euler B Juang, S.1
Lee, C.2
Soong, F.3
-
21
-
-
0025557590
-
"Speaker-independent recognition of spoken English letters
-
June
-
R. Cole, M. Fanty, Y. Muthusamy, and M. Gopalakrishnan, "Speaker-independent recognition of spoken English letters," in Proc. Int. Joint Conf. Neural Networks, vol. 2, June 1990, pp. 45-51.
-
(1990)
Proc. Int. Joint Conf. Neural Networks
, vol.2
, pp. 45-51
-
-
Cole, R.1
Fanty, M.2
Muthusamy, Y.3
Gopalakrishnan, M.4
-
23
-
-
33646901212
-
"English alphabet recognition of telephone speech
-
R. Cole, K. Roginski, and M. Fanty, "English alphabet recognition of telephone speech," in Proc. 2nd Euro. Conf. Speech Commun. Technol., 1991, pp. 24-26.
-
(1991)
Proc. 2nd Euro. Conf. Speech Commun. Technol.
, pp. 24-26
-
-
Cole, R.1
Roginski, K.2
Fanty, M.3
-
24
-
-
0003640523
-
"The ISOLET spoken letter database
-
Oregon Graduate Inst.
-
R. Cole, Y. Muthusamy, and M. Fanty, "The ISOLET spoken letter database," Tech. Rep. 90-004, Oregon Graduate Inst., 1990.
-
(1990)
Tech. Rep. 90-004
-
-
Cole, R.1
Muthusamy, Y.2
Fanty, M.3
-
25
-
-
0002583871
-
"Speech database development: Design and analysis of the acoustic phonetic corpus
-
L. Lamel, R. Kassel, and S. Seneff, "Speech database development: Design and analysis of the acoustic phonetic corpus," in Proc. DARPA Speech Recognition Workshop, 1986, pp. 100-109.
-
(1986)
Proc. DARPA Speech Recognition Workshop
, pp. 100-109
-
-
Lamel, L.1
Kassel, R.2
Seneff, S.3
-
26
-
-
0019053271
-
"Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
-
Aug.
-
B. Davis and P. Mermelstein, "Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences," IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-28, no. 4, pp. 357-366, Aug. 1980.
-
(1980)
IEEE Trans. Acoust., Speech, Signal Processing
, vol.ASSP-28
, Issue.4
, pp. 357-366
-
-
Davis, B.1
Mermelstein, P.2
-
27
-
-
0025493667
-
"The segmental k-means algorithm for estimating parameters of hidden Markov models,"
-
B. Juang and L. Rabiner, "The segmental k-means algorithm for estimating parameters of hidden Markov models," IEEE Trans. Acoust., Speech, Signal Processing, vol. 38, no. 9, pp. 1639-1641, 1990.
-
(1990)
IEEE Trans. Acoust., Speech, Signal Processing
, vol.38
, Issue.9
, pp. 1639-1641
-
-
Juang, B.1
Rabiner, L.2
-
28
-
-
0001882615
-
"Self-organized language modeling for speech recognition
-
A. Waibel and K. Lee, Eds. San Francisco, CA: Morgan Kaufmann
-
F. Jelinek, "Self-organized language modeling for speech recognition," in Readings in Speech Rcognition, A. Waibel and K. Lee, Eds. San Francisco, CA: Morgan Kaufmann, 1990, pp. 450-506.
-
(1990)
Readings in Speech Rcognition
, pp. 450-506
-
-
Jelinek, F.1
-
29
-
-
0028573857
-
"Context-dependent modeling in alphabet recognition
-
P. Loizou and A. Spanias, "Context-dependent modeling in alphabet recognition," in Proc. Int. Symp. Circuits Syst., 1994, pp. 189-192.
-
(1994)
Proc. Int. Symp. Circuits Syst.
, pp. 189-192
-
-
Loizou, P.1
Spanias, A.2
-
30
-
-
0022859679
-
The role of word-dependent coarticulatory effects in a phoneme-based speech recognition system
-
Y. Chow et al., "The role of word-dependent coarticulatory effects in a phoneme-based speech recognition system," in Proc. Int. Conf. Acoust., Speech, Signal Processing, 1986, pp. 1593-1596.
-
(1986)
Proc. Int. Conf. Acoust., Speech, Signal Processing
, pp. 1593-1596
-
-
Chow, Y.1
-
31
-
-
0003539541
-
-
Ph.D. thesis, Carnegie Mellon Univ., Pittsburgh, PA, Apr.
-
K. Lee, "Large vocabulary speaker-independent continuous speech recognition: The SPHINX system,"Ph.D. thesis, Carnegie Mellon Univ., Pittsburgh, PA, Apr. 1988.
-
(1988)
Large Vocabulary Speaker-independent Continuous Speech Recognition: the SPHINX System
-
-
Lee, K.1
-
33
-
-
0003459982
-
"Evaluation of LPC spectral matching measures for phonetic unit recognition
-
Carnegie Mellon Univ.
-
K. Shikano, "Evaluation of LPC spectral matching measures for phonetic unit recognition," Tech. Rep. CMU-CS-86-108, Carnegie Mellon Univ., 1986.
-
(1986)
Tech. Rep.
, vol.CMU-CS-86-108
-
-
Shikano, K.1
-
34
-
-
0022914334
-
Detection and recognition of nasal consonants in American english
-
J. Glass and V. Zue, "Detection and recognition of nasal consonants in American english," in Proc. Int. Conf. Acoust., Speech, Signal Processing, 1986, pp. 2767-2770.
-
(1986)
Proc. Int. Conf. Acoust., Speech, Signal Processing
, pp. 2767-2770
-
-
Glass, J.1
Zue, V.2
-
35
-
-
0021475513
-
"Perceptual integration of the murmur and formant transitions for place of articulation in nasal consonants
-
K. Kurowski and S. Blumstein, "Perceptual integration of the murmur and formant transitions for place of articulation in nasal consonants," J. Acoust. Soc. Amer., vol. 76, pp. 383-390, 1984.
-
(1984)
J. Acoust. Soc. Amer.
, vol.76
, pp. 383-390
-
-
Kurowski, K.1
Blumstein, S.2
-
36
-
-
0022523182
-
Perception of the [m]-[n] distinction in CV syllables
-
B. Repp, "Perception of the [m]-[n] distinction in CV syllables," J. Acoust. Soc. Amer., vol. 79, pp. 1987-1999, 1986.
-
(1986)
J. Acoust. Soc. Amer.
, vol.79
, pp. 1987-1999
-
-
Repp, B.1
-
37
-
-
0000629601
-
Acoustic cues for nasal consonants: An experimental study involving a tape-splicing technique
-
A. Malecot, "Acoustic cues for nasal consonants: An experimental study involving a tape-splicing technique," Language, vol. 32, pp. 274-284, 1956.
-
(1956)
Language
, vol.32
, pp. 274-284
-
-
Malecot, A.1
-
38
-
-
0028936631
-
Automatic recognition of syllable-final nasals preceded by /eh
-
Mar.
-
P. Loizou, M. Dorman, and A. Spanias, "Automatic recognition of syllable-final nasals preceded by /eh/," J. Acoust. Soc. Amer., vol. 97, no. 3, pp. 1925-1928, Mar. 1995.
-
(1995)
J. Acoust. Soc. Amer.
, vol.97
, Issue.3
, pp. 1925-1928
-
-
Loizou, P.1
Dorman, M.2
Spanias, A.3
-
39
-
-
33646920550
-
-
Ph.D. thesis, Arizona State Univ., Tempe, AZ
-
P. Loizou, "Robust speaker-independent recognition of a confusable vocabulary,"Ph.D. thesis, Arizona State Univ., Tempe, AZ, 1995.
-
(1995)
Robust speaker-independent recognition of a confusable vocabulary
-
-
Loizou, P.1
-
41
-
-
0002215069
-
"On a measure of divergence between two statistical populations defined by their probability distributions
-
A. Bhattacharyya, "On a measure of divergence between two statistical populations defined by their probability distributions," Bull. Calcutta Math. Soc., vol. 35, pp. 99-109, 1943.,
-
(1943)
Bull. Calcutta Math. Soc.
, vol.35
, pp. 99-109
-
-
Bhattacharyya, A.1
-
42
-
-
65249157560
-
"The divergence and Bhattacharyya distance measures in signal selection,"
-
T. Kailath, "The divergence and Bhattacharyya distance measures in signal selection," IEEE Trans. Commun. Technol., vol. COM-15, no. 1, pp. 52-60, 1967.
-
(1967)
IEEE Trans. Commun. Technol.
, vol.COM-15
, Issue.1
, pp. 52-60
-
-
Kailath, T.1
-
43
-
-
0000042860
-
Signal selection in communication and radar systems
-
Oct.
-
T. Grettenberg, "Signal selection in communication and radar systems," IEEE Trans. Inform. Theory, vol. IT-9, pp. 265-275, Oct. 1963.
-
(1963)
IEEE Trans. Inform. Theory
, vol.IT-9
, pp. 265-275
-
-
Grettenberg, T.1
-
44
-
-
84914813506
-
On the effectiveness of receptors in recognition systems
-
T. Marill and M. Green, "On the effectiveness of receptors in recognition systems," IEEE Trans. Inform. Theory, vol. IT-9, pp. 11-17, 1963.
-
(1963)
IEEE Trans. Inform. Theory
, vol.IT-9
, pp. 11-17
-
-
Marill, T.1
Green, M.2
-
45
-
-
0009061528
-
Some approaches to optimum feature extraction
-
J. Tou, Ed. New York: Academic
-
J. Tou and R. Heydorn, "Some approaches to optimum feature extraction," Computer and Information Sciences-II, J. Tou, Ed. New York: Academic, 1967, pp. 57-89.
-
(1967)
Computer and Information Sciences-II
, pp. 57-89
-
-
Tou, J.1
Heydorn, R.2
-
47
-
-
0014604351
-
A class of upper bounds on probability of error for multihypothesis pattern recognition
-
G. Lainiolis, "A class of upper bounds on probability of error for multihypothesis pattern recognition," IEEE Trans. Inform. Theory, vol. IT-15, pp. 730-731, 1969.
-
(1969)
IEEE Trans. Inform. Theory
, vol.IT-15
, pp. 730-731
-
-
Lainiolis, G.1
-
48
-
-
0346838156
-
"English alphabet recognition with telephone speech
-
J. Moody, S. Hanson, and R. Lippmann, Eds. San Francisco, CA: Morgan Kaufmann
-
M. Fanty, R. Cole, and K. Roginsky, "English alphabet recognition with telephone speech," in Advances in Neural Information Processing Systems 4, J. Moody, S. Hanson, and R. Lippmann, Eds. San Francisco, CA: Morgan Kaufmann, 1992.
-
(1992)
Advances in Neural Information Processing Systems 4
-
-
Fanty, M.1
Cole, R.2
Roginsky, K.3
-
49
-
-
85135100178
-
A telephone speech database of spelled and spoken names
-
R. Cole, M. Fanty, and K. Roginsky, "A telephone speech database of spelled and spoken names," in Proc. Int. Conf. Spoken Language Processing, 1992, pp. 891-893.
-
(1992)
Proc. Int. Conf. Spoken Language Processing
, pp. 891-893
-
-
Cole, R.1
Fanty, M.2
Roginsky, K.3
-
50
-
-
0025145948
-
Modeling the microsegments of stop consonants in a hidden Markov model based recognizer
-
June
-
L. Deng, M. Lennig, and P. Mermelstein, "Modeling the microsegments of stop consonants in a hidden Markov model based recognizer," J. Acoust. Soc. Amer., vol. 87, no. 6, pp. 2738-2747, June 1990.
-
(1990)
J. Acoust. Soc. Amer.
, vol.87
, Issue.6
, pp. 2738-2747
-
-
Deng, L.1
Lennig, M.2
Mermelstein, P.3
-
51
-
-
0010568388
-
"Telephone alphabet recognition for name-retrieval applications
-
Oct.
-
P. Loizou, A. Mekkoth, and A. Spanias, "Telephone alphabet recognition for name-retrieval applications," in Proc. Int. Conf. Signal Processing Applications Technol., vol. II, Oct. 1995, pp. 2014-2018.
-
(1995)
Proc. Int. Conf. Signal Processing Applications Technol.
, vol.2
, pp. 2014-2018
-
-
Loizou, P.1
Mekkoth, A.2
Spanias, A.3
-
52
-
-
1842272975
-
"Improved speech recognition using the weighted average divergence measure
-
P. Loizou and A. Spanias, "Improved speech recognition using the weighted average divergence measure," in Proc. Int. Conf. Digital Signal Processing, 1995, pp. 90-95.
-
(1995)
Proc. Int. Conf. Digital Signal Processing
, pp. 90-95
-
-
Loizou, P.1
Spanias, A.2
-
55
-
-
6244257245
-
"Comparative study of nonlinear time warping techniques in isolated word speech recognition systems
-
Carnegie Mellon Univ., Pittsburgh, PA
-
A. Waibel and B. Yegnanarayana, "Comparative study of nonlinear time warping techniques in isolated word speech recognition systems," Tech. Rep. CMU-CS-81-125, Carnegie Mellon Univ., Pittsburgh, PA, 1981.
-
(1981)
Tech. Rep. CMU-CS-81-125
-
-
Waibel, A.1
Yegnanarayana, B.2
-
56
-
-
0026370985
-
"Optimising hidden Markov models using discriminative output distributions
-
P. Woodland and D. Cole, "Optimising hidden Markov models using discriminative output distributions," in Proc. Int. Conf. Acoust., Speech, Signal Processing, 1991, pp. 545-548.
-
Proc. Int. Conf. Acoust., Speech, Signal Processing
, vol.1991
-
-
Woodland, P.1
Cole, D.2
-
57
-
-
0028251797
-
Stochastic modeling of temporal information in speech for Hidden Markov Models
-
Jan.
-
J. Dai, I. MacKenzie, and J. Tyler, "Stochastic modeling of temporal information in speech for Hidden Markov Models," IEEE Trans. Speech Audio Processing, vol. 2, no. 1, pp. 102-104, Jan. 1994.
-
(1994)
IEEE Trans. Speech Audio Processing
, vol.2
, Issue.1
, pp. 102-104
-
-
Dai, J.1
MacKenzie, I.2
Tyler, J.3
|