-
1
-
-
34047266607
-
Enriching speech recognition with automatic detection of sentence boundaries and disfluencies
-
Sep
-
Y. Liu, E. Shriberg, A. Stolcke, D. Hillard, M. Ostendorf, and M. Harper, "Enriching speech recognition with automatic detection of sentence boundaries and disfluencies," IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 5, pp. 1524-1538, Sep. 2006.
-
(2006)
IEEE Trans. Audio, Speech, Lang. Process
, vol.14
, Issue.5
, pp. 1524-1538
-
-
Liu, Y.1
Shriberg, E.2
Stolcke, A.3
Hillard, D.4
Ostendorf, M.5
Harper, M.6
-
2
-
-
0025041264
-
Perceptual linear predictive (PLP) analysis of speech
-
Apr
-
H. Hermansky, "Perceptual linear predictive (PLP) analysis of speech," J. Acoust. Soc. Amer., vol. 87, pp. 1738-1752, Apr. 1990.
-
(1990)
J. Acoust. Soc. Amer
, vol.87
, pp. 1738-1752
-
-
Hermansky, H.1
-
3
-
-
0029288633
-
Maximum likelihood linear regression for speaker adaptation of HMMs
-
C. Leggetter and P. Woodland, "Maximum likelihood linear regression for speaker adaptation of HMMs," Comput. Speech Lang., vol. 9, pp. 171-186, 1995.
-
(1995)
Comput. Speech Lang
, vol.9
, pp. 171-186
-
-
Leggetter, C.1
Woodland, P.2
-
4
-
-
0141703284
-
Prosodic knowledge sources for automatic speech recognition
-
Hong Kong, China, Apr
-
D. Vergyri, A. Stolcke, V. R. R. Gadde, L. Ferrer, and E. Shriberg, "Prosodic knowledge sources for automatic speech recognition," in Proc. IEEE Conf. Acoust., Speech, Signal Process., vol. 1, Hong Kong, China, Apr. 2003, pp. 208-211.
-
(2003)
Proc. IEEE Conf. Acoust., Speech, Signal Process
, vol.1
, pp. 208-211
-
-
Vergyri, D.1
Stolcke, A.2
Gadde, V.R.R.3
Ferrer, L.4
Shriberg, E.5
-
5
-
-
0029764708
-
Speaker normalization on conversational telephone speech
-
Atlanta, GA, May
-
S. Wegmann, D. McAllaster, J. Orloff, and B. Peskin, "Speaker normalization on conversational telephone speech," in Proc. IEEE Conf. Acoust., Speech. Signal Process., vol. 1, Atlanta, GA, May 1996, pp. 339-341.
-
(1996)
Proc. IEEE Conf. Acoust., Speech. Signal Process
, vol.1
, pp. 339-341
-
-
Wegmann, S.1
McAllaster, D.2
Orloff, J.3
Peskin, B.4
-
6
-
-
0003871508
-
Investigation of silicon-auditory models and generalization of linear discriminant analysis for improved speech recognition,
-
Ph.D. dissertation, Johns Hopkins Univ, Baltimore, MD
-
N. Kumar, "Investigation of silicon-auditory models and generalization of linear discriminant analysis for improved speech recognition," Ph.D. dissertation, Johns Hopkins Univ., Baltimore, MD, 1997.
-
(1997)
-
-
Kumar, N.1
-
7
-
-
0036475982
-
Maximum likelihood multiple subspace projections for hidden Markov models
-
Feb
-
M. J. Gales, "Maximum likelihood multiple subspace projections for hidden Markov models," IEEE Trans. Speech Audio Process., vol. 10, no. 2, pp. 37-17, Feb. 2002.
-
(2002)
IEEE Trans. Speech Audio Process
, vol.10
, Issue.2
, pp. 37-17
-
-
Gales, M.J.1
-
8
-
-
0009938649
-
Fast robust inverse transform SAT and multi-stage adaptation
-
Lansdowne, VA, Feb
-
H. Jin, S. Matsoukas, R. Schwartz, and F. Kubala, "Fast robust inverse transform SAT and multi-stage adaptation," in Proc. DARPA Broadcast News Transcription and Understanding Workshop, Lansdowne, VA, Feb. 1998, pp. 105-109.
-
(1998)
Proc. DARPA Broadcast News Transcription and Understanding Workshop
, pp. 105-109
-
-
Jin, H.1
Matsoukas, S.2
Schwartz, R.3
Kubala, F.4
-
9
-
-
0036296863
-
Minimum phone error and I-smoothing for improved discriminative training
-
Orlando, FL, May
-
D. Povey and P. C. Woodland, "Minimum phone error and I-smoothing for improved discriminative training," in Proc. IEEE Conf. Acoust., Speech, Signal Process., vol. 1, Orlando, FL, May 2002, pp. 105-108.
-
(2002)
Proc. IEEE Conf. Acoust., Speech, Signal Process
, vol.1
, pp. 105-108
-
-
Povey, D.1
Woodland, P.C.2
-
10
-
-
44949090835
-
Getting more mileage from web text sources for conversational speech language modeling using class-dependent mixtures
-
M. Hearst and M. Ostendorf, Eds, Edmonton, AB, Canada, Mar
-
I. Bulyko, M. Ostendorf, and A. Stolcke, "Getting more mileage from web text sources for conversational speech language modeling using class-dependent mixtures," in Proc. HLT-NAACL, Conf. North Amer. Chap. Assoc. Comput. Ling., vol. 2, M. Hearst and M. Ostendorf, Eds., Edmonton, AB, Canada, Mar. 2003, pp. 7-9.
-
(2003)
Proc. HLT-NAACL, Conf. North Amer. Chap. Assoc. Comput. Ling
, vol.2
, pp. 7-9
-
-
Bulyko, I.1
Ostendorf, M.2
Stolcke, A.3
-
11
-
-
4544351495
-
Voicing feature integration in SRI's Decipher LVCSR system
-
Montreal, QC, Canada, May
-
M. Graciarena, H. Franco, J. Zheng, D. Vergyri, and A. Stolcke, "Voicing feature integration in SRI's Decipher LVCSR system," in Proc. IEEE Conf. Acoust., Speech, Signal Process., vol. 1, Montreal, QC, Canada, May 2004, pp. 921-924.
-
(2004)
Proc. IEEE Conf. Acoust., Speech, Signal Process
, vol.1
, pp. 921-924
-
-
Graciarena, M.1
Franco, H.2
Zheng, J.3
Vergyri, D.4
Stolcke, A.5
-
12
-
-
0016067897
-
Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification
-
B. Atal, "Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification," J. Acoust. Soc. Amer., vol. 55, pp. 1304-1312, 1974.
-
(1974)
J. Acoust. Soc. Amer
, vol.55
, pp. 1304-1312
-
-
Atal, B.1
-
13
-
-
0028517164
-
RASTA processing of speech
-
Oct
-
H. Hermansky and N. Morgan, "RASTA processing of speech," IEEE Trans. Speech Audio Process., vol. 2, no. 4, pp. 578-589, Oct. 1994.
-
(1994)
IEEE Trans. Speech Audio Process
, vol.2
, Issue.4
, pp. 578-589
-
-
Hermansky, H.1
Morgan, N.2
-
14
-
-
0032658253
-
Temporal patterns (TRAPs) in ASR of noisy speech
-
Phoenix, AZ, Mar
-
H. Hermansky and S. Sharma, "Temporal patterns (TRAPs) in ASR of noisy speech," in Proc. IEEE Conf. Acoust., Speech, Signal Process., vol. 2, Phoenix, AZ, Mar. 1999, pp. 289-292.
-
(1999)
Proc. IEEE Conf. Acoust., Speech, Signal Process
, vol.2
, pp. 289-292
-
-
Hermansky, H.1
Sharma, S.2
-
15
-
-
0023540097
-
Multilayer perceptrons and automatic speech recognition
-
San Diego, CA
-
H. Bourlard and C. Wellekens, "Multilayer perceptrons and automatic speech recognition," in Proc. 1st Int. Conf. Neural Netw., vol. IV, San Diego, CA, 1987, pp. 407-416.
-
(1987)
Proc. 1st Int. Conf. Neural Netw
, vol.4
, pp. 407-416
-
-
Bourlard, H.1
Wellekens, C.2
-
16
-
-
0141676589
-
New entropy based combination rules in HMM/ANN multi-stream ASR
-
Hong Kong, Apr
-
H. Misra, H. Bourlard, and V. Tyagi, "New entropy based combination rules in HMM/ANN multi-stream ASR," in Proc. IEEE Conf. Acoust., Speech. Signal Process., vol. 2, Hong Kong, Apr. 2003, pp. 741-744.
-
(2003)
Proc. IEEE Conf. Acoust., Speech. Signal Process
, vol.2
, pp. 741-744
-
-
Misra, H.1
Bourlard, H.2
Tyagi, V.3
-
17
-
-
34047245552
-
Learning discriminant narrow-band temporal patterns for automatic recognition of conversational telephone speech,
-
Ph.D. dissertation, Univ. California, Berkeley
-
B. Y. Chen, "Learning discriminant narrow-band temporal patterns for automatic recognition of conversational telephone speech," Ph.D. dissertation, Univ. California, Berkeley, 2005.
-
(2005)
-
-
Chen, B.Y.1
-
18
-
-
33745185321
-
Using MLP features in SRI's conversational speech recognition system
-
Lisbon, Portugal, Sep
-
Q. Zhu, A. Stolcke, B. Y. Chen, and N. Morgan, "Using MLP features in SRI's conversational speech recognition system," in Proc. 9th Eur. Conf. Speech Commun. Technol., Lisbon, Portugal, Sep. 2005, pp. 2141-2144.
-
(2005)
Proc. 9th Eur. Conf. Speech Commun. Technol
, pp. 2141-2144
-
-
Zhu, Q.1
Stolcke, A.2
Chen, B.Y.3
Morgan, N.4
-
19
-
-
0019555090
-
Cepstral analysis technique for automatic speaker verification
-
Apr
-
S. Furui, "Cepstral analysis technique for automatic speaker verification," IEEE Trans. Acoust., Speech. Signal Process., vol. ASSP-29, no. 2, pp. 254-272, Apr. 1981.
-
(1981)
IEEE Trans. Acoust., Speech. Signal Process
, vol.ASSP-29
, Issue.2
, pp. 254-272
-
-
Furui, S.1
-
20
-
-
0001893347
-
Transcribing broadcast news: The LIMSI Nov96 Hub4 system
-
Chantilly, VA, Feb
-
J. L. Gauvain, G. Adda, L. Lamel, and M. Adda-Decker, "Transcribing broadcast news: The LIMSI Nov96 Hub4 system," in Proc. DARPA Speech Recognition Workshop, Chantilly, VA, Feb. 1997, pp. 56-63.
-
(1997)
Proc. DARPA Speech Recognition Workshop
, pp. 56-63
-
-
Gauvain, J.L.1
Adda, G.2
Lamel, L.3
Adda-Decker, M.4
-
21
-
-
84946807902
-
-
V. R. R. Gadde, A. Stolcke, D. Vergyri, J. Zheng, K. Sonmez, and A. Venkataraman, Building an ASR system for noisy environments: SRI's 2001 SPINE evaluation system, in Proc. Int. Conf. Spoken Lang. Process., 3, J. H. L. Hansen and B. Pellom, Eds., Denver, CO, Sep. 2002, pp. 1577-1580.
-
V. R. R. Gadde, A. Stolcke, D. Vergyri, J. Zheng, K. Sonmez, and A. Venkataraman, "Building an ASR system for noisy environments: SRI's 2001 SPINE evaluation system," in Proc. Int. Conf. Spoken Lang. Process., vol. 3, J. H. L. Hansen and B. Pellom, Eds., Denver, CO, Sep. 2002, pp. 1577-1580.
-
-
-
-
22
-
-
33846247945
-
Multirate ASR models for phone-class dependent n-best list rescoring
-
San Juan, PR, Nov
-
V. R. Gadde, K. Sonmez, and H. Franco, "Multirate ASR models for phone-class dependent n-best list rescoring," in Proc. IEEE Workshop Speech Recognition and Understanding, San Juan, PR, Nov. 2005, pp. 265-269.
-
(2005)
Proc. IEEE Workshop Speech Recognition and Understanding
, pp. 265-269
-
-
Gadde, V.R.1
Sonmez, K.2
Franco, H.3
-
23
-
-
0022890536
-
Maximum mutual information estimation of hidden Markov model parameters for speech recognition
-
Tokyo, Japan, Apr
-
L. R. Bahl, P. F. Brown, P. V. de Souza, and R. L. Mercer, "Maximum mutual information estimation of hidden Markov model parameters for speech recognition," in Proc. IEEE Int. Conf. Acoust., Speech. Signal Process., vol. 1, Tokyo, Japan, Apr. 1986, pp. 49-52.
-
(1986)
Proc. IEEE Int. Conf. Acoust., Speech. Signal Process
, vol.1
, pp. 49-52
-
-
Bahl, L.R.1
Brown, P.F.2
de Souza, P.V.3
Mercer, R.L.4
-
24
-
-
0036461035
-
Large scale discriminative training of hidden Markov models of speech recognition
-
P. C. Woodland and D. Povey, "Large scale discriminative training of hidden Markov models of speech recognition," Comput. Speech Lang., vol. 16, pp. 25-47, 2002.
-
(2002)
Comput. Speech Lang
, vol.16
, pp. 25-47
-
-
Woodland, P.C.1
Povey, D.2
-
25
-
-
33646791906
-
Improvements to the IBM Hub-5E system
-
Vienna, VA, May
-
J. Huang, B. Kingsbury, L. Mangu, G. Saon, R. Sarikaya, and G. Zweig, :. "Improvements to the IBM Hub-5E system," in Proc. NIST Rich Transcription Workshop, Vienna, VA, May 2002.
-
(2002)
Proc. NIST Rich Transcription Workshop
-
-
Huang, J.1
Kingsbury, B.2
Mangu, L.3
Saon, G.4
Sarikaya, R.5
Zweig, G.6
-
26
-
-
0348198473
-
Finite-state transducers in language and speech processing
-
M. Mohri, "Finite-state transducers in language and speech processing," Comput. Ling., vol. 23, pp. 269-311, 1997.
-
(1997)
Comput. Ling
, vol.23
, pp. 269-311
-
-
Mohri, M.1
-
27
-
-
85135253868
-
Efficient general lattice generation and rescoring
-
Budapest, Hungary, Sep
-
A. Ljolje, F. Pereira, and M. Riley, "Efficient general lattice generation and rescoring," in Proc. 6th Eur. Conf. Speech Commun. Technol., vol. 3, Budapest, Hungary, Sep. 1999, pp. 1251-1254.
-
(1999)
Proc. 6th Eur. Conf. Speech Commun. Technol
, vol.3
, pp. 1251-1254
-
-
Ljolje, A.1
Pereira, F.2
Riley, M.3
-
28
-
-
0034296009
-
Finding consensus in speech recognition: Word error minimization and other applications of confusion networks
-
L. Mangu, E. Brill, and A. Stolcke, "Finding consensus in speech recognition: Word error minimization and other applications of confusion networks," Comput. Speech Lang., vol. 14, no. 4, pp. 373-400, 2000.
-
(2000)
Comput. Speech Lang
, vol.14
, Issue.4
, pp. 373-400
-
-
Mangu, L.1
Brill, E.2
Stolcke, A.3
-
29
-
-
0141477960
-
Posterior probability decoding, confidence estimation, and system combination
-
College Park, MD, May
-
G. Evermann and P. Woodland, "Posterior probability decoding, confidence estimation, and system combination," in Proc. NIST Speech Transcription Workshop, College Park, MD, May 2000.
-
(2000)
Proc. NIST Speech Transcription Workshop
-
-
Evermann, G.1
Woodland, P.2
-
30
-
-
33745214663
-
Leveraging speaker-dependent variation of adaptation
-
Lisbon, Portugal, Sep
-
A. Mandal, M. Ostendorf, and A. Stolcke, "Leveraging speaker-dependent variation of adaptation," in Proc. 9th Eur. Conf. Speech Commun. Technol., Lisbon, Portugal, Sep. 2005, pp. 1793-1796.
-
(2005)
Proc. 9th Eur. Conf. Speech Commun. Technol
, pp. 1793-1796
-
-
Mandal, A.1
Ostendorf, M.2
Stolcke, A.3
-
32
-
-
34047268272
-
-
M. J. Gales, The generation and use of regression class trees for MLLR adaptation, Cambridge Univ., Cambridge, U.K., Tech. Rep. CUED/F-INFENG/TR263, 1996.
-
M. J. Gales, "The generation and use of regression class trees for MLLR adaptation," Cambridge Univ., Cambridge, U.K., Tech. Rep. CUED/F-INFENG/TR263, 1996.
-
-
-
-
33
-
-
4544358964
-
The SuperARV language model: Investigating the effectiveness of tightly integrating multiple knowledge sources
-
W. Wang and M. Harper, "The SuperARV language model: Investigating the effectiveness of tightly integrating multiple knowledge sources," in Proc. Conf. Empirical Methods Natural Language Process., 2002, pp. 238-247.
-
(2002)
Proc. Conf. Empirical Methods Natural Language Process
, pp. 238-247
-
-
Wang, W.1
Harper, M.2
-
34
-
-
85149132266
-
Structural disambiguation with constraints propagation
-
Pittsburgh, PA, Jun
-
H. Maruyama, "Structural disambiguation with constraints propagation," in Proc. 28th Annu. Meeting Assoc. Comput. Ling., Pittsburgh, PA, Jun. 1990, pp. 31-38.
-
(1990)
Proc. 28th Annu. Meeting Assoc. Comput. Ling
, pp. 31-38
-
-
Maruyama, H.1
-
35
-
-
34047258426
-
Statistical parsing and language modeling based on constraint dependency grammar,
-
Ph.D. dissertation, Purdue Univ, West Lafayette, IN
-
W. Wang, "Statistical parsing and language modeling based on constraint dependency grammar," Ph.D. dissertation, Purdue Univ., West Lafayette, IN, 2003.
-
(2003)
-
-
Wang, W.1
-
36
-
-
0141480038
-
The robustness of an almost-parsing language model given errorful training data
-
Hong Kong, China, Apr
-
W. Wang, M. P. Harper, and A. Stolcke, "The robustness of an almost-parsing language model given errorful training data," in Proc. IEEE Conf. Acoust., Speech, Signal Process., vol. 1, Hong Kong, China, Apr. 2003, pp. 240-243.
-
(2003)
Proc. IEEE Conf. Acoust., Speech, Signal Process
, vol.1
, pp. 240-243
-
-
Wang, W.1
Harper, M.P.2
Stolcke, A.3
-
37
-
-
4544383109
-
The use of a linguistically motivated language model in conversational speech recognition
-
Montreal, QC, Canada, May
-
W. Wang, A. Stolcke, and M. P. Harper, "The use of a linguistically motivated language model in conversational speech recognition," in Proc. IEEE Conf. Acoust., Speech, Signal Process., vol. 1, Montreal, QC, Canada, May 2004, pp. 261-264.
-
(2004)
Proc. IEEE Conf. Acoust., Speech, Signal Process
, vol.1
, pp. 261-264
-
-
Wang, W.1
Stolcke, A.2
Harper, M.P.3
-
38
-
-
34047245727
-
-
S. F. Chen and J. Goodman, An empirical study of smoothing techniques for language modeling, Computer Science Group, Harvard Univ., Cambridge, MA, Tech. Rep. TR-10-98, 1998.
-
S. F. Chen and J. Goodman, "An empirical study of smoothing techniques for language modeling," Computer Science Group, Harvard Univ., Cambridge, MA, Tech. Rep. TR-10-98, 1998.
-
-
-
-
39
-
-
85009223249
-
Techniques for effective vocabulary selection
-
Geneva, Switzerland, Sep
-
A. Venkataraman and W. Wang, "Techniques for effective vocabulary selection," in Proc. 8th Eur. Conf. Speech Commun. Technol., Geneva, Switzerland, Sep. 2003, pp. 245-248.
-
(2003)
Proc. 8th Eur. Conf. Speech Commun. Technol
, pp. 245-248
-
-
Venkataraman, A.1
Wang, W.2
-
40
-
-
34047266379
-
Progress in the CU-HTK broadcast news transcription system
-
Sep
-
M. J. F. Gales, D. Y. Kim, P. C. Woodland, H. Y. Chan, D. Mrva, R. Sinha, and S. E. Tranter, "Progress in the CU-HTK broadcast news transcription system," IEEE Trans. Audio, Speech. Lang. Process., vol. 14, no. 5, pp. 1511-1523, Sep. 2006.
-
(2006)
IEEE Trans. Audio, Speech. Lang. Process
, vol.14
, Issue.5
, pp. 1511-1523
-
-
Gales, M.J.F.1
Kim, D.Y.2
Woodland, P.C.3
Chan, H.Y.4
Mrva, D.5
Sinha, R.6
Tranter, S.E.7
-
41
-
-
84907336951
-
An efficient repair procedure for quick transcriptions
-
S. H. Kim and D. H. Youn, Eds, Jeju Island, Korea, Oct
-
A. Venkataraman, A. Stolcke, W. Wang, D. Vergyri, V. R. R. Gadde, and J. Zheng, "An efficient repair procedure for quick transcriptions," in Proc. Int. Conf. Spoken Language Process., S. H. Kim and D. H. Youn, Eds., Jeju Island, Korea, Oct. 2004, pp. 1961-1964.
-
(2004)
Proc. Int. Conf. Spoken Language Process
, pp. 1961-1964
-
-
Venkataraman, A.1
Stolcke, A.2
Wang, W.3
Vergyri, D.4
Gadde, V.R.R.5
Zheng, J.6
-
42
-
-
0002144369
-
Tree-based state tying for high accuracy acoustic modeling
-
S. Young, J. Odell, and P. Woodland, "Tree-based state tying for high accuracy acoustic modeling," in Proc. ARPA Workshop Human language, 1994, pp. 307-312.
-
(1994)
Proc. ARPA Workshop Human language
, pp. 307-312
-
-
Young, S.1
Odell, J.2
Woodland, P.3
-
43
-
-
0028996852
-
The 1994 HTK large vocabulary speech recognition system
-
Detroit, MI
-
P. Woodland, C. Leggetter, J. Odell, V. Valtchev, and S. Young, "The 1994 HTK large vocabulary speech recognition system," in Proc. ICASSP, Detroit, MI, 1995, pp. 73-76.
-
(1995)
Proc. ICASSP
, pp. 73-76
-
-
Woodland, P.1
Leggetter, C.2
Odell, J.3
Valtchev, V.4
Young, S.5
-
45
-
-
85093280076
-
Factored language models and generalized parallel backoff
-
J. Bilmes and K. Kirchhoff, "Factored language models and generalized parallel backoff," in Proc. HLT/NACCL, 2003, pp. 4-6.
-
(2003)
Proc. HLT/NACCL
, pp. 4-6
-
-
Bilmes, J.1
Kirchhoff, K.2
-
47
-
-
85009110467
-
Morphology-based language modeling for Arabic speech recognition
-
D. Vergyri, K. Kirchhoff, K. Duh, and A. Stolcke, "Morphology-based language modeling for Arabic speech recognition," in Proc. ICSLP, 2004, pp. 2245-2248.
-
(2004)
Proc. ICSLP
, pp. 2245-2248
-
-
Vergyri, D.1
Kirchhoff, K.2
Duh, K.3
Stolcke, A.4
-
49
-
-
34047258983
-
Porting Decipher from English to Mandarin
-
presented at the, Elect. Eng. Dept, Univ. Washington, Tech. Rep. UWEETR-2006-0013, Seattle, WA
-
M. Hwang, X. Lei, T. Ng, M. Ostendorf, A. Stolcke, W. Wang, J. Zheng, and V. Gadde, "Porting Decipher from English to Mandarin," presented at the NIST RT-04 EARS Fall Workshop 2004. Elect. Eng. Dept., Univ. Washington, Tech. Rep. UWEETR-2006-0013, Seattle, WA.
-
(2004)
NIST RT-04 EARS Fall Workshop
-
-
Hwang, M.1
Lei, X.2
Ng, T.3
Ostendorf, M.4
Stolcke, A.5
Wang, W.6
Zheng, J.7
Gadde, V.8
-
50
-
-
34047258615
-
-
New Mexico State Univ, Las Cruces, NM, Tech. Rep. MCCS-92-227
-
W. Jin, "Chinese segmentation and its diambiguation," New Mexico State Univ., Las Cruces, NM, Tech. Rep. MCCS-92-227, 1992.
-
(1992)
Chinese segmentation and its diambiguation
-
-
Jin, W.1
-
51
-
-
29144436747
-
Webdata augmented language models for Mandarin conversational speech recognition
-
Philadelphia, PA, Mar
-
T. Ng, M. Ostendorf, M.-Y. Hwang, M. Siu, I. Bulyko, and X. Lei, "Webdata augmented language models for Mandarin conversational speech recognition," in Proc. IEEE Conf. Acoust., Speech, Signal Process., vol. 1, Philadelphia, PA, Mar. 2005, pp. 589-593.
-
(2005)
Proc. IEEE Conf. Acoust., Speech, Signal Process
, vol.1
, pp. 589-593
-
-
Ng, T.1
Ostendorf, M.2
Hwang, M.-Y.3
Siu, M.4
Bulyko, I.5
Lei, X.6
-
52
-
-
84905283451
-
New methods in continuous Mandarin speech recognition
-
G. Kokkinakis, N. Fakotakis, and E. Dermatas, Eds, Rhodes, Greece, Sep
-
C. J. Chen, R. A. Gopinath, M. D. Monkowski, M. A. Picheny, and K. Shen, "New methods in continuous Mandarin speech recognition," in Proc. 5th Eur. Conf. Speech Commun. Technol., vol. 3, G. Kokkinakis, N. Fakotakis, and E. Dermatas, Eds., Rhodes, Greece, Sep. 1997, pp. 1543-1546.
-
(1997)
Proc. 5th Eur. Conf. Speech Commun. Technol
, vol.3
, pp. 1543-1546
-
-
Chen, C.J.1
Gopinath, R.A.2
Monkowski, M.D.3
Picheny, M.A.4
Shen, K.5
-
53
-
-
85135139722
-
A lognormal tied mixture model of pitch for prosody-based speaker recognition
-
G. Kokkinakis, N. Fakotakis, and E. Dermatas, Eds, Rhodes, Greece, Sep
-
M. K. Sönmez, L. Heck, M. Weintraub, and E. Shriberg, "A lognormal tied mixture model of pitch for prosody-based speaker recognition," in Proc. 5th Eur. Conf. Speech Commun. Technol., G. Kokkinakis, N. Fakotakis, and E. Dermatas, Eds., Rhodes, Greece, Sep. 1997, pp. 1391-1394.
-
(1997)
Proc. 5th Eur. Conf. Speech Commun. Technol
, pp. 1391-1394
-
-
Sönmez, M.K.1
Heck, L.2
Weintraub, M.3
Shriberg, E.4
|