SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 14, Issue 5, 2006, Pages 1729-1742

Recent innovations in speech-to-text transcription at SRI-ICSI-UW

(18) Stolcke, Andreas a,b,c Chen, Barry b,d Franco, Horacio a,e Gadde, Venkata Ramana Rao e Graciarena, Martin a,e Hwang, Mei Yuh c Kirchhoff, Katrin a,c Mandal, Arindam c Morgan, Nelson a,f Lei, Xin c Ng, Tim e,f Ostendorf, Mari a,c Sönmez, Kemal a,e Venkataraman, Anand e Vergyri, Dimitra a,e Wang, Wen a,e Zheng, Jing a,e Zhu, Qifeng a,c,g

a IEEE (United States)

b SRI INTERNATIONAL (United States)

c INTERNATIONAL COMPUTER SCIENCE INSTITUTE (United States)

d LAWRENCE LIVERMORE NATIONAL LABORATORY (United States)

e University of Washington (United States)

f BBN TECHNOLOGIES (United States)

g TEXAS INSTRUMENTS (United States)

Author keywords

Broadcast news (BN); Conversational telephone speech (CTS); Specch to text (STT)

Indexed keywords

ACOUSTIC FEATURES; BROADCAST NEWS (BN); CONVERSATIONAL TELEPHONE SPEECH (CTS); SPEECH TO TEXT (STT);

COMPUTATIONAL COMPLEXITY; FORMAL LANGUAGES; MATHEMATICAL MODELS; REGRESSION ANALYSIS; SPEECH RECOGNITION; VOCABULARY CONTROL;

SPEECH PROCESSING;

EID: 34047270914 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TASL.2006.879807 Document Type: Article

Times cited : (75)

References (53)

1
- 34047266607
- Enriching speech recognition with automatic detection of sentence boundaries and disfluencies
- Sep
- Y. Liu, E. Shriberg, A. Stolcke, D. Hillard, M. Ostendorf, and M. Harper, "Enriching speech recognition with automatic detection of sentence boundaries and disfluencies," IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 5, pp. 1524-1538, Sep. 2006.
- (2006) IEEE Trans. Audio, Speech, Lang. Process , vol.14 , Issue.5 , pp. 1524-1538
- Liu, Y.¹ Shriberg, E.² Stolcke, A.³ Hillard, D.⁴ Ostendorf, M.⁵ Harper, M.⁶

2
- 0025041264
- Perceptual linear predictive (PLP) analysis of speech
- Apr
- H. Hermansky, "Perceptual linear predictive (PLP) analysis of speech," J. Acoust. Soc. Amer., vol. 87, pp. 1738-1752, Apr. 1990.
- (1990) J. Acoust. Soc. Amer , vol.87 , pp. 1738-1752
- Hermansky, H.¹

3
- 0029288633
- Maximum likelihood linear regression for speaker adaptation of HMMs
- C. Leggetter and P. Woodland, "Maximum likelihood linear regression for speaker adaptation of HMMs," Comput. Speech Lang., vol. 9, pp. 171-186, 1995.
- (1995) Comput. Speech Lang , vol.9 , pp. 171-186
- Leggetter, C.¹ Woodland, P.²

4
- 0141703284
- Prosodic knowledge sources for automatic speech recognition
- Hong Kong, China, Apr
- D. Vergyri, A. Stolcke, V. R. R. Gadde, L. Ferrer, and E. Shriberg, "Prosodic knowledge sources for automatic speech recognition," in Proc. IEEE Conf. Acoust., Speech, Signal Process., vol. 1, Hong Kong, China, Apr. 2003, pp. 208-211.
- (2003) Proc. IEEE Conf. Acoust., Speech, Signal Process , vol.1 , pp. 208-211
- Vergyri, D.¹ Stolcke, A.² Gadde, V.R.R.³ Ferrer, L.⁴ Shriberg, E.⁵

5
- 0029764708
- Speaker normalization on conversational telephone speech
- Atlanta, GA, May
- S. Wegmann, D. McAllaster, J. Orloff, and B. Peskin, "Speaker normalization on conversational telephone speech," in Proc. IEEE Conf. Acoust., Speech. Signal Process., vol. 1, Atlanta, GA, May 1996, pp. 339-341.
- (1996) Proc. IEEE Conf. Acoust., Speech. Signal Process , vol.1 , pp. 339-341
- Wegmann, S.¹ McAllaster, D.² Orloff, J.³ Peskin, B.⁴

6
- 0003871508
- Investigation of silicon-auditory models and generalization of linear discriminant analysis for improved speech recognition,
- Ph.D. dissertation, Johns Hopkins Univ, Baltimore, MD
- N. Kumar, "Investigation of silicon-auditory models and generalization of linear discriminant analysis for improved speech recognition," Ph.D. dissertation, Johns Hopkins Univ., Baltimore, MD, 1997.
- (1997)
- Kumar, N.¹

7
- 0036475982
- Maximum likelihood multiple subspace projections for hidden Markov models
- Feb
- M. J. Gales, "Maximum likelihood multiple subspace projections for hidden Markov models," IEEE Trans. Speech Audio Process., vol. 10, no. 2, pp. 37-17, Feb. 2002.
- (2002) IEEE Trans. Speech Audio Process , vol.10 , Issue.2 , pp. 37-17
- Gales, M.J.¹

8
- 0009938649
- Fast robust inverse transform SAT and multi-stage adaptation
- Lansdowne, VA, Feb
- H. Jin, S. Matsoukas, R. Schwartz, and F. Kubala, "Fast robust inverse transform SAT and multi-stage adaptation," in Proc. DARPA Broadcast News Transcription and Understanding Workshop, Lansdowne, VA, Feb. 1998, pp. 105-109.
- (1998) Proc. DARPA Broadcast News Transcription and Understanding Workshop , pp. 105-109
- Jin, H.¹ Matsoukas, S.² Schwartz, R.³ Kubala, F.⁴

9
- 0036296863
- Minimum phone error and I-smoothing for improved discriminative training
- Orlando, FL, May
- D. Povey and P. C. Woodland, "Minimum phone error and I-smoothing for improved discriminative training," in Proc. IEEE Conf. Acoust., Speech, Signal Process., vol. 1, Orlando, FL, May 2002, pp. 105-108.
- (2002) Proc. IEEE Conf. Acoust., Speech, Signal Process , vol.1 , pp. 105-108
- Povey, D.¹ Woodland, P.C.²

10
- 44949090835
- Getting more mileage from web text sources for conversational speech language modeling using class-dependent mixtures
- M. Hearst and M. Ostendorf, Eds, Edmonton, AB, Canada, Mar
- I. Bulyko, M. Ostendorf, and A. Stolcke, "Getting more mileage from web text sources for conversational speech language modeling using class-dependent mixtures," in Proc. HLT-NAACL, Conf. North Amer. Chap. Assoc. Comput. Ling., vol. 2, M. Hearst and M. Ostendorf, Eds., Edmonton, AB, Canada, Mar. 2003, pp. 7-9.
- (2003) Proc. HLT-NAACL, Conf. North Amer. Chap. Assoc. Comput. Ling , vol.2 , pp. 7-9
- Bulyko, I.¹ Ostendorf, M.² Stolcke, A.³

11
- 4544351495
- Voicing feature integration in SRI's Decipher LVCSR system
- Montreal, QC, Canada, May
- M. Graciarena, H. Franco, J. Zheng, D. Vergyri, and A. Stolcke, "Voicing feature integration in SRI's Decipher LVCSR system," in Proc. IEEE Conf. Acoust., Speech, Signal Process., vol. 1, Montreal, QC, Canada, May 2004, pp. 921-924.
- (2004) Proc. IEEE Conf. Acoust., Speech, Signal Process , vol.1 , pp. 921-924
- Graciarena, M.¹ Franco, H.² Zheng, J.³ Vergyri, D.⁴ Stolcke, A.⁵

12
- 0016067897
- Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification
- B. Atal, "Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification," J. Acoust. Soc. Amer., vol. 55, pp. 1304-1312, 1974.
- (1974) J. Acoust. Soc. Amer , vol.55 , pp. 1304-1312
- Atal, B.¹

13
- 0028517164
- RASTA processing of speech
- Oct
- H. Hermansky and N. Morgan, "RASTA processing of speech," IEEE Trans. Speech Audio Process., vol. 2, no. 4, pp. 578-589, Oct. 1994.
- (1994) IEEE Trans. Speech Audio Process , vol.2 , Issue.4 , pp. 578-589
- Hermansky, H.¹ Morgan, N.²

14
- 0032658253
- Temporal patterns (TRAPs) in ASR of noisy speech
- Phoenix, AZ, Mar
- H. Hermansky and S. Sharma, "Temporal patterns (TRAPs) in ASR of noisy speech," in Proc. IEEE Conf. Acoust., Speech, Signal Process., vol. 2, Phoenix, AZ, Mar. 1999, pp. 289-292.
- (1999) Proc. IEEE Conf. Acoust., Speech, Signal Process , vol.2 , pp. 289-292
- Hermansky, H.¹ Sharma, S.²

15
- 0023540097
- Multilayer perceptrons and automatic speech recognition
- San Diego, CA
- H. Bourlard and C. Wellekens, "Multilayer perceptrons and automatic speech recognition," in Proc. 1st Int. Conf. Neural Netw., vol. IV, San Diego, CA, 1987, pp. 407-416.
- (1987) Proc. 1st Int. Conf. Neural Netw , vol.4 , pp. 407-416
- Bourlard, H.¹ Wellekens, C.²

16
- 0141676589
- New entropy based combination rules in HMM/ANN multi-stream ASR
- Hong Kong, Apr
- H. Misra, H. Bourlard, and V. Tyagi, "New entropy based combination rules in HMM/ANN multi-stream ASR," in Proc. IEEE Conf. Acoust., Speech. Signal Process., vol. 2, Hong Kong, Apr. 2003, pp. 741-744.
- (2003) Proc. IEEE Conf. Acoust., Speech. Signal Process , vol.2 , pp. 741-744
- Misra, H.¹ Bourlard, H.² Tyagi, V.³

17
- 34047245552
- Learning discriminant narrow-band temporal patterns for automatic recognition of conversational telephone speech,
- Ph.D. dissertation, Univ. California, Berkeley
- B. Y. Chen, "Learning discriminant narrow-band temporal patterns for automatic recognition of conversational telephone speech," Ph.D. dissertation, Univ. California, Berkeley, 2005.
- (2005)
- Chen, B.Y.¹

18
- 33745185321
- Using MLP features in SRI's conversational speech recognition system
- Lisbon, Portugal, Sep
- Q. Zhu, A. Stolcke, B. Y. Chen, and N. Morgan, "Using MLP features in SRI's conversational speech recognition system," in Proc. 9th Eur. Conf. Speech Commun. Technol., Lisbon, Portugal, Sep. 2005, pp. 2141-2144.
- (2005) Proc. 9th Eur. Conf. Speech Commun. Technol , pp. 2141-2144
- Zhu, Q.¹ Stolcke, A.² Chen, B.Y.³ Morgan, N.⁴

19
- 0019555090
- Cepstral analysis technique for automatic speaker verification
- Apr
- S. Furui, "Cepstral analysis technique for automatic speaker verification," IEEE Trans. Acoust., Speech. Signal Process., vol. ASSP-29, no. 2, pp. 254-272, Apr. 1981.
- (1981) IEEE Trans. Acoust., Speech. Signal Process , vol.ASSP-29 , Issue.2 , pp. 254-272
- Furui, S.¹

20
- 0001893347
- Transcribing broadcast news: The LIMSI Nov96 Hub4 system
- Chantilly, VA, Feb
- J. L. Gauvain, G. Adda, L. Lamel, and M. Adda-Decker, "Transcribing broadcast news: The LIMSI Nov96 Hub4 system," in Proc. DARPA Speech Recognition Workshop, Chantilly, VA, Feb. 1997, pp. 56-63.
- (1997) Proc. DARPA Speech Recognition Workshop , pp. 56-63
- Gauvain, J.L.¹ Adda, G.² Lamel, L.³ Adda-Decker, M.⁴

21
- 84946807902
- V. R. R. Gadde, A. Stolcke, D. Vergyri, J. Zheng, K. Sonmez, and A. Venkataraman, Building an ASR system for noisy environments: SRI's 2001 SPINE evaluation system, in Proc. Int. Conf. Spoken Lang. Process., 3, J. H. L. Hansen and B. Pellom, Eds., Denver, CO, Sep. 2002, pp. 1577-1580.
- V. R. R. Gadde, A. Stolcke, D. Vergyri, J. Zheng, K. Sonmez, and A. Venkataraman, "Building an ASR system for noisy environments: SRI's 2001 SPINE evaluation system," in Proc. Int. Conf. Spoken Lang. Process., vol. 3, J. H. L. Hansen and B. Pellom, Eds., Denver, CO, Sep. 2002, pp. 1577-1580.

22
- 33846247945
- Multirate ASR models for phone-class dependent n-best list rescoring
- San Juan, PR, Nov
- V. R. Gadde, K. Sonmez, and H. Franco, "Multirate ASR models for phone-class dependent n-best list rescoring," in Proc. IEEE Workshop Speech Recognition and Understanding, San Juan, PR, Nov. 2005, pp. 265-269.
- (2005) Proc. IEEE Workshop Speech Recognition and Understanding , pp. 265-269
- Gadde, V.R.¹ Sonmez, K.² Franco, H.³

23
- 0022890536
- Maximum mutual information estimation of hidden Markov model parameters for speech recognition
- Tokyo, Japan, Apr
- L. R. Bahl, P. F. Brown, P. V. de Souza, and R. L. Mercer, "Maximum mutual information estimation of hidden Markov model parameters for speech recognition," in Proc. IEEE Int. Conf. Acoust., Speech. Signal Process., vol. 1, Tokyo, Japan, Apr. 1986, pp. 49-52.
- (1986) Proc. IEEE Int. Conf. Acoust., Speech. Signal Process , vol.1 , pp. 49-52
- Bahl, L.R.¹ Brown, P.F.² de Souza, P.V.³ Mercer, R.L.⁴

24
- 0036461035
- Large scale discriminative training of hidden Markov models of speech recognition
- P. C. Woodland and D. Povey, "Large scale discriminative training of hidden Markov models of speech recognition," Comput. Speech Lang., vol. 16, pp. 25-47, 2002.
- (2002) Comput. Speech Lang , vol.16 , pp. 25-47
- Woodland, P.C.¹ Povey, D.²

25
- 33646791906
- Improvements to the IBM Hub-5E system
- Vienna, VA, May
- J. Huang, B. Kingsbury, L. Mangu, G. Saon, R. Sarikaya, and G. Zweig, :. "Improvements to the IBM Hub-5E system," in Proc. NIST Rich Transcription Workshop, Vienna, VA, May 2002.
- (2002) Proc. NIST Rich Transcription Workshop
- Huang, J.¹ Kingsbury, B.² Mangu, L.³ Saon, G.⁴ Sarikaya, R.⁵ Zweig, G.⁶

26
- 0348198473
- Finite-state transducers in language and speech processing
- M. Mohri, "Finite-state transducers in language and speech processing," Comput. Ling., vol. 23, pp. 269-311, 1997.
- (1997) Comput. Ling , vol.23 , pp. 269-311
- Mohri, M.¹

27
- 85135253868
- Efficient general lattice generation and rescoring
- Budapest, Hungary, Sep
- A. Ljolje, F. Pereira, and M. Riley, "Efficient general lattice generation and rescoring," in Proc. 6th Eur. Conf. Speech Commun. Technol., vol. 3, Budapest, Hungary, Sep. 1999, pp. 1251-1254.
- (1999) Proc. 6th Eur. Conf. Speech Commun. Technol , vol.3 , pp. 1251-1254
- Ljolje, A.¹ Pereira, F.² Riley, M.³

28
- 0034296009
- Finding consensus in speech recognition: Word error minimization and other applications of confusion networks
- L. Mangu, E. Brill, and A. Stolcke, "Finding consensus in speech recognition: Word error minimization and other applications of confusion networks," Comput. Speech Lang., vol. 14, no. 4, pp. 373-400, 2000.
- (2000) Comput. Speech Lang , vol.14 , Issue.4 , pp. 373-400
- Mangu, L.¹ Brill, E.² Stolcke, A.³

29
- 0141477960
- Posterior probability decoding, confidence estimation, and system combination
- College Park, MD, May
- G. Evermann and P. Woodland, "Posterior probability decoding, confidence estimation, and system combination," in Proc. NIST Speech Transcription Workshop, College Park, MD, May 2000.
- (2000) Proc. NIST Speech Transcription Workshop
- Evermann, G.¹ Woodland, P.²

30
- 33745214663
- Leveraging speaker-dependent variation of adaptation
- Lisbon, Portugal, Sep
- A. Mandal, M. Ostendorf, and A. Stolcke, "Leveraging speaker-dependent variation of adaptation," in Proc. 9th Eur. Conf. Speech Commun. Technol., Lisbon, Portugal, Sep. 2005, pp. 1793-1796.
- (2005) Proc. 9th Eur. Conf. Speech Commun. Technol , pp. 1793-1796
- Mandal, A.¹ Ostendorf, M.² Stolcke, A.³

31
- 0009623939
- Flexible speaker adaptation using maximum likelihood linear regression
- C. J. Leggetter and P. C. Woodland, "Flexible speaker adaptation using maximum likelihood linear regression," in Proc. ARPA Spoken Lang. Technol. Workshop, 1995, pp. 104-109.
- (1995) Proc. ARPA Spoken Lang. Technol. Workshop , pp. 104-109
- Leggetter, C.J.¹ Woodland, P.C.²

32
- 34047268272
- M. J. Gales, The generation and use of regression class trees for MLLR adaptation, Cambridge Univ., Cambridge, U.K., Tech. Rep. CUED/F-INFENG/TR263, 1996.
- M. J. Gales, "The generation and use of regression class trees for MLLR adaptation," Cambridge Univ., Cambridge, U.K., Tech. Rep. CUED/F-INFENG/TR263, 1996.

33
- 4544358964
- The SuperARV language model: Investigating the effectiveness of tightly integrating multiple knowledge sources
- W. Wang and M. Harper, "The SuperARV language model: Investigating the effectiveness of tightly integrating multiple knowledge sources," in Proc. Conf. Empirical Methods Natural Language Process., 2002, pp. 238-247.
- (2002) Proc. Conf. Empirical Methods Natural Language Process , pp. 238-247
- Wang, W.¹ Harper, M.²

34
- 85149132266
- Structural disambiguation with constraints propagation
- Pittsburgh, PA, Jun
- H. Maruyama, "Structural disambiguation with constraints propagation," in Proc. 28th Annu. Meeting Assoc. Comput. Ling., Pittsburgh, PA, Jun. 1990, pp. 31-38.
- (1990) Proc. 28th Annu. Meeting Assoc. Comput. Ling , pp. 31-38
- Maruyama, H.¹

35
- 34047258426
- Statistical parsing and language modeling based on constraint dependency grammar,
- Ph.D. dissertation, Purdue Univ, West Lafayette, IN
- W. Wang, "Statistical parsing and language modeling based on constraint dependency grammar," Ph.D. dissertation, Purdue Univ., West Lafayette, IN, 2003.
- (2003)
- Wang, W.¹

36
- 0141480038
- The robustness of an almost-parsing language model given errorful training data
- Hong Kong, China, Apr
- W. Wang, M. P. Harper, and A. Stolcke, "The robustness of an almost-parsing language model given errorful training data," in Proc. IEEE Conf. Acoust., Speech, Signal Process., vol. 1, Hong Kong, China, Apr. 2003, pp. 240-243.
- (2003) Proc. IEEE Conf. Acoust., Speech, Signal Process , vol.1 , pp. 240-243
- Wang, W.¹ Harper, M.P.² Stolcke, A.³

37
- 4544383109
- The use of a linguistically motivated language model in conversational speech recognition
- Montreal, QC, Canada, May
- W. Wang, A. Stolcke, and M. P. Harper, "The use of a linguistically motivated language model in conversational speech recognition," in Proc. IEEE Conf. Acoust., Speech, Signal Process., vol. 1, Montreal, QC, Canada, May 2004, pp. 261-264.
- (2004) Proc. IEEE Conf. Acoust., Speech, Signal Process , vol.1 , pp. 261-264
- Wang, W.¹ Stolcke, A.² Harper, M.P.³

38
- 34047245727
- S. F. Chen and J. Goodman, An empirical study of smoothing techniques for language modeling, Computer Science Group, Harvard Univ., Cambridge, MA, Tech. Rep. TR-10-98, 1998.
- S. F. Chen and J. Goodman, "An empirical study of smoothing techniques for language modeling," Computer Science Group, Harvard Univ., Cambridge, MA, Tech. Rep. TR-10-98, 1998.

39
- 85009223249
- Techniques for effective vocabulary selection
- Geneva, Switzerland, Sep
- A. Venkataraman and W. Wang, "Techniques for effective vocabulary selection," in Proc. 8th Eur. Conf. Speech Commun. Technol., Geneva, Switzerland, Sep. 2003, pp. 245-248.
- (2003) Proc. 8th Eur. Conf. Speech Commun. Technol , pp. 245-248
- Venkataraman, A.¹ Wang, W.²

40
- 34047266379
- Progress in the CU-HTK broadcast news transcription system
- Sep
- M. J. F. Gales, D. Y. Kim, P. C. Woodland, H. Y. Chan, D. Mrva, R. Sinha, and S. E. Tranter, "Progress in the CU-HTK broadcast news transcription system," IEEE Trans. Audio, Speech. Lang. Process., vol. 14, no. 5, pp. 1511-1523, Sep. 2006.
- (2006) IEEE Trans. Audio, Speech. Lang. Process , vol.14 , Issue.5 , pp. 1511-1523
- Gales, M.J.F.¹ Kim, D.Y.² Woodland, P.C.³ Chan, H.Y.⁴ Mrva, D.⁵ Sinha, R.⁶ Tranter, S.E.⁷

41
- 84907336951
- An efficient repair procedure for quick transcriptions
- S. H. Kim and D. H. Youn, Eds, Jeju Island, Korea, Oct
- A. Venkataraman, A. Stolcke, W. Wang, D. Vergyri, V. R. R. Gadde, and J. Zheng, "An efficient repair procedure for quick transcriptions," in Proc. Int. Conf. Spoken Language Process., S. H. Kim and D. H. Youn, Eds., Jeju Island, Korea, Oct. 2004, pp. 1961-1964.
- (2004) Proc. Int. Conf. Spoken Language Process , pp. 1961-1964
- Venkataraman, A.¹ Stolcke, A.² Wang, W.³ Vergyri, D.⁴ Gadde, V.R.R.⁵ Zheng, J.⁶

42
- 0002144369
- Tree-based state tying for high accuracy acoustic modeling
- S. Young, J. Odell, and P. Woodland, "Tree-based state tying for high accuracy acoustic modeling," in Proc. ARPA Workshop Human language, 1994, pp. 307-312.
- (1994) Proc. ARPA Workshop Human language , pp. 307-312
- Young, S.¹ Odell, J.² Woodland, P.³

43
- 0028996852
- The 1994 HTK large vocabulary speech recognition system
- Detroit, MI
- P. Woodland, C. Leggetter, J. Odell, V. Valtchev, and S. Young, "The 1994 HTK large vocabulary speech recognition system," in Proc. ICASSP, Detroit, MI, 1995, pp. 73-76.
- (1995) Proc. ICASSP , pp. 73-76
- Woodland, P.¹ Leggetter, C.² Odell, J.³ Valtchev, V.⁴ Young, S.⁵

44
- 34047258249
- Johns Hopkins Univ., Baltimore, MD
- Tech. Rep
- K. Kirchhoff et al., "Novel approaches to Arabic speech recognition-Final report from the JHU Summer Workshop 2002," Johns Hopkins Univ., Baltimore, MD, Tech. Rep., 2002.
- (2002) Novel approaches to Arabic speech recognition-Final report from the JHU Summer Workshop 2002
- Kirchhoff, K.¹

45
- 85093280076
- Factored language models and generalized parallel backoff
- J. Bilmes and K. Kirchhoff, "Factored language models and generalized parallel backoff," in Proc. HLT/NACCL, 2003, pp. 4-6.
- (2003) Proc. HLT/NACCL , pp. 4-6
- Bilmes, J.¹ Kirchhoff, K.²

46
- 85119098721
- Automatic learning of language model structure
- K. Duh and K. Kirchhoff, "Automatic learning of language model structure," in Proc. 20th Int. Conf. Comput. Ling. (COUNG), 2004, pp. 148-154.
- (2004) Proc. 20th Int. Conf. Comput. Ling. (COUNG) , pp. 148-154
- Duh, K.¹ Kirchhoff, K.²

47
- 85009110467
- Morphology-based language modeling for Arabic speech recognition
- D. Vergyri, K. Kirchhoff, K. Duh, and A. Stolcke, "Morphology-based language modeling for Arabic speech recognition," in Proc. ICSLP, 2004, pp. 2245-2248.
- (2004) Proc. ICSLP , pp. 2245-2248
- Vergyri, D.¹ Kirchhoff, K.² Duh, K.³ Stolcke, A.⁴

48
- 85149118016
- Building a shallow Arabic morphological analyzer in one day
- Philadelphia, PA
- K. Darwish, "Building a shallow Arabic morphological analyzer in one day," in Proc. ACL Workshop Computational Approaches to Semitic Languages, Philadelphia, PA, 2002, pp. 47-54.
- (2002) Proc. ACL Workshop Computational Approaches to Semitic Languages , pp. 47-54
- Darwish, K.¹

49
- 34047258983
- Porting Decipher from English to Mandarin
- presented at the, Elect. Eng. Dept, Univ. Washington, Tech. Rep. UWEETR-2006-0013, Seattle, WA
- M. Hwang, X. Lei, T. Ng, M. Ostendorf, A. Stolcke, W. Wang, J. Zheng, and V. Gadde, "Porting Decipher from English to Mandarin," presented at the NIST RT-04 EARS Fall Workshop 2004. Elect. Eng. Dept., Univ. Washington, Tech. Rep. UWEETR-2006-0013, Seattle, WA.
- (2004) NIST RT-04 EARS Fall Workshop
- Hwang, M.¹ Lei, X.² Ng, T.³ Ostendorf, M.⁴ Stolcke, A.⁵ Wang, W.⁶ Zheng, J.⁷ Gadde, V.⁸

50
- 34047258615
- New Mexico State Univ, Las Cruces, NM, Tech. Rep. MCCS-92-227
- W. Jin, "Chinese segmentation and its diambiguation," New Mexico State Univ., Las Cruces, NM, Tech. Rep. MCCS-92-227, 1992.
- (1992) Chinese segmentation and its diambiguation
- Jin, W.¹

51
- 29144436747
- Webdata augmented language models for Mandarin conversational speech recognition
- Philadelphia, PA, Mar
- T. Ng, M. Ostendorf, M.-Y. Hwang, M. Siu, I. Bulyko, and X. Lei, "Webdata augmented language models for Mandarin conversational speech recognition," in Proc. IEEE Conf. Acoust., Speech, Signal Process., vol. 1, Philadelphia, PA, Mar. 2005, pp. 589-593.
- (2005) Proc. IEEE Conf. Acoust., Speech, Signal Process , vol.1 , pp. 589-593
- Ng, T.¹ Ostendorf, M.² Hwang, M.-Y.³ Siu, M.⁴ Bulyko, I.⁵ Lei, X.⁶

52
- 84905283451
- New methods in continuous Mandarin speech recognition
- G. Kokkinakis, N. Fakotakis, and E. Dermatas, Eds, Rhodes, Greece, Sep
- C. J. Chen, R. A. Gopinath, M. D. Monkowski, M. A. Picheny, and K. Shen, "New methods in continuous Mandarin speech recognition," in Proc. 5th Eur. Conf. Speech Commun. Technol., vol. 3, G. Kokkinakis, N. Fakotakis, and E. Dermatas, Eds., Rhodes, Greece, Sep. 1997, pp. 1543-1546.
- (1997) Proc. 5th Eur. Conf. Speech Commun. Technol , vol.3 , pp. 1543-1546
- Chen, C.J.¹ Gopinath, R.A.² Monkowski, M.D.³ Picheny, M.A.⁴ Shen, K.⁵

53
- 85135139722
- A lognormal tied mixture model of pitch for prosody-based speaker recognition
- G. Kokkinakis, N. Fakotakis, and E. Dermatas, Eds, Rhodes, Greece, Sep
- M. K. Sönmez, L. Heck, M. Weintraub, and E. Shriberg, "A lognormal tied mixture model of pitch for prosody-based speaker recognition," in Proc. 5th Eur. Conf. Speech Commun. Technol., G. Kokkinakis, N. Fakotakis, and E. Dermatas, Eds., Rhodes, Greece, Sep. 1997, pp. 1391-1394.
- (1997) Proc. 5th Eur. Conf. Speech Commun. Technol , pp. 1391-1394
- Sönmez, M.K.¹ Heck, L.² Weintraub, M.³ Shriberg, E.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.