SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 18, Issue 6, 2010, Pages 1539-1549

Statistical transformation of language and pronunciation models for spontaneous speech recognition

Author keywords

Automatic speech recognition (ASR); Language model (LM); Pronunciation model; Spontaneous speech; Statistical transformation

Indexed keywords

AUTOMATIC SPEECH RECOGNITION; LANGUAGE MODEL; PRONUNCIATION MODEL; SPONTANEOUS SPEECH; STATISTICAL TRANSFORMATION;

COMPUTATIONAL LINGUISTICS; LAGRANGE MULTIPLIERS; STATISTICS;

SPEECH RECOGNITION;

EID: 77955729683 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TASL.2009.2037400 Document Type: Article

Times cited : (22)

References (32)

1
- 84892179040
- The BBN Byblos 1997 large vocabulary conversational speech recognition system
- G. Zavaliagkos, J. McDonough, D. Miller, A. El-Jaroudi, J. Billa, F. Richardson, K. Ma, M. Siu, and H. Gish, "The BBN Byblos 1997 large vocabulary conversational speech recognition system," in Proc. ICASSP, 1998, pp. 905-908.
- (1998) Proc. ICASSP , pp. 905-908
- Zavaliagkos, G.¹ McDonough, J.² Miller, D.³ El-Jaroudi, A.⁴ Billa, J.⁵ Richardson, F.⁶ Ma, K.⁷ Siu, M.⁸ Gish, H.⁹

2
- 0034847002
- The 1998 HTK system for transcription of conversational telephone speech
- T. Hain, P. Woodland, T. Niesler, and E. Whittaker, "The 1998 HTK system for transcription of conversational telephone speech," in Proc. ICASSP, 1999, pp. 57-60.
- (1999) Proc. ICASSP , pp. 57-60
- Hain, T.¹ Woodland, P.² Niesler, T.³ Whittaker, E.⁴

3
- 33745525361
- The rich transcription 2004 spring meeting recognition evaluation
- J. Garofolo, C. Laprun, and J. Fiscus, "The rich transcription 2004 spring meeting recognition evaluation," in Proc. ICASSP Meeting Recognition Workshop, 2004.
- (2004) Proc. ICASSP Meeting Recognition Workshop
- Garofolo, J.¹ Laprun, C.² Fiscus, J.³

4
- 44849090969
- Recognition and understanding of meetings: The AMI and AMIDA projects
- S. Renals, T. Hain, and H. Bourlard, "Recognition and understanding of meetings: The AMI and AMIDA projects," in Proc. ASRU, 2007, pp. 238-247.
- (2007) Proc. ASRU , pp. 238-247
- Renals, S.¹ Hain, T.² Bourlard, H.³

5
- 85009067726
- Toward the realization of spontaneous speech recognition-Introduction of a Japanese priority program and preliminary results
- S. Furui, K. Maekawa, and H. Isahara, "Toward the realization of spontaneous speech recognition-Introduction of a Japanese priority program and preliminary results," in Proc. ICSLP, 2000, pp. 518-521.
- (2000) Proc. ICSLP , pp. 518-521
- Furui, S.¹ Maekawa, K.² Isahara, H.³

6
- 0141591531
- Language modeling and transcription of the TED corpus lectures
- E. Leeuwis, M. Federico, and M. Cettolo, "Language modeling and transcription of the TED corpus lectures," in Proc. ICASSP, 2003, pp. 232-235.
- (2003) Proc. ICASSP , pp. 232-235
- Leeuwis, E.¹ Federico, M.² Cettolo, M.³

7
- 33745188014
- Transcribing lectures and seminars
- L. Lamel, G. Adda, E. Bilinski, and J. Gauvain, "Transcribing lectures and seminars," in Proc. Eurospeech, 2005, pp. 1657-1660.
- (2005) Proc. Eurospeech , pp. 1657-1660
- Lamel, L.¹ Adda, G.² Bilinski, E.³ Gauvain, J.⁴

8
- 43849107616
- Recent progress in the MIT spoken lecture processing project
- J. Glass, T. Hazen, S. Cyphers, I. Malioutov, D. Huynh, and R. Barzilay, "Recent progress in the MIT spoken lecture processing project," in Proc. Eurospeech, 2007, pp. 2553-2556.
- (2007) Proc. Eurospeech , pp. 2553-2556
- Glass, J.¹ Hazen, T.² Cyphers, S.³ Malioutov, I.⁴ Huynh, D.⁵ Barzilay, R.⁶

9
- 51449113481
- Automatic lecture transcription by exploiting presentation slide information for language model adaptation
- T. Kawahara, Y. Nemoto, and Y. Akita, "Automatic lecture transcription by exploiting presentation slide information for language model adaptation," in Proc. ICASSP, 2008, pp. 4929-4932.
- (2008) Proc. ICASSP , pp. 4929-4932
- Kawahara, T.¹ Nemoto, Y.² Akita, Y.³

10
- 85009286782
- Automatic transcription of courtroom speech
- R. Prasad, L. Nguyen, R. Schwartz, and J. Makhoul, "Automatic transcription of courtroom speech," in Proc. ICSLP, 2002, pp. 1745-1748.
- (2002) Proc. ICSLP , pp. 1745-1748
- Prasad, R.¹ Nguyen, L.² Schwartz, R.³ Makhoul, J.⁴

11
- 34547531456
- The LIMSI 2006 TC-STAR EPPS transcription systems
- L. Lamel, J.-L. Gauvain, G. Adda, C. Barras, E. Bilinski, O. Galibert, A. Pujol, H. Schwenk, and X. Zhu, "The LIMSI 2006 TC-STAR EPPS transcription systems," in Proc. ICASSP, 2007, vol.4, pp. 997-1000.
- (2007) Proc. ICASSP , vol.4 , pp. 997-1000
- Lamel, L.¹ Gauvain, J.-L.² Adda, G.³ Barras, C.⁴ Bilinski, E.⁵ Galibert, O.⁶ Pujol, A.⁷ Schwenk, H.⁸ Zhu, X.⁹

12
- 44949265179
- The 2006 RWTH parliamentary speeches transcription system
- J. Loof, M. Bisani, C. Gollan, G. Heigold, B. Hoffmeister, C. Plahl, R. Schluter, and H. Ney, "The 2006 RWTH parliamentary speeches transcription system," in Proc. ICSLP, 2006, pp. 105-108.
- (2006) Proc. ICSLP , pp. 105-108
- Loof, J.¹ Bisani, M.² Gollan, C.³ Heigold, G.⁴ Hoffmeister, B.⁵ Plahl, C.⁶ Schluter, R.⁷ Ney, H.⁸

13
- 44949221858
- The IBM 2006 speech transcription system for European parliamentary speeches
- B. Ramabhadran, O. Siohan, L. Mangu, G. Zweig, M. Westphal, H. Schulz, and A. Soneiro, "The IBM 2006 speech transcription system for European parliamentary speeches," in Proc. ICSLP, 2006, pp. 1225-1228.
- (2006) Proc. ICSLP , pp. 1225-1228
- Ramabhadran, B.¹ Siohan, O.² Mangu, L.³ Zweig, G.⁴ Westphal, M.⁵ Schulz, H.⁶ Soneiro, A.⁷

14
- 4544316882
- Advances in the automatic transcription of lectures
- M. Cettolo, F. Brugnara, and M. Federico, "Advances in the automatic transcription of lectures," in Proc. ICASSP, 2004, pp. 769-772.
- (2004) Proc. ICASSP , pp. 769-772
- Cettolo, M.¹ Brugnara, F.² Federico, M.³

15
- 85044611587
- The mathematics of statistical machine translation: Parameter estimation
- P. Brown, S. Pietra, V. Pietra, and R. Mercer, "The mathematics of statistical machine translation: Parameter estimation," Comput. Linguist., vol.19, no.2, pp. 263-311, 1993.
- (1993) Comput. Linguist. , vol.19 , Issue.2 , pp. 263-311
- Brown, P.¹ Pietra, S.² Pietra, V.³ Mercer, R.⁴

16
- 10444241409
- Filled-pause modeling for medical transcriptions
- H. Schramm, X. Aubert, C. Meyer, and J. Peters, "Filled-pause modeling for medical transcriptions," in Proc. Workshop Spontaneous Speech Process. Recognition, 2003, pp. 143-146.
- (2003) Proc. Workshop Spontaneous Speech Process. Recognition , pp. 143-146
- Schramm, H.¹ Aubert, X.² Meyer, C.³ Peters, J.⁴

17
- 34547522348
- Reconstructing medical dictations from automatically recognized and non-literal transcripts with phonetic similarity matching
- S. Petrik and G. Kubin, "Reconstructing medical dictations from automatically recognized and non-literal transcripts with phonetic similarity matching," in Proc. ICASSP, 2007, vol.4, pp. 1125-1128.
- (2007) Proc. ICASSP , vol.4 , pp. 1125-1128
- Petrik, S.¹ Kubin, G.²

18
- 0141480041
- Language model adaptation using WFST-based speaking-style translation
- T. Hori, D.Willett, and Y. Minami, "Language model adaptation using WFST-based speaking-style translation," in Proc. ICASSP, 2003, vol.1, pp. 228-231.
- (2003) Proc. ICASSP , vol.1 , pp. 228-231
- Hori, T.¹ Willett, D.² Minami, Y.³

19
- 4043075534
- Extended models and tools for high-performance part-of-speech tagger
- M. Asahara and Y. Matsumoto, "Extended models and tools for high-performance part-of-speech tagger," in Proc. COLING, 2000, pp. 21-27.
- (2000) Proc. COLING , pp. 21-27
- Asahara, M.¹ Matsumoto, Y.²

20
- 0002652285
- A maximum entropy approach to natural language processing
- A. Berger, V. Della Pietra, and S. Della Pietra, "A maximum entropy approach to natural language processing," Comput. Linguist., vol.22, no.1, pp. 39-71, 1996.
- (1996) Comput. Linguist. , vol.22 , Issue.1 , pp. 39-71
- Berger, A.¹ Della Pietra, V.² Della Pietra, S.³

21
- 0030351374
- On designing pronunciation lexicons for large vocabulary, continuous speech recognition
- L. Lamel and G. Adda, "On designing pronunciation lexicons for large vocabulary, continuous speech recognition," in Proc. ICSLP, 1996, pp. 6-9.
- (1996) Proc. ICSLP , pp. 6-9
- Lamel, L.¹ Adda, G.²

22
- 0030363039
- Dictionary learning for spontaneous speech recognition
- T. Sloboda and A.Waibel, "Dictionary learning for spontaneous speech recognition," in Proc. ICSLP, 1996, pp. 2328-2331.
- (1996) Proc. ICSLP , pp. 2328-2331
- Sloboda, T.¹ Waibel, A.²

23
- 3042704466
- Language model and speaking rate adaptation for spontaneous presentation speech recognition
- Jul.
- H. Nanjo and T. Kawahara, "Language model and speaking rate adaptation for spontaneous presentation speech recognition," IEEE Trans. Speech Audio Process., vol.12, no.4, pp. 391-400, Jul. 2004.
- (2004) IEEE Trans. Speech Audio Process. , vol.12 , Issue.4 , pp. 391-400
- Nanjo, H.¹ Kawahara, T.²

24
- 0033353288
- Stochastic pronunciation modelling from hand-labelled phonetic corpora
- M. Riley, W. Byrne, M. Finke, S. Khudanpur,A. Ljolje, J. McDonough, H. Nock, M. Saraclar, C. Wooters, and G. Zavaliagkos, "Stochastic pronunciation modelling from hand-labelled phonetic corpora," Speech Commun., vol.29, pp. 209-224, 1999.
- (1999) Speech Commun. , vol.29 , pp. 209-224
- Riley, M.¹ Byrne, W.² Finke, M.³ Khudanpur, S.⁴ Ljolje, A.⁵ McDonough, J.⁶ Nock, H.⁷ Saraclar, M.⁸ Wooters, C.⁹ Zavaliagkos, G.¹⁰

25
- 0033077780
- Automatic generation of multiple pronunciations based on neural networks
- T. Fukada, T. Yoshimura, and Y. Sagisaka, "Automatic generation of multiple pronunciations based on neural networks," Speech Communication, vol.27, pp. 63-73, 1999.
- (1999) Speech Communication , vol.27 , pp. 63-73
- Fukada, T.¹ Yoshimura, T.² Sagisaka, Y.³

26
- 0030672090
- Automatic alternative transcription generation and vocabulary selection for flexible word recognizers
- D. Torre, L. Villarrubia, J. Elvira, and L. Hernandez-Gomez, "Automatic alternative transcription generation and vocabulary selection for flexible word recognizers," in Proc. ICASSP, 1997, pp. 1463-1466.
- (1997) Proc. ICASSP , pp. 1463-1466
- Torre, D.¹ Villarrubia, L.² Elvira, J.³ Hernandez-Gomez, L.⁴

27
- 33646759445
- Pronunciation variation modeling for ASR: Large improvements are possible but small ones are likely to achieve
- Q. Yang, J.-P. Martens, P.-J. Ghesquiere, and D. Compernolle, "Pronunciation variation modeling for ASR: Large improvements are possible but small ones are likely to achieve," in Proc. ICSLP Workshop Pronunciation Modeling Lexicon Adaptation for Spoken Lang. Technol., 2002, pp. 123-128.
- (2002) Proc. ICSLP Workshop Pronunciation Modeling Lexicon Adaptation for Spoken Lang. Technol. , pp. 123-128
- Yang, Q.¹ Martens, J.-P.² Ghesquiere, P.-J.³ Compernolle, D.⁴

28
- 3042777118
- Corpus of spontaneous Japanese: Its design and evaluation
- K. Maekawa, "Corpus of spontaneous Japanese: Its design and evaluation," in Proc. Workshop Spontaneous Speech Process. Recognition, 2003, pp. 7-12.
- (2003) Proc. Workshop Spontaneous Speech Process. Recognition , pp. 7-12
- Maekawa, K.¹

29
- 0029725604
- A parametric approach to vocal tract length normalization
- E. Eide and H. Gish, "A parametric approach to vocal tract length normalization," in Proc. ICASSP, 1996, vol.1, pp. 346-349.
- (1996) Proc. ICASSP , vol.1 , pp. 346-349
- Eide, E.¹ Gish, H.²

30
- 0029747183
- Speaker normalization using efficient frequency warping procedures
- L. Lee and R. Rose, "Speaker normalization using efficient frequency warping procedures," in Proc. ICASSP, 1996, vol.1, pp. 353-356.
- (1996) Proc. ICASSP , vol.1 , pp. 353-356
- Lee, L.¹ Rose, R.²

31
- 0036296863
- Minimum phone error and I-smoothing for improved discriminative training
- D. Povey and P. Woodland, "Minimum phone error and I-smoothing for improved discriminative training," in Proc. ICASSP, 2002, vol.1, pp. 105-108.
- (2002) Proc. ICASSP , vol.1 , pp. 105-108
- Povey, D.¹ Woodland, P.²

32
- 33646809034
- Generalized statistical modeling of pronunciation variations using variable-length phone context
- Y. Akita and T. Kawahara, "Generalized statistical modeling of pronunciation variations using variable-length phone context," in Proc. ICASSP, 2005, vol.1, pp. 689-692.
- (2005) Proc. ICASSP , vol.1 , pp. 689-692
- Akita, Y.¹ Kawahara, T.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.