SCOPUS 정보 검색 플랫폼

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

Volumn 1, Issue , 2006, Pages

Cross-domain and cross-language portability of acoustic features estimated by multilayer perceptrons

(6) Stolcke, Andreas a,b Grézl, František b,e Hwang, Mei Yuh c Lei, Xin c Morgan, Nelson b,d Vergyri, Dimitra a

a SRI INTERNATIONAL (United States)

b INTERNATIONAL COMPUTER SCIENCE INSTITUTE (United States)

c UNIVERSITY OF WASHINGTON (United States)

d UNIVERSITY OF CALIFORNIA (United States)

e BRNO UNIVERSITY OF TECHNOLOGY (Czech Republic)

Author keywords

[No Author keywords available]

Indexed keywords

MATHEMATICAL MODELS; PARAMETER ESTIMATION; SPEECH RECOGNITION; TELEPHONE SETS; VOCABULARY CONTROL;

ACOUSTIC MODELS; CROSS LANGUAGE PORTABILITY; MULTILAYER PERCEPTRONS (MLP); PHONE CLASSIFICATION;

FEATURE EXTRACTION;

EID: 33947619591 PISSN: 15206149 EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (105)

References (22)

1
- 0025041264
- Perceptual linear predictive (PLP) analysis of speech
- Apr
- H. Hermansky, "Perceptual linear predictive (PLP) analysis of speech", J. Acoust. Soc. Am., vol. 87, pp. 1738-1752, Apr. 1990.
- (1990) J. Acoust. Soc. Am , vol.87 , pp. 1738-1752
- Hermansky, H.¹

2
- 0033709098
- Tandem connectionist feature extraction for conventional HMM systems
- Istanbul, June
- H. Hermansky, D. P. W. Ellis, and S. Sharma, "Tandem connectionist feature extraction for conventional HMM systems", in Proc. ICASSP, pp. 1635-1638, Istanbul, June 2000.
- (2000) Proc. ICASSP , pp. 1635-1638
- Hermansky, H.¹ Ellis, D.P.W.² Sharma, S.³

3
- 0003573244
- Kluwer Academic Publishers, Boston, MA
- H. Bourlard and N. Morgan, Connectionist Speech Recognition. A Hybrid Approach, Kluwer Academic Publishers, Boston, MA, 1993.
- (1993) Connectionist Speech Recognition. A Hybrid Approach
- Bourlard, H.¹ Morgan, N.²

4
- 0032658253
- Temporal patterns (TRAPs) in ASR of noisy speech
- Phoenix, AZ, Mar
- H. Hermansky and S. Sharma, "Temporal patterns (TRAPs) in ASR of noisy speech", in Proc. ICASSP, vol. 2, pp. 289-292, Phoenix, AZ, Mar. 1999.
- (1999) Proc. ICASSP , vol.2 , pp. 289-292
- Hermansky, H.¹ Sharma, S.²

5
- 4544224866
- TRAPping conversational speech: Extending TRAP/Tandem approaches to conversational telephone speech recognition
- Montreal, May
- N. Morgan, B. Y. Chen, Q. Zhu, and A. Stolcke, "TRAPping conversational speech: Extending TRAP/Tandem approaches to conversational telephone speech recognition", in Proc. ICASSP, vol. 1, pp. 536-539, Montreal, May 2004.
- (2004) Proc. ICASSP , vol.1 , pp. 536-539
- Morgan, N.¹ Chen, B.Y.² Zhu, Q.³ Stolcke, A.⁴

6
- 33745185321
- Using MLP features in SRI's conversational speech recognition system
- Lisbon, Sep
- Q. Zhu, A. Stolcke, B. Y. Chen, and N. Morgan, "Using MLP features in SRI's conversational speech recognition system", in Proc. Interspeech, pp. 2141-2144, Lisbon, Sep. 2005.
- (2005) Proc. Interspeech , pp. 2141-2144
- Zhu, Q.¹ Stolcke, A.² Chen, B.Y.³ Morgan, N.⁴

7
- 85009097225
- On using MLP features in LVCSR
- S. H. Kim and D. H. Youn, editors, Jeju, Korea, Oct
- Q. Zhu, B. Chen, N. Morgan, and A. Stolcke, "On using MLP features in LVCSR", in S. H. Kim and D. H. Youn, editors, Proc. ICSLP, pp. 921-924, Jeju, Korea, Oct. 2004.
- (2004) Proc. ICSLP , pp. 921-924
- Zhu, Q.¹ Chen, B.² Morgan, N.³ Stolcke, A.⁴

8
- 85009110188
- Learning long-term temporal features in LVCSR using neural networks
- S. H. Kim and D. H. Youn, editors, Jeju, Korea, Oct
- B. Y. Chen, Q. Zhu, and N. Morgan, "Learning long-term temporal features in LVCSR using neural networks", in S. H. Kim and D. H. Youn, editors, Proc. ICSLP, Jeju, Korea, Oct. 2004.
- (2004) Proc. ICSLP
- Chen, B.Y.¹ Zhu, Q.² Morgan, N.³

9
- 0141676589
- New entropy based combination rules in HMM/ANN multi-stream ASR
- Hong Kong, Apr
- H. Misra, H. Bourlard, and V. Tyagi, "New entropy based combination rules in HMM/ANN multi-stream ASR", in Proc. ICASSP, vol. 2, pp. 741-744, Hong Kong, Apr. 2003.
- (2003) Proc. ICASSP , vol.2 , pp. 741-744
- Misra, H.¹ Bourlard, H.² Tyagi, V.³

10
- 0029288633
- Maximum likelihood linear regression for speaker adaptation of HMMs
- C. Leggetter and P. Woodland, "Maximum likelihood linear regression for speaker adaptation of HMMs", Computer Speech and Language, vol. 9, pp. 171-186, 1995.
- (1995) Computer Speech and Language , vol.9 , pp. 171-186
- Leggetter, C.¹ Woodland, P.²

11
- 0141703284
- Prosodic knowledge sources for automatic speech recognition
- Hong Kong, Apr
- D. Vergyri, A. Stolcke, V. R. R. Gadde, L. Ferrer, and E. Shriberg, "Prosodic knowledge sources for automatic speech recognition", in Proc. ICASSP, vol. 1, pp. 208-211, Hong Kong, Apr. 2003.
- (2003) Proc. ICASSP , vol.1 , pp. 208-211
- Vergyri, D.¹ Stolcke, A.² Gadde, V.R.R.³ Ferrer, L.⁴ Shriberg, E.⁵

12
- 0029764708
- Speaker normalization on conversational telephone speech
- Atlanta, May
- S. Wegmann, D. McAllaster, J. Orloff, and B. Peskin, "Speaker normalization on conversational telephone speech", in Proc. ICASSP, vol. 1, pp. 339-341, Atlanta, May 1996.
- (1996) Proc. ICASSP , vol.1 , pp. 339-341
- Wegmann, S.¹ McAllaster, D.² Orloff, J.³ Peskin, B.⁴

13
- 0003871508
- PhD thesis, John Hopkins University, Baltimore
- N. Kumar, Investigation of Silicon-Auditory Models and Generalization of Linear Discriminant Analysis for Improved Speech Recognition, PhD thesis, John Hopkins University, Baltimore, 1997.
- (1997) Investigation of Silicon-Auditory Models and Generalization of Linear Discriminant Analysis for Improved Speech Recognition
- Kumar, N.¹

14
- 0036475982
- Maximum likelihood multiple subspace projections for hidden Markov models
- M. J. Gales, "Maximum likelihood multiple subspace projections for hidden Markov models", IEEE Trans. Speech Audio Process., vol. 10, pp. 37-47, 2002.
- (2002) IEEE Trans. Speech Audio Process , vol.10 , pp. 37-47
- Gales, M.J.¹

15
- 0009938649
- Fast robust inverse transform SAT and multi-stage adaptation
- Lansdowne, VA, Feb, Morgan Kaufmann
- H. Jin, S. Matsoukas, R. Schwartz, and F. Kubala, "Fast robust inverse transform SAT and multi-stage adaptation", in Proceedings DARPA Broadcast News Transcription and Understanding Workshop, pp. 105-109, Lansdowne, VA, Feb. 1998. Morgan Kaufmann.
- (1998) Proceedings DARPA Broadcast News Transcription and Understanding Workshop , pp. 105-109
- Jin, H.¹ Matsoukas, S.² Schwartz, R.³ Kubala, F.⁴

16
- 0036296863
- Minimum phone error and Ismoothing for improved discriminative training
- Orlando, FL, May
- D. Povey and P. C. Woodland, "Minimum phone error and Ismoothing for improved discriminative training", in Proc. ICASSP, vol. 1, pp. 105-108, Orlando, FL, May 2002.
- (2002) Proc. ICASSP , vol.1 , pp. 105-108
- Povey, D.¹ Woodland, P.C.²

17
- 44949090835
- Getting more mileage from web text sources for conversational speech language modeling using class-dependent mixtures
- M. Hearst and M. Ostendorf, editors, Edmonton, Alberta, Canada, Mar, Association for Computational Linguistics
- I. Bulyko, M. Ostendorf, and A. Stolcke, "Getting more mileage from web text sources for conversational speech language modeling using class-dependent mixtures", in M. Hearst and M. Ostendorf, editors, Proc. HLT-NAACL, vol. 2, pp. 7-9, Edmonton, Alberta, Canada, Mar. 2003. Association for Computational Linguistics.
- (2003) Proc. HLT-NAACL , vol.2 , pp. 7-9
- Bulyko, I.¹ Ostendorf, M.² Stolcke, A.³

18
- 33947616153
- Further progress in meeting recognition: The ICSISRI Spring 2005 speech-to-text evaluation system
- Edinburgh, July, National Institute of Standards and Technology
- A. Stolcke, X. Anguera, K. Boakye, Ö. Çetin, F. Grézl, A. Janin, A. Mandal, B. Peskin, C. Wooters, and J. Zheng, "Further progress in meeting recognition: The ICSISRI Spring 2005 speech-to-text evaluation system", in Proceedings of the Rich Transcription 2005 Spring Meeting Recognition Evaluation, pp. 39-50, Edinburgh, July 2005. National Institute of Standards and Technology.
- (2005) Proceedings of the Rich Transcription 2005 Spring Meeting Recognition Evaluation , pp. 39-50
- Stolcke, A.¹ Anguera, X.² Boakye, K.³ Çetin, O.⁴ Grézl, F.⁵ Janin, A.⁶ Mandal, A.⁷ Peskin, B.⁸ Wooters, C.⁹ Zheng, J.¹⁰

19
- 33745210540
- Incorporating tone-related MLP posteriors in the feature representation for mandarin ASR
- Lisbon, Sep
- X. Lei, M.-Y. Hwang, and M. Ostendorf, "Incorporating tone-related MLP posteriors in the feature representation for mandarin ASR", in Proc. Interspeech, pp. 2981-2984, Lisbon, Sep. 2005.
- (2005) Proc. Interspeech , pp. 2981-2984
- Lei, X.¹ Hwang, M.-Y.² Ostendorf, M.³

20
- 33745197525
- Porting Decipher from English to Mandarin
- Palisades, NY, Nov
- M. Hwang, X. Lei, T. Ng, M. Ostendorf, A. Stolcke, W. Wang, J. Zheng, and V. Gadde, "Porting Decipher from English to Mandarin", in Proc. DARPA Rich Transcription Workshop, Palisades, NY, Nov. 2004.
- (2004) Proc. DARPA Rich Transcription Workshop
- Hwang, M.¹ Lei, X.² Ng, T.³ Ostendorf, M.⁴ Stolcke, A.⁵ Wang, W.⁶ Zheng, J.⁷ Gadde, V.⁸

21
- 33745207357
- Development of a conversational telephone speech recognizer for Levantine Arabic
- Lisbon, Sep
- D. Vergyri, K. Kirchhoff, R. Gadde, A. Stolcke, and J. Zheng, "Development of a conversational telephone speech recognizer for Levantine Arabic", in Proc. Interspeech, pp. 1613-1616, Lisbon, Sep. 2005.
- (2005) Proc. Interspeech , pp. 1613-1616
- Vergyri, D.¹ Kirchhoff, K.² Gadde, R.³ Stolcke, A.⁴ Zheng, J.⁵

22
- 85009110467
- Morphology-based language modeling for Arabic speech recognition
- S. H. Kim and D. H. Youn, editors, Jeju, Korea, Oct
- D. Vergyri, K. Kirchhoff, K. Duh, and A. Stolcke, "Morphology-based language modeling for Arabic speech recognition", in S. H. Kim and D. H. Youn, editors, Proc. ICSLP, pp. 2245-2248, Jeju, Korea, Oct. 2004.
- (2004) Proc. ICSLP , pp. 2245-2248
- Vergyri, D.¹ Kirchhoff, K.² Duh, K.³ Stolcke, A.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.