메뉴 건너뛰기




Volumn 1, Issue , 2006, Pages

Cross-domain and cross-language portability of acoustic features estimated by multilayer perceptrons

Author keywords

[No Author keywords available]

Indexed keywords

MATHEMATICAL MODELS; PARAMETER ESTIMATION; SPEECH RECOGNITION; TELEPHONE SETS; VOCABULARY CONTROL;

EID: 33947619591     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (105)

References (22)
  • 1
    • 0025041264 scopus 로고
    • Perceptual linear predictive (PLP) analysis of speech
    • Apr
    • H. Hermansky, "Perceptual linear predictive (PLP) analysis of speech", J. Acoust. Soc. Am., vol. 87, pp. 1738-1752, Apr. 1990.
    • (1990) J. Acoust. Soc. Am , vol.87 , pp. 1738-1752
    • Hermansky, H.1
  • 2
    • 0033709098 scopus 로고    scopus 로고
    • Tandem connectionist feature extraction for conventional HMM systems
    • Istanbul, June
    • H. Hermansky, D. P. W. Ellis, and S. Sharma, "Tandem connectionist feature extraction for conventional HMM systems", in Proc. ICASSP, pp. 1635-1638, Istanbul, June 2000.
    • (2000) Proc. ICASSP , pp. 1635-1638
    • Hermansky, H.1    Ellis, D.P.W.2    Sharma, S.3
  • 4
    • 0032658253 scopus 로고    scopus 로고
    • Temporal patterns (TRAPs) in ASR of noisy speech
    • Phoenix, AZ, Mar
    • H. Hermansky and S. Sharma, "Temporal patterns (TRAPs) in ASR of noisy speech", in Proc. ICASSP, vol. 2, pp. 289-292, Phoenix, AZ, Mar. 1999.
    • (1999) Proc. ICASSP , vol.2 , pp. 289-292
    • Hermansky, H.1    Sharma, S.2
  • 5
    • 4544224866 scopus 로고    scopus 로고
    • TRAPping conversational speech: Extending TRAP/Tandem approaches to conversational telephone speech recognition
    • Montreal, May
    • N. Morgan, B. Y. Chen, Q. Zhu, and A. Stolcke, "TRAPping conversational speech: Extending TRAP/Tandem approaches to conversational telephone speech recognition", in Proc. ICASSP, vol. 1, pp. 536-539, Montreal, May 2004.
    • (2004) Proc. ICASSP , vol.1 , pp. 536-539
    • Morgan, N.1    Chen, B.Y.2    Zhu, Q.3    Stolcke, A.4
  • 6
    • 33745185321 scopus 로고    scopus 로고
    • Using MLP features in SRI's conversational speech recognition system
    • Lisbon, Sep
    • Q. Zhu, A. Stolcke, B. Y. Chen, and N. Morgan, "Using MLP features in SRI's conversational speech recognition system", in Proc. Interspeech, pp. 2141-2144, Lisbon, Sep. 2005.
    • (2005) Proc. Interspeech , pp. 2141-2144
    • Zhu, Q.1    Stolcke, A.2    Chen, B.Y.3    Morgan, N.4
  • 7
    • 85009097225 scopus 로고    scopus 로고
    • On using MLP features in LVCSR
    • S. H. Kim and D. H. Youn, editors, Jeju, Korea, Oct
    • Q. Zhu, B. Chen, N. Morgan, and A. Stolcke, "On using MLP features in LVCSR", in S. H. Kim and D. H. Youn, editors, Proc. ICSLP, pp. 921-924, Jeju, Korea, Oct. 2004.
    • (2004) Proc. ICSLP , pp. 921-924
    • Zhu, Q.1    Chen, B.2    Morgan, N.3    Stolcke, A.4
  • 8
    • 85009110188 scopus 로고    scopus 로고
    • Learning long-term temporal features in LVCSR using neural networks
    • S. H. Kim and D. H. Youn, editors, Jeju, Korea, Oct
    • B. Y. Chen, Q. Zhu, and N. Morgan, "Learning long-term temporal features in LVCSR using neural networks", in S. H. Kim and D. H. Youn, editors, Proc. ICSLP, Jeju, Korea, Oct. 2004.
    • (2004) Proc. ICSLP
    • Chen, B.Y.1    Zhu, Q.2    Morgan, N.3
  • 9
    • 0141676589 scopus 로고    scopus 로고
    • New entropy based combination rules in HMM/ANN multi-stream ASR
    • Hong Kong, Apr
    • H. Misra, H. Bourlard, and V. Tyagi, "New entropy based combination rules in HMM/ANN multi-stream ASR", in Proc. ICASSP, vol. 2, pp. 741-744, Hong Kong, Apr. 2003.
    • (2003) Proc. ICASSP , vol.2 , pp. 741-744
    • Misra, H.1    Bourlard, H.2    Tyagi, V.3
  • 10
    • 0029288633 scopus 로고
    • Maximum likelihood linear regression for speaker adaptation of HMMs
    • C. Leggetter and P. Woodland, "Maximum likelihood linear regression for speaker adaptation of HMMs", Computer Speech and Language, vol. 9, pp. 171-186, 1995.
    • (1995) Computer Speech and Language , vol.9 , pp. 171-186
    • Leggetter, C.1    Woodland, P.2
  • 11
    • 0141703284 scopus 로고    scopus 로고
    • Prosodic knowledge sources for automatic speech recognition
    • Hong Kong, Apr
    • D. Vergyri, A. Stolcke, V. R. R. Gadde, L. Ferrer, and E. Shriberg, "Prosodic knowledge sources for automatic speech recognition", in Proc. ICASSP, vol. 1, pp. 208-211, Hong Kong, Apr. 2003.
    • (2003) Proc. ICASSP , vol.1 , pp. 208-211
    • Vergyri, D.1    Stolcke, A.2    Gadde, V.R.R.3    Ferrer, L.4    Shriberg, E.5
  • 12
    • 0029764708 scopus 로고    scopus 로고
    • Speaker normalization on conversational telephone speech
    • Atlanta, May
    • S. Wegmann, D. McAllaster, J. Orloff, and B. Peskin, "Speaker normalization on conversational telephone speech", in Proc. ICASSP, vol. 1, pp. 339-341, Atlanta, May 1996.
    • (1996) Proc. ICASSP , vol.1 , pp. 339-341
    • Wegmann, S.1    McAllaster, D.2    Orloff, J.3    Peskin, B.4
  • 14
    • 0036475982 scopus 로고    scopus 로고
    • Maximum likelihood multiple subspace projections for hidden Markov models
    • M. J. Gales, "Maximum likelihood multiple subspace projections for hidden Markov models", IEEE Trans. Speech Audio Process., vol. 10, pp. 37-47, 2002.
    • (2002) IEEE Trans. Speech Audio Process , vol.10 , pp. 37-47
    • Gales, M.J.1
  • 16
    • 0036296863 scopus 로고    scopus 로고
    • Minimum phone error and Ismoothing for improved discriminative training
    • Orlando, FL, May
    • D. Povey and P. C. Woodland, "Minimum phone error and Ismoothing for improved discriminative training", in Proc. ICASSP, vol. 1, pp. 105-108, Orlando, FL, May 2002.
    • (2002) Proc. ICASSP , vol.1 , pp. 105-108
    • Povey, D.1    Woodland, P.C.2
  • 17
    • 44949090835 scopus 로고    scopus 로고
    • Getting more mileage from web text sources for conversational speech language modeling using class-dependent mixtures
    • M. Hearst and M. Ostendorf, editors, Edmonton, Alberta, Canada, Mar, Association for Computational Linguistics
    • I. Bulyko, M. Ostendorf, and A. Stolcke, "Getting more mileage from web text sources for conversational speech language modeling using class-dependent mixtures", in M. Hearst and M. Ostendorf, editors, Proc. HLT-NAACL, vol. 2, pp. 7-9, Edmonton, Alberta, Canada, Mar. 2003. Association for Computational Linguistics.
    • (2003) Proc. HLT-NAACL , vol.2 , pp. 7-9
    • Bulyko, I.1    Ostendorf, M.2    Stolcke, A.3
  • 19
    • 33745210540 scopus 로고    scopus 로고
    • Incorporating tone-related MLP posteriors in the feature representation for mandarin ASR
    • Lisbon, Sep
    • X. Lei, M.-Y. Hwang, and M. Ostendorf, "Incorporating tone-related MLP posteriors in the feature representation for mandarin ASR", in Proc. Interspeech, pp. 2981-2984, Lisbon, Sep. 2005.
    • (2005) Proc. Interspeech , pp. 2981-2984
    • Lei, X.1    Hwang, M.-Y.2    Ostendorf, M.3
  • 21
    • 33745207357 scopus 로고    scopus 로고
    • Development of a conversational telephone speech recognizer for Levantine Arabic
    • Lisbon, Sep
    • D. Vergyri, K. Kirchhoff, R. Gadde, A. Stolcke, and J. Zheng, "Development of a conversational telephone speech recognizer for Levantine Arabic", in Proc. Interspeech, pp. 1613-1616, Lisbon, Sep. 2005.
    • (2005) Proc. Interspeech , pp. 1613-1616
    • Vergyri, D.1    Kirchhoff, K.2    Gadde, R.3    Stolcke, A.4    Zheng, J.5
  • 22
    • 85009110467 scopus 로고    scopus 로고
    • Morphology-based language modeling for Arabic speech recognition
    • S. H. Kim and D. H. Youn, editors, Jeju, Korea, Oct
    • D. Vergyri, K. Kirchhoff, K. Duh, and A. Stolcke, "Morphology-based language modeling for Arabic speech recognition", in S. H. Kim and D. H. Youn, editors, Proc. ICSLP, pp. 2245-2248, Jeju, Korea, Oct. 2004.
    • (2004) Proc. ICSLP , pp. 2245-2248
    • Vergyri, D.1    Kirchhoff, K.2    Duh, K.3    Stolcke, A.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.