메뉴 건너뛰기




Volumn 15, Issue 3, 2007, Pages 803-812

Using broad phonetic group experts for improved speech recognition

Author keywords

Automatic speech recognition; Broad phonetic groups (BPGs); Mixture of experts; Mutual information (MI)

Indexed keywords

ACOUSTIC MODELS; AUTOMATIC SPEECH RECOGNITION; BROAD PHONETIC GROUPS (BPGS); CONTEXT WINDOWS; CONTINUOUS SPEECH CORPORA; ERROR RATE REDUCTIONS; FEATURE SETS; INPUT FEATURES; MIXTURE OF EXPERTS; MUTUAL INFORMATION (MI); PHONE RECOGNITION; PHONEME RECOGNITION; PROPOSED ARCHITECTURES; SPECTRAL STRUCTURES; SPEECH RECOGNITION SYSTEMS; TEMPORAL CHARACTERISTICS; TIME FREQUENCIES;

EID: 44849122716     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2006.885907     Document Type: Article
Times cited : (52)

References (14)
  • 1
    • 0031619381 scopus 로고    scopus 로고
    • Maximum mutual information based reduction strategies for cross-correlation based joint distribution modelling
    • Seattle, WA
    • J. Bilmes, "Maximum mutual information based reduction strategies for cross-correlation based joint distribution modelling," in Proc. Int. Conf. Acoust., Speech, Signal Process., Seattle, WA, 1998, pp. 469-472.
    • (1998) Proc. Int. Conf. Acoust., Speech, Signal Process , pp. 469-472
    • Bilmes, J.1
  • 3
    • 85009072541 scopus 로고    scopus 로고
    • An elitist approach to articulatory- acoustic feature classification
    • S. Chang, S. Greenberg, and M. Wester, "An elitist approach to articulatory- acoustic feature classification," in Proc. Eurospeech, 2001, pp. 1725-1728.
    • (2001) Proc. Eurospeech , pp. 1725-1728
    • Chang, S.1    Greenberg, S.2    Wester, M.3
  • 5
    • 85135152036 scopus 로고    scopus 로고
    • Heterogeneous acoustic measurements for phonetic classification
    • A. Halberstadt and J. Glass, "Heterogeneous acoustic measurements for phonetic classification," in Proc. Eurospeech, 1997, pp. 401-404.
    • (1997) Proc. Eurospeech , pp. 401-404
    • Halberstadt, A.1    Glass, J.2
  • 6
    • 0025041264 scopus 로고
    • Perceptual linear predictive (PLP) analysis for speech
    • H. Hermansky, "Perceptual linear predictive (PLP) analysis for speech," J. Acoust. Soc. Amer., vol. 87, pp. 1738-1752, 1990.
    • (1990) J. Acoust. Soc. Amer , vol.87 , pp. 1738-1752
    • Hermansky, H.1
  • 7
    • 0027574303 scopus 로고
    • An information theoretical investigation into the distribution of phonetic information across the auditory spectrogram
    • A. Morris, J. Schwartz, and P. Escudier, "An information theoretical investigation into the distribution of phonetic information across the auditory spectrogram," Comput. Speech Lang., vol. 7, no. 2, pp. 121-136, 1993.
    • (1993) Comput. Speech Lang , vol.7 , Issue.2 , pp. 121-136
    • Morris, A.1    Schwartz, J.2    Escudier, P.3
  • 9
    • 0141642202 scopus 로고    scopus 로고
    • Experiments in speech recognition using a modular mlp architecture for acoustic modelling
    • T. Reynolds and C. Antoniou, "Experiments in speech recognition using a modular mlp architecture for acoustic modelling," Inf. Sci., vol. 156, pp. 39-54, 2003.
    • (2003) Inf. Sci , vol.156 , pp. 39-54
    • Reynolds, T.1    Antoniou, C.2
  • 10
    • 85009216422 scopus 로고    scopus 로고
    • Using mutual information to design class specific phone recognizers
    • P. Scanlon, D. P. W. Ellis, and R. Reilly, "Using mutual information to design class specific phone recognizers," in Proc. Eurospeech, 2003, pp. 857-860.
    • (2003) Proc. Eurospeech , pp. 857-860
    • Scanlon, P.1    Ellis, D.P.W.2    Reilly, R.3
  • 11
    • 0032639912 scopus 로고    scopus 로고
    • Using boosting to improve a hybrid HMM/neural network speech recognizer
    • H. Schwenk, "Using boosting to improve a hybrid HMM/neural network speech recognizer," in Proc. Int. Conf. Acoust., Speech, Signal Process., 1999, pp. 1009-1012.
    • (1999) Proc. Int. Conf. Acoust., Speech, Signal Process , pp. 1009-1012
    • Schwenk, H.1
  • 13
    • 0002915083 scopus 로고    scopus 로고
    • Relevance of time-frequency features for phonetic and speaker-channel classification
    • H. Yang, S. V. Vuuren, S. Sharma, and H. Hermansky, "Relevance of time-frequency features for phonetic and speaker-channel classification," Speech Commun., vol. 31, pp. 35-50, 2000.
    • (2000) Speech Commun , vol.31 , pp. 35-50
    • Yang, H.1    Vuuren, S.V.2    Sharma, S.3    Hermansky, H.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.