메뉴 건너뛰기




Volumn 22, Issue 5, 2005, Pages 81-88

Pushing the envelope - Aside

(15)  Morgan, Nelson a   Zhu, Qifeng b   Stolcke, Andreas b   Sönmez, Kemal c   Sivadas, Sunil d   Shinozaki, Takahiro e   Ostendorf, Mari f   Jain, Pratibha g   Hermansky, Hynek h   Ellis, Dan i   Doddington, George j   Chen, Barry a   Çetin, Özgür b   Bourlard, Hervé k   Athineos, Marios l  


Author keywords

[No Author keywords available]

Indexed keywords

FEATURE EXTRACTION; MARKOV PROCESSES; MATHEMATICAL MODELS; PROBABILITY; SPEECH PROCESSING; SPEECH SYNTHESIS; STATISTICAL METHODS;

EID: 85032751546     PISSN: 10535888     EISSN: None     Source Type: Journal    
DOI: 10.1109/MSP.2005.1511826     Document Type: Article
Times cited : (77)

References (28)
  • 1
    • 0028516073 scopus 로고
    • "How do humans process and recognize speech?"
    • Oct
    • J. Allen, "How do humans process and recognize speech?" IEEE Trans. Speech Audio Processing, vol. 2, no. 4, pp. 567-577, Oct. 1994.
    • (1994) IEEE Trans. Speech Audio Processing , vol.2 , Issue.4 , pp. 567-577
    • Allen, J.1
  • 2
    • 0016067897 scopus 로고
    • "Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification"
    • B. Atal, "Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification," J. Acoust. Soc. America, vol. 55, no. 6, pp. 1304-1312, 1974.
    • (1974) J. Acoust. Soc. America , vol.55 , Issue.6 , pp. 1304-1312
    • Atal, B.1
  • 3
    • 84863773378 scopus 로고    scopus 로고
    • "Frequency-domain linear prediction for temporal features"
    • M. Athineos and D.P.W. Ellis, "Frequency-domain linear prediction for temporal features," in Proc. ASRU, 2003, pp. 261-266.
    • (2003) Proc. ASRU , pp. 261-266
    • Athineos, M.1    Ellis, D.P.W.2
  • 4
    • 53049096459 scopus 로고    scopus 로고
    • "LP-TRAP: Linear predictive temporal patterns"
    • M. Athineos, H. Hermansky, and D. Ellis, "LP-TRAP: Linear predictive temporal patterns," in Proc. ICSLP, 2004, pp. 949-952.
    • (2004) Proc. ICSLP , pp. 949-952
    • Athineos, M.1    Hermansky, H.2    Ellis, D.3
  • 6
    • 0031619381 scopus 로고    scopus 로고
    • "Maximum mutual information based reduction strategies for cross-correlation based joint distributional modeling"
    • Seattle
    • J. Bilmes, "Maximum mutual information based reduction strategies for cross-correlation based joint distributional modeling," in Proc. ICASSP-98, Seattle, 1998, pp. 469-472.
    • (1998) Proc. ICASSP-98 , pp. 469-472
    • Bilmes, J.1
  • 7
    • 0030142722 scopus 로고    scopus 로고
    • "Towards increasing speech recognition error rates"
    • May
    • H. Bourlard, H. Hermansky, and N. Morgan, "Towards increasing speech recognition error rates," Speech Commun., vol. 18, no. 3, pp. 205-231, May 1996.
    • (1996) Speech Commun. , vol.18 , Issue.3 , pp. 205-231
    • Bourlard, H.1    Hermansky, H.2    Morgan, N.3
  • 8
    • 27144520907 scopus 로고    scopus 로고
    • "Multi-rate and variable-rate modeling of speech at phone and syllable time scales"
    • Ö. Çetin and M. Ostendorf, "Multi-rate and variable-rate modeling of speech at phone and syllable time scales," in Proc. ICASSP 2005, pp. I-665-668.
    • (2005) Proc. ICASSP
    • Çetin, Ö.1    Ostendorf, M.2
  • 10
    • 27144509179 scopus 로고    scopus 로고
    • "Learning long term temporal features in LVCSR using neural networks"
    • B. Chen, Q. Zhu, and N. Morgan, "Learning long term temporal features in LVCSR using neural networks," in Proc. ICSLP, 2004, pp. 612-615.
    • (2004) Proc. ICSLP , pp. 612-615
    • Chen, B.1    Zhu, Q.2    Morgan, N.3
  • 11
    • 27144558023 scopus 로고
    • "Eyes and ears for computers"
    • E. Davis and O. Selfridge, "Eyes and ears for computers," Proc. IRE, vol. 50, pp. 1093-1101, 1962.
    • (1962) Proc. IRE , vol.50 , pp. 1093-1101
    • Davis, E.1    Selfridge, O.2
  • 12
    • 0002629270 scopus 로고
    • "Maximum likelihood from incomplete data via the EM algorithm"
    • A. Dempster, N. Laird, and D. Rubin, "Maximum likelihood from incomplete data via the EM algorithm," J. Royal Statist. Soci. Series B, vol. 39, pp. 1-38, 1977.
    • (1977) J. Royal Statist. Soci. Series B , vol.39 , pp. 1-38
    • Dempster, A.1    Laird, N.2    Rubin, D.3
  • 13
    • 85079090910 scopus 로고
    • "Phonetic classification and recognition using HMM representation of overlapping articulatory features for all classes of English sounds"
    • Apr
    • L. Deng and D. Sun, "Phonetic classification and recognition using HMM representation of overlapping articulatory features for all classes of English sounds," Proc. ICASSP, Apr. 1994, pp. 45-48.
    • (1994) Proc. ICASSP , pp. 45-48
    • Deng, L.1    Sun, D.2
  • 14
    • 0002174507 scopus 로고
    • "The vocoder"
    • Dec
    • H. Dudley, "The vocoder," Bell Labs Record, vol. 17, pp. 122-126, Dec. 1939.
    • (1939) Bell Labs Record , vol.17 , pp. 122-126
    • Dudley, H.1
  • 15
    • 0005029290 scopus 로고
    • "The road not taken"
    • New York: Henry Holt and Co
    • R. Frost, "The road not taken," in Mountain Interval. New York: Henry Holt and Co., 1920.
    • (1920) Mountain Interval
    • Frost, R.1
  • 16
    • 0022667694 scopus 로고
    • "Speaker independent isolated word recognizer using dynamic features of speech spectrum"
    • S. Furui, "Speaker independent isolated word recognizer using dynamic features of speech spectrum," IEEE Trans. Acoust. Speech Audio Processing, vol. 34, no. 1, pp. 52-59, 1986.
    • (1986) IEEE Trans. Acoust. Speech Audio Processing , vol.34 , Issue.1 , pp. 52-59
    • Furui, S.1
  • 17
    • 0027239233 scopus 로고
    • "Improvements in connected digit recognition using linear discriminant analysis and mixture densities"
    • Adelaide, Australia
    • R. Haeb-Umbach, D. Geller, and H. Ney, "Improvements in connected digit recognition using linear discriminant analysis and mixture densities," Proc. IEEE Int. Conf. Acoustics Speech Signal Processing, Adelaide, Australia, 1994, vol. 2, pp. 239-242.
    • (1994) Proc. IEEE Int. Conf. Acoustics Speech Signal Processing , vol.2 , pp. 239-242
    • Haeb-Umbach, R.1    Geller, D.2    Ney, H.3
  • 19
    • 85009254284 scopus 로고    scopus 로고
    • "TRAPS - Classifiers of temporal patterns"
    • Sydney
    • H. Hermansky and S. Sharma, "TRAPS - Classifiers of temporal patterns," in Proc. ICSLP-98, Sydney, 1998, vol. 3, pp. 1003-1006.
    • (1998) Proc. ICSLP-98 , vol.3 , pp. 1003-1006
    • Hermansky, H.1    Sharma, S.2
  • 20
    • 27144439262 scopus 로고    scopus 로고
    • "Data-derived nonlinear mapping for feature extraction in HMM"
    • Keystone, CO
    • H. Hermansky, S. Sharma, and P. Jain, "Data-derived nonlinear mapping for feature extraction in HMM," in Proc. ASRU-99, Keystone, CO, 1999, pp. I-63-66.
    • (1999) Proc. ASRU-99
    • Hermansky, H.1    Sharma, S.2    Jain, P.3
  • 21
    • 0024905238 scopus 로고
    • "A comparison of several acoustic representations for speech recognit on with degraded and undegraded speech"
    • Glasgow, Scotland
    • M. Hunt and C. Lefebvre, "A comparison of several acoustic representations for speech recognit on with degraded and undegraded speech," in Proc. IEEE Conf. Acoustics, Speech, Signal Processing, Glasgow, Scotland, 1989, pp. 262-265.
    • (1989) Proc. IEEE Conf. Acoustics, Speech, Signal Processing , pp. 262-265
    • Hunt, M.1    Lefebvre, C.2
  • 22
    • 85009233038 scopus 로고    scopus 로고
    • "Improving word accuracy with Gabor feature extraction"
    • Denver, CO, Sept
    • M. Kleinschmidt and D. Gelbart, "Improving word accuracy with Gabor feature extraction," in Proc. ICSLP-2002, Denver, CO, Sept. 2002, pp. 25-28.
    • (2002) Proc. ICSLP-2002 , pp. 25-28
    • Kleinschmidt, M.1    Gelbart, D.2
  • 23
    • 0346262152 scopus 로고    scopus 로고
    • "Real-time probabilistic segmentation for segment-based speech recognition"
    • Sydney
    • S. Lee and J. Glass, "Real-time probabilistic segmentation for segment-based speech recognition," in Proc. ICSLP-1998, Sydney, 1998, pp. 1803-1806.
    • (1998) Proc. ICSLP-1998 , pp. 1803-1806
    • Lee, S.1    Glass, J.2
  • 24
    • 0030245363 scopus 로고    scopus 로고
    • "From HMMs to segment models: A unified view of stochastic modeling for speech recognition"
    • M. Ostendorf, V. Digilakis, and O. Kimball, "From HMMs to segment models: A unified view of stochastic modeling for speech recognition," IEEE Trans. Acoustics, Speech, Signal Processing, vol. 4, no. 5, pp. 369-378, 1996.
    • (1996) IEEE Trans. Acoustics, Speech, Signal Processing , vol.4 , Issue.5 , pp. 369-378
    • Ostendorf, M.1    Digilakis, V.2    Kimball, O.3
  • 26
    • 85079097438 scopus 로고
    • "IPA: Improved modelling with recurrent neural networks"
    • Apr
    • A. Robinson, M. Hochberg, and S. Renals, "IPA: Improved modelling with recurrent neural networks," in Proc. ICASSP-94, Apr. 1994, pp. 37-40.
    • (1994) Proc. ICASSP-94 , pp. 37-40
    • Robinson, A.1    Hochberg, M.2    Renals, S.3
  • 27
    • 85009115694 scopus 로고    scopus 로고
    • "Consonant discrimination in elicited and spontaneous speech: A case for signal-adaptive front ends in ASR"
    • Beijing, China, Oct
    • M. Sonmez, M. Plauche, E. Shriberg, and H. Franco, "Consonant discrimination in elicited and spontaneous speech: A case for signal-adaptive front ends in ASR," in Proc. ICSLP-2000, Beijing, China, Oct. 2000, pp. 548-551.
    • (2000) Proc. ICSLP-2000 , pp. 548-551
    • Sonmez, M.1    Plauche, M.2    Shriberg, E.3    Franco, H.4
  • 28
    • 0002915083 scopus 로고    scopus 로고
    • "Relevance of time-frequency features for phonetic and speaker-channel classification"
    • H. Yang, S. Van Vuuren, S. Sharma and H. Hermansky, "Relevance of time-frequency features for phonetic and speaker-channel classification," Speech Commun., vol. 31, no. 1, pp. 35-50, 2000.
    • (2000) Speech Commun. , vol.31 , Issue.1 , pp. 35-50
    • Yang, H.1    Van Vuuren, S.2    Sharma, S.3    Hermansky, H.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.