메뉴 건너뛰기




Volumn 45, Issue 1, 2005, Pages 63-87

A schema-based model for phonemic restoration

Author keywords

Computational auditory scene analysis; Dynamic time warping; Missing data ASR; Phonemic restoration; Prediction; Speech schemas; Top down model

Indexed keywords

COMPUTATIONAL AUDITORY SCENE ANALYSIS; DYNAMIC TIME WARPING; MISSING DATA ASR; PHONEMIC RESTORATION; PREDICTION; SPEECH SCHEMES; TOP-DOWN MODELS;

EID: 11144339352     PISSN: 01676393     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.specom.2004.09.002     Document Type: Article
Times cited : (31)

References (53)
  • 2
    • 0026826028 scopus 로고
    • Increasing the intelligibility of speech through multiple phonemic restorations
    • J.A. Bashford, K.R. Riener, and R.M. Warren Increasing the intelligibility of speech through multiple phonemic restorations Percept. Psychophys. 51 1992 211 217
    • (1992) Percept. Psychophys. , vol.51 , pp. 211-217
    • Bashford, J.A.1    Riener, K.R.2    Warren, R.M.3
  • 4
    • 85131244943 scopus 로고
    • Asking the "what for" question in auditory perception
    • M. Kubovy J.R. Pomerantz Lawrence Erlbaum Associates Hillsdale, NJ
    • A.S. Bregman Asking the "what for" question in auditory perception M. Kubovy J.R. Pomerantz Perceptual Organization 1981 Lawrence Erlbaum Associates Hillsdale, NJ 99 118
    • (1981) Perceptual Organization , pp. 99-118
    • Bregman, A.S.1
  • 6
    • 0028531926 scopus 로고
    • Computational auditory scene analysis
    • G.J. Brown, and M.P. Cooke Computational auditory scene analysis Comp. Speech Lang. 8 1994 297 336
    • (1994) Comp. Speech Lang. , vol.8 , pp. 297-336
    • Brown, G.J.1    Cooke, M.P.2
  • 7
    • 0035342414 scopus 로고    scopus 로고
    • Robust automatic speech recognition with missing and unreliable acoustic data
    • M. Cooke, P. Green, L. Josifovski, and A. Vizinho Robust automatic speech recognition with missing and unreliable acoustic data Speech Commun. 34 2001 267 285
    • (2001) Speech Commun. , vol.34 , pp. 267-285
    • Cooke, M.1    Green, P.2    Josifovski, L.3    Vizinho, A.4
  • 8
    • 0027879087 scopus 로고
    • Computational auditory scence analysis: Exploiting principles of perceived continuity
    • M.P. Cooke, and G.J. Brown Computational auditory scence analysis: exploiting principles of perceived continuity Speech Commun. 13 1993 391 399
    • (1993) Speech Commun. , vol.13 , pp. 391-399
    • Cooke, M.P.1    Brown, G.J.2
  • 9
    • 0031619912 scopus 로고    scopus 로고
    • Speaker verification in noisy environment with combined spectral subtraction and missing data theory
    • Drygajlo, A., El-Maliki, M., 1998. Speaker verification in noisy environment with combined spectral subtraction and missing data theory. In: Proc. ICASSP '98, Vol. 1, pp. 121-124
    • (1998) Proc. ICASSP '98 , vol.1 , pp. 121-124
    • Drygajlo, A.1    El-Maliki, M.2
  • 10
    • 0032626792 scopus 로고    scopus 로고
    • Using knowledge to orgnaize sound: The prediction-driven approach to computational auditory scene analysis, and its application to speech/non-speech mixtures
    • D.P.W. Ellis Using knowledge to orgnaize sound: the prediction-driven approach to computational auditory scene analysis, and its application to speech/non-speech mixtures Speech Commun. 27 1999 281 298
    • (1999) Speech Commun. , vol.27 , pp. 281-298
    • Ellis, D.P.W.1
  • 11
    • 0030237557 scopus 로고    scopus 로고
    • Words and voices: Episodic traces in spoken word identification and recognition memory
    • S.D. Goldinger Words and voices: episodic traces in spoken word identification and recognition memory J. Exp. Psychol. Learn. 22 1996 1166 1183
    • (1996) J. Exp. Psychol. Learn. , vol.22 , pp. 1166-1183
    • Goldinger, S.D.1
  • 12
    • 0345443173 scopus 로고    scopus 로고
    • Puzzle-solving science: The quixotic quest for units in speech perception
    • S.D. Goldinger, and T. Azuma Puzzle-solving science: the quixotic quest for units in speech perception J. Phonetics 31 2003 305 320
    • (2003) J. Phonetics , vol.31 , pp. 305-320
    • Goldinger, S.D.1    Azuma, T.2
  • 14
    • 0034172725 scopus 로고    scopus 로고
    • Internet telephony: Services, technical challenges, and products
    • M. Hassan, A. Nayandoro, and M. Atiquzzaman Internet telephony: services, technical challenges, and products IEEE Commun. 38 2000 96 103
    • (2000) IEEE Commun. , vol.38 , pp. 96-103
    • Hassan, M.1    Nayandoro, A.2    Atiquzzaman, M.3
  • 16
    • 4644265990 scopus 로고    scopus 로고
    • Monaural speech segregation based on pitch tracking and amplitude modulation
    • G. Hu, and D.L. Wang Monaural speech segregation based on pitch tracking and amplitude modulation IEEE Trans. Neural Networks 15 2004 1135 1150
    • (2004) IEEE Trans. Neural Networks , vol.15 , pp. 1135-1150
    • Hu, G.1    Wang, D.L.2
  • 17
    • 85009210599 scopus 로고    scopus 로고
    • New model-based HMM distances with applications to run-time ASR error estimation and model tuning
    • Huang, C.S., Lee, C.H., Wang, H.C., 2003. New model-based HMM distances with applications to run-time ASR error estimation and model tuning. In: Proc. Eurospeech '03, pp. 457-460
    • (2003) Proc. Eurospeech '03 , pp. 457-460
    • Huang, C.S.1    Lee, C.H.2    Wang, H.C.3
  • 18
    • 11144249505 scopus 로고    scopus 로고
    • DALL: Davidson's algorithm for log likelihood maximization - A subroutine for statistical model builders
    • The Institute of Statistical Mathematics
    • Ishiguro, M., Akaike, H., 1999. DALL: Davidson's algorithm for log likelihood maximization - a subroutine for statistical model builders. In: Computer Science Monographs, No. 25. The Institute of Statistical Mathematics
    • (1999) Computer Science Monographs , Issue.25
    • Ishiguro, M.1    Akaike, H.2
  • 20
    • 0032118931 scopus 로고    scopus 로고
    • An application of the Bayesian time series model and statistical system analysis for F0 control
    • H. Kato, and H. Kawahara An application of the Bayesian time series model and statistical system analysis for F0 control Speech Commun. 24 1998 325 339
    • (1998) Speech Commun. , vol.24 , pp. 325-339
    • Kato, H.1    Kawahara, H.2
  • 21
    • 0002560960 scopus 로고
    • A database for speaker-independent digit recognition
    • Leonard, R.G., 1984. A database for speaker-independent digit recognition. In: Proc. ICASSP '84. pp. 111-114
    • (1984) Proc. ICASSP '84 , pp. 111-114
    • Leonard, G.R.1
  • 22
    • 0032651334 scopus 로고    scopus 로고
    • Dynamic sound stream formation based on continuity of spectral change
    • I. Masuda-Katsuse, and H. Kawahara Dynamic sound stream formation based on continuity of spectral change Speech Commun. 27 1999 235 259
    • (1999) Speech Commun. , vol.27 , pp. 235-259
    • Masuda-Katsuse, I.1    Kawahara, H.2
  • 23
    • 84933250500 scopus 로고
    • The intelligibility of interrupted speech
    • G.A. Miller, and J.C.R. Licklider The intelligibility of interrupted speech J. Acoust. Soc. Am. 22 1950 167 173
    • (1950) J. Acoust. Soc. Am. , vol.22 , pp. 167-173
    • Miller, G.A.1    Licklider, J.C.R.2
  • 25
    • 0025543906 scopus 로고
    • Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones
    • E. Moulines, and F. Charpentier Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones Speech Commun. 9 1990 453 467
    • (1990) Speech Commun. , vol.9 , pp. 453-467
    • Moulines, E.1    Charpentier, F.2
  • 26
    • 0032630841 scopus 로고    scopus 로고
    • Harmonic sound stream segregation using localization and its application to speech stream segregation
    • T. Nakatani, and H.G. Okuno Harmonic sound stream segregation using localization and its application to speech stream segregation Speech Commun. 27 1999 209 222
    • (1999) Speech Commun. , vol.27 , pp. 209-222
    • Nakatani, T.1    Okuno, H.G.2
  • 27
    • 0001825373 scopus 로고
    • Visual surface representation: A critical link between lower-level and higher-level vision
    • S.M. Kosslyn D.N. Osherson The MIT Press Cambridge
    • K. Nakayama, Z.J. He, and S. Shimojo Visual surface representation: a critical link between lower-level and higher-level vision S.M. Kosslyn D.N. Osherson An Invitation to Cognitive Science 1995 The MIT Press Cambridge 1 70
    • (1995) An Invitation to Cognitive Science , pp. 1-70
    • Nakayama, K.1    He, Z.J.2    Shimojo, S.3
  • 28
    • 0032044939 scopus 로고    scopus 로고
    • Talker-specific learning in speech perception
    • L.C. Nygaard, and D.B. Pisoni Talker-specific learning in speech perception Percept. Psychophys. 60 1998 335 376
    • (1998) Percept. Psychophys. , vol.60 , pp. 335-376
    • Nygaard, L.C.1    Pisoni, D.B.2
  • 30
    • 0032155046 scopus 로고    scopus 로고
    • A survey of packet loss recovery techniques for streaming audio
    • C. Perkins, O. Hodson, and V. Hardman A survey of packet loss recovery techniques for streaming audio IEEE Network 12 1998 40 48
    • (1998) IEEE Network , vol.12 , pp. 40-48
    • Perkins, C.1    Hodson, O.2    Hardman, V.3
  • 35
    • 0026436413 scopus 로고
    • Perceptual restoration of a "missing" speech sound: Auditory induction or illusion?
    • B.H. Repp Perceptual restoration of a "missing" speech sound: Auditory induction or illusion? Percept. Psychophys. 51 1992 14 32
    • (1992) Percept. Psychophys. , vol.51 , pp. 14-32
    • Repp, B.H.1
  • 36
    • 0019624814 scopus 로고
    • The role of bottom-up confirmation in the phonemic restoration illusion
    • A.G. Samuel The role of bottom-up confirmation in the phonemic restoration illusion J. Exp. Psychol.: Hum. Percept. Perform. 7 1981 1124 1131
    • (1981) J. Exp. Psychol.: Hum. Percept. Perform. , vol.7 , pp. 1124-1131
    • Samuel, A.G.1
  • 37
    • 0031090373 scopus 로고    scopus 로고
    • Lexical activation produces potent phonemic percepts
    • A.G. Samuel Lexical activation produces potent phonemic percepts Cogn. Psychol. 32 1997 97 127
    • (1997) Cogn. Psychol. , vol.32 , pp. 97-127
    • Samuel, A.G.1
  • 38
    • 85009180557 scopus 로고    scopus 로고
    • A harmonic-model-based front end for robust speech recognition
    • Seltzer, M.L., Droppo, J., Acero, A., 2003. A harmonic-model-based front end for robust speech recognition. In: Proc. Eurospeech '03. pp. 1277-1280
    • (2003) Proc. Eurospeech '03 , pp. 1277-1280
    • Seltzer, M.L.1    Droppo, J.2    Acero, A.3
  • 40
    • 85009193720 scopus 로고    scopus 로고
    • Schema-based modeling of phonemic restoration
    • Srinivasan, S., Wang, D.L., 2003. Schema-based modeling of phonemic restoration. In: Proc. Eurospeech '03. pp. 2053-2056
    • (2003) Proc. Eurospeech '03 , pp. 2053-2056
    • Srinivasan, S.1    Wang, D.L.2
  • 43
    • 0020731187 scopus 로고
    • Intelligibility of interrupted meaningful and nonsense speech with and without intervening noise
    • J. Verschuure, and M.P. Brocaar Intelligibility of interrupted meaningful and nonsense speech with and without intervening noise Percept. Psychophys. 33 1983 232 240
    • (1983) Percept. Psychophys. , vol.33 , pp. 232-240
    • Verschuure, J.1    Brocaar, M.P.2
  • 44
    • 0032682770 scopus 로고    scopus 로고
    • Separation of speech from interfering sounds based on oscillatory correlation
    • D.L. Wang, and G.J. Brown Separation of speech from interfering sounds based on oscillatory correlation IEEE Trans. Neural Networks 10 3 1999 684 697
    • (1999) IEEE Trans. Neural Networks , vol.10 , Issue.3 , pp. 684-697
    • Wang, D.L.1    Brown, G.J.2
  • 45
    • 0014959117 scopus 로고
    • Perceptual restoration of missing speech sounds
    • R.M. Warren Perceptual restoration of missing speech sounds Science 167 1970 392 393
    • (1970) Science , vol.167 , pp. 392-393
    • Warren, R.M.1
  • 48
    • 0000145053 scopus 로고
    • Speech perception and phonemic restorations
    • R.M. Warren, and C.J. Obusek Speech perception and phonemic restorations Percept. Psychophys. 9 1971 358 362
    • (1971) Percept. Psychophys. , vol.9 , pp. 358-362
    • Warren, R.M.1    Obusek, C.J.2
  • 49
    • 0016301459 scopus 로고
    • Phonemic restorations based on subsequent context
    • R.M. Warren, and G.L. Sherman Phonemic restorations based on subsequent context Percept. Psychophys. 16 1974 150 156
    • (1974) Percept. Psychophys. , vol.16 , pp. 150-156
    • Warren, R.M.1    Sherman, G.L.2
  • 52
    • 0030244826 scopus 로고    scopus 로고
    • A review of large-vocabulary continuous-speech recognition
    • S. Young A review of large-vocabulary continuous-speech recognition IEEE Signal Process. Mag. 13 1996 45 57
    • (1996) IEEE Signal Process. Mag. , vol.13 , pp. 45-57
    • Young, S.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.