SCOPUS 정보 검색 플랫폼

Volumn 45, Issue 1, 2005, Pages 63-87

A schema-based model for phonemic restoration

(2) Srinivasan, Soundararajan a Wang, Deliang a

Author keywords

Computational auditory scene analysis; Dynamic time warping; Missing data ASR; Phonemic restoration; Prediction; Speech schemas; Top down model

Indexed keywords

COMPUTATIONAL AUDITORY SCENE ANALYSIS; DYNAMIC TIME WARPING; MISSING DATA ASR; PHONEMIC RESTORATION; PREDICTION; SPEECH SCHEMES; TOP-DOWN MODELS;

AUDITION; COMPUTATIONAL LINGUISTICS; DATA REDUCTION; IMAGE ANALYSIS; ITERATIVE METHODS; SPEECH RECOGNITION;

SPEECH ANALYSIS;

EID: 11144339352 PISSN: 01676393 EISSN: None Source Type: Journal
DOI: 10.1016/j.specom.2004.09.002 Document Type: Article

Times cited : (31)

References (53)

1
- 0004093046
- Prentice-Hall, Inc. Englewood Cliffs, NJ
- B.D.O. Anderson, and J.B. Moore Optimal Filtering 1979 Prentice-Hall, Inc. Englewood Cliffs, NJ
- (1979) Optimal Filtering
- Anderson, B.D.O.¹ Moore, J.B.²

2
- 0026826028
- Increasing the intelligibility of speech through multiple phonemic restorations
- J.A. Bashford, K.R. Riener, and R.M. Warren Increasing the intelligibility of speech through multiple phonemic restorations Percept. Psychophys. 51 1992 211 217
- (1992) Percept. Psychophys. , vol.51 , pp. 211-217
- Bashford, J.A.¹ Riener, K.R.² Warren, R.M.³

3
- 0038120523
- Boersma, P., Weenink, D., 2002. Praat: Doing Phonetics by Computer, Version 4.0.26. Available from:
- (2002) Praat: Doing Phonetics by Computer, Version 4.0.26
- Boersma, P.¹ Weenink, D.²

4
- 85131244943
- Asking the "what for" question in auditory perception
- M. Kubovy J.R. Pomerantz Lawrence Erlbaum Associates Hillsdale, NJ
- A.S. Bregman Asking the "what for" question in auditory perception M. Kubovy J.R. Pomerantz Perceptual Organization 1981 Lawrence Erlbaum Associates Hillsdale, NJ 99 118
- (1981) Perceptual Organization , pp. 99-118
- Bregman, A.S.¹

5
- 0003684441
- The MIT Press Cambridge, MA
- A.S. Bregman Auditory Scene Analysis 1990 The MIT Press Cambridge, MA
- (1990) Auditory Scene Analysis
- Bregman, A.S.¹

6
- 0028531926
- Computational auditory scene analysis
- G.J. Brown, and M.P. Cooke Computational auditory scene analysis Comp. Speech Lang. 8 1994 297 336
- (1994) Comp. Speech Lang. , vol.8 , pp. 297-336
- Brown, G.J.¹ Cooke, M.P.²

7
- 0035342414
- Robust automatic speech recognition with missing and unreliable acoustic data
- M. Cooke, P. Green, L. Josifovski, and A. Vizinho Robust automatic speech recognition with missing and unreliable acoustic data Speech Commun. 34 2001 267 285
- (2001) Speech Commun. , vol.34 , pp. 267-285
- Cooke, M.¹ Green, P.² Josifovski, L.³ Vizinho, A.⁴

8
- 0027879087
- Computational auditory scence analysis: Exploiting principles of perceived continuity
- M.P. Cooke, and G.J. Brown Computational auditory scence analysis: exploiting principles of perceived continuity Speech Commun. 13 1993 391 399
- (1993) Speech Commun. , vol.13 , pp. 391-399
- Cooke, M.P.¹ Brown, G.J.²

9
- 0031619912
- Speaker verification in noisy environment with combined spectral subtraction and missing data theory
- Drygajlo, A., El-Maliki, M., 1998. Speaker verification in noisy environment with combined spectral subtraction and missing data theory. In: Proc. ICASSP '98, Vol. 1, pp. 121-124
- (1998) Proc. ICASSP '98 , vol.1 , pp. 121-124
- Drygajlo, A.¹ El-Maliki, M.²

10
- 0032626792
- Using knowledge to orgnaize sound: The prediction-driven approach to computational auditory scene analysis, and its application to speech/non-speech mixtures
- D.P.W. Ellis Using knowledge to orgnaize sound: the prediction-driven approach to computational auditory scene analysis, and its application to speech/non-speech mixtures Speech Commun. 27 1999 281 298
- (1999) Speech Commun. , vol.27 , pp. 281-298
- Ellis, D.P.W.¹

11
- 0030237557
- Words and voices: Episodic traces in spoken word identification and recognition memory
- S.D. Goldinger Words and voices: episodic traces in spoken word identification and recognition memory J. Exp. Psychol. Learn. 22 1996 1166 1183
- (1996) J. Exp. Psychol. Learn. , vol.22 , pp. 1166-1183
- Goldinger, S.D.¹

12
- 0345443173
- Puzzle-solving science: The quixotic quest for units in speech perception
- S.D. Goldinger, and T. Azuma Puzzle-solving science: the quixotic quest for units in speech perception J. Phonetics 31 2003 305 320
- (2003) J. Phonetics , vol.31 , pp. 305-320
- Goldinger, S.D.¹ Azuma, T.²

13
- 0017097474
- Distance measures for speech processing
- A.H. Gray, and J.D. Markel Distance measures for speech processing IEEE Trans. Acoust. Speech Signal Process. ASSP-24 5 1976 380 391
- (1976) IEEE Trans. Acoust. Speech Signal Process. , vol.24 , Issue.5 , pp. 380-391
- Gray, A.H.¹ Markel, J.D.²

14
- 0034172725
- Internet telephony: Services, technical challenges, and products
- M. Hassan, A. Nayandoro, and M. Atiquzzaman Internet telephony: services, technical challenges, and products IEEE Commun. 38 2000 96 103
- (2000) IEEE Commun. , vol.38 , pp. 96-103
- Hassan, M.¹ Nayandoro, A.² Atiquzzaman, M.³

15
- 0035688755
- Robust matching of audio signals using spectral flatness features
- Herre, J., Allamanche, E., Hellmuth, O., 2001. Robust matching of audio signals using spectral flatness features. In: Proc. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics '01. pp. 127-30
- (2001) Proc. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics '01 , pp. 127-130
- Herre, J.¹ Allamanche, E.² Hellmuth, O.³

16
- 4644265990
- Monaural speech segregation based on pitch tracking and amplitude modulation
- G. Hu, and D.L. Wang Monaural speech segregation based on pitch tracking and amplitude modulation IEEE Trans. Neural Networks 15 2004 1135 1150
- (2004) IEEE Trans. Neural Networks , vol.15 , pp. 1135-1150
- Hu, G.¹ Wang, D.L.²

17
- 85009210599
- New model-based HMM distances with applications to run-time ASR error estimation and model tuning
- Huang, C.S., Lee, C.H., Wang, H.C., 2003. New model-based HMM distances with applications to run-time ASR error estimation and model tuning. In: Proc. Eurospeech '03, pp. 457-460
- (2003) Proc. Eurospeech '03 , pp. 457-460
- Huang, C.S.¹ Lee, C.H.² Wang, H.C.³

18
- 11144249505
- DALL: Davidson's algorithm for log likelihood maximization - A subroutine for statistical model builders
- The Institute of Statistical Mathematics
- Ishiguro, M., Akaike, H., 1999. DALL: Davidson's algorithm for log likelihood maximization - a subroutine for statistical model builders. In: Computer Science Monographs, No. 25. The Institute of Statistical Mathematics
- (1999) Computer Science Monographs , Issue.25
- Ishiguro, M.¹ Akaike, H.²

19
- 0003579084
- Prentice-Hall, Inc. Englewood Cliffs, NJ
- N.S. Jayant, and P. Noll Digital Coding of Waveforms 1984 Prentice-Hall, Inc. Englewood Cliffs, NJ
- (1984) Digital Coding of Waveforms
- Jayant, N.S.¹ Noll, P.²

20
- 0032118931
- An application of the Bayesian time series model and statistical system analysis for F0 control
- H. Kato, and H. Kawahara An application of the Bayesian time series model and statistical system analysis for F0 control Speech Commun. 24 1998 325 339
- (1998) Speech Commun. , vol.24 , pp. 325-339
- Kato, H.¹ Kawahara, H.²

21
- 0002560960
- A database for speaker-independent digit recognition
- Leonard, R.G., 1984. A database for speaker-independent digit recognition. In: Proc. ICASSP '84. pp. 111-114
- (1984) Proc. ICASSP '84 , pp. 111-114
- Leonard, G.R.¹

22
- 0032651334
- Dynamic sound stream formation based on continuity of spectral change
- I. Masuda-Katsuse, and H. Kawahara Dynamic sound stream formation based on continuity of spectral change Speech Commun. 27 1999 235 259
- (1999) Speech Commun. , vol.27 , pp. 235-259
- Masuda-Katsuse, I.¹ Kawahara, H.²

23
- 84933250500
- The intelligibility of interrupted speech
- G.A. Miller, and J.C.R. Licklider The intelligibility of interrupted speech J. Acoust. Soc. Am. 22 1950 167 173
- (1950) J. Acoust. Soc. Am. , vol.22 , pp. 167-173
- Miller, G.A.¹ Licklider, J.C.R.²

24
- 11144303109
- Diphone synthesis using a mulitpulse lpc technique
- Moulines, E., Charpentier, F., 1988. Diphone synthesis using a mulitpulse lpc technique. In: Proc. The Federation of Acoustical Societies of Europe International Conference '88, pp. 47-55
- (1988) Proc. the Federation of Acoustical Societies of Europe International Conference '88 , pp. 47-55
- Moulines, E.¹ Charpentier, F.²

25
- 0025543906
- Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones
- E. Moulines, and F. Charpentier Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones Speech Commun. 9 1990 453 467
- (1990) Speech Commun. , vol.9 , pp. 453-467
- Moulines, E.¹ Charpentier, F.²

26
- 0032630841
- Harmonic sound stream segregation using localization and its application to speech stream segregation
- T. Nakatani, and H.G. Okuno Harmonic sound stream segregation using localization and its application to speech stream segregation Speech Commun. 27 1999 209 222
- (1999) Speech Commun. , vol.27 , pp. 209-222
- Nakatani, T.¹ Okuno, H.G.²

27
- 0001825373
- Visual surface representation: A critical link between lower-level and higher-level vision
- S.M. Kosslyn D.N. Osherson The MIT Press Cambridge
- K. Nakayama, Z.J. He, and S. Shimojo Visual surface representation: a critical link between lower-level and higher-level vision S.M. Kosslyn D.N. Osherson An Invitation to Cognitive Science 1995 The MIT Press Cambridge 1 70
- (1995) An Invitation to Cognitive Science , pp. 1-70
- Nakayama, K.¹ He, Z.J.² Shimojo, S.³

28
- 0032044939
- Talker-specific learning in speech perception
- L.C. Nygaard, and D.B. Pisoni Talker-specific learning in speech perception Percept. Psychophys. 60 1998 335 376
- (1998) Percept. Psychophys. , vol.60 , pp. 335-376
- Nygaard, L.C.¹ Pisoni, D.B.²

29
- 0003513556
- second ed. Prentice-Hall, Inc. Upper Saddle River, NJ
- A.V. Oppenheim, R.W. Schafer, and J.R. Buck Discrete-Time Signal Processing second ed. 1999 Prentice-Hall, Inc. Upper Saddle River, NJ
- (1999) Discrete-Time Signal Processing
- Oppenheim, A.V.¹ Schafer, R.W.² Buck, J.R.³

30
- 0032155046
- A survey of packet loss recovery techniques for streaming audio
- C. Perkins, O. Hodson, and V. Hardman A survey of packet loss recovery techniques for streaming audio IEEE Network 12 1998 40 48
- (1998) IEEE Network , vol.12 , pp. 40-48
- Perkins, C.¹ Hodson, O.² Hardman, V.³

31
- 0004197136
- John Wiley and Sons, Inc. New York, NY
- J.C. Principe, N.R. Euliano, and W.C. Lefebvre Neural and Adaptive Systems 2000 John Wiley and Sons, Inc. New York, NY
- (2000) Neural and Adaptive Systems
- Principe, J.C.¹ Euliano, N.R.² Lefebvre, W.C.³

32
- 0004244302
- second ed. Prentice-Hall, Inc. Englewood Cliffs, NJ
- L.R. Rabiner, and B.H. Juang Fundamentals of Speech Recognition second ed. 1993 Prentice-Hall, Inc. Englewood Cliffs, NJ
- (1993) Fundamentals of Speech Recognition
- Rabiner, L.R.¹ Juang, B.H.²

33
- 85009104896
- Reconstruction of damaged spectrographic feautres for robust speech recognition
- Raj, B., Seltzer, M.L., Stern, R.M., 2000. Reconstruction of damaged spectrographic feautres for robust speech recognition. In: Proc. International Conference on Spoken Language Processing '00. pp. 1491-1494
- (2000) Proc. International Conference on Spoken Language Processing '00 , pp. 1491-1494
- Raj, B.¹ Seltzer, M.L.² Stern, R.M.³

34
- 11144343436
- Detection of reliable features for speech recognition in noisy conditions using a statistical criterion
- Renevey, P., Drygajlo, A., 2001. Detection of reliable features for speech recognition in noisy conditions using a statistical criterion. In: Proc. Consistent and Reliable Acoustic Cues for Sound Analysis Workshop '01. pp. 71-74
- (2001) Proc. Consistent and Reliable Acoustic Cues for Sound Analysis Workshop '01 , pp. 71-74
- Renevey, P.¹ Drygajlo, A.²

35
- 0026436413
- Perceptual restoration of a "missing" speech sound: Auditory induction or illusion?
- B.H. Repp Perceptual restoration of a "missing" speech sound: Auditory induction or illusion? Percept. Psychophys. 51 1992 14 32
- (1992) Percept. Psychophys. , vol.51 , pp. 14-32
- Repp, B.H.¹

36
- 0019624814
- The role of bottom-up confirmation in the phonemic restoration illusion
- A.G. Samuel The role of bottom-up confirmation in the phonemic restoration illusion J. Exp. Psychol.: Hum. Percept. Perform. 7 1981 1124 1131
- (1981) J. Exp. Psychol.: Hum. Percept. Perform. , vol.7 , pp. 1124-1131
- Samuel, A.G.¹

37
- 0031090373
- Lexical activation produces potent phonemic percepts
- A.G. Samuel Lexical activation produces potent phonemic percepts Cogn. Psychol. 32 1997 97 127
- (1997) Cogn. Psychol. , vol.32 , pp. 97-127
- Samuel, A.G.¹

38
- 85009180557
- A harmonic-model-based front end for robust speech recognition
- Seltzer, M.L., Droppo, J., Acero, A., 2003. A harmonic-model-based front end for robust speech recognition. In: Proc. Eurospeech '03. pp. 1277-1280
- (2003) Proc. Eurospeech '03 , pp. 1277-1280
- Seltzer, M.L.¹ Droppo, J.² Acero, A.³

39
- 85009089485
- Classifier-based mask estimation for missing feature methods of robust speech recognition
- Seltzer, M.L., Raj, B., Stern, R.M., 2000. Classifier-based mask estimation for missing feature methods of robust speech recognition. In: Proc. International Conference on Spoken Language Processing '00. pp. 538-541
- (2000) Proc. International Conference on Spoken Language Processing '00 , pp. 538-541
- Seltzer, M.L.¹ Raj, B.² Stern, R.M.³

40
- 85009193720
- Schema-based modeling of phonemic restoration
- Srinivasan, S., Wang, D.L., 2003. Schema-based modeling of phonemic restoration. In: Proc. Eurospeech '03. pp. 2053-2056
- (2003) Proc. Eurospeech '03 , pp. 2053-2056
- Srinivasan, S.¹ Wang, D.L.²

41
- 0004129646
- The MIT Press Cambridge, MA
- K.N. Stevens Acoustic Phonetics 1998 The MIT Press Cambridge, MA
- (1998) Acoustic Phonetics
- Stevens, K.N.¹

42
- 0004197863
- Prentice-Hall Upper Saddle River, NJ
- P. Stoica, and R.L. Moses Introduction to Spectral Analysis 1997 Prentice-Hall Upper Saddle River, NJ
- (1997) Introduction to Spectral Analysis
- Stoica, P.¹ Moses, R.L.²

43
- 0020731187
- Intelligibility of interrupted meaningful and nonsense speech with and without intervening noise
- J. Verschuure, and M.P. Brocaar Intelligibility of interrupted meaningful and nonsense speech with and without intervening noise Percept. Psychophys. 33 1983 232 240
- (1983) Percept. Psychophys. , vol.33 , pp. 232-240
- Verschuure, J.¹ Brocaar, M.P.²

44
- 0032682770
- Separation of speech from interfering sounds based on oscillatory correlation
- D.L. Wang, and G.J. Brown Separation of speech from interfering sounds based on oscillatory correlation IEEE Trans. Neural Networks 10 3 1999 684 697
- (1999) IEEE Trans. Neural Networks , vol.10 , Issue.3 , pp. 684-697
- Wang, D.L.¹ Brown, G.J.²

45
- 0014959117
- Perceptual restoration of missing speech sounds
- R.M. Warren Perceptual restoration of missing speech sounds Science 167 1970 392 393
- (1970) Science , vol.167 , pp. 392-393
- Warren, R.M.¹

46
- 0003775799
- Cambridge University Press Cambridge, UK
- R.M. Warren Auditory Perception: A New Analysis and Synthesis 1999 Cambridge University Press Cambridge, UK
- (1999) Auditory Perception: A New Analysis and Synthesis
- Warren, R.M.¹

47
- 0028394425
- Auditory induction: Reciprocal changes in alternating sounds
- R.M. Warren, J.A. Bashford, E.W. Healy, and B.S. Brubaker Auditory induction: reciprocal changes in alternating sounds Percept. Psychophys. 55 1994 313 322
- (1994) Percept. Psychophys. , vol.55 , pp. 313-322
- Warren, R.M.¹ Bashford, J.A.² Healy, E.W.³ Brubaker, B.S.⁴

48
- 0000145053
- Speech perception and phonemic restorations
- R.M. Warren, and C.J. Obusek Speech perception and phonemic restorations Percept. Psychophys. 9 1971 358 362
- (1971) Percept. Psychophys. , vol.9 , pp. 358-362
- Warren, R.M.¹ Obusek, C.J.²

49
- 0016301459
- Phonemic restorations based on subsequent context
- R.M. Warren, and G.L. Sherman Phonemic restorations based on subsequent context Percept. Psychophys. 16 1974 150 156
- (1974) Percept. Psychophys. , vol.16 , pp. 150-156
- Warren, R.M.¹ Sherman, G.L.²

50
- 0012715410
- Comparison of distance measures in discrete spectral modeling
- Wei, B., Gibson, J.D., 2000. Comparison of distance measures in discrete spectral modeling. In: Proc. IEEE Digital Signal Processing Workshop '00
- (2000) Proc. IEEE Digital Signal Processing Workshop '00
- Wei, B.¹ Gibson, J.D.²

51
- 11144286121
- The spectral autocorrelation peak valley ratio (SAPVR)-a usable speech measure employed as a co-channel detection system
- Yantorno, R.E., Krishnamachari, K.R., Lovekin, J.M., Benincasa, D.S., Wenndt, S.J., 2001. The spectral autocorrelation peak valley ratio (SAPVR)-a usable speech measure employed as a co-channel detection system. In: Proc. IEEE International Workshop on Intelligent Signal Processing '01. pp. 193-197
- (2001) Proc. IEEE International Workshop on Intelligent Signal Processing '01 , pp. 193-197
- Yantorno, R.E.¹ Krishnamachari, K.R.² Lovekin, J.M.³ Benincasa, D.S.⁴ Wenndt, S.J.⁵

52
- 0030244826
- A review of large-vocabulary continuous-speech recognition
- S. Young A review of large-vocabulary continuous-speech recognition IEEE Signal Process. Mag. 13 1996 45 57
- (1996) IEEE Signal Process. Mag. , vol.13 , pp. 45-57
- Young, S.¹

53
- 0003571976
- Microsoft Corporation
- Young, S., Kershaw, D., Odell, J., Valtchev, V., Woodland, P., 2000. The HTK Book (for HTK Version 3.0). Microsoft Corporation
- (2000) The HTK Book (For HTK Version 3.0)
- Young, S.¹ Kershaw, D.² Odell, J.³ Valtchev, V.⁴ Woodland, P.⁵

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.