SCOPUS 정보 검색 플랫폼

Trends in Amplification

Volumn 12, Issue 4, 2008, Pages 332-353

Time-Frequency Masking for Speech Separation and Its Potential for Hearing Aid Design

(1) Wang, Deliang a

a Ohio State University (United States)

Author keywords

computational auditory scene analysis; hearing aids; ideal binary mask; time frequency masking

Indexed keywords

ALGORITHM; AUDITORY DISCRIMINATION; AUDITORY MASKING; HEARING AID; HUMAN; NOISE REDUCTION; REVIEW; SPEECH ANALYSIS; SPEECH DISCRIMINATION;

ALGORITHMS; AUDITORY THRESHOLD; EQUIPMENT DESIGN; HEARING AIDS; HEARING IMPAIRED PERSONS; HEARING LOSS; HUMANS; MODELS, BIOLOGICAL; PERCEPTUAL MASKING; PITCH PERCEPTION; REHABILITATION OF HEARING IMPAIRED; SIGNAL PROCESSING, COMPUTER-ASSISTED; SPEECH INTELLIGIBILITY; SPEECH PERCEPTION; TIME PERCEPTION;

EID: 56249144201 PISSN: 10847138 EISSN: 19405588 Source Type: Journal
DOI: 10.1177/1084713808326455 Document Type: Article

Times cited : (156)

References (88)

1
- 3442876970
- Phase-based dual-microphone robust speech enhancement
- Aarabi P. Shi G. (2004). Phase-based dual-microphone robust speech enhancement. IEEE Transactions on Systems, Man, and Cybernetics—Part B: Cybernetics, 34, 1763–1773.
- (2004) IEEE Transactions on Systems, Man, and Cybernetics—Part B: Cybernetics , vol.34 , pp. 1763-1773
- Aarabi, P.¹ Shi, G.²

2
- 33748523481
- Determination of the potential benefit of time-frequency gain manipulation
- Anzalone M. C. Calandruccio L. Doherty K. A. Carney L. H. (2006). Determination of the potential benefit of time-frequency gain manipulation. Ear and Hearing, 27, 480–492.
- (2006) Ear and Hearing , vol.27 , pp. 480-492
- Anzalone, M.C.¹ Calandruccio, L.² Doherty, K.A.³ Carney, L.H.⁴

3
- 4544333241
- Underdetermined blind separation for speech in speech in real environments with sparseness and ICA
- (May) Montreal, Quebec, Canada.
- Araki S. Makino S. Blin A. Mukai R. Sawada H. (2004, May). Underdetermined blind separation for speech in speech in real environments with sparseness and ICA. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal processing (Vol. III, pp. 881–884), Montreal, Quebec, Canada.
- (2004) Proceedings of the IEEE International Conference on Acoustics, Speech and Signal processing , vol.3 , pp. 881-884
- Araki, S.¹ Makino, S.² Blin, A.³ Mukai, R.⁴ Sawada, H.⁵

4
- 35048881485
- Underdetermined blind separation of convolutive mixtures of speech with directivity pattern based mask and ICA
- In Puntonet C. G. Prieto A. (Eds.) Berlin: Springer
- Araki S. Makino S. Sawada H. Mukai R. (2004). Underdetermined blind separation of convolutive mixtures of speech with directivity pattern based mask and ICA. In Puntonet C. G. Prieto A. (Eds.), Lecture notes in computer science: 3195. Independent component analysis and blind signal separation: Proceedings of the Fifth International Congress, ICA 2004 (pp. 898–905). Berlin: Springer.
- (2004) Lecture notes in computer science: 3195. Independent component analysis and blind signal separation: Proceedings of the Fifth International Congress, ICA 2004 , pp. 898-905
- Araki, S.¹ Makino, S.² Sawada, H.³ Mukai, R.⁴

5
- 33646759922
- Reducing musical noise by a fine-shift overlap-and-add method applied to source separation using a time-frequency mask
- (March) Philadelphia, PA.
- Araki S. Makino S. Sawada H. Mukai R. (2005, March). Reducing musical noise by a fine-shift overlap-and-add method applied to source separation using a time-frequency mask. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (Vol. III, pp. 81–84), Philadelphia, PA.
- (2005) Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing , vol.3 , pp. 81-84
- Araki, S.¹ Makino, S.² Sawada, H.³ Mukai, R.⁴

6
- 78249236527
- Blind sparse source separation with spatially smoothed time-frequency masking
- (September) Paris, France
- Araki S. Sawada H. Mukai R. Makino S. (2006, September). Blind sparse source separation with spatially smoothed time-frequency masking. In Proceedings of the 10th International Workshop Acoustic Echo and Noise Control, Paris, France.
- (2006) Proceedings of the 10th International Workshop Acoustic Echo and Noise Control
- Araki, S.¹ Sawada, H.² Mukai, R.³ Makino, S.⁴

7
- 34247223586
- Underdetermined blind sparse source separation for arbitrarily arranged multiple sensors
- Araki S. Sawada H. Mukai R. Makino S. (2007). Underdetermined blind sparse source separation for arbitrarily arranged multiple sensors. Signal Processing, 87, 1833–1847.
- (2007) Signal Processing , vol.87 , pp. 1833-1847
- Araki, S.¹ Sawada, H.² Mukai, R.³ Makino, S.⁴

8
- 85009063707
- Soft decisions in missing data techniques for robust automatic speech recognition
- (October) Beijing, China.
- Barker J. Josifovski L. Cooke M. Green P. (2000, October). Soft decisions in missing data techniques for robust automatic speech recognition. In Proceedings of Sixth International Conference on Spoken Language Processing (Vol. 1, pp. 373–376), Beijing, China.
- (2000) Proceedings of Sixth International Conference on Spoken Language Processing , vol.1 , pp. 373-376
- Barker, J.¹ Josifovski, L.² Cooke, M.³ Green, P.⁴

9
- 0004076845
- London: Academic Press
- Bench J. Bamford J. (1979). Speech hearing tests and the spoken language of hearing-impaired children. London: Academic Press.
- (1979) Speech hearing tests and the spoken language of hearing-impaired children
- Bench, J.¹ Bamford, J.²

10
- 26044451875
- Underdetermined blind separation of convolutive mixtures of speech using time-frequency mask and mixing matrix estimation
- Blin A. Araki S. Makino S. (2005). Underdetermined blind separation of convolutive mixtures of speech using time-frequency mask and mixing matrix estimation. IEICE Transactions on Fundamentals of Electronics Communications and Computer Sciences, E88, 88–1700.
- (2005) IEICE Transactions on Fundamentals of Electronics Communications and Computer Sciences , vol.E88 , pp. 88-1700
- Blin, A.¹ Araki, S.² Makino, S.³

11
- 0002706411
- Modeling human sound-source localization and the cocktail-party-effect
- Bodden M. (1993). Modeling human sound-source localization and the cocktail-party-effect. Acta Acustica, 1, 43–55.
- (1993) Acta Acustica , vol.1 , pp. 43-55
- Bodden, M.¹

12
- 79953826416
- Estimation of the ideal binary mask using directional systems
- (September) Seattle, WA
- Boldt J. B. Kjems U. Pedersen M. S. Lunner T. Wang D. L. (2008, September). Estimation of the ideal binary mask using directional systems. In Proceedings of the 11th International Workshop on Acoustic Echo and Noise Control, Seattle, WA.
- (2008) Proceedings of the 11th International Workshop on Acoustic Echo and Noise Control
- Boldt, J.B.¹ Kjems, U.² Pedersen, M.S.³ Lunner, T.⁴ Wang, D.L.⁵

13
- 0033965381
- A speech corpus for multitalker communications research
- Bolia R. S. Nelson W. T. Ericson M. A. Simpson B. D. (2000). A speech corpus for multitalker communications research. Journal of the Acoustical Society of America, 107, 1065–1066.
- (2000) Journal of the Acoustical Society of America , vol.107 , pp. 1065-1066
- Bolia, R.S.¹ Nelson, W.T.² Ericson, M.A.³ Simpson, B.D.⁴

14
- 0003684441
- Cambridge: MIT Press
- Bregman A. S. (1990). Auditory scene analysis. Cambridge: MIT Press.
- (1990) Auditory scene analysis
- Bregman, A.S.¹

15
- 0028531926
- Computational auditory scene analysis
- Brown G. J. Cooke M. (1994). Computational auditory scene analysis. Computer Speech and Language, 8, 297–336.
- (1994) Computer Speech and Language , vol.8 , pp. 297-336
- Brown, G.J.¹ Cooke, M.²

16
- 85008004589
- Reverberation
- In Wang D. L. Brown G. J. (Eds.) Hoboken, NJ: Wiley/IEEE Press
- Brown G. J. Palomäki K. J. (2006). Reverberation. In Wang D. L. Brown G. J. (Eds.), Computational auditory scene analysis: Principles, algorithms, and applications (pp. 209–250). Hoboken, NJ: Wiley/IEEE Press.
- (2006) Computational auditory scene analysis: Principles, algorithms, and applications , pp. 209-250
- Brown, G.J.¹ Palomäki, K.J.²

17
- 33845354768
- Isolating the energetic component of speech-on-speech masking with ideal time-frequency segregation
- Brungart D. Chang P. S. Simpson B. D. Wang D. L. (2006). Isolating the energetic component of speech-on-speech masking with ideal time-frequency segregation. Journal of the Acoustical Society of America, 120, 4007–4018.
- (2006) Journal of the Acoustical Society of America , vol.120 , pp. 4007-4018
- Brungart, D.¹ Chang, P.S.² Simpson, B.D.³ Wang, D.L.⁴

18
- 33751303502
- Sound classification in hearing aids inspired by auditory scene analysis
- Buchler M. Allegro S. Launer S. Dillier N. (2005). Sound classification in hearing aids inspired by auditory scene analysis. EURASIP Journal on Applied Signal Processing, 18, 2991–3002.
- (2005) EURASIP Journal on Applied Signal Processing , vol.18 , pp. 2991-3002
- Buchler, M.¹ Allegro, S.² Launer, S.³ Dillier, N.⁴

19
- 0016522748
- Anthropometric manikin for acoustic research
- Burkhard M. D. Sachs R. M. (1975). Anthropometric manikin for acoustic research. Journal of the Acoustical Society of America, 58, 214–222.
- (1975) Journal of the Acoustical Society of America , vol.58 , pp. 214-222
- Burkhard, M.D.¹ Sachs, R.M.²

20
- 0028413241
- Elimination of the musical noise phenomenon with the Ephraim and Malah noise suppressor
- Cappe O. (1994). Elimination of the musical noise phenomenon with the Ephraim and Malah noise suppressor. IEEE Transactions on Speech and Audio Processing, 2, 345–349.
- (1994) IEEE Transactions on Speech and Audio Processing , vol.2 , pp. 345-349
- Cappe, O.¹

21
- 34547526788
- Blind speech separation by combining beamformers and a time frequency binary mask
- (September) Paris, France
- Cermak J. Araki S. Sawada H. Makino S. (2006, September). Blind speech separation by combining beamformers and a time frequency binary mask. In Proceedings of the 10th International Workshop Acoustic Echo and Noise Control, Paris, France.
- (2006) Proceedings of the 10th International Workshop Acoustic Echo and Noise Control
- Cermak, J.¹ Araki, S.² Sawada, H.³ Makino, S.⁴

22
- 33745217651
- Unpublished master's thesis, Department of Computer Science and Engineering, The Ohio State University, Columbus
- Chang P. (2004). Exploration of behavioral, physiological, and computational approaches to auditory scene analysis. Unpublished master's thesis, Department of Computer Science and Engineering, The Ohio State University, Columbus.
- (2004) Exploration of behavioral, physiological, and computational approaches to auditory scene analysis
- Chang, P.¹

23
- 0035342414
- Robust automatic speech recognition with missing and unreliable acoustic data
- Cooke M. Green P. Josifovski L. Vizinho A. (2001). Robust automatic speech recognition with missing and unreliable acoustic data. Speech Communication, 34, 267–285.
- (2001) Speech Communication , vol.34 , pp. 267-285
- Cooke, M.¹ Green, P.² Josifovski, L.³ Vizinho, A.⁴

24
- 34249884500
- Speech enhancement using the modified phase-opponency model
- Deshmukh O. D. Espy-Wilson C. Y. Carney L. H. (2007). Speech enhancement using the modified phase-opponency model. Journal of the Acoustical Society of America, 121, 3886–3898.
- (2007) Journal of the Acoustical Society of America , vol.121 , pp. 3886-3898
- Deshmukh, O.D.¹ Espy-Wilson, C.Y.² Carney, L.H.³

25
- 0004191790
- New York: Thieme
- Dillon H. (2001). Hearing aids. New York: Thieme.
- (2001) Hearing aids
- Dillon, H.¹

26
- 4544247268
- A method for directionally-disjoint source separation in convolutive environment
- (May) Montreal, Quebec, Canada.
- Dubnov S. Tabrikian J. Arnon-Targan M. (2004, May). A method for directionally-disjoint source separation in convolutive environment. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (Vol. V, pp. 489–492), Montreal, Quebec, Canada.
- (2004) Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing , vol.5 , pp. 489-492
- Dubnov, S.¹ Tabrikian, J.² Arnon-Targan, M.³

27
- 33746589116
- Speech source separation in convolutive environments using space-time-frequency analysis
- Article 38412
- Dubnov S. Tabrikian J. Arnon-Targan M. (2006). Speech source separation in convolutive environments using space-time-frequency analysis. EURASIP Journal on Applied Signal Processing, 2006, Article 38412, 11 pages.
- (2006) EURASIP Journal on Applied Signal Processing , vol.2006 , pp. 11
- Dubnov, S.¹ Tabrikian, J.² Arnon-Targan, M.³

28
- 0003922190
- (2nd ed.). New York: Wiley
- Duda R. O. Hart P. E. Stork D. G. (2001). Pattern classification (2nd ed.). New York: Wiley.
- (2001) Pattern classification
- Duda, R.O.¹ Hart, P.E.² Stork, D.G.³

29
- 84873856136
- Model-based scene analysis
- In Wang D. L. Brown G. J. (Eds.) Hoboken, NJ: Wiley/IEEE Press
- Ellis D. (2006). Model-based scene analysis. In Wang D. L. Brown G. J. (Eds.), Computational auditory scene analysis: Principles, algorithms, and applications (pp. 115–146). Hoboken, NJ: Wiley/IEEE Press.
- (2006) Computational auditory scene analysis: Principles, algorithms, and applications , pp. 115-146
- Ellis, D.¹

30
- 0023922474
- Excess masking among listeners with a sensorineural hearing loss
- Gagne J.-P. (1988). Excess masking among listeners with a sensorineural hearing loss. Journal of the Acoustical Society of America, 83, 2311–2321.
- (1988) Journal of the Acoustical Society of America , vol.83 , pp. 2311-2321
- Gagne, J.-P.¹

31
- 0142127061
- Microphone-array hearing aids
- In Brandstein M. Ward D. (Eds.) New York: Springer
- Greenberg J. E. Zurek P. M. (2001). Microphone-array hearing aids. In Brandstein M. Ward D. (Eds.), Microphone arrays: Signal processing techniques and applications (pp. 229–253). New York: Springer.
- (2001) Microphone arrays: Signal processing techniques and applications , pp. 229-253
- Greenberg, J.E.¹ Zurek, P.M.²

32
- 33744971131
- Mask estimation for missing data speech recognition based on statistics of binaural interaction
- Harding S. Barker J. Brown G. J. (2006). Mask estimation for missing data speech recognition based on statistics of binaural interaction. IEEE Transactions on Audio, Speech, and Language Processing, 14, 58–67.
- (2006) IEEE Transactions on Audio, Speech, and Language Processing , vol.14 , pp. 58-67
- Harding, S.¹ Barker, J.² Brown, G.J.³

33
- 84998077855
- (Ellis A. J., Trans., 2nd English ed.). New York: Dover
- Helmholtz H. (1863). On the sensation of tone (Ellis A. J., Trans., 2nd English ed.). New York: Dover.
- (1863) On the sensation of tone
- Helmholtz, H.¹

34
- 0035681924
- Speech segregation based on pitch tracking and amplitude modulation
- (October) New Paltz, NY.
- Hu G. Wang D. L. (2001, October). Speech segregation based on pitch tracking and amplitude modulation. In Proceedings of IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (pp. 79–82), New Paltz, NY.
- (2001) Proceedings of IEEE Workshop on Applications of Signal Processing to Audio and Acoustics , pp. 79-82
- Hu, G.¹ Wang, D.L.²

35
- 4644265990
- Monaural speech segregation based on pitch tracking and amplitude modulation
- Hu G. Wang D. L. (2004). Monaural speech segregation based on pitch tracking and amplitude modulation. IEEE Transactions on Neural Networks, 15, 1135–1150.
- (2004) IEEE Transactions on Neural Networks , vol.15 , pp. 1135-1150
- Hu, G.¹ Wang, D.L.²

36
- 46049084696
- An auditory scene analysis approach to monaural speech segregation
- In Hansler E. Schmidt G. (Eds.) Heidelberg, Germany: Springer
- Hu G. Wang D. L. (2006). An auditory scene analysis approach to monaural speech segregation. In Hansler E. Schmidt G. (Eds.), Topics in acoustic echo and noise control (pp. 485–515). Heidelberg, Germany: Springer.
- (2006) Topics in acoustic echo and noise control , pp. 485-515
- Hu, G.¹ Wang, D.L.²

37
- 49249107353
- Segregation of unvoiced speech from nonspeech interference
- Hu G. Wang D. L. (2008). Segregation of unvoiced speech from nonspeech interference. Journal of the Acoustical Society of America, 124, 1306–1319.
- (2008) Journal of the Acoustical Society of America , vol.124 , pp. 1306-1319
- Hu, G.¹ Wang, D.L.²

38
- 0003905759
- New York: Wiley
- Hyvärinen A. Karhunen J. Oja E. (2001). Independent component analysis. New York: Wiley.
- (2001) Independent component analysis
- Hyvärinen, A.¹ Karhunen, J.² Oja, E.³

39
- 0014568991
- IEEE recommended practice for speech quality measurements
- IEEE. (1969). IEEE recommended practice for speech quality measurements. IEEE Transactions on Audio and Electroacoustics, 17, 225–246.
- (1969) IEEE Transactions on Audio and Electroacoustics , vol.17 , pp. 225-246

40
- 0033692661
- Blind separation of disjoint orthogonal signals: Demixing N sources from 2 mixtures
- (June) Istanbul, Turkey.
- Jourjine A. Rickard S. Yilmaz O. (2000, June). Blind separation of disjoint orthogonal signals: Demixing N sources from 2 mixtures. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (Vol. 5, pp. 2985–2988), Istanbul, Turkey.
- (2000) Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing , vol.5 , pp. 2985-2988
- Jourjine, A.¹ Rickard, S.² Yilmaz, O.³

41
- 33751279943
- Multichannel dynamic-range compression using digital frequency warping
- Kates J. M. Arehart K. H. (2005). Multichannel dynamic-range compression using digital frequency warping. EURASIP Journal on Applied Signal Processing, 18, 3003–3014.
- (2005) EURASIP Journal on Applied Signal Processing , vol.18 , pp. 3003-3014
- Kates, J.M.¹ Arehart, K.H.²

42
- 0027445368
- Real-time multiband dynamic compression and noise reduction for binaural hearing aids
- Kollmeier B. Peissig J. Hohmann V. (1993). Real-time multiband dynamic compression and noise reduction for binaural hearing aids. Journal of Rehabilitation Research and Development, 30, 82–94.
- (1993) Journal of Rehabilitation Research and Development , vol.30 , pp. 82-94
- Kollmeier, B.¹ Peissig, J.² Hohmann, V.³

43
- 33749058582
- Separation and robust recognition of noisy, convolutive speech mixtures using time-frequency masking and missing data techniques
- (October) New Paltz, NY.
- Kolossa D. Klimas A. Orglmeister R. (2005, October). Separation and robust recognition of noisy, convolutive speech mixtures using time-frequency masking and missing data techniques. In Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (pp. 82–85), New Paltz, NY.
- (2005) Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics , pp. 82-85
- Kolossa, D.¹ Klimas, A.² Orglmeister, R.³

44
- 33749041814
- Nonlinear postprocessing for blind speech separation
- In Puntonet C. G. Prieto A. (Eds.) Berlin: Springer
- Kolossa D. Orglmeister R. (2004). Nonlinear postprocessing for blind speech separation. In Puntonet C. G. Prieto A. (Eds.), Lecture notes in computer science: 3195. Independent component analysis and blind signal separation: Proceedings of the Fifth International Congress, ICA 2004 (pp. 832–839). Berlin: Springer.
- (2004) Lecture notes in computer science: 3195. Independent component analysis and blind signal separation: Proceedings of the Fifth International Congress, ICA 2004 , pp. 832-839
- Kolossa, D.¹ Orglmeister, R.²

45
- 0034892786
- Perceptual time-frequency subtraction algorithm for noise reduction in hearing aids
- Li M. McAllister H. G. Black N. D. Perez T. A. D. (2001). Perceptual time-frequency subtraction algorithm for noise reduction in hearing aids. IEEE Transactions on Biomedical Engineering, 48, 979–988.
- (2001) IEEE Transactions on Biomedical Engineering , vol.48 , pp. 979-988
- Li, M.¹ McAllister, H.G.² Black, N.D.³ Perez, T.A.D.⁴

46
- 41849093721
- Effect of spectral resolution on the intelligibility of ideal binary masked speech
- Li N. Loizou P. C. (2008a). Effect of spectral resolution on the intelligibility of ideal binary masked speech. Journal of the Acoustical Society of America, 123, EL59–EL64.
- (2008) Journal of the Acoustical Society of America , vol.123 , pp. EL59-EL64
- Li, N.¹ Loizou, P.C.²

47
- 40749125179
- Factors influencing intelligibility of ideal binary-masked speech: Implications for noise reduction
- Li N. Loizou P. C. (2008b). Factors influencing intelligibility of ideal binary-masked speech: Implications for noise reduction. Journal of the Acoustical Society of America, 123, 1673–1682.
- (2008) Journal of the Acoustical Society of America , vol.123 , pp. 1673-1682
- Li, N.¹ Loizou, P.C.²

48
- 40949108726
- Monaural speech separation based on computational auditory scene analysis and objective quality assessment of speech
- Li P. Guan Y. Xu B. Liu W. (2006). Monaural speech separation based on computational auditory scene analysis and objective quality assessment of speech. IEEE Transactions on Audio, Speech, and Language Processing, 14, 2014–2023.
- (2006) IEEE Transactions on Audio, Speech, and Language Processing , vol.14 , pp. 2014-2023
- Li, P.¹ Guan, Y.² Xu, B.³ Liu, W.⁴

49
- 84997954247
- (in press) Speech Communication.
- Li Y. Wang D. L. (in press). On the optimality of ideal binary time-frequency masks. Speech Communication.
- On the optimality of ideal binary time-frequency masks
- Li, Y.¹ Wang, D.L.²

50
- 33746850812
- Separating more sources than sensors using time-frequency distributions
- Linh-Trung N. Belouchrani A. Abed-Meraim K. Boashush B. (2005). Separating more sources than sensors using time-frequency distributions. EURASIP Journal on Applied Signal Processing, 17, 2828–2847.
- (2005) EURASIP Journal on Applied Signal Processing , vol.17 , pp. 2828-2847
- Linh-Trung, N.¹ Belouchrani, A.² Abed-Meraim, K.³ Boashush, B.⁴

51
- 34447100796
- Boca Raton, FL: CRC Press
- Loizou P. C. (2007). Speech enhancement: Theory and practice. Boca Raton, FL: CRC Press.
- (2007) Speech enhancement: Theory and practice
- Loizou, P.C.¹

52
- 0020497765
- A computational model of binaural localization and separation
- (April) (pp. 1148–1148), Boston, MA
- Lyon R. F. (1983, April). A computational model of binaural localization and separation. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (pp. 1148–1148), Boston, MA.
- (1983) Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing
- Lyon, R.F.¹

53
- 51449112795
- Temporal smoothing of spectral masks in the cepstral domain for speech separation
- Las Vegas, NV.
- Madhu N. Breithaupt C. Martin R. (2008). Temporal smoothing of spectral masks in the cepstral domain for speech separation. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (pp. 45–48), Las Vegas, NV.
- (2008) Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing , pp. 45-48
- Madhu, N.¹ Breithaupt, C.² Martin, R.³

54
- 33745683217
- Separation of mixed audio signals by source localization and binary masking with Hilbert spectrum
- Berlin: Springer
- Molla M. K. I. Hirose K. Minematsu N. (2006). Separation of mixed audio signals by source localization and binary masking with Hilbert spectrum. Lecture notes in computer science: 3889. Independent component analysis and blind signal separation: Proceedings of the Sixth International Congress, ICA 2006 (pp. 641–648). Berlin: Springer.
- (2006) Lecture notes in computer science: 3889. Independent component analysis and blind signal separation: Proceedings of the Sixth International Congress, ICA 2006 , pp. 641-648
- Molla, M.K.I.¹ Hirose, K.² Minematsu, N.³

55
- 0003789815
- (5th ed.). San Diego, CA: Academic Press
- Moore B. C. J. (2003). An introduction to the psychology of hearing (5th ed.). San Diego, CA: Academic Press.
- (2003) An introduction to the psychology of hearing
- Moore, B.C.J.¹

56
- 36348991585
- (2nd ed.). Chichester, UK: Wiley
- Moore B. C. J. (2007). Cochlear hearing loss (2nd ed.). Chichester, UK: Wiley.
- (2007) Cochlear hearing loss
- Moore, B.C.J.¹

57
- 33749539632
- Blind separation of acoustic signals combining SIMO-model-based independent component analysis and binary masking
- Mori Y. Saruwatari H. Takatani T. Ukai S. Shikano K. Hiekata T. et al., (2006). Blind separation of acoustic signals combining SIMO-model-based independent component analysis and binary masking. EURASIP Journal on Applied Signal Processing, 2006(20), 1–17
- (2006) EURASIP Journal on Applied Signal Processing , vol.2006 , Issue.20 , pp. 1-17
- Mori, Y.¹ Saruwatari, H.² Takatani, T.³ Ukai, S.⁴ Shikano, K.⁵ Hiekata, T.⁶

58
- 0024753593
- Speech recognition using noise-adaptive prototypes
- Nadas A. Nahamoo D. Picheny M. A. (1989). Speech recognition using noise-adaptive prototypes. IEEE Transactions on Acoustics, Speech, and Signal Processing, 37, 1495–1503.
- (1989) IEEE Transactions on Acoustics, Speech, and Signal Processing , vol.37 , pp. 1495-1503
- Nadas, A.¹ Nahamoo, D.² Picheny, M.A.³

59
- 0028012490
- Development of the hearing in noise test for the measurement of speech reception thresholds in quiet and in noise
- Nilsson M. Soli S. Sullivan J. A. (1994). Development of the hearing in noise test for the measurement of speech reception thresholds in quiet and in noise. Journal of the Acoustical Society of America, 95, 1085–1099.
- (1994) Journal of the Acoustical Society of America , vol.95 , pp. 1085-1099
- Nilsson, M.¹ Soli, S.² Sullivan, J.A.³

60
- 2942539074
- Techniques for handling convolutional distortion with “missing data” automatic speech recognition
- Palomäki K. J. Brown G. J. Barker J. (2004). Techniques for handling convolutional distortion with “missing data” automatic speech recognition. Speech Communication, 43, 123–142.
- (2004) Speech Communication , vol.43 , pp. 123-142
- Palomäki, K.J.¹ Brown, G.J.² Barker, J.³

61
- 33745715346
- Overcomplete blind source separation by combining ICA and binary time-frequency masking
- (September) Mystic, CT.
- Pedersen M. S. Wang D. L. Larsen J. Kjems U. (2005, September). Overcomplete blind source separation by combining ICA and binary time-frequency masking. In Proceedings of the IEEE International Workshop on Machine Learning for Signal Processing (pp. 15–20), Mystic, CT.
- (2005) Proceedings of the IEEE International Workshop on Machine Learning for Signal Processing , pp. 15-20
- Pedersen, M.S.¹ Wang, D.L.² Larsen, J.³ Kjems, U.⁴

62
- 40949096929
- Two-microphone separation of speech mixtures
- Pedersen M. S. Wang D. L. Larsen J. Kjems U. (2008). Two-microphone separation of speech mixtures. IEEE Transactions on Neural Networks, 19, 475–492.
- (2008) IEEE Transactions on Neural Networks , vol.19 , pp. 475-492
- Pedersen, M.S.¹ Wang, D.L.² Larsen, J.³ Kjems, U.⁴

63
- 35648992055
- Monophonic sound source separation with an unsupervised network of spiking neurones
- Pichevar R. Rouat J. (2007). Monophonic sound source separation with an unsupervised network of spiking neurones. Neurocomputing, 71, 109–120.
- (2007) Neurocomputing , vol.71 , pp. 109-120
- Pichevar, R.¹ Rouat, J.²

64
- 33845940172
- A maximum likelihood estimation of vocal-tract-related filter characteristics for single channel speech separation
- Article 84186
- Radfar M. H. Dansereau R. M. Sayadiyan A. (2007). A maximum likelihood estimation of vocal-tract-related filter characteristics for single channel speech separation. EURASIP Journal on Audio, Speech, and Music Processing, 2007, Article 84186, 15 pages.
- (2007) EURASIP Journal on Audio, Speech, and Music Processing , vol.2007 , pp. 15
- Radfar, M.H.¹ Dansereau, R.M.² Sayadiyan, A.³

65
- 56249144712
- Soft mask methods for single-channel speaker separation
- Reddy A. M. Raj B. (2007). Soft mask methods for single-channel speaker separation. IEEE Transactions on Audio, Speech, and Language Processing, 15, 1766–1776.
- (2007) IEEE Transactions on Audio, Speech, and Language Processing , vol.15 , pp. 1766-1776
- Reddy, A.M.¹ Raj, B.²

66
- 0038028633
- December). Real-time time-frequency based blind source separation
- San Diego, CA.
- Rickard S. Balan R. Rosca J. (2001, December). Real-time time-frequency based blind source separation. In Proceedings of the Third International Conference on Independent Component Analysis and Blind Source Separation (pp. 651–656), San Diego, CA.
- (2001) Proceedings of the Third International Conference on Independent Component Analysis and Blind Source Separation , pp. 651-656
- Rickard, S.¹ Balan, R.² Rosca, J.³

67
- 33845361885
- Binaural segregation in multisource reverberant environments
- Roman N. Srinivasan S. Wang D. L. (2006). Binaural segregation in multisource reverberant environments. Journal of the Acoustical Society of America, 120, 4040–4051.
- (2006) Journal of the Acoustical Society of America , vol.120 , pp. 4040-4051
- Roman, N.¹ Srinivasan, S.² Wang, D.L.³

68
- 4644328243
- Binaural sound separation for multisource reverberant environments
- Montreal Quebec, Canada
- Roman N. Wang D. L. (2004). Binaural sound separation for multisource reverberant environments. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (Vol. II, pp. 373–376), Montreal Quebec, Canada.
- (2004) Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing , vol.2 , pp. 373-376
- Roman, N.¹ Wang, D.L.²

69
- 0142026377
- Speech segregation based on sound localization
- Roman N. Wang D. L. Brown G. J. (2003). Speech segregation based on sound localization. Journal of the Acoustical Society of America, 114, 2236–2252.
- (2003) Journal of the Acoustical Society of America , vol.114 , pp. 2236-2252
- Roman, N.¹ Wang, D.L.² Brown, G.J.³

70
- 84898946024
- One microphone source separation
- Cambridge, MA: MIT Press
- Roweis S. T. (2001). One microphone source separation. In Advances in Neural Information Processing Systems (NIPS'00) (Vol. 13, pp. 793–799). Cambridge, MA: MIT Press.
- (2001) Advances in Neural Information Processing Systems (NIPS'00) , vol.13 , pp. 793-799
- Roweis, S.T.¹

71
- 34250639628
- Two-stage blind source separation based on ICA and binary masking for real-time robot audition system
- (August) Edmont, Alberta, Canada.
- Saruwatari H. Mori Y. Takatani T. Ukai S. Shikano K. Hiekata T. et al., (2005, August). Two-stage blind source separation based on ICA and binary masking for real-time robot audition system. Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2005 (pp. 2303–2308), Edmont, Alberta, Canada.
- (2005) Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2005 , pp. 2303-2308
- Saruwatari, H.¹ Mori, Y.² Takatani, T.³ Ukai, S.⁴ Shikano, K.⁵ Hiekata, T.⁶

72
- 33645163182
- Blind extraction of a dominant source signal from mixtures of many sources
- Philadelphia, PA.
- Sawada H. Araki S. Mukai R. Makino S. (2005). Blind extraction of a dominant source signal from mixtures of many sources. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (Vol. III, pp. 61–64), Philadelphia, PA.
- (2005) Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing , vol.3 , pp. 61-64
- Sawada, H.¹ Araki, S.² Mukai, R.³ Makino, S.⁴

73
- 33847771459
- Blind extraction of dominant target sources using ICA and time-frequency masking
- Sawada H. Araki S. Mukai R. Makino S. (2006). Blind extraction of dominant target sources using ICA and time-frequency masking. IEEE Transactions on Audio, Speech, and Language Processing, 14, 2165–2173.
- (2006) IEEE Transactions on Audio, Speech, and Language Processing , vol.14 , pp. 2165-2173
- Sawada, H.¹ Araki, S.² Mukai, R.³ Makino, S.⁴

74
- 0028823541
- Speech recognition with primarily temporal cues
- Shannon R. V. Zeng F.-G. Kamath V. Wygonski J. Ekelid M. (1995). Speech recognition with primarily temporal cues. Science, 270, 303–304.
- (1995) Science , vol.270 , pp. 303-304
- Shannon, R.V.¹ Zeng, F.-G.² Kamath, V.³ Wygonski, J.⁴ Ekelid, M.⁵

75
- 33750311718
- Binary and ratio time-frequency masks for robust speech recognition
- Srinivasan S. Roman N. Wang D. L. (2006). Binary and ratio time-frequency masks for robust speech recognition. Speech Communication, 48, 1486–1501.
- (2006) Speech Communication , vol.48 , pp. 1486-1501
- Srinivasan, S.¹ Roman, N.² Wang, D.L.³

76
- 56249136428
- Transforming binary uncertainties for robust speech recognition
- Srinivasan S. Wang D. L. (2007). Transforming binary uncertainties for robust speech recognition. IEEE Transactions on Audio, Speech, and Language Processing, 15, 2130–2140.
- (2007) IEEE Transactions on Audio, Speech, and Language Processing , vol.15 , pp. 2130-2140
- Srinivasan, S.¹ Wang, D.L.²

77
- 33847208897
- Time-frequency masking for BSS problem using equilateral triangular microphone array
- (December) Hong Kong.
- Takenouchi Y. Hamada N. (2005, December). Time-frequency masking for BSS problem using equilateral triangular microphone array. In Proceedings of the 2005 International Symposium on Intelligent Signal Processing and Communication Systems (pp. 185–188), Hong Kong.
- (2005) Proceedings of the 2005 International Symposium on Intelligent Signal Processing and Communication Systems , pp. 185-188
- Takenouchi, Y.¹ Hamada, N.²

78
- 26744446157
- (Tech. Rep.). Itasca, IL: Knowles Electronics
- Thompson S. C. (2000). Directional patterns obtained from two or three microphones (Tech. Rep.). Itasca, IL: Knowles Electronics.
- (2000) Directional patterns obtained from two or three microphones
- Thompson, S.C.¹

79
- 77955691145
- Sound source separation of overcomplete convolutive mixtures using generalized sparseness
- (September) Paris, France
- Togami M. Sumiyoshi T. Amano A. (2006, September). Sound source separation of overcomplete convolutive mixtures using generalized sparseness. In Proceedings of the 10th International Workshop Acoustic Echo and Noise Control, Paris, France.
- (2006) Proceedings of the 10th International Workshop Acoustic Echo and Noise Control
- Togami, M.¹ Sumiyoshi, T.² Amano, A.³

80
- 0023985457
- Beamforming: A versatile approach to spatial filtering
- (April)
- van Veen B. D. Buckley K. M. (1988, April). Beamforming: A versatile approach to spatial filtering. IEEE ASSP Magazine, pp. 4–24.
- (1988) IEEE ASSP Magazine , pp. 4-24
- van Veen, B.D.¹ Buckley, K.M.²

81
- 0037504237
- Design, optimization and evaluation of a Danish sentence test in noise
- Wagener K. Josvassen J. L. Ardenkjær R. (2003). Design, optimization and evaluation of a Danish sentence test in noise. International Journal of Audiology, 42, 10–17.
- (2003) International Journal of Audiology , vol.42 , pp. 10-17
- Wagener, K.¹ Josvassen, J.L.² Ardenkjær, R.³

82
- 84892233308
- On ideal binary mask as the computational goal of auditory scene analysis
- In Divenyi P. (Ed.) Norwell, MA: Kluwer Academic
- Wang D. L. (2005). On ideal binary mask as the computational goal of auditory scene analysis. In Divenyi P. (Ed.), Speech separation by humans and machines (pp. 181–197). Norwell, MA: Kluwer Academic.
- (2005) Speech separation by humans and machines , pp. 181-197
- Wang, D.L.¹

83
- 0032682770
- Separation of speech from interfering sounds based on oscillatory correlation
- Wang D. L. Brown G. J. (1999). Separation of speech from interfering sounds based on oscillatory correlation. IEEE Transactions on Neural Networks, 10, 684–697.
- (1999) IEEE Transactions on Neural Networks , vol.10 , pp. 684-697
- Wang, D.L.¹ Brown, G.J.²

84
- 82255178542
- Hoboken, NJ: Wiley/IEEE Press
- Wang D. L. Brown G. J. (Eds.). (2006). Computational auditory scene analysis: Principles, algorithms, and applications. Hoboken, NJ: Wiley/IEEE Press.
- (2006) Computational auditory scene analysis: Principles, algorithms, and applications
- Wang, D.L.¹ Brown, G.J.²

85
- 56249127018
- Speech intelligibility in background noise with ideal binary time-frequency masking
- Wang D. L. Kjems U. Pedersen M. S. Boldt J. B. Lunner T. (2008). Speech intelligibility in background noise with ideal binary time-frequency masking. Journal of the Acoustical Society of America, conditionally accepted.
- (2008) Journal of the Acoustical Society of America, conditionally accepted
- Wang, D.L.¹ Kjems, U.² Pedersen, M.S.³ Boldt, J.B.⁴ Lunner, T.⁵

86
- 84998179088
- Speech perception of noise with binary gains
- (in press)
- Wang D. L. Kjems U. Pedersen M. S. Boldt J. B. Lunner T. (in press). Speech perception of noise with binary gains. Journal of the Acoustical Society of America.
- Journal of the Acoustical Society of America
- Wang, D.L.¹ Kjems, U.² Pedersen, M.S.³ Boldt, J.B.⁴ Lunner, T.⁵

87
- 0003982501
- Unpublished doctoral dissertation, Department of Electrical Engineering, Stanford University, CA
- Weintraub M. (1985). A theory and computational model of auditory monaural sound separation. Unpublished doctoral dissertation, Department of Electrical Engineering, Stanford University, CA.
- (1985) A theory and computational model of auditory monaural sound separation
- Weintraub, M.¹

88
- 3142694930
- Blind separation of speech mixtures via time-frequency masking
- Yilmaz O. Rickard S. (2004). Blind separation of speech mixtures via time-frequency masking. IEEE Transactions on Signal Processing, 52, 1830–1847.
- (2004) IEEE Transactions on Signal Processing , vol.52 , pp. 1830-1847
- Yilmaz, O.¹ Rickard, S.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.