메뉴 건너뛰기




Volumn 15, Issue 5, 2004, Pages 1135-1150

Monaural speech segregation based on pitch tracking and amplitude modulation

Author keywords

[No Author keywords available]

Indexed keywords

AMPLITUDE MODULATION; AUDITION; CORRELATION METHODS; HARMONIC ANALYSIS; MATHEMATICAL MODELS; SOUND RECORDING; SPEECH SYNTHESIS;

EID: 4644265990     PISSN: 10459227     EISSN: None     Source Type: Journal    
DOI: 10.1109/TNN.2004.832812     Document Type: Article
Times cited : (359)

References (35)
  • 1
    • 0036649241 scopus 로고    scopus 로고
    • Estimation of speech embedded in a reverberant and noisy environment by independent component analysis and wavelets
    • July
    • A. K. Barros, T. Rutkowski, F. Itakura, and N. Ohnishi, "Estimation of speech embedded in a reverberant and noisy environment by independent component analysis and wavelets," IEEE Trans. Neural Networks, vol. 13, pp. 888-893, July 2002.
    • (2002) IEEE Trans. Neural Networks , vol.13 , pp. 888-893
    • Barros, A.K.1    Rutkowski, T.2    Itakura, F.3    Ohnishi, N.4
  • 2
    • 0002888637 scopus 로고    scopus 로고
    • Effects of a difference in fundamental frequency in separating two sentences
    • A. R. Palmer, A. Rees, A. Q. Summerfield, and R. Meddis, Eds. London, U.K.: Whurr
    • J. Bird and C. J. Darwin, "Effects of a difference in fundamental frequency in separating two sentences," in Psychophysical and Physiological Advances in Hearing, A. R. Palmer, A. Rees, A. Q. Summerfield, and R. Meddis, Eds. London, U.K.: Whurr, 1997.
    • (1997) Psychophysical and Physiological Advances in Hearing
    • Bird, J.1    Darwin, C.J.2
  • 3
    • 0017804799 scopus 로고
    • On cochlear encoding: Potentialities and limitations of the reverse-correlation techniques
    • E. de Boer and H. R. de Jongh, "On cochlear encoding: potentialities and limitations of the reverse-correlation techniques," J. Acoust. Soc. Amer., vol. 63, pp. 115-135, 1978.
    • (1978) J. Acoust. Soc. Amer. , vol.63 , pp. 115-135
    • de Boer, E.1    de Jongh, H.R.2
  • 5
    • 0028531926 scopus 로고
    • Computational auditory scene analysis
    • G. J. Brown and M. P. Cooke, "Computational auditory scene analysis," Comput. Speech Language, vol. 8, pp. 297-336, 1994.
    • (1994) Comput. Speech Language , vol.8 , pp. 297-336
    • Brown, G.J.1    Cooke, M.P.2
  • 6
    • 0032702589 scopus 로고    scopus 로고
    • Temporal coding of periodicity pitch in the auditory system: An overview
    • P. Cariani, "Temporal coding of periodicity pitch in the auditory system: an overview," Neural Plasticity, vol. 6, pp. 147-172, 1999.
    • (1999) Neural Plasticity , vol.6 , pp. 147-172
    • Cariani, P.1
  • 7
    • 0028264314 scopus 로고
    • Comparing the fundamental frequencies of resolved and unresolved harmonics: Evidence for two pitch mechanisms?
    • R. P. Carlyon and T. M. Shackleton, "Comparing the fundamental frequencies of resolved and unresolved harmonics: evidence for two pitch mechanisms?," J. Acoust. Soc. Amer., vol. 95, pp. 3541-3554, 1994.
    • (1994) J. Acoust. Soc. Amer. , vol.95 , pp. 3541-3554
    • Carlyon, R.P.1    Shackleton, T.M.2
  • 9
    • 0035342414 scopus 로고    scopus 로고
    • Robust automatic speech recognition with missing and unreliable acoustic data
    • M. P. Cooke, P. D. Green, L. Josifovski, and A. Vizinho, "Robust automatic speech recognition with missing and unreliable acoustic data," Speech Commun., vol. 34, pp. 267-285, 2001.
    • (2001) Speech Commun. , vol.34 , pp. 267-285
    • Cooke, M.P.1    Green, P.D.2    Josifovski, L.3    Vizinho, A.4
  • 10
    • 0037750051 scopus 로고    scopus 로고
    • Sound Source Separation Via Computational Auditory Scene Analysis (CASA )-enhanced Beamforming
    • Ph.D. dissertation, Dept. Elect. Comput. Eng., Northwestern Univ., Evanston, IL
    • L. A. Drake, "Sound source separation via computational auditory scene analysis (CASA)-enhanced beamforming," Ph.D. dissertation, Dept. Elect. Comput. Eng., Northwestern Univ., Evanston, IL, 2001.
    • (2001)
    • Drake, L.A.1
  • 13
    • 0029345417 scopus 로고
    • A signal subspace approach for speech enhancement
    • July
    • Y. Ephraim and H. L. Trees, "A signal subspace approach for speech enhancement," IEEE Trans. Speech Audio Processing, vol. 3, pp. 251-266, July 1995.
    • (1995) IEEE Trans. Speech Audio Processing , vol.3 , pp. 251-266
    • Ephraim, Y.1    Trees, H.L.2
  • 14
    • 0028312802 scopus 로고
    • Auditory models and human performance in tasks related to speech coding and speech recognition
    • Jan
    • O. Ghitza, "Auditory models and human performance in tasks related to speech coding and speech recognition," IEEE Trans. Speech Audio Processing, vol. 2, pp. 115-132, Jan. 1994.
    • (1994) IEEE Trans. Speech Audio Processing , vol.2 , pp. 115-132
    • Ghitza, O.1
  • 16
    • 17544384941 scopus 로고    scopus 로고
    • Monaural speech segregation based on pitch tracking and amplitude modulation
    • G. Hu and D. L. Wang, "Monaural speech segregation based on pitch tracking and amplitude modulation," in Int. Conf. Acoustics, Speech and Signal Processing, 2002, pp. 553-556.
    • (2002) Int. Conf. Acoustics, Speech and Signal Processing , pp. 553-556
    • Hu, G.1    Wang, D.L.2
  • 18
    • 0001463644 scopus 로고
    • A duplex theory of pitch perception
    • J. C. R. Licklider, "A duplex theory of pitch perception," Experientia, vol. 7, pp. 128-134, 1951.
    • (1951) Experientia , vol.7 , pp. 128-134
    • Licklider, J.C.R.1
  • 19
    • 0029345416 scopus 로고
    • A comparison of signal processing front ends for automatic word recognition
    • July
    • C. R. Jankowski, H. H. Vo, and R. P. Lippmann, "A comparison of signal processing front ends for automatic word recognition," IEEE Trans. Speech Audio Processing, vol. 3, pp. 286-293, July 1995.
    • (1995) IEEE Trans. Speech Audio Processing , vol.3 , pp. 286-293
    • Jankowski, C.R.1    Vo, H.H.2    Lippmann, R.P.3
  • 20
    • 0035472866 scopus 로고    scopus 로고
    • Speech enhancement using a constrained iterative sinusoidal model
    • Oct
    • J. Jensen and J. H. L. Hansen, "Speech enhancement using a constrained iterative sinusoidal model," IEEE Trans. Speech Audio Processing, vol. 9, pp. 731-740, Oct. 2001.
    • (2001) IEEE Trans. Speech Audio Processing , vol.9 , pp. 731-740
    • Jensen, J.1    Hansen, J.H.L.2
  • 21
    • 0030193445 scopus 로고    scopus 로고
    • Two decades of array signal processing research: The parametric approach
    • July
    • H. Krim and M. Viberg, "Two decades of array signal processing research: the parametric approach," IEEE Signal Processing Mag., vol. 13, pp. 67-94, July 1996.
    • (1996) IEEE Signal Processing Mag. , vol.13 , pp. 67-94
    • Krim, H.1    Viberg, M.2
  • 23
    • 0035396555 scopus 로고    scopus 로고
    • Noise power spectral density estimation based on optimal smoothing and minimum statistics
    • July
    • R. Martin, "Noise power spectral density estimation based on optimal smoothing and minimum statistics," IEEE Trans. Speech Audio Processing, vol. 9, pp. 504-512, July 2001.
    • (2001) IEEE Trans. Speech Audio Processing , vol.9 , pp. 504-512
    • Martin, R.1
  • 24
    • 0023944462 scopus 로고
    • Simulation of auditory-neural transduction: Further studies
    • R. Meddis, "Simulation of auditory-neural transduction: further studies," J. Acoust. Soc. Amer., vol. 83, pp. 1056-1063, 1988.
    • (1988) J. Acoust. Soc. Amer. , vol.83 , pp. 1056-1063
    • Meddis, R.1
  • 25
    • 0030846123 scopus 로고    scopus 로고
    • A unitary model of pitch perception
    • R. Meddis and L. O'Mard, "A unitary model of pitch perception," J. Acoust. Soc. Amer., vol. 102, pp. 1811-1820, 1997.
    • (1997) J. Acoust. Soc. Amer. , vol.102 , pp. 1811-1820
    • Meddis, R.1    O'Mard, L.2
  • 28
    • 0014271904 scopus 로고
    • The ear as a frequency analyzer II
    • R. Plomp and A. M. Mimpen, "The ear as a frequency analyzer II," J. Acoust. Soc. Amer., vol. 43, pp. 764-767, 1968.
    • (1968) J. Acoust. Soc. Amer. , vol.43 , pp. 764-767
    • Plomp, R.1    Mimpen, A.M.2
  • 30
    • 0032166087 scopus 로고    scopus 로고
    • HMM-based strategies for enhancement of speech signals embedded in nonstationary noise
    • Sept
    • H. Sameti, H. Sheikhzadeh, L. Deng, and R. L. Brennan, "HMM-based strategies for enhancement of speech signals embedded in nonstationary noise," IEEE Trans. Speech Audio Processing, vol. 6, pp. 445-455, Sept. 1998.
    • (1998) IEEE Trans. Speech Audio Processing , vol.6 , pp. 445-455
    • Sameti, H.1    Sheikhzadeh, H.2    Deng, L.3    Brennan, R.L.4
  • 31
    • 0002296637 scopus 로고
    • On the importance of time - A temporal representation of sound
    • M. P. Cooke, S. Beet, and M. Crawford, Eds. New York: Wiley
    • M. Slaney and R. F. Lyon, "On the importance of time - a temporal representation of sound," in Visual Representations of Speech Signals, M. P. Cooke, S. Beet, and M. Crawford, Eds. New York: Wiley, 1993, pp. 95-116.
    • (1993) Visual Representations of Speech Signals , pp. 95-116
    • Slaney, M.1    Lyon, R.F.2
  • 32
    • 0030188146 scopus 로고    scopus 로고
    • Primitive auditory segregation based on oscillatory correlation
    • D. L. Wang, "Primitive auditory segregation based on oscillatory correlation," Cogn. Sci., vol. 20, pp. 409-456, 1996.
    • (1996) Cogn. Sci. , vol.20 , pp. 409-456
    • Wang, D.L.1
  • 33
    • 0032682770 scopus 로고    scopus 로고
    • Separation of speech from interfering sounds based on oscillatory correlation
    • May
    • D. L. Wang and G. J. Brown, "Separation of speech from interfering sounds based on oscillatory correlation," IEEE Trans. Neural Networks, vol. 10, pp. 684-697, May 1999.
    • (1999) IEEE Trans. Neural Networks , vol.10 , pp. 684-697
    • Wang, D.L.1    Brown, G.J.2
  • 35
    • 0037767686 scopus 로고    scopus 로고
    • A multipitch tracking algorithm for noisy speech
    • May
    • M. Wu, D. L. Wang, and G. J. Brown, "A multipitch tracking algorithm for noisy speech," IEEE Trans. Speech Audio Processing, vol. 11, pp. 229-241, May 2003.
    • (2003) IEEE Trans. Speech Audio Processing , vol.11 , pp. 229-241
    • Wu, M.1    Wang, D.L.2    Brown, G.J.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.