메뉴 건너뛰기




Volumn 10, Issue 3, 1999, Pages 684-697

Separation of speech from interfering sounds based on oscillatory correlation

Author keywords

[No Author keywords available]

Indexed keywords

ACOUSTIC WAVES; AUDITION; COMPUTER SIMULATION; PERFORMANCE; RELAXATION OSCILLATORS; SIGNAL TO NOISE RATIO; SPEECH RECOGNITION;

EID: 0032682770     PISSN: 10459227     EISSN: None     Source Type: Journal    
DOI: 10.1109/72.761727     Document Type: Article
Times cited : (248)

References (57)
  • 1
    • 0025003184 scopus 로고
    • Modeling the perception of concurrent vowels: Vowels with different fundamental frequencies
    • P. F. Assmann and Q. Summerfield, "Modeling the perception of concurrent vowels: Vowels with different fundamental frequencies," J. Acoust. Soc. Am., vol. 88, pp. 680-697, 1990.
    • (1990) J. Acoust. Soc. Am. , vol.88 , pp. 680-697
    • Assmann, P.F.1    Summerfield, Q.2
  • 2
    • 0029845962 scopus 로고    scopus 로고
    • Thalamic modulation of high-frequency oscillating potentials in auditory cortex
    • D. S. Barth and K. D. MacDonald, "Thalamic modulation of high-frequency oscillating potentials in auditory cortex," Nature, vol. 383, pp. 78-81, 1996.
    • (1996) Nature , vol.383 , pp. 78-81
    • Barth, D.S.1    MacDonald, K.D.2
  • 3
    • 0030009733 scopus 로고    scopus 로고
    • Computer simulation of auditory stream segregation in alternating-tone sequences
    • M. W. Beauvois and R. Meddis, "Computer simulation of auditory stream segregation in alternating-tone sequences," J. Acoust. Soc. Am., vol. 99, pp. 2270-2280, 1996.
    • (1996) J. Acoust. Soc. Am. , vol.99 , pp. 2270-2280
    • Beauvois, M.W.1    Meddis, R.2
  • 4
    • 33749711919 scopus 로고
    • Automatic speech recognition using a reduced auditory representation and position-tolerant discrimination
    • S. W. Beet, "Automatic speech recognition using a reduced auditory representation and position-tolerant discrimination," Computer Speech and Language, vol. 4, pp. 17-33, 1990.
    • (1990) Computer Speech and Language , vol.4 , pp. 17-33
    • Beet, S.W.1
  • 5
    • 41849119997 scopus 로고
    • Intonation and the perceptual separation of simultaneous voices
    • J. P. L. Brokx and S. G. Nooteboom, "Intonation and the perceptual separation of simultaneous voices," J. Phonetics, vol. 10, pp. 23-36, 1982.
    • (1982) J. Phonetics , vol.10 , pp. 23-36
    • Brokx, J.P.L.1    Nooteboom, S.G.2
  • 7
    • 0028531926 scopus 로고
    • Computational auditory scene analysis
    • G. J. Brown and M. P. Cooke, "Computational auditory scene analysis," Computer Speech and Language, vol. 8, pp. 297-336, 1994.
    • (1994) Computer Speech and Language , vol.8 , pp. 297-336
    • Brown, G.J.1    Cooke, M.P.2
  • 8
    • 84963179243 scopus 로고
    • Perceptual grouping of musical sounds: A computational model
    • G. J. Brown and M. P. Cooke, "Perceptual grouping of musical sounds: A computational model," J. New Music Research, vol. 23, pp. 107-132, 1994.
    • (1994) J. New Music Research , vol.23 , pp. 107-132
    • Brown, G.J.1    Cooke, M.P.2
  • 9
    • 0039106507 scopus 로고    scopus 로고
    • Temporal synchronization in a neural oscillator model of primitive auditory stream segregation
    • D. F. Rosenthal and H. Okuno (Eds.), Mahwah, NJ: Lawrence Erlbaum
    • G. J. Brown and M. P. Cooke, "Temporal synchronization in a neural oscillator model of primitive auditory stream segregation," in Computational Auditory Scene Analysis, D. F. Rosenthal and H. Okuno (Eds.), Mahwah, NJ: Lawrence Erlbaum, pp. 87-103, 1998.
    • (1998) Computational Auditory Scene Analysis , pp. 87-103
    • Brown, G.J.1    Cooke, M.P.2
  • 10
    • 0031442487 scopus 로고    scopus 로고
    • Modeling the perceptual segregation of double vowels with a network of neural oscillators
    • G. J. Brown and D. L. Wang, "Modeling the perceptual segregation of double vowels with a network of neural oscillators," Neural Networks, vol. 10, pp. 1547-1558, 1997.
    • (1997) Neural Networks , vol.10 , pp. 1547-1558
    • Brown, G.J.1    Wang, D.L.2
  • 11
    • 0025036303 scopus 로고
    • Sensitivities of cells in the anteroventral cochlear nucleus of cat to spatiotemporal discharge patterns across primary afferents
    • L. H. Carney, "Sensitivities of cells in the anteroventral cochlear nucleus of cat to spatiotemporal discharge patterns across primary afferents," J. Neurophysiol., vol. 64, pp. 437-456, 1990.
    • (1990) J. Neurophysiol. , vol.64 , pp. 437-456
    • Carney, L.H.1
  • 13
    • 33749790807 scopus 로고
    • Separating simultaneous sound sources: Issues, challenges and models
    • E. Keller (Ed.), London: John Wiley and Sons
    • M. P. Cooke and G. J. Brown, "Separating simultaneous sound sources: Issues, challenges and models," in Speech Recognition and Speech Synthesis, E. Keller (Ed.), London: John Wiley and Sons, 1994.
    • (1994) Speech Recognition and Speech Synthesis
    • Cooke, M.P.1    Brown, G.J.2
  • 14
    • 0017804799 scopus 로고
    • On cochlear encoding: Potentialities and limitations of the reverse correlation technique
    • E. de Boer and H. D. de Jongh, "On cochlear encoding: Potentialities and limitations of the reverse correlation technique," J. Acoust. Soc. Am., vol. 63, pp. 115-135, 1978.
    • (1978) J. Acoust. Soc. Am. , vol.63 , pp. 115-135
    • De Boer, E.1    De Jongh, H.D.2
  • 15
    • 15844431539 scopus 로고    scopus 로고
    • Primary cortical representation of sounds by the coordination of action-potential timing
    • R. C. deCharms and M. M. Merzenich, "Primary cortical representation of sounds by the coordination of action-potential timing," Nature, vol. 381, pp. 610-613, 1996.
    • (1996) Nature , vol.381 , pp. 610-613
    • DeCharms, R.C.1    Merzenich, M.M.2
  • 17
    • 0345470067 scopus 로고    scopus 로고
    • Mid-level representations for computational auditory scene analysis: The weft element
    • D. F. Rosenthal and H. Okuno (Eds.), Mahwah, NJ: Lawrence Erlbaum
    • D. P. W. Ellis and D. Rosenthal, "Mid-level representations for computational auditory scene analysis: The weft element," in Computational Auditory Scene Analysis, D. F. Rosenthal and H. Okuno (Eds.), Mahwah, NJ: Lawrence Erlbaum, pp. 257-272, 1998.
    • (1998) Computational Auditory Scene Analysis , pp. 257-272
    • Ellis, D.P.W.1    Rosenthal, D.2
  • 18
    • 0002955515 scopus 로고
    • Nonlinear dynamics in olfactory information processing
    • J. L. Davis and H. Eichenbaum (Eds.), Cambridge, MA: MIT Press
    • W. J. Freeman, "Nonlinear dynamics in olfactory information processing," in Olfaction, J. L. Davis and H. Eichenbaum (Eds.), Cambridge, MA: MIT Press, pp. 225-249, 1991.
    • (1991) Olfaction , pp. 225-249
    • Freeman, W.J.1
  • 19
    • 0025332026 scopus 로고
    • Encoding of amplitude-modulation in the gerbil cochlear nucleus. 1. A hierarchy of enhancement
    • R. D. Frisina, R. L. Smith, and S. C. Chamberlain, "Encoding of amplitude-modulation in the gerbil cochlear nucleus. 1. A hierarchy of enhancement," Hearing Research, vol. 44, pp. 99-122, 1990.
    • (1990) Hearing Research , vol.44 , pp. 99-122
    • Frisina, R.D.1    Smith, R.L.2    Chamberlain, S.C.3
  • 20
    • 0039058777 scopus 로고
    • A 40-Hz auditory potential recorded from the human scalp
    • USA
    • R. Galambos, S. Makeig, and P. J. Talmachoff, "A 40-Hz auditory potential recorded from the human scalp," in Proc. Natl. Acad. Sci., USA, 1981, vol. 78, pp. 2643-2647.
    • (1981) Proc. Natl. Acad. Sci. , vol.78 , pp. 2643-2647
    • Galambos, R.1    Makeig, S.2    Talmachoff, P.J.3
  • 21
    • 0025110885 scopus 로고
    • Derivation of auditory filter shapes from notched-noise data
    • B. R. Glasberg and B. C. J. Moore, "Derivation of auditory filter shapes from notched-noise data," Hearing Research, vol. 47, pp. 103-138, 1990.
    • (1990) Hearing Research , vol.47 , pp. 103-138
    • Glasberg, B.R.1    Moore, B.C.J.2
  • 22
    • 0344607947 scopus 로고    scopus 로고
    • Context-sensitive selection of competing auditory organizations: A blackboard model
    • D. F. Rosenthal and H. Okuno (Eds.), Mahwah, NJ: Lawrence Erlbaum
    • D. J. Godsmark and G. J. Brown, "Context-sensitive selection of competing auditory organizations: A blackboard model," in Computational Auditory Scene Analysis, D. F. Rosenthal and H. Okuno (Eds.), Mahwah, NJ: Lawrence Erlbaum, pp. 139-155, 1998.
    • (1998) Computational Auditory Scene Analysis , pp. 139-155
    • Godsmark, D.J.1    Brown, G.J.2
  • 23
    • 33749691178 scopus 로고    scopus 로고
    • Pitch-based streaming in auditory perception
    • N. Griffith and P. Todd (Eds.), Cambridge, MA: MIT Press
    • S. Grossberg, "Pitch-based streaming in auditory perception," in Creative Networks, N. Griffith and P. Todd (Eds.), Cambridge, MA: MIT Press, 1998.
    • (1998) Creative Networks
    • Grossberg, S.1
  • 25
    • 0028134901 scopus 로고
    • Human oscillatory brain activity near to 40 Hz coexists with cognitive temporal binding
    • USA
    • M. Joliot, U. Ribary, and R. Llinas, "Human oscillatory brain activity near to 40 Hz coexists with cognitive temporal binding," in Proc. Natl. Acad. Sci., USA, 1994, vol. 91, pp. 11748-11751.
    • (1994) Proc. Natl. Acad. Sci. , vol.91 , pp. 11748-11751
    • Joliot, M.1    Ribary, U.2    Llinas, R.3
  • 26
    • 0003275634 scopus 로고    scopus 로고
    • Application of the Bayesian probability network to music scene analysis
    • D. F. Rosenthal and H. Okuno (Eds.), Mahwah, NJ: Lawrence Erlbaum
    • K. Kashino, K. Nakadai, T. Kinoshita and H. Tanaka, "Application of the Bayesian probability network to music scene analysis," in Computational Auditory Scene Analysis, D. F. Rosenthal and H. Okuno (Eds.), Mahwah, NJ: Lawrence Erlbaum, pp. 115-137, 1998.
    • (1998) Computational Auditory Scene Analysis , pp. 115-137
    • Kashino, K.1    Nakadai, K.2    Kinoshita, T.3    Tanaka, H.4
  • 27
    • 33749780714 scopus 로고    scopus 로고
    • The IPUS blackboard architecture as a framework for computational auditory scene analysis
    • D. F. Rosenthal and H. Okuno (Eds.), Mahwah, NJ: Lawrence Erlbaum
    • F. Klassner, V. Lesser, and S. H. Nawab, "The IPUS blackboard architecture as a framework for computational auditory scene analysis," in Computational Auditory Scene Analysis, D. F. Rosenthal and H. Okuno (Eds.), Mahwah, NJ: Lawrence Erlbaum, pp. 105-114, 1998.
    • (1998) Computational Auditory Scene Analysis , pp. 105-114
    • Klassner, F.1    Lesser, V.2    Nawab, S.H.3
  • 28
    • 0032074995 scopus 로고    scopus 로고
    • Fast numerical integration of relaxation oscillator networks based on singular limit solutions
    • P. S. Linsay and D. L. Wang, "Fast numerical integration of relaxation oscillator networks based on singular limit solutions," IEEE Trans. Neural Networks, vol. 9, pp. 523-532, 1998.
    • (1998) IEEE Trans. Neural Networks , vol.9 , pp. 523-532
    • Linsay, P.S.1    Wang, D.L.2
  • 29
    • 0027461767 scopus 로고
    • Coherent 40-Hz oscillation characterizes dream state in humans
    • USA
    • R. Llinás and U. Ribary, "Coherent 40-Hz oscillation characterizes dream state in humans," in Proc. Natl. Acad. Sci., USA, 1993, vol. 90, pp. 2078-2082.
    • (1993) Proc. Natl. Acad. Sci. , vol.90 , pp. 2078-2082
    • Llinás, R.1    Ribary, U.2
  • 30
    • 0030464182 scopus 로고    scopus 로고
    • Neuronal assembly dynamics in the rat auditory cortex during reorganization induced by intracortical microstimulation
    • P. E. Maldonado and G. L. Gerstein, "Neuronal assembly dynamics in the rat auditory cortex during reorganization induced by intracortical microstimulation," Exp. Brain Res., vol. 112, pp. 431-441, 1996.
    • (1996) Exp. Brain Res. , vol.112 , pp. 431-441
    • Maldonado, P.E.1    Gerstein, G.L.2
  • 31
    • 0031043484 scopus 로고    scopus 로고
    • A model of auditory streaming
    • S. L. McCabe and M. J. Denham, "A model of auditory streaming," J. Acoust. Soc. Am., vol. 101, pp. 1611-1621, 1997.
    • (1997) J. Acoust. Soc. Am. , vol.101 , pp. 1611-1621
    • McCabe, S.L.1    Denham, M.J.2
  • 32
    • 0023944462 scopus 로고
    • Simulation of auditory-neural transduction: Further studies
    • R. Meddis, "Simulation of auditory-neural transduction: Further studies," J. Acoust. Soc. Am., vol. 83, pp. 1056-1063, 1988.
    • (1988) J. Acoust. Soc. Am. , vol.83 , pp. 1056-1063
    • Meddis, R.1
  • 33
    • 0030846123 scopus 로고    scopus 로고
    • A unitary model of pitch perception
    • R. Meddis and L. O'Mard, "A unitary model of pitch perception," J. Acoust. Soc. Am., vol. 102, pp. 1811-1820, 1997.
    • (1997) J. Acoust. Soc. Am. , vol.102 , pp. 1811-1820
    • Meddis, R.1    O'Mard, L.2
  • 34
    • 0026654967 scopus 로고
    • Modeling the identification of concurrent vowels with different fundamental frequencies
    • R. Meddis and M. Hewitt, "Modeling the identification of concurrent vowels with different fundamental frequencies," J. Acoust. Soc. Am., vol. 91, pp. 233-245, 1992.
    • (1992) J. Acoust. Soc. Am. , vol.91 , pp. 233-245
    • Meddis, R.1    Hewitt, M.2
  • 37
    • 0345038879 scopus 로고    scopus 로고
    • Multiagent based binaural sound stream segregation
    • D. F. Rosenthal and H. Okuno (Eds.), Mahwah, NJ: Lawrence Erlbaum
    • T. Nakatani, H. Okuno, M. Goto, and T. Ito, "Multiagent based binaural sound stream segregation," in Computational Auditory Scene Analysis, D. F. Rosenthal and H. Okuno (Eds.), Mahwah, NJ: Lawrence Erlbaum, pp. 195-214, 1998.
    • (1998) Computational Auditory Scene Analysis , pp. 195-214
    • Nakatani, T.1    Okuno, H.2    Goto, M.3    Ito, T.4
  • 38
    • 0021361187 scopus 로고
    • The central nucleus of the inferior colliculus in the cat
    • D. L. Oliver and D. K. Morest, "The central nucleus of the inferior colliculus in the cat," J. Comp. Neurol., vol. 222, pp. 237-264, 1984.
    • (1984) J. Comp. Neurol. , vol.222 , pp. 237-264
    • Oliver, D.L.1    Morest, D.K.2
  • 39
    • 33749776242 scopus 로고
    • Physiology of the cochlear nerve and cochlear nucleus
    • M. P. Haggard and E. F. Evans, Eds., Edinburgh: Churchill Livingstone
    • A. R. Palmer, "Physiology of the cochlear nerve and cochlear nucleus," in Hearing, M. P. Haggard and E. F. Evans, Eds., Edinburgh: Churchill Livingstone, 1987.
    • (1987) Hearing
    • Palmer, A.R.1
  • 40
    • 0017004953 scopus 로고
    • Separation of speech from interfering speech by means of harmonic selection
    • T. W. Parsons, "Separation of speech from interfering speech by means of harmonic selection," J. Acoust. Soc. Am., vol. 60, no. 4, pp. 911-918, 1976.
    • (1976) J. Acoust. Soc. Am. , vol.60 , Issue.4 , pp. 911-918
    • Parsons, T.W.1
  • 42
    • 0026335720 scopus 로고
    • Magnetic field tomography of coherent thalamocortical 40-Hz oscillations in humans
    • USA
    • U. Ribary et al., "Magnetic field tomography of coherent thalamocortical 40-Hz oscillations in humans," in Proc. Natl. Acad. Sci., USA, 1991, vol. 88, pp. 11037-11041.
    • (1991) Proc. Natl. Acad. Sci. , vol.88 , pp. 11037-11041
    • Ribary, U.1
  • 44
    • 0024263459 scopus 로고
    • Periodicity coding in the inferior colliculus of the cat. II. Topographical organization
    • C. E. Schreiner and G. Langner, "Periodicity coding in the inferior colliculus of the cat. II. Topographical organization," J. Neurophysiology, vol. 60, pp. 1823-1840, 1988.
    • (1988) J. Neurophysiology , vol.60 , pp. 1823-1840
    • Schreiner, C.E.1    Langner, G.2
  • 45
    • 0022341184 scopus 로고
    • Speech processing in the auditory system I: The representation of speech sounds in the responses of the auditory nerve
    • S. A. Shamma, "Speech processing in the auditory system I: The representation of speech sounds in the responses of the auditory nerve," J. Acoust. Soc. Am., vol. 78, pp. 1613-1621, 1985.
    • (1985) J. Acoust. Soc. Am. , vol.78 , pp. 1613-1621
    • Shamma, S.A.1
  • 46
    • 0028969330 scopus 로고
    • Visual feature integration and the temporal correlation hypothesis
    • W. Singer and C. M. Gray, "Visual feature integration and the temporal correlation hypothesis," Ann. Rev. Neurosci., vol. 18, pp. 555-586, 1995.
    • (1995) Ann. Rev. Neurosci. , vol.18 , pp. 555-586
    • Singer, W.1    Gray, C.M.2
  • 47
    • 0025623060 scopus 로고
    • A perceptual pitch detector
    • M. Slaney and R. F. Lyon, "A perceptual pitch detector," in Proc. ICASSP, 1990, pp. 357-360.
    • (1990) Proc. ICASSP , pp. 357-360
    • Slaney, M.1    Lyon, R.F.2
  • 48
    • 0025343521 scopus 로고
    • Algorithms for separating the speech of interfering talkers: Evaluations with voiced sentences, and normalhearing and hearing-impaired listeners
    • R. J. Stubbs and Q. Summerfield, "Algorithms for separating the speech of interfering talkers: Evaluations with voiced sentences, and normalhearing and hearing-impaired listeners," J. Acoust. Soc. Am., vol. 87, pp. 359-372, 1990.
    • (1990) J. Acoust. Soc. Am. , vol.87 , pp. 359-372
    • Stubbs, R.J.1    Summerfield, Q.2
  • 49
    • 58149210859 scopus 로고
    • Global competition and local cooperation in a network of neural oscillators
    • D. Terman and D. L. Wang, "Global competition and local cooperation in a network of neural oscillators," Physica D, vol. 81, pp. 148-176, 1995.
    • (1995) Physica D , vol.81 , pp. 148-176
    • Terman, D.1    Wang, D.L.2
  • 50
    • 0000873906 scopus 로고
    • On relaxation oscillations
    • B. van der Pol, "On relaxation oscillations," Philosophical Mag., vol. 2, no. 11, pp. 978-992, 1926.
    • (1926) Philosophical Mag. , vol.2 , Issue.11 , pp. 978-992
    • Van Der Pol, B.1
  • 51
    • 0003708920 scopus 로고
    • The correlation theory of brain function
    • Max-Planck-Institute for Biophysical Chemistry
    • C. von der Malsburg, "The correlation theory of brain function," Internal Report 81-2, Max-Planck-Institute for Biophysical Chemistry, 1981.
    • (1981) Internal Report 81-2
    • Von Der Malsburg, C.1
  • 52
    • 0022633118 scopus 로고
    • A neural cocktail-party processor
    • C. von der Malsburg and W. Schneider, "A neural cocktail-party processor," Biol. Cybern., vol. 54, pp. 29-40, 1986.
    • (1986) Biol. Cybern. , vol.54 , pp. 29-40
    • Von Der Malsburg, C.1    Schneider, W.2
  • 54
    • 0038838363 scopus 로고    scopus 로고
    • Object selection based on oscillatory correlation
    • OSU Department of Computer and Information Science
    • D. L. Wang, "Object selection based on oscillatory correlation," Tech. Rep. 96-67, OSU Department of Computer and Information Science, 1996.
    • (1996) Tech. Rep. 96-67
    • Wang, D.L.1
  • 55
    • 0030188146 scopus 로고    scopus 로고
    • Primitive auditory segregation based on oscillatory correlation
    • D. L. Wang, "Primitive auditory segregation based on oscillatory correlation," Cognit. Sci., vol. 20, pp. 409-456, 1996.
    • (1996) Cognit. Sci. , vol.20 , pp. 409-456
    • Wang, D.L.1
  • 56
    • 0031570072 scopus 로고    scopus 로고
    • Image segmentation based on oscillatory correlation
    • for errata see Neural Comp., vol. 9, pp. 1623-1626, 1997
    • D. L. Wang and D. Terman, "Image segmentation based on oscillatory correlation," Neural Comp., vol. 9, pp. 805-836, 1997 (for errata see Neural Comp., vol. 9, pp. 1623-1626, 1997).
    • (1997) Neural Comp. , vol.9 , pp. 805-836
    • Wang, D.L.1    Terman, D.2
  • 57
    • 0022907820 scopus 로고
    • A computational model for separating two simultaneous talkers
    • M. Weintraub, "A computational model for separating two simultaneous talkers," in Proc. IEEE ICASSP, 1986, pp. 81-84.
    • (1986) Proc. IEEE ICASSP , pp. 81-84
    • Weintraub, M.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.