-
1
-
-
33749051687
-
Blind one-microphone speech separation: A spectral learning approach
-
R R. Bach and M. I. Jordan. Blind one-microphone speech separation: A spectral learning approach. Proc. NIPS (2004).
-
(2004)
Proc. NIPS
-
-
Bach, R.R.1
Jordan, M.I.2
-
2
-
-
34948856380
-
Harmony in Motion
-
Tech. Rep. CCIT #620, Dep. of Electrical Engineering, Technion
-
Z. Barzelay and Y. Y. Schechner. Harmony in Motion. Tech. Rep. CCIT #620, Dep. of Electrical Engineering, Technion (2007).
-
(2007)
-
-
Barzelay, Z.1
Schechner, Y.Y.2
-
3
-
-
27644583688
-
A tutorial on onset detection in music signals
-
J. Bello, L. Daudet, S. Abdallah, C. Duxbury, M. Davies, and M. Sandler. A tutorial on onset detection in music signals. In IEEE Trans. Speech and Audio Process., 5:1035-1047 (2005).
-
(2005)
IEEE Trans. Speech and Audio Process
, vol.5
, pp. 1035-1047
-
-
Bello, J.1
Daudet, L.2
Abdallah, S.3
Duxbury, C.4
Davies, M.5
Sandler, M.6
-
6
-
-
0026967479
-
Survey of image registration techniques
-
L. S. Brown. Survey of image registration techniques. ACM Comput. Surv., 24:325-376 (1992).
-
(1992)
ACM Comput. Surv
, vol.24
, pp. 325-376
-
-
Brown, L.S.1
-
7
-
-
75649112752
-
Relating audio-visual events caused by multiple movements: In the case of entire object movement
-
J. Chen, T. Mukai, Y. Takeuchi, T. Matsumoto, H. Kudo, T. Yamamura, and N. Ohnishi. Relating audio-visual events caused by multiple movements: in the case of entire object movement. Proc. Inf. Fusion, pp. 213-219 (2002).
-
(2002)
Proc. Inf. Fusion
, pp. 213-219
-
-
Chen, J.1
Mukai, T.2
Takeuchi, Y.3
Matsumoto, T.4
Kudo, H.5
Yamamura, T.6
Ohnishi, N.7
-
8
-
-
4544386970
-
Boosting and structure learning in dynamic bayesian networks for audio-visual speaker detection
-
T. Choudhury, J. Rehg, V. Pavlovic, and A. Pentland. Boosting and structure learning in dynamic bayesian networks for audio-visual speaker detection. In Proc. ICPR., vol. 3, pp. 789-794 (2002).
-
(2002)
Proc. ICPR
, vol.3
, pp. 789-794
-
-
Choudhury, T.1
Rehg, J.2
Pavlovic, V.3
Pentland, A.4
-
9
-
-
21544435145
-
Efficient pitch detection techniques for interactive music using harmonic model
-
P. Cuadra, A. Master, and C. Sapp Efficient pitch detection techniques for interactive music using harmonic model. Proc. ICMI, (2001).
-
(2001)
Proc. ICMI
-
-
Cuadra, P.1
Master, A.2
Sapp, C.3
-
10
-
-
1842830672
-
Audio-visual segmentation and the cocktail party effect
-
T. Darrell, J. W. Fisher, , P. A. Viola, and W. T. Freeman. Audio-visual segmentation and the cocktail party effect. In Proc. ICMI2000, pp. 1611-3349 (2000).
-
(2000)
Proc. ICMI2000
, pp. 1611-3349
-
-
Darrell, T.1
Fisher, J.W.2
Viola, P.A.3
Freeman, W.T.4
-
11
-
-
27144543541
-
Temporal frequency characteristics of synchrony-asynchrony discrimination of audio-visual signals
-
W. Fujisaki and S. Nishida. Temporal frequency characteristics of synchrony-asynchrony discrimination of audio-visual signals. J. Exp. Brain Res., 166:455-464 (2005).
-
(2005)
J. Exp. Brain Res
, vol.166
, pp. 455-464
-
-
Fujisaki, W.1
Nishida, S.2
-
12
-
-
0037199954
-
Gated visual input to the central auditory system
-
Y. Gutfreund, W. Zheng, and E. I. Knudsen. Gated visual input to the central auditory system. Science 297:1556-1559 (2002).
-
(2002)
Science
, vol.297
, pp. 1556-1559
-
-
Gutfreund, Y.1
Zheng, W.2
Knudsen, E.I.3
-
13
-
-
10044285992
-
Canonical correlation analysis: An overview with application to learning methods
-
D. Hardoon, S. Szedmak, and J. Shawe-Taylor. Canonical correlation analysis: An overview with application to learning methods. Neural Computation, 16:2639-2664 (2004).
-
(2004)
Neural Computation
, vol.16
, pp. 2639-2664
-
-
Hardoon, D.1
Szedmak, S.2
Shawe-Taylor, J.3
-
14
-
-
34948876301
-
Audio-visual sound separation via hidden markov models
-
J. Hershey and M. Casey. Audio-visual sound separation via hidden markov models. Proc. NIPS, pp. 1173-1180 (2001).
-
(2001)
Proc. NIPS
, pp. 1173-1180
-
-
Hershey, J.1
Casey, M.2
-
15
-
-
84899028297
-
Audio vision: Using audio-visual synchrony to locate sounds
-
J. Hershey and J. R. Movellan. Audio vision: Using audio-visual synchrony to locate sounds. Proc. NIPS, pp. 813-819 (1999).
-
(1999)
Proc. NIPS
, pp. 813-819
-
-
Hershey, J.1
Movellan, J.R.2
-
16
-
-
0032315531
-
Robust multi-sensor image alignment
-
M. Irani and P. Anandan. Robust multi-sensor image alignment. Proc. IEEE ICCV,pp. 959-966 (1998).
-
(1998)
Proc. IEEE ICCV
, pp. 959-966
-
-
Irani, M.1
Anandan, P.2
-
17
-
-
33745127045
-
Computer vision for music identification
-
Y. Ke, D. Hoiem, and R. Sukthankar. Computer vision for music identification. Proc. IEEE CVPR, vol. 1, pp. 597-604 (2005).
-
(2005)
Proc. IEEE CVPR
, vol.1
, pp. 597-604
-
-
Ke, Y.1
Hoiem, D.2
Sukthankar, R.3
-
20
-
-
0032649117
-
Sound onset detection by applying psychoacoustic knowledge
-
A. Klapuri. Sound onset detection by applying psychoacoustic knowledge. Proc. IEEE ICASSP, vol. 6, pp. 3089-3092 (1999).
-
(1999)
Proc. IEEE ICASSP
, vol.6
, pp. 3089-3092
-
-
Klapuri, A.1
-
22
-
-
0027842081
-
Matching pursuits with time-frequency dictionaries
-
S. Mallat and Z. Zhang. Matching pursuits with time-frequency dictionaries. Proc. IEEE Trans. Sig. Process., 41:3397-3415 (1993).
-
(1993)
Proc. IEEE Trans. Sig. Process
, vol.41
, pp. 3397-3415
-
-
Mallat, S.1
Zhang, Z.2
-
24
-
-
0036058193
-
Real-time speaker localization and speech separation by audio-visual integration
-
K. Nakadai, K. Hidai, H. Okuno, and H. Kitano. Real-time speaker localization and speech separation by audio-visual integration. IEEE Conf. Robotics & Auto., vol. 1, pp. 1043-1049 (2002).
-
(2002)
IEEE Conf. Robotics & Auto
, vol.1
, pp. 1043-1049
-
-
Nakadai, K.1
Hidai, K.2
Okuno, H.3
Kitano, H.4
-
25
-
-
13344250690
-
Data fusion for visual tracking with particles
-
P. Perez, J. Vermaak, and A. Blake. Data fusion for visual tracking with particles. Proc. IEEE, 92:495-513 (2004).
-
(2004)
Proc. IEEE
, vol.92
, pp. 495-513
-
-
Perez, P.1
Vermaak, J.2
Blake, A.3
-
26
-
-
4544247264
-
Bayesian separation of audio-visual speech sources
-
S. Rajaram, A. Nefian, and T. Huang. Bayesian separation of audio-visual speech sources. Proc. IEEE ICAASP, vol. 5, pp. 657-660 (2004).
-
(2004)
Proc. IEEE ICAASP
, vol.5
, pp. 657-660
-
-
Rajaram, S.1
Nefian, A.2
Huang, T.3
-
27
-
-
84898946024
-
One microphone source separation
-
S. T. Roweis. One microphone source separation. Proc. NIPS, pp. 793-799 (2001).
-
(2001)
Proc. NIPS
, pp. 793-799
-
-
Roweis, S.T.1
-
28
-
-
33745936208
-
Separating transparent layers of repetitive dynamic behaviors
-
B. Sarel and M. Irani. Separating transparent layers of repetitive dynamic behaviors. Proc. IEEE ICCV, vol. 1, pp. 26-32 (2005).
-
(2005)
Proc. IEEE ICCV
, vol.1
, pp. 26-32
-
-
Sarel, B.1
Irani, M.2
-
29
-
-
0028112849
-
Good features to track
-
J. Shi and C. Tomasi. Good features to track. Proc. IEEE CVPR, pp. 593-600 (1994).
-
(1994)
Proc. IEEE CVPR
, pp. 593-600
-
-
Shi, J.1
Tomasi, C.2
-
30
-
-
13444275916
-
Audio/visual independent components
-
P. Smaragdis and M. Casey. Audio/visual independent components. Proc. ICA, pp. 709-714 (2003).
-
(2003)
Proc. ICA
, pp. 709-714
-
-
Smaragdis, P.1
Casey, M.2
-
31
-
-
0036296012
-
A multi-pitch tracking algorithm for noisy speech
-
M. Wu, D. Wang, and G. Brown. A multi-pitch tracking algorithm for noisy speech. Proc. IEEE ICAASP, vol. 2, pp. 229-241 (2002).
-
(2002)
Proc. IEEE ICAASP
, vol.2
, pp. 229-241
-
-
Wu, M.1
Wang, D.2
Brown, G.3
-
32
-
-
3142694930
-
Blind separation of speech mixtures via time-frequency masking
-
O. Yilmaz and S. Rickard. Blind separation of speech mixtures via time-frequency masking. IEEE Trans. Sig. Process., 52:1830-1847 (2004).
-
(2004)
IEEE Trans. Sig. Process
, vol.52
, pp. 1830-1847
-
-
Yilmaz, O.1
Rickard, S.2
|