SCOPUS 정보 검색 플랫폼

Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition

Volumn , Issue , 2012, Pages 486-493

Example-based cross-modal denoising

(3) Segev, Dana a Schechner, Yoav Y a Elad, Michael a

a TECHNION ISRAEL INSTITUTE OF TECHNOLOGY (Israel)

Author keywords

[No Author keywords available]

Indexed keywords

CROSS-MODAL; DE-NOISE; DE-NOISING; MULTISENSORY SYSTEMS; NOISE SOURCE; NOISY AUDIO; NONSTATIONARY; TRAINING EXAMPLE; UNIMODAL; VIDEO CHANNELS;

AUDIO ACOUSTICS;

COMPUTER VISION;

EID: 84866696873 PISSN: 10636919 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/CVPR.2012.6247712 Document Type: Conference Paper

Times cited : (7)

References (40)

1
- 33646697511
- Representation analysis and synthesis of lip images using dimensionality reduction
- Aharon M., Kimmel R.: Representation analysis and synthesis of lip images using dimensionality reduction. IJCV 67:297-312, 2006.
- (2006) IJCV , vol.67 , pp. 297-312
- Aharon, M.¹ Kimmel, R.²

2
- 34948829598
- Harmony in motion
- Barzelay Z., Schechner Y.Y.: Harmony in motion. Proc. IEEE CVPR, 2007.
- (2007) Proc. IEEE CVPR
- Barzelay, Z.¹ Schechner, Y.Y.²

3
- 85012688561
- Princeton University Press
- Bellman R. Dynamic Programming. 1957, Princeton University Press
- (1957) Dynamic Programming
- Bellman, R.¹

4
- 0030677313
- Video rewrite: Driving visual speech with audio
- Bregler C., Covell M., Slaney M. Video Rewrite: Driving Visual Speech with Audio. Proc. ACM SIGGRAPH. 353-360, 1997.
- (1997) Proc. ACM SIGGRAPH , pp. 353-360
- Bregler, C.¹ Covell, M.² Slaney, M.³

5
- 33744968614
- Audio source separation with a single sensor
- Bimbot F., Benaroya L., Gribonval R.: Audio source separation with a single sensor. IEEE Trans. ASSP 14:191-199, 2006.
- (2006) IEEE Trans. ASSP , vol.14 , pp. 191-199
- Bimbot, F.¹ Benaroya, L.² Gribonval, R.³

6
- 84866701010
- Blind audio-visual source separation using sparse representations
- Casanovas A. L., Monaci G., Vandergheynst P.: Blind audio-visual source separation using sparse representations. IEEE ICIP 2007.
- (2007) IEEE ICIP
- Casanovas, A.L.¹ Monaci, G.² Vandergheynst, P.³

7
- 84863054350
- Multi-view 3D reconstruction for scenes under the refractive plane with known vertical direction
- Chang Y. J., Chen T: Multi-View 3D Reconstruction for Scenes under the Refractive Plane with Known Vertical Direction. In Proc. ICCV, 2011.
- (2011) Proc. ICCV
- Chang, Y.J.¹ Chen, T.²

8
- 4544386970
- Boosting and structure learning in dynamic bayesian networks for audio-visual speaker detection
- Choudhury T., Rehg J., Pavlovic V., Pentland A.: Boosting and structure learning in dynamic bayesian networks for audio-visual speaker detection. In Proc. ICPR pp. 789-794, 2002.
- (2002) Proc. ICPR , pp. 789-794
- Choudhury, T.¹ Rehg, J.² Pavlovic, V.³ Pentland, A.⁴

9
- 0035500783
- Speech enhancement for non-stationary noise environments
- Cohen I., Berdugo B.: Speech enhancement for non-stationary noise environments. Signal Processing 81:2403-2418, 2001.
- (2001) Signal Processing , vol.81 , pp. 2403-2418
- Cohen, I.¹ Berdugo, B.²

10
- 4344675264
- Region filling and object removal by exemplar-based image inpainting
- Criminisi A., Perez P., Toyama K.: Region filling and object removal by exemplar-based image inpainting. IEEE Trans. IP 13:1200-1212, 2004.
- (2004) IEEE Trans. IP , vol.13 , pp. 1200-1212
- Criminisi, A.¹ Perez, P.² Toyama, K.³

11
- 24644433991
- Audio-visual speech enhancement with AVCDCN (audio-visual codebock dependent cepstral normalization
- Deligne S., Potamianos G., Neti C.: Audio-visual speech enhancement with AVCDCN (audio-visual codebock dependent cepstral normalization). IEEEWorksh. Sensor Array & Multichannel SP, 68-71, 2002.
- (2002) IEEE Worksh. Sensor Array & Multichannel SP , pp. 68-71
- Deligne, S.¹ Potamianos, G.² Neti, C.³

12
- 0029935458
- Enhancement of selective listening by illusory mislocation of speech sounds due to lip-reading
- Driver J.: Enhancement of selective listening by illusory mislocation of speech sounds due to lip-reading. Nature 381:66-68, 1996.
- (1996) Nature , vol.381 , pp. 66-68
- Driver, J.¹

13
- 0034270644
- Audio-visual speech modeling for continuous speech recognition
- Dupont S., Luettin J.:Audio-visual speech modeling for continuous speech recognition. IEEE Trans. Multimedia 2:141-151, 2000.
- (2000) IEEE Trans. Multimedia , vol.2 , pp. 141-151
- Dupont, S.¹ Luettin, J.²

14
- 84892329327
- Springer New-York
- Elad M.: Sparse and Redundant Representations - From Theory to Applications in Signal and Image Processing. Springer New-York, 2010.
- (2010) Sparse and Redundant Representations - From Theory to Applications in Signal and Image Processing
- Elad, M.¹

15
- 47749117221
- Example-based regularization deployed to super-resolution reconstruction of a single image
- Elad M., Datsenko D.: Example-based regularization deployed to super-resolution reconstruction of a single image. The Computer Journal, 50:1-16, 2007.
- (2007) The Computer Journal , vol.50 , pp. 1-16
- Elad, M.¹ Datsenko, D.²

16
- 84866701009
- Sparse regression with structured priors: Application to audio denoising
- Fevotte C., Daudet L., Godsill S. J., Torresani B.: Sparse regression with structured priors: application to audio denoising. IEEE ICASSP, 2006.
- (2006) IEEE ICASSP
- Fevotte, C.¹ Daudet, L.² Godsill, S.J.³ Torresani, B.⁴

17
- 84898954418
- Learning joint statistical models for audio-visual fusion and Segregation
- Fisher III J.W., Darrell T., FreemanW. T., Viola P.: Learning joint statistical models for audio-visual fusion and Segregation. in Proc. NIPS 13, 772-778, 2001.
- (2001) Proc NIPS , vol.13 , pp. 772-778
- Fisher Iii, J.W.¹ Darrell, T.² Freeman, W.T.³ Viola, P.⁴

18
- 0034291933
- Learning low-level vision
- Freeman W. T., Pasztor E. C., Carmichael O. T.: Learning low-level vision. IJCV 40:25-47, 2000.
- (2000) IJCV , vol.40 , pp. 25-47
- Freeman, W.T.¹ Pasztor, E.C.² Carmichael, O.T.³

19
- 0037199954
- Gated visual input to the central auditory system
- Gutfreund Y., Zheng W., Knudsen E. I.: Gated visual input to the central auditory system. Science 297:1556-1559, 2002.
- (2002) Science , vol.297 , pp. 1556-1559
- Gutfreund, Y.¹ Zheng, W.² Knudsen, E.I.³

20
- 0003419545
- Gaithersburg, MD: National Inst. of Standards and Technol. (NIST
- Garofolo J. S.: Getting StartedWith the DARPA TIMIT CDROM: An Acoustic-Phonetic Continuous Speech Database. Gaithersburg, MD: National Inst. of Standards and Technol. (NIST) 1993.
- (1993) Getting Started with the DARPA TIMIT CDROM: An Acoustic-Phonetic Continuous Speech Database
- Garofolo, J.S.¹

21
- 34948876301
- Audio-visual sound separation via hidden markov models
- Hershey J., Casey M.: Audio-visual sound separation via hidden markov models. in Proc. NIPS pp. 1173-1180, 2001.
- (2001) Proc NIPS , pp. 1173-1180
- Hershey, J.¹ Casey, M.²

22
- 84863643959
- Digital Video Stabilization and Rolling Shutter Correction using Gyroscopes
- Karpenko A., Jacobs D. E.,Baek J., Levoy .M,: Digital Video Stabilization and Rolling Shutter Correction using Gyroscopes. Stanford CSTR pp. 2011-03, 2011.
- (2011) Stanford CSTR , pp. 2011-03
- Karpenko, A.¹ Jacobs, D.E.² Baek, J.³ Levoy, M.⁴

23
- 33745127045
- Computer vision for music identification
- Ke Y., Hoiem D., Sukthankar R.: Computer vision for music identification. Proc. IEEE CVPR pp. 597- 604, 2005
- (2005) Proc IEEE CVPR , pp. 597-604
- Ke, Y.¹ Hoiem, D.² Sukthankar, R.³

24
- 84866701011
- Audio-Visual clustering for 3D speaker localization
- Khalidov V., Forbes F., Hansard M., Arnaud E., Horaud R.: Audio-Visual clustering for 3D speaker localization. Proc. MLMI Workshop, 2008.
- (2008) Proc. MLMI Workshop
- Khalidov, V.¹ Forbes, F.² Hansard, M.³ Arnaud, E.⁴ Horaud, R.⁵

25
- 24644451644
- Pixels that sound
- Kidron E., Schechner Y. Y., Elad M.: Pixels that sound. Proc. IEEE CVPR pp. 88-95, 2005.
- (2005) Proc IEEE CVPR , pp. 88-95
- Kidron, E.¹ Schechner, Y.Y.² Elad, M.³

26
- 84866679630
- Visual localization of non-stationary sound sources
- Liu Y., Sato Y.: Visual localization of non-stationary sound sources. In Proc. Multimedia, 2009.
- (2009) Proc. Multimedia
- Liu, Y.¹ Sato, Y.²

27
- 84866679635
- Speechreading using Shape and Intensity Information
- Luettin J., Thacker N. A., Beet S. W.: Speechreading using Shape and Intensity Information. In ISCA, 1996.
- (1996) ISCA
- Luettin, J.¹ Thacker, N.A.² Beet, S.W.³

28
- 85135379452
- An efficient algorithm to estimate the instantaneous SNR of speech signals
- Martin R.: An efficient algorithm to estimate the instantaneous SNR of speech signals, Proc. EUROSPEECH:1093-1096, 1993.
- (1993) Proc. EUROSPEECH , pp. 1093-1096
- Martin, R.¹

29
- 72149118713
- Learning bimodal structure in audio-visual data
- NN
- Monaci G., Sommer F., Vandergheynst P.: Learning bimodal structure in audio-visual data. IEEE Trans. NN, 2009.
- (2009) IEEE Trans
- Monaci, G.¹ Sommer, F.² Vandergheynst, P.³

30
- 34948889993
- Microphone arrays as generalized cameras for integrated audio visual processing
- O'Donovan A., Duraiswami R., Neumann J.: Microphone arrays as generalized cameras for integrated audio visual processing. Proc. IEEE CVPR pp. :1-8, 2007.
- (2007) Proc IEEE CVPR , pp. 1-8
- O'donovan, A.¹ Duraiswami, R.² Neumann, J.³

31
- 4544290191
- Recent advances in the automatic recognition of audio-visual speech
- Potamianos G., Neti C., Gravier G., Garg A., Senior A.: Recent advances in the automatic recognition of audio-visual speech. Proc. IEEE, 91:1306-1326, 2003.
- (2003) Proc IEEE , vol.91 , pp. 1306-1326
- Potamianos, G.¹ Neti, C.² Gravier, G.³ Garg, A.⁴ Senior, A.⁵

32
- 0003927842
- Prentice Hall, chapter 14
- Quatieri T. F.: Discrete Time Speech Signal Processing, Principles and Practice. Prentice Hall, chapter 14, 2002.
- (2002) Discrete Time Speech Signal Processing, Principles and Practice
- Quatieri, T.F.¹

33
- 33745936208
- Separating transparent layers of repetitive dynamic behaviors
- Sarel B., Irani M.: Separating transparent layers of repetitive dynamic behaviors. Proc. IEEE ICCV pp. 26-32, 2005.
- (2005) Proc IEEE ICCV , pp. 26-32
- Sarel, B.¹ Irani, M.²

34
- 44949110218
- Single-channel speech separation using sparse non-negative matrix factorization
- Schmidt M. N., Olsson R. K.: Single-channel speech separation using sparse non-negative matrix factorization. Conf. Spoken Language Processing, 2006.
- (2006) Conf. Spoken Language Processing
- Schmidt, M.N.¹ Olsson, R.K.²

35
- 84866679631
- Segev D., Schechner Y. Y., Elad M.: Crossmodal denoising: supplemental online material. www.ee.technion.ac.il/~yoav/research/CM-denoising.html
- Crossmodal Denoising: Supplemental Online Material
- Segev, D.¹ Schechner, Y.Y.² Elad, M.³

36
- 84858719009
- A sparse non-parameteric approach for single channel separation of known sounds
- Smaragdis P., Shashanka R. and Raj B.: A Sparse Non-Parameteric Approach for Single Channel Separation of Known Sounds. In Proc. NIPS pp. 1705-1713, 2009.
- (2009) Proc. NIPS , pp. 1705-1713
- Smaragdis, P.¹ Shashanka, R.² Raj, B.³

37
- 0032762471
- A statistical model-based voice activitydetector
- Sohn J., Kim N.S, Sung W.: A statistical model-based voice activitydetector. IEEE SP Lett 6:1-3, 1999.
- (1999) IEEE SP Lett , vol.6 , pp. 1-3
- Sohn, J.¹ Kim, N.S.² Sung, W.³

38
- 5044226917
- Audio-visual based emotion recognition - A new approach
- Song M., Bu J., Chen C., Li N.: Audio-visual based emotion recognition-a new approach. Proc. IEEE CVPR 2004.
- (2004) Proc IEEE CVPR
- Song, M.¹ Bu, J.² Chen, C.³ Li, N.⁴

39
- 0033693215
- Quantile based noise estimation for spectral subtraction and Wiener filtering
- Stahl V., Fischer A., Bippus R.: Quantile based noise estimation for spectral subtraction and Wiener filtering. Proc. ICASSP pp. 1875-1878, 2000.
- (2000) Proc. ICASSP , pp. 1875-1878
- Stahl, V.¹ Fischer, A.² Bippus, R.³

40
- 34047223614
- Audio segmentation and speaker localization in meeting videos
- Vajaria H., Islam T., Sarkar S., Sankar R., Kasturi R.: Audio segmentation and speaker localization in meeting videos. In ICPR pp. 1150-1153, 2006.
- (2006) ICPR , pp. 1150-1153
- Vajaria, H.¹ Islam, T.² Sarkar, S.³ Sankar, R.⁴ Kasturi, R.⁵

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.