메뉴 건너뛰기




Volumn , Issue , 2012, Pages 486-493

Example-based cross-modal denoising

Author keywords

[No Author keywords available]

Indexed keywords

CROSS-MODAL; DE-NOISE; DE-NOISING; MULTISENSORY SYSTEMS; NOISE SOURCE; NOISY AUDIO; NONSTATIONARY; TRAINING EXAMPLE; UNIMODAL; VIDEO CHANNELS;

EID: 84866696873     PISSN: 10636919     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/CVPR.2012.6247712     Document Type: Conference Paper
Times cited : (7)

References (40)
  • 1
    • 33646697511 scopus 로고    scopus 로고
    • Representation analysis and synthesis of lip images using dimensionality reduction
    • Aharon M., Kimmel R.: Representation analysis and synthesis of lip images using dimensionality reduction. IJCV 67:297-312, 2006.
    • (2006) IJCV , vol.67 , pp. 297-312
    • Aharon, M.1    Kimmel, R.2
  • 4
    • 0030677313 scopus 로고    scopus 로고
    • Video rewrite: Driving visual speech with audio
    • Bregler C., Covell M., Slaney M. Video Rewrite: Driving Visual Speech with Audio. Proc. ACM SIGGRAPH. 353-360, 1997.
    • (1997) Proc. ACM SIGGRAPH , pp. 353-360
    • Bregler, C.1    Covell, M.2    Slaney, M.3
  • 5
    • 33744968614 scopus 로고    scopus 로고
    • Audio source separation with a single sensor
    • Bimbot F., Benaroya L., Gribonval R.: Audio source separation with a single sensor. IEEE Trans. ASSP 14:191-199, 2006.
    • (2006) IEEE Trans. ASSP , vol.14 , pp. 191-199
    • Bimbot, F.1    Benaroya, L.2    Gribonval, R.3
  • 6
    • 84866701010 scopus 로고    scopus 로고
    • Blind audio-visual source separation using sparse representations
    • Casanovas A. L., Monaci G., Vandergheynst P.: Blind audio-visual source separation using sparse representations. IEEE ICIP 2007.
    • (2007) IEEE ICIP
    • Casanovas, A.L.1    Monaci, G.2    Vandergheynst, P.3
  • 7
    • 84863054350 scopus 로고    scopus 로고
    • Multi-view 3D reconstruction for scenes under the refractive plane with known vertical direction
    • Chang Y. J., Chen T: Multi-View 3D Reconstruction for Scenes under the Refractive Plane with Known Vertical Direction. In Proc. ICCV, 2011.
    • (2011) Proc. ICCV
    • Chang, Y.J.1    Chen, T.2
  • 8
    • 4544386970 scopus 로고    scopus 로고
    • Boosting and structure learning in dynamic bayesian networks for audio-visual speaker detection
    • Choudhury T., Rehg J., Pavlovic V., Pentland A.: Boosting and structure learning in dynamic bayesian networks for audio-visual speaker detection. In Proc. ICPR pp. 789-794, 2002.
    • (2002) Proc. ICPR , pp. 789-794
    • Choudhury, T.1    Rehg, J.2    Pavlovic, V.3    Pentland, A.4
  • 9
    • 0035500783 scopus 로고    scopus 로고
    • Speech enhancement for non-stationary noise environments
    • Cohen I., Berdugo B.: Speech enhancement for non-stationary noise environments. Signal Processing 81:2403-2418, 2001.
    • (2001) Signal Processing , vol.81 , pp. 2403-2418
    • Cohen, I.1    Berdugo, B.2
  • 10
    • 4344675264 scopus 로고    scopus 로고
    • Region filling and object removal by exemplar-based image inpainting
    • Criminisi A., Perez P., Toyama K.: Region filling and object removal by exemplar-based image inpainting. IEEE Trans. IP 13:1200-1212, 2004.
    • (2004) IEEE Trans. IP , vol.13 , pp. 1200-1212
    • Criminisi, A.1    Perez, P.2    Toyama, K.3
  • 11
    • 24644433991 scopus 로고    scopus 로고
    • Audio-visual speech enhancement with AVCDCN (audio-visual codebock dependent cepstral normalization
    • Deligne S., Potamianos G., Neti C.: Audio-visual speech enhancement with AVCDCN (audio-visual codebock dependent cepstral normalization). IEEEWorksh. Sensor Array & Multichannel SP, 68-71, 2002.
    • (2002) IEEE Worksh. Sensor Array & Multichannel SP , pp. 68-71
    • Deligne, S.1    Potamianos, G.2    Neti, C.3
  • 12
    • 0029935458 scopus 로고    scopus 로고
    • Enhancement of selective listening by illusory mislocation of speech sounds due to lip-reading
    • Driver J.: Enhancement of selective listening by illusory mislocation of speech sounds due to lip-reading. Nature 381:66-68, 1996.
    • (1996) Nature , vol.381 , pp. 66-68
    • Driver, J.1
  • 13
    • 0034270644 scopus 로고    scopus 로고
    • Audio-visual speech modeling for continuous speech recognition
    • Dupont S., Luettin J.:Audio-visual speech modeling for continuous speech recognition. IEEE Trans. Multimedia 2:141-151, 2000.
    • (2000) IEEE Trans. Multimedia , vol.2 , pp. 141-151
    • Dupont, S.1    Luettin, J.2
  • 15
    • 47749117221 scopus 로고    scopus 로고
    • Example-based regularization deployed to super-resolution reconstruction of a single image
    • Elad M., Datsenko D.: Example-based regularization deployed to super-resolution reconstruction of a single image. The Computer Journal, 50:1-16, 2007.
    • (2007) The Computer Journal , vol.50 , pp. 1-16
    • Elad, M.1    Datsenko, D.2
  • 16
    • 84866701009 scopus 로고    scopus 로고
    • Sparse regression with structured priors: Application to audio denoising
    • Fevotte C., Daudet L., Godsill S. J., Torresani B.: Sparse regression with structured priors: application to audio denoising. IEEE ICASSP, 2006.
    • (2006) IEEE ICASSP
    • Fevotte, C.1    Daudet, L.2    Godsill, S.J.3    Torresani, B.4
  • 17
    • 84898954418 scopus 로고    scopus 로고
    • Learning joint statistical models for audio-visual fusion and Segregation
    • Fisher III J.W., Darrell T., FreemanW. T., Viola P.: Learning joint statistical models for audio-visual fusion and Segregation. in Proc. NIPS 13, 772-778, 2001.
    • (2001) Proc NIPS , vol.13 , pp. 772-778
    • Fisher Iii, J.W.1    Darrell, T.2    Freeman, W.T.3    Viola, P.4
  • 19
    • 0037199954 scopus 로고    scopus 로고
    • Gated visual input to the central auditory system
    • Gutfreund Y., Zheng W., Knudsen E. I.: Gated visual input to the central auditory system. Science 297:1556-1559, 2002.
    • (2002) Science , vol.297 , pp. 1556-1559
    • Gutfreund, Y.1    Zheng, W.2    Knudsen, E.I.3
  • 21
    • 34948876301 scopus 로고    scopus 로고
    • Audio-visual sound separation via hidden markov models
    • Hershey J., Casey M.: Audio-visual sound separation via hidden markov models. in Proc. NIPS pp. 1173-1180, 2001.
    • (2001) Proc NIPS , pp. 1173-1180
    • Hershey, J.1    Casey, M.2
  • 22
    • 84863643959 scopus 로고    scopus 로고
    • Digital Video Stabilization and Rolling Shutter Correction using Gyroscopes
    • Karpenko A., Jacobs D. E.,Baek J., Levoy .M,: Digital Video Stabilization and Rolling Shutter Correction using Gyroscopes. Stanford CSTR pp. 2011-03, 2011.
    • (2011) Stanford CSTR , pp. 2011-03
    • Karpenko, A.1    Jacobs, D.E.2    Baek, J.3    Levoy, M.4
  • 23
    • 33745127045 scopus 로고    scopus 로고
    • Computer vision for music identification
    • Ke Y., Hoiem D., Sukthankar R.: Computer vision for music identification. Proc. IEEE CVPR pp. 597- 604, 2005
    • (2005) Proc IEEE CVPR , pp. 597-604
    • Ke, Y.1    Hoiem, D.2    Sukthankar, R.3
  • 26
    • 84866679630 scopus 로고    scopus 로고
    • Visual localization of non-stationary sound sources
    • Liu Y., Sato Y.: Visual localization of non-stationary sound sources. In Proc. Multimedia, 2009.
    • (2009) Proc. Multimedia
    • Liu, Y.1    Sato, Y.2
  • 27
    • 84866679635 scopus 로고    scopus 로고
    • Speechreading using Shape and Intensity Information
    • Luettin J., Thacker N. A., Beet S. W.: Speechreading using Shape and Intensity Information. In ISCA, 1996.
    • (1996) ISCA
    • Luettin, J.1    Thacker, N.A.2    Beet, S.W.3
  • 28
    • 85135379452 scopus 로고
    • An efficient algorithm to estimate the instantaneous SNR of speech signals
    • Martin R.: An efficient algorithm to estimate the instantaneous SNR of speech signals, Proc. EUROSPEECH:1093-1096, 1993.
    • (1993) Proc. EUROSPEECH , pp. 1093-1096
    • Martin, R.1
  • 30
    • 34948889993 scopus 로고    scopus 로고
    • Microphone arrays as generalized cameras for integrated audio visual processing
    • O'Donovan A., Duraiswami R., Neumann J.: Microphone arrays as generalized cameras for integrated audio visual processing. Proc. IEEE CVPR pp. :1-8, 2007.
    • (2007) Proc IEEE CVPR , pp. 1-8
    • O'donovan, A.1    Duraiswami, R.2    Neumann, J.3
  • 31
    • 4544290191 scopus 로고    scopus 로고
    • Recent advances in the automatic recognition of audio-visual speech
    • Potamianos G., Neti C., Gravier G., Garg A., Senior A.: Recent advances in the automatic recognition of audio-visual speech. Proc. IEEE, 91:1306-1326, 2003.
    • (2003) Proc IEEE , vol.91 , pp. 1306-1326
    • Potamianos, G.1    Neti, C.2    Gravier, G.3    Garg, A.4    Senior, A.5
  • 33
    • 33745936208 scopus 로고    scopus 로고
    • Separating transparent layers of repetitive dynamic behaviors
    • Sarel B., Irani M.: Separating transparent layers of repetitive dynamic behaviors. Proc. IEEE ICCV pp. 26-32, 2005.
    • (2005) Proc IEEE ICCV , pp. 26-32
    • Sarel, B.1    Irani, M.2
  • 34
    • 44949110218 scopus 로고    scopus 로고
    • Single-channel speech separation using sparse non-negative matrix factorization
    • Schmidt M. N., Olsson R. K.: Single-channel speech separation using sparse non-negative matrix factorization. Conf. Spoken Language Processing, 2006.
    • (2006) Conf. Spoken Language Processing
    • Schmidt, M.N.1    Olsson, R.K.2
  • 36
    • 84858719009 scopus 로고    scopus 로고
    • A sparse non-parameteric approach for single channel separation of known sounds
    • Smaragdis P., Shashanka R. and Raj B.: A Sparse Non-Parameteric Approach for Single Channel Separation of Known Sounds. In Proc. NIPS pp. 1705-1713, 2009.
    • (2009) Proc. NIPS , pp. 1705-1713
    • Smaragdis, P.1    Shashanka, R.2    Raj, B.3
  • 37
    • 0032762471 scopus 로고    scopus 로고
    • A statistical model-based voice activitydetector
    • Sohn J., Kim N.S, Sung W.: A statistical model-based voice activitydetector. IEEE SP Lett 6:1-3, 1999.
    • (1999) IEEE SP Lett , vol.6 , pp. 1-3
    • Sohn, J.1    Kim, N.S.2    Sung, W.3
  • 38
    • 5044226917 scopus 로고    scopus 로고
    • Audio-visual based emotion recognition - A new approach
    • Song M., Bu J., Chen C., Li N.: Audio-visual based emotion recognition-a new approach. Proc. IEEE CVPR 2004.
    • (2004) Proc IEEE CVPR
    • Song, M.1    Bu, J.2    Chen, C.3    Li, N.4
  • 39
    • 0033693215 scopus 로고    scopus 로고
    • Quantile based noise estimation for spectral subtraction and Wiener filtering
    • Stahl V., Fischer A., Bippus R.: Quantile based noise estimation for spectral subtraction and Wiener filtering. Proc. ICASSP pp. 1875-1878, 2000.
    • (2000) Proc. ICASSP , pp. 1875-1878
    • Stahl, V.1    Fischer, A.2    Bippus, R.3
  • 40
    • 34047223614 scopus 로고    scopus 로고
    • Audio segmentation and speaker localization in meeting videos
    • Vajaria H., Islam T., Sarkar S., Sankar R., Kasturi R.: Audio segmentation and speaker localization in meeting videos. In ICPR pp. 1150-1153, 2006.
    • (2006) ICPR , pp. 1150-1153
    • Vajaria, H.1    Islam, T.2    Sarkar, S.3    Sankar, R.4    Kasturi, R.5


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.