메뉴 건너뛰기




Volumn 3, Issue , 2012, Pages 1975-1978

Speech/nonspeech segmentation in web videos

Author keywords

Segmentation; Speech detection; Video; Voice activity detection

Indexed keywords

DISCRIMINATIVE CLASSIFIERS; GAUSSIAN MIXTURE MODEL; SEGMENTATION TECHNIQUES; SPEECH DETECTION; SPEECH TRANSCRIPTIONS; SPEECH/NON-SPEECH SEGMENTATIONS; VIDEO; VOICE ACTIVITY DETECTION;

EID: 84878610785     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (28)

References (16)
  • 1
    • 85071069033 scopus 로고    scopus 로고
    • Segmentation and classification of broadcast news audio
    • T. Hain and P. C. Woodland, "Segmentation and classification of broadcast news audio," in Proc. of ICSLP, 1998, pp. 2727-2730.
    • (1998) Proc. of ICSLP , pp. 2727-2730
    • Hain, T.1    Woodland, P.C.2
  • 4
    • 84867208777 scopus 로고    scopus 로고
    • Study of integration of statistical model-based voice activity detection and noise suppression
    • M. Fujimoto, K. Ishizuka, and T. Nakatani, "Study of integration of statistical model-based voice activity detection and noise suppression," in Proc. of Interspeech, 2008, pp. 2008-2011.
    • (2008) Proc. of Interspeech , pp. 2008-2011
    • Fujimoto, M.1    Ishizuka, K.2    Nakatani, T.3
  • 5
    • 84867225559 scopus 로고    scopus 로고
    • DySANA: Dynamic speech and noise adaptation for voice activity detection
    • R. J. Weiss and T. Kristjansson, "DySANA: Dynamic speech and noise adaptation for voice activity detection," in Proc. of Interspeech, 2008, pp. 127-130.
    • (2008) Proc. of Interspeech , pp. 127-130
    • Weiss, R.J.1    Kristjansson, T.2
  • 7
    • 84865706680 scopus 로고    scopus 로고
    • On noise robust voice activity detection
    • T. Dekens and W. Verhelst, "On noise robust voice activity detection," in Proc. of Interspeech, 2011, pp. 2649-2652.
    • (2011) Proc. of Interspeech , pp. 2649-2652
    • Dekens, T.1    Verhelst, W.2
  • 8
    • 79959838316 scopus 로고    scopus 로고
    • Voice activity detection based on conditional random fields using multiple features
    • A. Saito, Y. Nankaku, A. Lee, and K. Tokuda, "Voice activity detection based on conditional random fields using multiple features," in Proc. of Interspeech, 2010, pp. 2086-2089.
    • (2010) Proc. of Interspeech , pp. 2086-2089
    • Saito, A.1    Nankaku, Y.2    Lee, A.3    Tokuda, K.4
  • 9
    • 0030648077 scopus 로고    scopus 로고
    • Construction and evaluation of a robust multifeature speech/music discriminator
    • E. Scheirer and M. Slaney, "Construction and evaluation of a robust multifeature speech/music discriminator," in Proc. of ICASSP, 1997, pp. 1331-1334.
    • (1997) Proc. of ICASSP , pp. 1331-1334
    • Scheirer, E.1    Slaney, M.2
  • 10
    • 0032638667 scopus 로고    scopus 로고
    • A comparison of features for speech, music discrimination
    • M. J. Carey, E. S. Parris, and H. Lloyd-Thomas, "A comparison of features for speech, music discrimination," in Proc. of ICASSP, 1999, pp. 149-152.
    • (1999) Proc. of ICASSP , pp. 149-152
    • Carey, M.J.1    Parris, E.S.2    Lloyd-Thomas, H.3
  • 11
    • 0036648502 scopus 로고    scopus 로고
    • Musical genre classification of audio signals
    • July
    • G. Tzanetakis and P. Cook, "Musical genre classification of audio signals," IEEE Trans. on Speech and Audio Proc., vol. 10, no. 5, pp. 293-302, July 2002.
    • (2002) IEEE Trans. on Speech and Audio Proc. , vol.10 , Issue.5 , pp. 293-302
    • Tzanetakis, G.1    Cook, P.2
  • 12
    • 0036816475 scopus 로고    scopus 로고
    • Content analysis for audio classification and segmentation
    • October
    • L. Lu, H.-J. Zhang, and H. Jiang, "Content analysis for audio classification and segmentation," IEEE Trans. on Speech and Audio Proc., vol. 10, no. 7, pp. 504-516, October 2002.
    • (2002) IEEE Trans. on Speech and Audio Proc. , vol.10 , Issue.7 , pp. 504-516
    • Lu, L.1    Zhang, H.-J.2    Jiang, H.3
  • 13
    • 80051623447 scopus 로고    scopus 로고
    • Speaker diarization of heterogeneous web video files: A preliminary study
    • P. Clement, T. Bazillon, and C. Fredouille, "Speaker diarization of heterogeneous web video files: A preliminary study," in Proc. of ICASSP, 2011, pp. 4432-4435.
    • (2011) Proc. of ICASSP , pp. 4432-4435
    • Clement, P.1    Bazillon, T.2    Fredouille, C.3
  • 15
    • 33645690579 scopus 로고    scopus 로고
    • Fast binary feature selection with conditional mutual information
    • December
    • F. Fleuret, "Fast binary feature selection with conditional mutual information," Journal of Machine Learning Research, vol. 5, pp. 1531-1535, December 2004.
    • (2004) Journal of Machine Learning Research , vol.5 , pp. 1531-1535
    • Fleuret, F.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.