SCOPUS 정보 검색 플랫폼

13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012

Volumn 3, Issue , 2012, Pages 1975-1978

Speech/nonspeech segmentation in web videos

Author keywords

Segmentation; Speech detection; Video; Voice activity detection

Indexed keywords

DISCRIMINATIVE CLASSIFIERS; GAUSSIAN MIXTURE MODEL; SEGMENTATION TECHNIQUES; SPEECH DETECTION; SPEECH TRANSCRIPTIONS; SPEECH/NON-SPEECH SEGMENTATIONS; VIDEO; VOICE ACTIVITY DETECTION;

IMAGE SEGMENTATION; SPEECH RECOGNITION; SPEECH TRANSMISSION;

WEBSITES;

EID: 84878610785 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (28)

References (16)

1
- 85071069033
- Segmentation and classification of broadcast news audio
- T. Hain and P. C. Woodland, "Segmentation and classification of broadcast news audio," in Proc. of ICSLP, 1998, pp. 2727-2730.
- (1998) Proc. of ICSLP , pp. 2727-2730
- Hain, T.¹ Woodland, P.C.²

2
- 84865783453
- Hierarchical audio segmentation with hmm and factor analysis in broadcast news domain
- D. Castán, C. Vaquero, A. Ortega, D. Martínez, J. Villalba, and E. Lleida, "Hierarchical audio segmentation with hmm and factor analysis in broadcast news domain," in Proc. of Interspeech, 2011, pp. 421-424.
- (2011) Proc. of Interspeech , pp. 421-424
- Castán, D.¹ Vaquero, C.² Ortega, A.³ Martínez, D.⁴ Villalba, J.⁵ Lleida, E.⁶

3
- 84865772202
- Online speech activity detection in broadcast news
- C. Gao, G. Saikumar, S. Khanwalkar, A. Herscovici, A. Kumar, A. Srivastava, and P. Natarajan, "Online speech activity detection in broadcast news," in Proc. of Interspeech, 2011, pp. 2637-2640.
- (2011) Proc. of Interspeech , pp. 2637-2640
- Gao, C.¹ Saikumar, G.² Khanwalkar, S.³ Herscovici, A.⁴ Kumar, A.⁵ Srivastava, A.⁶ Natarajan, P.⁷

4
- 84867208777
- Study of integration of statistical model-based voice activity detection and noise suppression
- M. Fujimoto, K. Ishizuka, and T. Nakatani, "Study of integration of statistical model-based voice activity detection and noise suppression," in Proc. of Interspeech, 2008, pp. 2008-2011.
- (2008) Proc. of Interspeech , pp. 2008-2011
- Fujimoto, M.¹ Ishizuka, K.² Nakatani, T.³

5
- 84867225559
- DySANA: Dynamic speech and noise adaptation for voice activity detection
- R. J. Weiss and T. Kristjansson, "DySANA: Dynamic speech and noise adaptation for voice activity detection," in Proc. of Interspeech, 2008, pp. 127-130.
- (2008) Proc. of Interspeech , pp. 127-130
- Weiss, R.J.¹ Kristjansson, T.²

6
- 33745218538
- Voicing features for robust speech detection
- T. Kristjansson, S. Deligne, and P. Olsen, "Voicing features for robust speech detection," in Proc. of Interspeech, 2005, pp. 369-372.
- (2005) Proc. of Interspeech , pp. 369-372
- Kristjansson, T.¹ Deligne, S.² Olsen, P.³

7
- 84865706680
- On noise robust voice activity detection
- T. Dekens and W. Verhelst, "On noise robust voice activity detection," in Proc. of Interspeech, 2011, pp. 2649-2652.
- (2011) Proc. of Interspeech , pp. 2649-2652
- Dekens, T.¹ Verhelst, W.²

8
- 79959838316
- Voice activity detection based on conditional random fields using multiple features
- A. Saito, Y. Nankaku, A. Lee, and K. Tokuda, "Voice activity detection based on conditional random fields using multiple features," in Proc. of Interspeech, 2010, pp. 2086-2089.
- (2010) Proc. of Interspeech , pp. 2086-2089
- Saito, A.¹ Nankaku, Y.² Lee, A.³ Tokuda, K.⁴

9
- 0030648077
- Construction and evaluation of a robust multifeature speech/music discriminator
- E. Scheirer and M. Slaney, "Construction and evaluation of a robust multifeature speech/music discriminator," in Proc. of ICASSP, 1997, pp. 1331-1334.
- (1997) Proc. of ICASSP , pp. 1331-1334
- Scheirer, E.¹ Slaney, M.²

10
- 0032638667
- A comparison of features for speech, music discrimination
- M. J. Carey, E. S. Parris, and H. Lloyd-Thomas, "A comparison of features for speech, music discrimination," in Proc. of ICASSP, 1999, pp. 149-152.
- (1999) Proc. of ICASSP , pp. 149-152
- Carey, M.J.¹ Parris, E.S.² Lloyd-Thomas, H.³

11
- 0036648502
- Musical genre classification of audio signals
- July
- G. Tzanetakis and P. Cook, "Musical genre classification of audio signals," IEEE Trans. on Speech and Audio Proc., vol. 10, no. 5, pp. 293-302, July 2002.
- (2002) IEEE Trans. on Speech and Audio Proc. , vol.10 , Issue.5 , pp. 293-302
- Tzanetakis, G.¹ Cook, P.²

12
- 0036816475
- Content analysis for audio classification and segmentation
- October
- L. Lu, H.-J. Zhang, and H. Jiang, "Content analysis for audio classification and segmentation," IEEE Trans. on Speech and Audio Proc., vol. 10, no. 7, pp. 504-516, October 2002.
- (2002) IEEE Trans. on Speech and Audio Proc. , vol.10 , Issue.7 , pp. 504-516
- Lu, L.¹ Zhang, H.-J.² Jiang, H.³

13
- 80051623447
- Speaker diarization of heterogeneous web video files: A preliminary study
- P. Clement, T. Bazillon, and C. Fredouille, "Speaker diarization of heterogeneous web video files: A preliminary study," in Proc. of ICASSP, 2011, pp. 4432-4435.
- (2011) Proc. of ICASSP , pp. 4432-4435
- Clement, P.¹ Bazillon, T.² Fredouille, C.³

14
- 80052652249
- Efficient large-scale distributed training of conditional maximum entropy models
- G. Mann, R. McDonald, M. Mohri, N. Silberman, and D. Walker, "Efficient large-scale distributed training of conditional maximum entropy models," In Advances in Neural Information Processing Systems, vol. 22, pp. 1231-1239, 2009.
- (2009) Advances in Neural Information Processing Systems , vol.22 , pp. 1231-1239
- Mann, G.¹ McDonald, R.² Mohri, M.³ Silberman, N.⁴ Walker, D.⁵

15
- 33645690579
- Fast binary feature selection with conditional mutual information
- December
- F. Fleuret, "Fast binary feature selection with conditional mutual information," Journal of Machine Learning Research, vol. 5, pp. 1531-1535, December 2004.
- (2004) Journal of Machine Learning Research , vol.5 , pp. 1531-1535
- Fleuret, F.¹

16
- 0032021555
- On combining classifiers
- March
- J. Kittler, M. Hatef, R. P. Duin, and J. Matas, "On combining classifiers," IEEE Trans. on Pattern Analysis and Machine Intelligence, vol. 20, no. 3, pp. 226-239, March 1998.
- (1998) IEEE Trans. on Pattern Analysis and Machine Intelligence , vol.20 , Issue.3 , pp. 226-239
- Kittler, J.¹ Hatef, M.² Duin, R.P.³ Matas, J.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.