SCOPUS 정보 검색 플랫폼

2014 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA 2014

Volumn , Issue , 2014, Pages

Robust anchorperson detection based on audio streams using a hybrid I-vector and DNN system

(9) Chang, Yun Fan a Lin, Payton a Cheng, Shao Hua b Chan, Kai Hsuan b Zeng, Yi Chong b Liao, Chia Wei b Chang, Wen Tsung b Wang, Yu Chiang a Tsao, Yu a

a RESEARCH CENTER FOR INFORMATION TECHNOLOGY INNOVATION (Taiwan)

b INSTITUTE FOR INFORMATION INDUSTRY (Taiwan)

Author keywords

[No Author keywords available]

Indexed keywords

COMPLEX NETWORKS; FEATURE EXTRACTION; HYBRID SYSTEMS; SUPPORT VECTOR MACHINES; VIDEO RECORDING;

ANCHORPERSON DETECTION; DEEP NEURAL NETWORKS; EQUAL ERROR RATE; FEATURE NORMALIZATION; RECORDING DEVICES; ROBUST FEATURE EXTRACTIONS; STATE OF THE ART; VIDEO CONTENTS;

VECTORS;

EID: 84949926132 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/APSIPA.2014.7041717 Document Type: Conference Paper

Times cited : (3)

References (21)

1
- 34247568961
- Audiovisual anchorperson detection for topic oriented navigation in broadcast news
- Haller, M., Kim, H. G. and Sikora, T., "Audiovisual anchorperson detection for topic oriented navigation in broadcast news," in proc. ICME, pp. 1817 1820, 2006.
- (2006) Proc. ICME , pp. 1817-1820
- Haller, M.¹ Kim, H.G.² Sikora, T.³

2
- 0033896657
- Adaptive anchor detection using online trained audio/visual model
- Liu, Z. and Huang, Q., "Adaptive anchor detection using online trained audio/visual model," Electronic Imaging, pp. 156 167, 1999.
- (1999) Electronic Imaging , pp. 156-167
- Liu, Z.¹ Huang, Q.²

3
- 0034444712
- Integrating visual, audio and text analysis for news video
- Qi, W., Gu, L., Jiang, H. and Chen, X. R., "Integrating visual, audio and text analysis for news video," in proc. ICIP, pp. 520 523, 2000.
- (2000) Proc. ICIP , pp. 520-523
- Qi, W.¹ Gu, L.² Jiang, H.³ Chen, X.R.⁴

4
- 0028516097
- Text independent speaker identification
- Gish, H. and Schmidt, M., "Text independent speaker identification," IEEE Signal Processing Magazine, vol. 11, pp. 18 32, 1994.
- (1994) IEEE Signal Processing Magazine , vol.11 , pp. 18-32
- Gish, H.¹ Schmidt, M.²

5
- 0033884858
- Speaker verification using adapted Gaussian mixture models
- Reynolds, D. A., Quatieri, T. F. and Dunn, R. B., "Speaker verification using adapted Gaussian mixture models," Digital Signal Processing, vol. 10, pp. 19 41, 2000.
- (2000) Digital Signal Processing , vol.10 , pp. 19-41
- Reynolds, D.A.¹ Quatieri, T.F.² Dunn, R.B.³

6
- 0036293830
- An overview of automatic speaker recognition technology
- Reynolds, D. A., "An overview of automatic speaker recognition technology," in proc. ICASSP, pp. 4072 4075, 2002.
- (2002) Proc. ICASSP , pp. 4072-4075
- Reynolds, D.A.¹

7
- 0029209272
- Robust text independent speaker identification using Gaussian mixture speaker models
- Reynolds, D. A. and Rose, R., "Robust text independent speaker identification using Gaussian mixture speaker models," IEEE Transactions, Speech and Audio Processing, vol. 3, pp. 72 83, 1995.
- (1995) IEEE Transactions, Speech and Audio Processing , vol.3 , pp. 72-83
- Reynolds, D.A.¹ Rose, R.²

8
- 0023211850
- On the automatic segmentation of speech signals
- Svendsen, T. and Soong, F. K., "On the automatic segmentation of speech signals," in proc. ICASSP, pp. 77 80, 1987.
- (1987) Proc. ICASSP , pp. 77-80
- Svendsen, T.¹ Soong, F.K.²

9
- 0023800699
- A segment model based approach to speech recognition
- Lee, C. H., Soong, F. K. and Juang, B. H., "A segment model based approach to speech recognition," in proc. ICASSP, pp. 501 541, 1988.
- (1988) Proc. ICASSP , pp. 501-541
- Lee, C.H.¹ Soong, F.K.² Juang, B.H.³

10
- 0024906979
- Speaker verification over long distance telephone lines
- Naik, J., Netsch, L. P. and Doddington, G. R., "Speaker verification over long distance telephone lines," in Proc. ICASSP, pp. 524 527, 1989.
- (1989) Proc. ICASSP , pp. 524-527
- Naik, J.¹ Netsch, L.P.² Doddington, G.R.³

11
- 29044444825
- Support vector machines for speaker and language recognition
- Campbell, W. M., Campbell, J. P., Reynolds, D. A., Singer, E. and Torres Carrasquillo, P. A., "Support vector machines for speaker and language recognition," Computer Speech & Language, vol. 20, pp. 210229, 2006.
- (2006) Computer Speech & Language , vol.20 , pp. 210229
- Campbell, W.M.¹ Campbell, J.P.² Reynolds, D.A.³ Singer, E.⁴ Torres Carrasquillo, P.A.⁵

12
- 0027636611
- Learning and development in neural networks: The importance of starting small
- Elman, J. L., "Learning and development in neural networks: The importance of starting small," Cognition, vol. 48, pp. 71 99, 1993.
- (1993) Cognition , vol.48 , pp. 71-99
- Elman, J.L.¹

13
- 79951609039
- Front end factor analysis for speaker verification
- Dehak, N., Kenny, P., Dehak, R., Dumouchel, P. and Ouellet, P., "Front end factor analysis for speaker verification," IEEE Transactions, Audio, Speech, and Language Processing, vol. 19, pp. 788 798, 2011.
- (2011) IEEE Transactions, Audio, Speech, and Language Processing , vol.19 , pp. 788-798
- Dehak, N.¹ Kenny, P.² Dehak, R.³ Dumouchel, P.⁴ Ouellet, P.⁵

14
- 84873907352
- Boosting the performance of i vector based speaker verification via utterance partitioning
- Rao, W. and Mak, M. W., "Boosting the performance of I vector based speaker verification via utterance partitioning," IEEE Transactions, Audio, Speech, and Language Processing, vol. 21, pp. 1012 1022, 2013.
- (2013) IEEE Transactions, Audio, Speech, and Language Processing , vol.21 , pp. 1012-1022
- Rao, W.¹ Mak, M.W.²

15
- 71249120659
- A recursive feature vector normalization approach for robust speech recognition in noise
- Viikki, O. and Laurila, K., "A recursive feature vector normalization approach for robust speech recognition in noise," in proc. ICASSP, pp. 733 736,1998.
- (1998) Proc. ICASSP , pp. 733-736
- Viikki, O.¹ Laurila, K.²

16
- 85135190755
- Multiband and adaptation approaches to robust speech recognition
- Tibrewala, S. and Hermansky, H., " Multiband and adaptation approaches to robust speech recognition," in Proc. Eurospeech, pp. 2619 2622, 1997.
- (1997) Proc. Eurospeech , pp. 2619-2622
- Tibrewala, S.¹ Hermansky, H.²

17
- 34047249084
- Quantile based histogram equalization for noise robust large vocabulary speech recognition
- Hilger, F. and Ney, H., "Quantile based histogram equalization for noise robust large vocabulary speech recognition," IEEE Transactions, Audio, Speech and Language Processing, vol. 14, pp. 845 854, 2006.
- (2006) IEEE Transactions, Audio, Speech and Language Processing , vol.14 , pp. 845-854
- Hilger, F.¹ Ney, H.²

18
- 84874495144
- A study on cepstral sub band normalization for robust ASR
- Wang, S. S., Hung, J. W. and Tsao, Y., "A study on cepstral sub band normalization for robust ASR," in proc. ISCSLP, pp. 141 145, 2012.
- (2012) Proc. ISCSLP , pp. 141-145
- Wang, S.S.¹ Hung, J.W.² Tsao, Y.³

19
- 0036733224
- Unsupervised video shot segmentation and model free anchorperson detection for news video story parsing
- Gao, X. and Tang, X., "Unsupervised video shot segmentation and model free anchorperson detection for news video story parsing," IEEE Transactions, Circuits and Systems for Video Technology, vol. 12, pp. 765 776, 2002.
- (2002) IEEE Transactions, Circuits and Systems for Video Technology , vol.12 , pp. 765-776
- Gao, X.¹ Tang, X.²

20
- 59449087310
- Exploring strategies for training deep neural networks
- Larochelle, H., Benqio, Y, Louradour, J. and Lamblin, P., "Exploring strategies for training deep neural networks," Machine Learning, vol. 10, pp. 1 40, 2009.
- (2009) Machine Learning , vol.10 , pp. 1-40
- Larochelle, H.¹ Benqio, Y.² Louradour, J.³ Lamblin, P.⁴

21
- 69349090197
- Learning deep architectures for AI
- Bengio, Y, "Learning deep architectures for AI," Foundation and Trends in Machine Learning, vol. 2, pp. 1 127, 2009.
- (2009) Foundation and Trends in Machine Learning , vol.2 , pp. 1-127
- Bengio, Y.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.