메뉴 건너뛰기




Volumn , Issue , 2014, Pages

Robust anchorperson detection based on audio streams using a hybrid I-vector and DNN system

Author keywords

[No Author keywords available]

Indexed keywords

COMPLEX NETWORKS; FEATURE EXTRACTION; HYBRID SYSTEMS; SUPPORT VECTOR MACHINES; VIDEO RECORDING;

EID: 84949926132     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/APSIPA.2014.7041717     Document Type: Conference Paper
Times cited : (3)

References (21)
  • 1
    • 34247568961 scopus 로고    scopus 로고
    • Audiovisual anchorperson detection for topic oriented navigation in broadcast news
    • Haller, M., Kim, H. G. and Sikora, T., "Audiovisual anchorperson detection for topic oriented navigation in broadcast news," in proc. ICME, pp. 1817 1820, 2006.
    • (2006) Proc. ICME , pp. 1817-1820
    • Haller, M.1    Kim, H.G.2    Sikora, T.3
  • 2
    • 0033896657 scopus 로고    scopus 로고
    • Adaptive anchor detection using online trained audio/visual model
    • Liu, Z. and Huang, Q., "Adaptive anchor detection using online trained audio/visual model," Electronic Imaging, pp. 156 167, 1999.
    • (1999) Electronic Imaging , pp. 156-167
    • Liu, Z.1    Huang, Q.2
  • 3
    • 0034444712 scopus 로고    scopus 로고
    • Integrating visual, audio and text analysis for news video
    • Qi, W., Gu, L., Jiang, H. and Chen, X. R., "Integrating visual, audio and text analysis for news video," in proc. ICIP, pp. 520 523, 2000.
    • (2000) Proc. ICIP , pp. 520-523
    • Qi, W.1    Gu, L.2    Jiang, H.3    Chen, X.R.4
  • 4
    • 0028516097 scopus 로고
    • Text independent speaker identification
    • Gish, H. and Schmidt, M., "Text independent speaker identification," IEEE Signal Processing Magazine, vol. 11, pp. 18 32, 1994.
    • (1994) IEEE Signal Processing Magazine , vol.11 , pp. 18-32
    • Gish, H.1    Schmidt, M.2
  • 5
    • 0033884858 scopus 로고    scopus 로고
    • Speaker verification using adapted Gaussian mixture models
    • Reynolds, D. A., Quatieri, T. F. and Dunn, R. B., "Speaker verification using adapted Gaussian mixture models," Digital Signal Processing, vol. 10, pp. 19 41, 2000.
    • (2000) Digital Signal Processing , vol.10 , pp. 19-41
    • Reynolds, D.A.1    Quatieri, T.F.2    Dunn, R.B.3
  • 6
    • 0036293830 scopus 로고    scopus 로고
    • An overview of automatic speaker recognition technology
    • Reynolds, D. A., "An overview of automatic speaker recognition technology," in proc. ICASSP, pp. 4072 4075, 2002.
    • (2002) Proc. ICASSP , pp. 4072-4075
    • Reynolds, D.A.1
  • 7
    • 0029209272 scopus 로고
    • Robust text independent speaker identification using Gaussian mixture speaker models
    • Reynolds, D. A. and Rose, R., "Robust text independent speaker identification using Gaussian mixture speaker models," IEEE Transactions, Speech and Audio Processing, vol. 3, pp. 72 83, 1995.
    • (1995) IEEE Transactions, Speech and Audio Processing , vol.3 , pp. 72-83
    • Reynolds, D.A.1    Rose, R.2
  • 8
    • 0023211850 scopus 로고
    • On the automatic segmentation of speech signals
    • Svendsen, T. and Soong, F. K., "On the automatic segmentation of speech signals," in proc. ICASSP, pp. 77 80, 1987.
    • (1987) Proc. ICASSP , pp. 77-80
    • Svendsen, T.1    Soong, F.K.2
  • 9
    • 0023800699 scopus 로고
    • A segment model based approach to speech recognition
    • Lee, C. H., Soong, F. K. and Juang, B. H., "A segment model based approach to speech recognition," in proc. ICASSP, pp. 501 541, 1988.
    • (1988) Proc. ICASSP , pp. 501-541
    • Lee, C.H.1    Soong, F.K.2    Juang, B.H.3
  • 10
    • 0024906979 scopus 로고
    • Speaker verification over long distance telephone lines
    • Naik, J., Netsch, L. P. and Doddington, G. R., "Speaker verification over long distance telephone lines," in Proc. ICASSP, pp. 524 527, 1989.
    • (1989) Proc. ICASSP , pp. 524-527
    • Naik, J.1    Netsch, L.P.2    Doddington, G.R.3
  • 12
    • 0027636611 scopus 로고
    • Learning and development in neural networks: The importance of starting small
    • Elman, J. L., "Learning and development in neural networks: The importance of starting small," Cognition, vol. 48, pp. 71 99, 1993.
    • (1993) Cognition , vol.48 , pp. 71-99
    • Elman, J.L.1
  • 14
    • 84873907352 scopus 로고    scopus 로고
    • Boosting the performance of i vector based speaker verification via utterance partitioning
    • Rao, W. and Mak, M. W., "Boosting the performance of I vector based speaker verification via utterance partitioning," IEEE Transactions, Audio, Speech, and Language Processing, vol. 21, pp. 1012 1022, 2013.
    • (2013) IEEE Transactions, Audio, Speech, and Language Processing , vol.21 , pp. 1012-1022
    • Rao, W.1    Mak, M.W.2
  • 15
    • 71249120659 scopus 로고    scopus 로고
    • A recursive feature vector normalization approach for robust speech recognition in noise
    • Viikki, O. and Laurila, K., "A recursive feature vector normalization approach for robust speech recognition in noise," in proc. ICASSP, pp. 733 736,1998.
    • (1998) Proc. ICASSP , pp. 733-736
    • Viikki, O.1    Laurila, K.2
  • 16
    • 85135190755 scopus 로고    scopus 로고
    • Multiband and adaptation approaches to robust speech recognition
    • Tibrewala, S. and Hermansky, H., " Multiband and adaptation approaches to robust speech recognition," in Proc. Eurospeech, pp. 2619 2622, 1997.
    • (1997) Proc. Eurospeech , pp. 2619-2622
    • Tibrewala, S.1    Hermansky, H.2
  • 17
    • 34047249084 scopus 로고    scopus 로고
    • Quantile based histogram equalization for noise robust large vocabulary speech recognition
    • Hilger, F. and Ney, H., "Quantile based histogram equalization for noise robust large vocabulary speech recognition," IEEE Transactions, Audio, Speech and Language Processing, vol. 14, pp. 845 854, 2006.
    • (2006) IEEE Transactions, Audio, Speech and Language Processing , vol.14 , pp. 845-854
    • Hilger, F.1    Ney, H.2
  • 18
    • 84874495144 scopus 로고    scopus 로고
    • A study on cepstral sub band normalization for robust ASR
    • Wang, S. S., Hung, J. W. and Tsao, Y., "A study on cepstral sub band normalization for robust ASR," in proc. ISCSLP, pp. 141 145, 2012.
    • (2012) Proc. ISCSLP , pp. 141-145
    • Wang, S.S.1    Hung, J.W.2    Tsao, Y.3
  • 19
    • 0036733224 scopus 로고    scopus 로고
    • Unsupervised video shot segmentation and model free anchorperson detection for news video story parsing
    • Gao, X. and Tang, X., "Unsupervised video shot segmentation and model free anchorperson detection for news video story parsing," IEEE Transactions, Circuits and Systems for Video Technology, vol. 12, pp. 765 776, 2002.
    • (2002) IEEE Transactions, Circuits and Systems for Video Technology , vol.12 , pp. 765-776
    • Gao, X.1    Tang, X.2
  • 20


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.