메뉴 건너뛰기




Volumn , Issue , 2013, Pages 3647-3651

Speaker trait characterization in web videos: Uniting speech, language, and facial features

Author keywords

computational paralinguistics; multi modal fusion; speaker classification

Indexed keywords

AUTOMATIC FEATURE EXTRACTION; AUTOMATIC SPEECH RECOGNITION; LINGUISTIC FEATURES; MULTI-MODAL APPROACH; MULTI-MODAL FUSION; PARALINGUISTICS; RACE CLASSIFICATION; SPEAKER CLASSIFICATION;

EID: 84890532851     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2013.6638338     Document Type: Conference Paper
Times cited : (6)

References (26)
  • 2
    • 77949917470 scopus 로고    scopus 로고
    • Estimation of unknown speakers height from speech
    • I. Mporas and T. Ganchev, "Estimation of unknown speakers height from speech," International Journal of Speech Technology, vol. 12, no. 4, pp. 149-160, 2009.
    • (2009) International Journal of Speech Technology , vol.12 , Issue.4 , pp. 149-160
    • Mporas, I.1    Ganchev, T.2
  • 3
    • 84867336059 scopus 로고    scopus 로고
    • Semantic speech tagging: Towards combined analysis of speaker traits
    • K. Brandenburg and M. Sandler, Eds., Ilmenau, Germany, Audio Engineering Society
    • B. Schuller, M. Wollmer, F. Eyben, G. Rigoll, and D. Arsic, "Semantic Speech Tagging: Towards Combined Analysis of Speaker Traits," in Proceedings AES 42nd International Conference, K. Brandenburg and M. Sandler, Eds., Ilmenau, Germany, 2011, pp. 89-97, Audio Engineering Society.
    • (2011) Proceedings AES 42nd International Conference , pp. 89-97
    • Schuller, B.1    Wollmer, M.2    Eyben, F.3    Rigoll, G.4    Arsic, D.5
  • 4
    • 85032750851 scopus 로고    scopus 로고
    • The computational paralinguistics challenge
    • July
    • B. Schuller, "The Computational Paralinguistics Challenge," IEEE Signal Processing Magazine, vol. 29, no. 4, pp. 97-101, July 2012.
    • (2012) IEEE Signal Processing Magazine , vol.29 , Issue.4 , pp. 97-101
    • Schuller, B.1
  • 5
    • 84872229639 scopus 로고    scopus 로고
    • The voice of leadership: Models and performances of automatic analysis in on-line speeches
    • F. Weninger, J. Krajewski, A. Batliner, and B. Schuller, "The Voice of Leadership: Models and Performances of Automatic Analysis in On-Line Speeches," IEEE Transactions on Affective Computing, 2012, http://doi.ieeecomputersociety.org/10.1109/T-AFFC.2012.15.
    • (2012) IEEE Transactions on Affective Computing
    • Weninger, F.1    Krajewski, J.2    Batliner, A.3    Schuller, B.4
  • 6
    • 84878398325 scopus 로고    scopus 로고
    • Age estimation from telephone speech using ivectors
    • Portland, OR, USA, no pagination
    • M. H. Bahari, M. McLaren, H. Van hamme, and D. Van Leeuwen, "Age Estimation from Telephone Speech using ivectors," in Proc. of INTERSPEECH, Portland, OR, USA, 2012, no pagination.
    • (2012) Proc. of INTERSPEECH
    • Bahari, M.H.1    McLaren, M.2    Van Hamme, H.3    Van Leeuwen, D.4
  • 12
    • 81855180780 scopus 로고    scopus 로고
    • Analyzing facial behavioral features from videos
    • A. A. Salah and B. Lepri, Eds. of Lecture Notes in Computer Science. Springer Berlin Heidelberg
    • A. Hadid, "Analyzing facial behavioral features from videos," in Human Behavior Understanding, A. A. Salah and B. Lepri, Eds., vol. 7065 of Lecture Notes in Computer Science, pp. 52-61. Springer Berlin Heidelberg, 2011.
    • (2011) Human Behavior Understanding , pp. 52-61
    • Hadid, A.1
  • 15
    • 79959816279 scopus 로고    scopus 로고
    • Can conversational word usage be used to predict speaker demographics
    • Makuhari, Japan
    • D. Gillick, "Can conversational word usage be used to predict speaker demographics?," in Proc. of Interspeech, Makuhari, Japan, 2010, pp. 1381-1384.
    • (2010) Proc. of Interspeech , pp. 1381-1384
    • Gillick, D.1
  • 20
    • 78650977476 scopus 로고    scopus 로고
    • OpenSMILE\-The Munich versatile and fast open-source audio feature extractor
    • Florence, Italy, October, ACM
    • F. Eyben, M. Wollmer, and B. Schuller, "openSMILE\-The Munich versatile and fast open-source audio feature extractor," in Proc. of ACM Multimedia, Florence, Italy, October 2010, pp. 1459-1462, ACM.
    • (2010) Proc. of ACM Multimedia , pp. 1459-1462
    • Eyben, F.1    Wollmer, M.2    Schuller, B.3
  • 21
    • 0042609977 scopus 로고    scopus 로고
    • Psychological aspects of natural language use: Our words, our selves
    • J.W. Pennebaker, M. R. Mehl, and K. G. Niederhoffer, "Psychological aspects of natural language use: Our words, our selves," Annual Review of Psychology, vol. 54, no. 1, pp. 547-577, 2003.
    • (2003) Annual Review of Psychology , vol.54 , Issue.1 , pp. 547-577
    • Pennebaker, J.W.1    Mehl, M.R.2    Niederhoffer, K.G.3
  • 22
    • 77649253939 scopus 로고    scopus 로고
    • The psychological meaning of words: Liwc and computerized text analysis methods
    • Y. R. Tausczik and J. W. Pennebaker, "The Psychological Meaning of Words: LIWC and Computerized Text Analysis Methods," Journal of Language and Social Psychology, vol. 29, no. 1, pp. 24-54, 2010.
    • (2010) Journal of Language and Social Psychology , vol.29 , Issue.1 , pp. 24-54
    • Tausczik, Y.R.1    Pennebaker, J.W.2
  • 23
    • 84948481845 scopus 로고
    • An algorithm for suffix stripping
    • October
    • M. F. Porter, "An algorithm for suffix stripping," Program, vol. 3, no. 14, pp. 130-137, October 1980.
    • (1980) Program , vol.3 , Issue.14 , pp. 130-137
    • Porter, M.F.1
  • 26
    • 33747115683 scopus 로고    scopus 로고
    • Normative standards for vocal tract dimensions by race as measured by acoustic pharyngometry
    • S. A. Xue and J. G. Hao, "Normative standards for vocal tract dimensions by race as measured by acoustic pharyngometry," Journal of Voice, vol. 20, no. 3, pp. 391-400, 2006.
    • (2006) Journal of Voice , vol.20 , Issue.3 , pp. 391-400
    • Xue, S.A.1    Hao, J.G.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.