메뉴 건너뛰기




Volumn , Issue , 2014, Pages 3027-3031

UBM fused total variability modeling for language identification

Author keywords

I vector representation; Language identification; Noise robustness; RATS; Short duration

Indexed keywords

RATING; RATS; SPEECH COMMUNICATION;

EID: 84910070752     PISSN: 2308457X     EISSN: 19909772     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (7)

References (32)
  • 1
    • 0028996643 scopus 로고
    • Language identification using phoneme recognition and phonotactic language modeling
    • IEEE
    • M. A. Zissman, "Language identification using phoneme recognition and phonotactic language modeling, " in Proc. ICASSP, vol. 5. IEEE, 1995, pp. 3503-3506.
    • (1995) Proc. ICASSP , vol.5 , pp. 3503-3506
    • Zissman, M.A.1
  • 2
    • 0028996642 scopus 로고
    • An approach to automatic language identification based on language-dependent phone recognition
    • IEEE
    • Y. Yan and E. Barnard, "An approach to automatic language identification based on language-dependent phone recognition, " in Proc. ICASSP, vol. 5. IEEE, 1995, pp. 3511-3514.
    • (1995) Proc. ICASSP , vol.5 , pp. 3511-3514
    • Yan, Y.1    Barnard, E.2
  • 4
    • 33745190265 scopus 로고    scopus 로고
    • Phonotactic language identification using high quality phoneme recognition
    • P. Matejka, P. Schwarz, J. Cernocky, and P. Chytil, "Phonotactic language identification using high quality phoneme recognition, " in Proc. Interspeech, 2005, pp. 2237-2240.
    • (2005) Proc. Interspeech , pp. 2237-2240
    • Matejka, P.1    Schwarz, P.2    Cernocky, J.3    Chytil, P.4
  • 5
    • 17444453660 scopus 로고    scopus 로고
    • Language identification using gaussian mixture model tokenization
    • IEEE
    • P. A. Torres-Carrasquillo, D. A. Reynolds, and J. Deller Jr, "Language identification using gaussian mixture model tokenization, " in Proc. ICASSP, vol. 1. IEEE, 2002, pp. 1-757.
    • (2002) Proc. ICASSP , vol.1 , pp. 1-757
    • Torres-Carrasquillo, P.A.1    Reynolds, D.A.2    Deller, J.3
  • 6
    • 84910087367 scopus 로고    scopus 로고
    • Methods to improve gaussian mixture model based language identification system
    • E. Wong and S. Sridharan, "Methods to improve gaussian mixture model based language identification system, " in Proc. Interspeech, 2002.
    • (2002) Proc. Interspeech
    • Wong, E.1    Sridharan, S.2
  • 7
    • 33947696754 scopus 로고    scopus 로고
    • SVM based speaker verification using a GMM supervector kernel and NAP variability compensation
    • W. M. Campbell, D. E. Sturim, D. A. Reynolds, and A. Solomonoff, "SVM based speaker verification using a GMM supervector kernel and NAP variability compensation, " in Proc. ICASSP, vol. 1, 2006.
    • (2006) Proc. ICASSP , vol.1
    • Campbell, W.M.1    Sturim, D.E.2    Reynolds, D.A.3    Solomonoff, A.4
  • 15
    • 84900522099 scopus 로고    scopus 로고
    • Simplified supervised i-vector modeling with application to robust and efficient language identification and speaker verification
    • M. Li and S. Narayanan, "Simplified supervised i-vector modeling with application to robust and efficient language identification and speaker verification, " Computer Speech & Language, 2014.
    • (2014) Computer Speech & Language
    • Li, M.1    Narayanan, S.2
  • 16
    • 84890466219 scopus 로고    scopus 로고
    • Speaker verification using simplified and supervised i-vector modeling
    • M. Li, A. Tsiartas, M. Segbroeck, and S. Narayanan, "Speaker verification using simplified and supervised i-vector modeling, " in Proc. ICASSP, 2013.
    • (2013) Proc. ICASSP
    • Li, M.1    Tsiartas, A.2    Segbroeck, M.3    Narayanan, S.4
  • 19
    • 84867598363 scopus 로고    scopus 로고
    • I-vectors in the context of phonetically-constrained short utterances for speaker verification
    • A. Larcher, P. Bousquet, K. A. Lee, D. Matrouf, H. Li, and J.- F. Bonastre, "I-vectors in the context of phonetically-constrained short utterances for speaker verification, " in Proc. ICASSP IEEE, 2012, pp. 4773-4776.
    • (2012) Proc. ICASSP IEEE , pp. 4773-4776
    • Larcher, A.1    Bousquet, P.2    Lee, K.A.3    Matrouf, D.4    Li, H.5    Bonastre J.-., F.6
  • 20
    • 84906217020 scopus 로고    scopus 로고
    • Improving language identification robustness to highly channel-degraded speech through multiple system fusion
    • A. Lawson, M. McLaren, Y. Lei, V. Mitra, N. Scheffer, L. Ferrer, and M. Graciarena, "Improving language identification robustness to highly channel-degraded speech through multiple system fusion, " in Proc. Interspeech, 2013.
    • (2013) Proc. Interspeech
    • Lawson, A.1    McLaren, M.2    Lei, Y.3    Mitra, V.4    Scheffer, N.5    Ferrer, L.6    Graciarena, M.7
  • 22
    • 84910028543 scopus 로고    scopus 로고
    • Modifiedprior i-vector estimation for language identification of short duration utterances
    • submitted
    • R. Travadi, M. Van Segbroeck, and S. S. Narayanan, "Modifiedprior i-vector estimation for language identification of short duration utterances, " in Proc. Interspeech, 2014, submitted.
    • (2014) Proc. Interspeech
    • Travadi, R.1    Van Segbroeck, M.2    Narayanan, S.S.3
  • 23
    • 44949114401 scopus 로고    scopus 로고
    • Within-class covariance normalization for SVM-based speaker recognition
    • A. O. Hatch, S. S. Kajarekar, and A. Stolcke, "Within-class covariance normalization for SVM-based speaker recognition, " in Proc. Interspeech, 2006.
    • (2006) Proc. Interspeech
    • Hatch, A.O.1    Kajarekar, S.S.2    Stolcke, A.3
  • 24
    • 33947637189 scopus 로고    scopus 로고
    • Joint factor analysis of speaker and session variability: Theory and algorithms
    • P. Kenny, "Joint factor analysis of speaker and session variability: Theory and algorithms, " CRIM, Montreal, (Report) CRIM-06/08- 13, 2005.
    • (2005) CRIM, Montreal, (Report)
    • Kenny, P.1
  • 27
    • 84906246377 scopus 로고    scopus 로고
    • A robust frontend for VAD: Exploiting contextual, discriminative and spectral cues of human voice
    • M. Van Segbroeck, A. Tsiartas, and S. Narayanan, "A robust frontend for VAD: Exploiting contextual, discriminative and spectral cues of human voice, " in Proc. Interspeech, 2013.
    • (2013) Proc. Interspeech
    • Van Segbroeck, M.1    Tsiartas, A.2    Narayanan, S.3
  • 28
    • 0019053271 scopus 로고
    • Comparison of parametric representations for monosyllabic word recognitions in continuously spoken sentences
    • Aug
    • S. Davis and P. Mermelstein, "Comparison of parametric representations for monosyllabic word recognitions in continuously spoken sentences, " IEEE Transactions on Acoustics, Speech and Signal Processing, vol. 28, no. 4, pp. 357-366, Aug. 1980.
    • (1980) IEEE Transactions on Acoustics, Speech and Signal Processing , vol.28 , Issue.4 , pp. 357-366
    • Davis, S.1    Mermelstein, P.2
  • 29
    • 0025041264 scopus 로고
    • Perceptual linear predictive (PLP) analysis of speech
    • Apr
    • H. Hermansky, "Perceptual linear predictive (PLP) analysis of speech, " Journal of the Acoustical Society of America, vol. 87, no. 4, pp. 1738-1752, Apr. 1990.
    • (1990) Journal of the Acoustical Society of America , vol.87 , Issue.4 , pp. 1738-1752
    • Hermansky, H.1
  • 30
    • 34547499683 scopus 로고    scopus 로고
    • Incorporating auditory feature uncertainties in robust speaker identification
    • Y. Shao, S. Srinivasan, and D. Wang, "Incorporating auditory feature uncertainties in robust speaker identification, " in Proc. ICASSP, 2002, pp. 277-280.
    • (2002) Proc. ICASSP , pp. 277-280
    • Shao, Y.1    Srinivasan, S.2    Wang, D.3
  • 31
    • 84890447859 scopus 로고    scopus 로고
    • Spectro-temporal gabor features as a front end for ASR
    • M. Kleinschmidt, "Spectro-temporal gabor features as a front end for ASR, " in Proc. Forum Acusticum Sevilla, 2002.
    • (2002) Proc. Forum Acusticum Sevilla
    • Kleinschmidt, M.1
  • 32
    • 79959850251 scopus 로고    scopus 로고
    • The NIST 2010 speaker recognition evaluation
    • A. F. Martin and C. S. Greenberg, "The NIST 2010 speaker recognition evaluation, " in Proc. Interspeech, 2010, pp. 2726-2729.
    • (2010) Proc. Interspeech , pp. 2726-2729
    • Martin, A.F.1    Greenberg, C.S.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.