메뉴 건너뛰기




Volumn , Issue , 2014, Pages 3007-3011

Speaker adaptation of DNN-based ASR with i-vectors: Does it actually adapt models to speakers?

Author keywords

[No Author keywords available]

Indexed keywords

SPEECH COMMUNICATION;

EID: 84910073132     PISSN: 2308457X     EISSN: 19909772     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (26)

References (23)
  • 1
    • 0028419019 scopus 로고
    • Maximum a posteriori estimation for multivariate gaussian mixture observations of markov chains
    • J.-L. Gauvain and C.-H. Lee, "Maximum a posteriori estimation for multivariate gaussian mixture observations of markov chains, " Speech and Audio Processing, IEEE Transactions on, vol. 2, no. 2, pp. 291-298, 1994.
    • (1994) Speech and Audio Processing, IEEE Transactions on , vol.2 , Issue.2 , pp. 291-298
    • Gauvain, J.-L.1    Lee, C.-H.2
  • 2
    • 0030263447 scopus 로고    scopus 로고
    • Mean and variance adaptation within the mllr framework
    • M. J. Gales and P. Woodland, "Mean and variance adaptation within the mllr framework, " Computer Speech & Language, vol. 10, no. 4, pp. 249-264, 1996.
    • (1996) Computer Speech & Language , vol.10 , Issue.4 , pp. 249-264
    • Gales, M.J.1    Woodland, P.2
  • 4
    • 84055222005 scopus 로고    scopus 로고
    • Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition
    • G. E. Dahl, D. Yu, L. Deng, and A. Acero, "Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition, " Audio, Speech, and Language Processing, IEEE Transactions on, vol. 20, no. 1, pp. 30-42, 2012.
    • (2012) Audio, Speech, and Language Processing, IEEE Transactions on , vol.20 , Issue.1 , pp. 30-42
    • Dahl, G.E.1    Yu, D.2    Deng, L.3    Acero, A.4
  • 5
    • 84865801985 scopus 로고    scopus 로고
    • Conversational speech transcription using context-dependent deep neural networks
    • F. Seide, G. Li, and D. Yu, "Conversational speech transcription using context-dependent deep neural networks." in Interspeech, 2011, pp. 437-440.
    • (2011) Interspeech , pp. 437-440
    • Seide, F.1    Li, G.2    Yu, D.3
  • 10
    • 70450180849 scopus 로고    scopus 로고
    • Support vector machines versus fast scoring in the lowdimensional total variability space for speaker verification
    • N. Dehak, R. Dehak, P. Kenny, N. Brümmer, P. Ouellet, and P. Dumouchel, "Support vector machines versus fast scoring in the lowdimensional total variability space for speaker verification." in INTERSPEECH, vol. 9, 2009, pp. 1559-1562.
    • (2009) INTERSPEECH , vol.9 , pp. 1559-1562
    • Dehak, N.1    Dehak, R.2    Kenny, P.3    Brümmer, N.4    Ouellet, P.5    Dumouchel, P.6
  • 11
    • 84865753339 scopus 로고    scopus 로고
    • Intersession compensation and scoring methods in the i-vectors space for speaker recognition
    • P.-M. Bousquet, D. Matrouf, and J.-F. Bonastre, "Intersession compensation and scoring methods in the i-vectors space for speaker recognition." in InterSpeech, 2011, pp. 485-488.
    • (2011) InterSpeech , pp. 485-488
    • Bousquet, P.-M.1    Matrouf, D.2    Bonastre, J.-F.3
  • 12
    • 84865733857 scopus 로고    scopus 로고
    • Analysis of i-vector length normalization in speaker recognition systems
    • D. Garcia-Romero and C. Y. Espy-Wilson, "Analysis of i-vector length normalization in speaker recognition systems." in Interspeech, 2011, pp. 249-252.
    • (2011) Interspeech , pp. 249-252
    • Garcia-Romero, D.1    Espy-Wilson, C.Y.2
  • 16
    • 70450180496 scopus 로고    scopus 로고
    • The ester 2 evaluation campaign for the rich transcription of french radio broadcasts
    • S. Galliano, G. Gravier, and L. Chaubard, "The ester 2 evaluation campaign for the rich transcription of french radio broadcasts." in Interspeech, vol. 9, 2009, pp. 2583-2586.
    • (2009) Interspeech , vol.9 , pp. 2583-2586
    • Galliano, S.1    Gravier, G.2    Chaubard, L.3
  • 17
    • 85016241152 scopus 로고    scopus 로고
    • The epac corpus: Manual and automatic annotations of conversational speech in french broadcast news
    • Y. Esteve, T. Bazillon, J.-Y. Antoine, F. Béchet, and J. Farinas, "The epac corpus: Manual and automatic annotations of conversational speech in french broadcast news." in LREC, 2010.
    • (2010) LREC
    • Esteve, Y.1    Bazillon, T.2    Antoine, J.-Y.3    Béchet, F.4    Farinas, J.5
  • 19
    • 84907937611 scopus 로고    scopus 로고
    • Srilm-an extensible language modeling toolkit
    • A. Stolcke et al., "Srilm-an extensible language modeling toolkit." in InterSpeech, 2002.
    • (2002) InterSpeech
    • Stolcke, A.1
  • 22
    • 84865783736 scopus 로고    scopus 로고
    • Mixture of plda models in i-vector space for genderindependent speaker recognition
    • M. Senoussaoui, P. Kenny, N. Brümmer, E. De Villiers, and P. Dumouchel, "Mixture of plda models in i-vector space for genderindependent speaker recognition." in InterSpeech, 2011, pp. 25- 28.
    • (2011) InterSpeech , pp. 25-28
    • Senoussaoui, M.1    Kenny, P.2    Brümmer, N.3    De Villiers, E.4    Dumouchel, P.5


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.