메뉴 건너뛰기




Volumn 22, Issue 1, 2014, Pages 17-27

Cross-lingual subspace Gaussian mixture models for low-resource speech recognition

Author keywords

Acoustic modeling; Adaptation; Cross lingual speech recognition; Regularization; Subspace Gaussian mixture model

Indexed keywords

COMMUNICATION CHANNELS (INFORMATION THEORY); HIDDEN MARKOV MODELS; OBJECT RECOGNITION; SPEECH RECOGNITION; AUDIO ACOUSTICS; COMPUTATIONAL LINGUISTICS; GAUSSIAN DISTRIBUTION; MARKOV PROCESSES; TRELLIS CODES;

EID: 84897937578     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2013.2281575     Document Type: Article
Times cited : (25)

References (40)
  • 1
    • 85135166225 scopus 로고    scopus 로고
    • Fast bootstrapping of LVCSRsystems with multilingual phoneme sets
    • T. Schultz and A. Waibel, "Fast bootstrapping of LVCSRsystems with multilingual phoneme sets," in Proc. Eurospeech, 1997, pp. 371-374.
    • Proc. Eurospeech, 1997 , pp. 371-374
    • Schultz, T.1    Waibel, A.2
  • 4
    • 33646764228 scopus 로고    scopus 로고
    • First steps in fast acoustic modeling for a new target language: Application to Vietnamese
    • V. B. Le and L. Besacier, "First steps in fast acoustic modeling for a new target language: Application to Vietnamese," in Proc. ICASSP, 2005, pp. 821-824.
    • Proc. ICASSP, 2005 , pp. 821-824
    • Le, V.B.1    Besacier, L.2
  • 5
    • 79959819891 scopus 로고    scopus 로고
    • Cross-lingual and multi-stream posterior features for low resource LVCSR systems
    • S. Thomas, S. Ganapathy, and H. Hermansky, "Cross-lingual and multi-stream posterior features for low resource LVCSR systems," in Proc. INTERSPEECH, 2010, pp. 877-880.
    • Proc. INTERSPEECH, 2010 , pp. 877-880
    • Thomas, S.1    Ganapathy, S.2    Hermansky, H.3
  • 6
    • 0030363039 scopus 로고    scopus 로고
    • Dictionary learning for spontaneous speech recognition
    • T. Slobada and A. Waibel, "Dictionary learning for spontaneous speech recognition," in Proc. ICSLP, 1996, pp. 2328-2331.
    • Proc. ICSLP, 1996 , pp. 2328-2331
    • Slobada, T.1    Waibel, A.2
  • 7
    • 0033708114 scopus 로고    scopus 로고
    • Automatic generation of phone sets and lexical transcriptions
    • R. Singh, B. Raj, and R. M. Stern, "Automatic generation of phone sets and lexical transcriptions," in Proc. ICASSP, 2000, pp. 1691-1694.
    • Proc. ICASSP, 2000 , pp. 1691-1694
    • Singh, R.1    Raj, B.2    Stern, R.M.3
  • 9
    • 0035426931 scopus 로고    scopus 로고
    • Language-independent and language-adaptive acoustic modeling for speech recognition
    • T. Schultz and A. Waibel, "Language-independent and language-adaptive acoustic modeling for speech recognition," Speech Commun., vol. 35, no. 1, pp. 31-52, 2001.
    • (2001) Speech Commun. , vol.35 , Issue.1 , pp. 31-52
    • Schultz, T.1    Waibel, A.2
  • 10
    • 85009274666 scopus 로고    scopus 로고
    • GlobalPhone: A multilingual speech and text database developed at Karlsruhe University
    • T. Schultz, "GlobalPhone: A multilingual speech and text database developed at Karlsruhe University," in Proc. ICLSP, 2002, pp. 345-348.
    • Proc. ICLSP, 2002 , pp. 345-348
    • Schultz, T.1
  • 11
    • 0030371812 scopus 로고    scopus 로고
    • Multi-lingual phoneme recognition exploiting acoustic-phonetic similarities of sounds
    • J. Kohler, "Multi-lingual phoneme recognition exploiting acoustic-phonetic similarities of sounds," in Proc. ICSLP, 1996, pp. 2195-2198.
    • Proc. ICSLP, 1996 , pp. 2195-2198
    • Kohler, J.1
  • 12
    • 84862931515 scopus 로고    scopus 로고
    • Experiments on cross-language attribute detection and phone recognition with minimal target-specific training data
    • Mar.
    • S. M. Siniscalchi, D. C. Lyu, T. Svendsen, and C. H. Lee, "Experiments on cross-language attribute detection and phone recognition with minimal target-specific training data," IEEE Trans. Audio, Speech, Lang. Process., vol. 20, no. 3, pp. 875-887, Mar. 2012.
    • (2012) IEEE Trans. Audio, Speech, Lang. Process. , vol.20 , Issue.3 , pp. 875-887
    • Siniscalchi, S.M.1    Lyu, D.C.2    Svendsen, T.3    Lee, C.H.4
  • 13
    • 51449101990 scopus 로고    scopus 로고
    • Robust phone set mapping using decision tree clustering for cross-lingual phone recognition
    • K. C. Sim and H. Li, "Robust phone set mapping using decision tree clustering for cross-lingual phone recognition," in Proc. ICASSP, 2008, pp. 4309-4312.
    • Proc. ICASSP, 2008 , pp. 4309-4312
    • Sim, K.C.1    Li, H.2
  • 14
    • 77949342620 scopus 로고    scopus 로고
    • Discriminative product-of-expert acoustic mapping for cross-lingual phone recognition
    • K. C. Sim, "Discriminative product-of-expert acoustic mapping for cross-lingual phone recognition," in Proc. ASRU, 2009, pp. 546-551.
    • Proc. ASRU, 2009 , pp. 546-551
    • Sim, K.C.1
  • 15
    • 0033709098 scopus 로고    scopus 로고
    • Tandem connectionist feature extraction for conventional HMM systems
    • H. Hermansky, D. P. W. Ellis, and S. Sharma, "Tandem connectionist feature extraction for conventional HMM systems," in Proc. ICASSP, 2000, pp. 1635-1638.
    • Proc. ICASSP, 2000 , pp. 1635-1638
    • Hermansky, H.1    Ellis, D.P.W.2    Sharma, S.3
  • 16
    • 33947619591 scopus 로고    scopus 로고
    • Cross-domain and cross-language portability of acoustic features estimated by multilayer perceptrons
    • A. Stolcke, F. Grézl, M. Y. Hwang, X. Lei, N. Morgan, and D. Vergyri, "Cross-domain and cross-language portability of acoustic features estimated by multilayer perceptrons," in Proc. ICASSP, 2006, pp. 321-324.
    • Proc. ICASSP, 2006 , pp. 321-324
    • Stolcke, A.1    Grézl, F.2    Hwang, M.Y.3    Lei, X.4    Morgan, N.5    Vergyri, D.6
  • 18
    • 84858971854 scopus 로고    scopus 로고
    • Strategies for using MLP based features with limited target-language training data
    • Y. Qian, J. Xu, D. Povey, and L. Jia, "Strategies for using MLP based features with limited target-language training data," in Proc. ASRU, 2011, pp. 354-358.
    • Proc. ASRU, 2011 , pp. 354-358
    • Qian, Y.1    Xu, J.2    Povey, D.3    Jia, L.4
  • 19
    • 84858976609 scopus 로고    scopus 로고
    • Cross-lingual portability of Chinese and English neural network features for French and German LVCSR
    • C. Plahl, R. Schluter, and H. Ney, "Cross-lingual portability of Chinese and English neural network features for French and German LVCSR," in Proc. ASRU, 2011, pp. 371-376.
    • Proc. ASRU, 2011 , pp. 371-376
    • Plahl, C.1    Schluter, R.2    Ney, H.3
  • 21
    • 84055222005 scopus 로고    scopus 로고
    • Context-dependent pretrained deep neural networks for large-vocabulary speech recognition
    • Jan.
    • G. E. Dahl, D. Yu, L. Deng, and A. Acero, "Context-dependent pretrained deep neural networks for large-vocabulary speech recognition," IEEE Trans. Audio, Speech, Lang. Process., vol. 20, no. 1, pp. 30-42, Jan. 2012.
    • (2012) IEEE Trans. Audio, Speech, Lang. Process. , vol.20 , Issue.1 , pp. 30-42
    • Dahl, G.E.1    Yu, D.2    Deng, L.3    Acero, A.4
  • 22
    • 84874278045 scopus 로고    scopus 로고
    • Unsupervised cross-lingual knowledge transfer in DNN-based LVCSR
    • P. Swietojanski, A. Ghoshal, and S. Renals, "Unsupervised cross-lingual knowledge transfer in DNN-based LVCSR," in Proc. IEEE SLT, 2012, pp. 246-251.
    • Proc. IEEE SLT, 2012 , pp. 246-251
    • Swietojanski, P.1    Ghoshal, A.2    Renals, S.3
  • 24
    • 84858976352 scopus 로고    scopus 로고
    • Fast and flexible kullback-leibler divergence based acoustic modelling for non-native speech recognition
    • D. Imseng, R. Rasipuram, and M. Magimai-Doss, "Fast and flexible kullback-leibler divergence based acoustic modelling for non-native speech recognition," in Proc. ASRU, 2011, pp. 348-353.
    • Proc. ASRU, 2011 , pp. 348-353
    • Imseng, D.1    Rasipuram, R.2    Magimai-Doss, M.3
  • 25
    • 84867616349 scopus 로고    scopus 로고
    • Using KL-divergence and multilingual informaiton to improve ASR for under-resourced languages
    • D. Imseng, H. Bourlard, and P. N. Garner, "Using KL-divergence and multilingual informaiton to improve ASR for under-resourced languages," in Proc. ICASSP, 2012, pp. 4869-4872.
    • Proc. ICASSP, 2012 , pp. 4869-4872
    • Imseng, D.1    Bourlard, H.2    Garner, P.N.3
  • 28
    • 84858952433 scopus 로고    scopus 로고
    • Regularized subspace Gaussian mixture models for cross-lingual speech recognition
    • L. Lu, A. Ghoshal, and S. Renals, "Regularized subspace Gaussian mixture models for cross-lingual speech recognition," in Proc. IEEE ASRU, 2011, pp. 922-932.
    • Proc. IEEE ASRU, 2011 , pp. 922-932
    • Lu, L.1    Ghoshal, A.2    Renals, S.3
  • 29
    • 84867597584 scopus 로고    scopus 로고
    • Maximum a posteriori adaptation of subspace Gaussian mixture models for cross-lingual speech recognition
    • L. Lu, A. Ghoshal, and S. Renals, "Maximum a posteriori adaptation of subspace Gaussian mixture models for cross-lingual speech recognition," in Proc. ICASSP, 2012, pp. 4887-4877-4880.
    • Proc. ICASSP, 2012
    • Lu, L.1    Ghoshal, A.2    Renals, S.3
  • 30
    • 79958067294 scopus 로고    scopus 로고
    • Regularized subspace Gaussian mixture models for speech recognition
    • L. Lu, A. Ghoshal, and S. Renals, "Regularized subspace Gaussian mixture models for speech recognition," IEEE Signal Process. Lett., vol. 18, no. 7, pp. 419-422, 2011.
    • (2011) IEEE Signal Process. Lett. , vol.18 , Issue.7 , pp. 419-422
    • Lu, L.1    Ghoshal, A.2    Renals, S.3
  • 34
    • 33645712892 scopus 로고    scopus 로고
    • Compressed sensing
    • Apr.
    • D. L. Donoho, "Compressed sensing," IEEE Trans. Inf. Theory, vol. 52, no. 4, pp. 1289-1306, Apr. 2006.
    • (2006) IEEE Trans. Inf. Theory , vol.52 , Issue.4 , pp. 1289-1306
    • Donoho, D.L.1
  • 35
    • 39449126969 scopus 로고    scopus 로고
    • Gradient projection for sparse reconstruction: Application to compressed sensing and other inverse problems
    • Dec.
    • M. A. T. Figueiredo, R. D. Nowak, and S. J. Wright, "Gradient projection for sparse reconstruction: Application to compressed sensing and other inverse problems," IEEE J. Sel. Topics Signal Process., vol. 1, no. 4, pp. 586-597, Dec. 2007.
    • (2007) IEEE J. Sel. Topics Signal Process. , vol.1 , Issue.4 , pp. 586-597
    • Figueiredo, M.A.T.1    Nowak, R.D.2    Wright, S.J.3
  • 37
    • 0035341086 scopus 로고    scopus 로고
    • Joint Maximum a Posteriori adaptation of transformation and HMM parameters
    • DOI 10.1109/89.917687, PII S1063667601027419
    • O. Siohan, C. Chesta, and C. H. Lee, "Joint maximum a posteriori adaptation of transformation and HMM parameters," IEEE Trans. Speech Audio Process., vol. 9, no. 4, pp. 417-428, May 2001. (Pubitemid 32372183)
    • (2001) IEEE Transactions on Speech and Audio Processing , vol.9 , Issue.4 , pp. 417-428
    • Siohan, O.1    Chesta, C.2    Lee, C.-H.3
  • 38
    • 0033236298 scopus 로고    scopus 로고
    • The MLE algorithm for the matrix normal distribution
    • P. Dutilleul, "The MLE algorithm for the matrix normal distribution," J. Statist. Comput. Simulat., vol. 64, no. 2, pp. 105-123, 1999.
    • (1999) J. Statist. Comput. Simulat. , vol.64 , Issue.2 , pp. 105-123
    • Dutilleul, P.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.