메뉴 건너뛰기




Volumn , Issue , 2013, Pages 6704-6708

Deep neural network features and semi-supervised training for low resource speech recognition

Author keywords

bottleneck features; deep neural networks; Low resource; semi supervised training; speech recognition

Indexed keywords

ACOUSTIC MODEL; BOTTLENECK FEATURES; DEEP NEURAL NETWORKS; LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION; LOW RESOURCE; LOW-RESOURCE SETTINGS; LOW-RESOURCE SPEECH RECOGNITION; SEMI-SUPERVISED TRAININGS;

EID: 84890474716     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2013.6638959     Document Type: Conference Paper
Times cited : (164)

References (38)
  • 1
    • 70349220094 scopus 로고    scopus 로고
    • A study on multilingual acoustic modeling for large vocabulary ASR
    • H. Lin, L. Deng, D. Yu, Y. Gong, A. Acero, and C.H. Lee, "A study on multilingual acoustic modeling for large vocabulary ASR," in IEEE ICASSP, 2009
    • (2009) IEEE ICASSP
    • Lin, H.1    Deng, L.2    Yu, D.3    Gong, Y.4    Acero, A.5    Lee, C.H.6
  • 4
    • 84890456495 scopus 로고    scopus 로고
    • Regularized subspace Gaussian mixture models for cross-lingual speech recognition
    • L. Lu, A. Ghoshal, and S. Renals, "Regularized subspace Gaussian mixture models for cross-lingual speech recognition," in IEEE ASRU, 2011
    • (2011) IEEE ASRU
    • Lu, L.1    Ghoshal, A.2    Renals, S.3
  • 5
    • 84865804486 scopus 로고    scopus 로고
    • State-level data borrowing for low-resource speech recognition based on subspace GMMs
    • Y. Qian, D. Povey, and J. Liu, "State-level data borrowing for low-resource speech recognition based on subspace GMMs," in ISCA Interspeech, 2011
    • (2011) ISCA Interspeech
    • Qian, Y.1    Povey, D.2    Liu, J.3
  • 6
    • 84890500781 scopus 로고    scopus 로고
    • On use of task independent training data in tandem feature extraction
    • S. Sivadas and H. Hermansky, "On use of task independent training data in tandem feature extraction," in IEEE ICASSP, 2004
    • (2004) IEEE ICASSP
    • Sivadas, S.1    Hermansky, H.2
  • 7
    • 84890483790 scopus 로고    scopus 로고
    • Cross-domain and cross-language portability of acoustic features estimated by multilayer perceptrons
    • A. Stolcke, F. Grezl, M.Y. Hwang, X. Lei, N. Morgan, and D. Vergyri, "Cross-domain and cross-language portability of acoustic features estimated by multilayer perceptrons," in IEEE ICASSP, 2006
    • (2006) IEEE ICASSP
    • Stolcke, A.1    Grezl, F.2    Hwang, M.Y.3    Lei, X.4    Morgan, N.5    Vergyri, D.6
  • 8
    • 79959819891 scopus 로고    scopus 로고
    • Cross-lingual and multistream posterior features for low resource LVCSR systems
    • S. Thomas, S. Ganapathy, and H. Hermansky, "Cross-lingual and multistream posterior features for low resource LVCSR systems," in ISCA Interspeech, 2010
    • (2010) ISCA Interspeech
    • Thomas, S.1    Ganapathy, S.2    Hermansky, H.3
  • 9
    • 84878582419 scopus 로고    scopus 로고
    • Cross-lingual and ensemble MLPs-Strategies for low-resource speech recognition
    • Y. Qian and J. Liu, "Cross-lingual and ensemble MLPs-Strategies for low-resource speech recognition," in ISCA Interspeech, 2012
    • (2012) ISCA Interspeech
    • Qian, Y.1    Liu, J.2
  • 10
    • 84890458274 scopus 로고    scopus 로고
    • Initialization schemes for multilayer perceptron training and their impact on ASR performance using multilingual data
    • N. Thang, B. Wojtek, F. Metze, and T. Schultz, "Initialization schemes for multilayer perceptron training and their impact on ASR performance using multilingual data," in ISCA Interspeech, 2012
    • (2012) ISCA Interspeech
    • Thang, N.1    Wojtek, B.2    Metze, F.3    Schultz, T.4
  • 11
    • 84890513744 scopus 로고    scopus 로고
    • Multilingual MLP features for low-resource LVCSR systems
    • S. Thomas, S. Ganapathy, and H. Hermansky, "Multilingual MLP features for low-resource LVCSR systems," in IEEE ICASSP, 2012
    • (2012) IEEE ICASSP
    • Thomas, S.1    Ganapathy, S.2    Hermansky, H.3
  • 12
    • 84878392008 scopus 로고    scopus 로고
    • Data-driven posterior features for low resource speech recognition applications
    • S. Thomas, S. Ganapathy, A. Jansen, and H. Hermansky, "Data-driven posterior features for low resource speech recognition applications," in ISCA Interspeech, 2012
    • (2012) ISCA Interspeech
    • Thomas, S.1    Ganapathy, S.2    Jansen, A.3    Hermansky, H.4
  • 13
    • 84890453097 scopus 로고    scopus 로고
    • Feature engineering in context-dependent deep neural networks for conversational speech transcription
    • F. Seide, G. Li, X. Chen, and D. Yu, "Feature engineering in context-dependent deep neural networks for conversational speech transcription," in IEEE ASRU, 2011
    • (2011) IEEE ASRU
    • Seide, F.1    Li, G.2    Chen, X.3    Yu, D.4
  • 16
    • 84055163920 scopus 로고    scopus 로고
    • Roles of pre-training and fine-tuning in context-dependent DBN-HMMs for real-world speech recognition
    • D. Yu, L. Deng, and G. Dahl, "Roles of pre-training and fine-tuning in context-dependent DBN-HMMs for real-world speech recognition," in NIPS Workshop, 2010
    • (2010) NIPS Workshop
    • Yu, D.1    Deng, L.2    Dahl, G.3
  • 17
    • 84055222005 scopus 로고    scopus 로고
    • Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition
    • G.E. Dahl, D. Yu, L. Deng, and A. Acero, "Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition," IEEE TASLP, 2012
    • (2012) IEEE TASLP
    • Dahl, G.E.1    Yu, D.2    Deng, L.3    Acero, A.4
  • 18
    • 84865785753 scopus 로고    scopus 로고
    • Improved bottleneck features using pretrained deep neural networks
    • D. Yu and M.L. Seltzer, "Improved bottleneck features using pretrained deep neural networks," ISCA Interspeech, 2011
    • (2011) ISCA Interspeech
    • Yu, D.1    Seltzer, M.L.2
  • 19
    • 84890515212 scopus 로고    scopus 로고
    • Autoencoder bottleneck features using deep belief networks
    • T.N. Sainath, B. Kingsbury, and B. Ramabhadran, "Autoencoder bottleneck features using deep belief networks," in IEEE ICASSP, 2012
    • (2012) IEEE ICASSP
    • Sainath, T.N.1    Kingsbury, B.2    Ramabhadran, B.3
  • 23
    • 34547548235 scopus 로고    scopus 로고
    • Probabilistic and bottle-neck features for lvcsr of meetings
    • F. Grezl, M. Karafiat, S. Kontar, and J. Cernocky, "Probabilistic and bottle-neck features for lvcsr of meetings," in IEEE ICASSP, 2007
    • (2007) IEEE ICASSP
    • Grezl, F.1    Karafiat, M.2    Kontar, S.3    Cernocky, J.4
  • 24
    • 84890500819 scopus 로고    scopus 로고
    • Learning long-term temporal features in lvcsr using neural networks
    • B. Chen, Q. Zhu, and N. Morgan, "Learning long-term temporal features in lvcsr using neural networks," in ISCA ICSLP, 2004
    • (2004) ISCA ICSLP
    • Chen, B.1    Zhu, Q.2    Morgan, N.3
  • 27
  • 28
    • 0002035663 scopus 로고
    • Switchboard: Telephone speech corpus for research and development
    • J. Godfrey, E. Holliman, and J. McDaniel, "Switchboard: Telephone speech corpus for research and development," in IEEE ICASSP, 1992
    • (1992) IEEE ICASSP
    • Godfrey, J.1    Holliman, E.2    McDaniel, J.3
  • 31
    • 84890474252 scopus 로고    scopus 로고
    • Phoneme recognition using spectral envelope and modulation frequency features
    • S. Thomas, S. Ganapathy, and H. Hermansky, "Phoneme recognition using spectral envelope and modulation frequency features," in IEEE ICASSP, 2009
    • (2009) IEEE ICASSP
    • Thomas, S.1    Ganapathy, S.2    Hermansky, H.3
  • 33
    • 85135261720 scopus 로고    scopus 로고
    • Unsupervised training of a speech recognizer: Recent experiments
    • T. Kemp and A. Waibel, "Unsupervised training of a speech recognizer: Recent experiments," in ISCA Eurospeech, 1999
    • (1999) ISCA Eurospeech
    • Kemp, T.1    Waibel, A.2
  • 35
    • 84890521566 scopus 로고    scopus 로고
    • Unsupervised training on large amounts of broadcast news data
    • J. Ma, S. Matsoukas, O. Kimball, and R. Schwartz, "Unsupervised training on large amounts of broadcast news data," in IEEE ICASSP, 2006
    • (2006) IEEE ICASSP
    • Ma, J.1    Matsoukas, S.2    Kimball, O.3    Schwartz, R.4
  • 36
    • 70450189191 scopus 로고    scopus 로고
    • Analysis of low-resource acoustic model self-training
    • S. Novotney and R. Schwartz, "Analysis of low-resource acoustic model self-training," in ISCA Interspeech, 2009
    • (2009) ISCA Interspeech
    • Novotney, S.1    Schwartz, R.2
  • 37
    • 85135146711 scopus 로고    scopus 로고
    • Estimating confidence using word lattices
    • T. Kemp and T. Schaaf, "Estimating confidence using word lattices," in ISCA Eurospeech, 1997
    • (1997) ISCA Eurospeech
    • Kemp, T.1    Schaaf, T.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.