메뉴 건너뛰기




Volumn , Issue , 2013, Pages 6975-6979

Multi-level adaptive networks in tandem and hybrid ASR systems

Author keywords

BBC; deep neural networks; hybrid; MLAN; tandem; TED

Indexed keywords

BBC; DEEP NEURAL NETWORKS; HYBRID; MLAN; TANDEM; TED;

EID: 84890537527     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2013.6639014     Document Type: Conference Paper
Times cited : (38)

References (34)
  • 3
    • 0033709098 scopus 로고    scopus 로고
    • Tandem connectionist feature extraction for conventional HMM systems
    • H. Hermanksy, D.P.W. Ellis, and S. Sharma, "Tandem connectionist feature extraction for conventional HMM systems," in Proc. ICASSP, 2000, pp. 1635-1630
    • (2000) Proc. ICASSP , pp. 1635-1630
    • Hermanksy, H.1    Ellis, D.P.W.2    Sharma, S.3
  • 4
    • 33745528628 scopus 로고    scopus 로고
    • Using MLP features in SRIs conversational speech recognition system
    • Q. Zhu, A. Stolcke, B.Y. Chen, and N. Morgan, "Using MLP features in SRIs conversational speech recognition system," in Proc. Interspeech, 2005
    • (2005) Proc. Interspeech
    • Zhu, Q.1    Stolcke, A.2    Chen, B.Y.3    Morgan, N.4
  • 8
    • 0028530231 scopus 로고
    • State clustering in hidden Markov model-based continuous speech recognition
    • S. J. Young and P. C. Woodland, "State clustering in hidden Markov model-based continuous speech recognition," Computer Speech &Language, vol. 8, no. 4, pp. 369-383, 1994
    • (1994) Computer Speech &Language , vol.8 , Issue.4 , pp. 369-383
    • Young, S.J.1    Woodland, P.C.2
  • 9
    • 0032050110 scopus 로고    scopus 로고
    • Maximum likelihood linear transforms for HMM-based speech recognition
    • "Maximum likelihood linear transforms for HMM-based speech recognition," Computer Speech and Language, vol. 12, no. 75-98, 1998
    • (1998) Computer Speech and Language , vol.12 , Issue.75-98
  • 10
    • 0036296863 scopus 로고    scopus 로고
    • Minimum phone error and I-smoothing for improved discriminative training
    • D. Povey and P.C. Woodland, "Minimum phone error and I-smoothing for improved discriminative training," in Proc. ICASSP. IEEE, 2002, vol. I, pp. 105-108
    • (2002) Proc. ICASSP. IEEE , vol.1 , pp. 105-108
    • Povey, D.1    Woodland, P.C.2
  • 13
    • 84055222005 scopus 로고    scopus 로고
    • Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition
    • G.E. Dahl, D. Yu, L. Deng, and A. Acero, "Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition," IEEE Transactions on Audio, Speech and Language Processing, vol. 20, no. 1, pp. 30-42, 2012
    • (2012) IEEE Transactions on Audio, Speech and Language Processing , vol.20 , Issue.1 , pp. 30-42
    • Dahl, G.E.1    Yu, D.2    Deng, L.3    Acero, A.4
  • 14
    • 84858976070 scopus 로고    scopus 로고
    • Feature engineering in context-dependent deep neural networks for conversational speech transcription
    • F. Seide, G. Li, X. Chen, and D. Yu, "Feature engineering in context-dependent deep neural networks for conversational speech transcription," in Proc. ASRU, 2011
    • (2011) Proc. ASRU
    • Seide, F.1    Li, G.2    Chen, X.3    Yu, D.4
  • 16
    • 85009078709 scopus 로고
    • CDNN: A context dependent neural network for continuous speech recognition
    • H. Bourlard, N. Morgan, C.Wooters, and S. Renals, "CDNN: A context dependent neural network for continuous speech recognition," in Proc. ICASSP, 1992, vol. 2, pp. 349-352
    • (1992) Proc. ICASSP , vol.2 , pp. 349-352
    • Bourlard, H.1    Morgan, N.2    Wooters, C.3    Renals, S.4
  • 17
    • 0030371791 scopus 로고    scopus 로고
    • The 1995 ABBOT LVCSR system for multiple unknown microphones
    • D. Kershaw, T. Robinson, and S. Renals, "The 1995 ABBOT LVCSR system for multiple unknown microphones," in Proc. ICSLP, 1996, pp. 1325-1328
    • (1996) Proc. ICSLP , pp. 1325-1328
    • Kershaw, D.1    Robinson, T.2    Renals, S.3
  • 18
    • 33745805403 scopus 로고    scopus 로고
    • A fast learning algorithm for deep belief nets
    • G. Hinton, S. Osindero, and Y. Teh, "A fast learning algorithm for deep belief nets," Neural Computation, vol. 18, pp. 1527-1554, 2006
    • (2006) Neural Computation , vol.18 , pp. 1527-1554
    • Hinton, G.1    Osindero, S.2    Teh, Y.3
  • 19
    • 84865801985 scopus 로고    scopus 로고
    • Conversational speech transcription using context-dependent deep neural networks
    • F. Seide, G. Li, and D. Yu, "Conversational speech transcription using context-dependent deep neural networks," in Proc. Interspeech, 2011
    • (2011) Proc. Interspeech
    • Seide, F.1    Li, G.2    Yu, D.3
  • 21
    • 84867593213 scopus 로고    scopus 로고
    • Auto-encoder bottleneck features using deep belief networks
    • T. Sainath, B. Kingsbury, and B. Ramabhadran, "Auto-encoder bottleneck features using deep belief networks," in Proc ICASSP, 2012
    • (2012) Proc ICASSP
    • Sainath, T.1    Kingsbury, B.2    Ramabhadran, B.3
  • 22
    • 84878392008 scopus 로고    scopus 로고
    • Data-driven posterior features for low resource speech recognition applications
    • S. Thomas, S. Ganapathy, A. Jansen, and H. Hermansky, "Data-driven posterior features for low resource speech recognition applications," in Proc. Interspeech, 2012
    • (2012) Proc. Interspeech
    • Thomas, S.1    Ganapathy, S.2    Jansen, A.3    Hermansky, H.4
  • 23
    • 84858955616 scopus 로고    scopus 로고
    • Study of probabilistic and bottle-neck features in multilingual environment
    • F. Grezl, M. Karafiat, and M. Janda, "Study of probabilistic and bottle-neck features in multilingual environment," in Proc. ASRU, 2011
    • (2011) Proc. ASRU
    • Grezl, F.1    Karafiat, M.2    Janda, M.3
  • 24
    • 84867606552 scopus 로고    scopus 로고
    • Multilingual MLP features for low-resource LVCSR systems
    • S. Thomas, S. Ganapathy, and H. Hermansky, "Multilingual MLP features for low-resource LVCSR systems," in Proc. ICASSP, 2012
    • (2012) Proc. ICASSP
    • Thomas, S.1    Ganapathy, S.2    Hermansky, H.3
  • 25
    • 4544236237 scopus 로고    scopus 로고
    • On use of task independent training data in tandem feature extraction
    • S. Sivadas and H. Hermansky, "On use of task independent training data in tandem feature extraction," in Proc. ICASSP, 2004
    • (2004) Proc. ICASSP
    • Sivadas, S.1    Hermansky, H.2
  • 26
    • 33947619591 scopus 로고    scopus 로고
    • Cross-domain and cross-language portability of acoustic features estimated by multilayer perceptrons
    • A. Stolcke, F. Grezl, M.-Y. Hwang, X Lei, N. Morgan, and D. Vergyri, "Cross-domain and cross-language portability of acoustic features estimated by multilayer perceptrons," in Proc. ICASSP, 2006
    • (2006) Proc. ICASSP
    • Stolcke, A.1    Grezl, F.2    Hwang, M.-Y.3    Lei, X.4    Morgan, N.5    Vergyri, D.6
  • 27
    • 78049384951 scopus 로고    scopus 로고
    • Multi-style MLP features for BN transcription
    • V.-B. Le, L. Lamel, and J.-L. Gauvain, "Multi-style MLP features for BN transcription," in Proc. ICASSP, 2010, pp. 4866-4869
    • (2010) Proc. ICASSP , pp. 4866-4869
    • Le, V.-B.1    Lamel, L.2    Gauvain, J.-L.3
  • 28
    • 77949374930 scopus 로고    scopus 로고
    • MLP based hierachical system for task adaptation in ASR
    • J. Pinto, M. Magimai-Doss, and H. Bourlard, "MLP based hierachical system for task adaptation in ASR," in Proc. ASRU, 2009
    • (2009) Proc. ASRU
    • Pinto, J.1    Magimai-Doss, M.2    Bourlard, H.3
  • 30
    • 33947620115 scopus 로고    scopus 로고
    • Hierarchical structures of neural networks for phoneme recognition
    • P. Schwarz, Matejka P., and J. Cernokcy, "Hierarchical structures of neural networks for phoneme recognition," in Proc. ICASSP, 2006
    • (2006) Proc. ICASSP
    • Schwarz, P.1    Matejka, P.2    Cernokcy, J.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.