메뉴 건너뛰기




Volumn , Issue , 2014, Pages 800-804

Improving language-universal feature extraction with deep maxout and convolutional neural networks

Author keywords

Deep convolutional networks; Deep maxout networks; Language universal feature extraction

Indexed keywords

COMPLEX NETWORKS; CONVOLUTION; EXTRACTION; FEATURE EXTRACTION; NEURAL NETWORKS; SPEECH COMMUNICATION;

EID: 84910028405     PISSN: 2308457X     EISSN: 19909772     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (22)

References (35)
  • 1
    • 84055222005 scopus 로고    scopus 로고
    • Contextdependent pre-trained deep neural networks for large vocabulary speech recognition
    • G. Dahl, D. Yu, L. Deng, and A. Acero, "Contextdependent pre-trained deep neural networks for large vocabulary speech recognition, " IEEE Transactions on Audio, Speech and Language Processing, vol. 20(1), pp. 30-42, 2012.
    • (2012) IEEE Transactions on Audio, Speech and Language Processing , vol.20 , Issue.1 , pp. 30-42
    • Dahl, G.1    Yu, D.2    Deng, L.3    Acero, A.4
  • 2
    • 84858976070 scopus 로고    scopus 로고
    • Feature engineering in context-dependent deep neural networks for conversational speech transcription
    • F. Seide, G. Li, X. Chen, and D. Yu, "Feature engineering in context-dependent deep neural networks for conversational speech transcription, " in Proc. ASRU, pp. 24-29, 2011.
    • (2011) Proc. ASRU , pp. 24-29
    • Seide, F.1    Li, G.2    Chen, X.3    Yu, D.4
  • 4
    • 84890527497 scopus 로고    scopus 로고
    • Crosslanguage knowledge transfer using multilingual deep neural network with shared hidden layers
    • J. Huang, J. Li, D. Yu, L. Deng, and Y. Gong, "Crosslanguage knowledge transfer using multilingual deep neural network with shared hidden layers, " in Proc. ICASSP, pp. 7304-7308, 2013.
    • (2013) Proc. ICASSP , pp. 7304-7308
    • Huang, J.1    Li, J.2    Yu, D.3    Deng, L.4    Gong, Y.5
  • 5
    • 84893701756 scopus 로고    scopus 로고
    • Deep maxout networks for low-resource speech recognition
    • Y. Miao, F. Metze, and S. Rawat, "Deep maxout networks for low-resource speech recognition, " in Proc. ASRU, 2013.
    • (2013) Proc. ASRU
    • Miao, Y.1    Metze, F.2    Rawat, S.3
  • 6
    • 84910068044 scopus 로고    scopus 로고
    • Distributed learning of multilingual DNN feature extractors using GPUs
    • Y. Miao, H. Zhang, and F. Metze, "Distributed learning of multilingual DNN feature extractors using GPUs, " to appear in Proc. Interspeech, 2014.
    • (2014) Proc. Interspeech
    • Miao, Y.1    Zhang, H.2    Metze, F.3
  • 7
    • 85161980001 scopus 로고    scopus 로고
    • Sparse deep belief net model for visual area V2
    • H. Lee, C. Ekanadham, and A. Y. Ng, "Sparse deep belief net model for visual area V2, " in Proc. NIPS, 2008.
    • (2008) Proc. NIPS
    • Lee, H.1    Ekanadham, C.2    Ng, A.Y.3
  • 8
    • 71149119164 scopus 로고    scopus 로고
    • Convolutional deep belief networks for scalable unsupervised learning of hierarchical representations
    • H. Lee, R. Grosse, R. Ranganath, and A. Y. Ng, "Convolutional deep belief networks for scalable unsupervised learning of hierarchical representations, " in Proc. ICML, pp. 609-616, 2009.
    • (2009) Proc. ICML , pp. 609-616
    • Lee, H.1    Grosse, R.2    Ranganath, R.3    Ng, A.Y.4
  • 11
    • 84863380535 scopus 로고    scopus 로고
    • Unsupervised feature learning for audio classification using convolutional deep belief networks
    • H. Lee, Y. Largman, P. Pham, and A. Y. Ng, "Unsupervised feature learning for audio classification using convolutional deep belief networks, " in Proc. NIPS, 2009.
    • (2009) Proc. NIPS
    • Lee, H.1    Largman, Y.2    Pham, P.3    Ng, A.Y.4
  • 12
    • 80051612464 scopus 로고    scopus 로고
    • Multilayer perceptron with sparse hidden outputs for phoneme recognition
    • G. Sivaram, and H. Hermansky, "Multilayer perceptron with sparse hidden outputs for phoneme recognition, " in Proc. ICASSP, pp. 5336-5339, 2011.
    • (2011) Proc. ICASSP , pp. 5336-5339
    • Sivaram, G.1    Hermansky, H.2
  • 13
    • 84878538214 scopus 로고    scopus 로고
    • Are sparse representations rich enough for acoustic modeling?
    • O. Vinyals, and L. Deng, "Are sparse representations rich enough for acoustic modeling?, " in Proc. Interspeech, 2012.
    • (2012) Proc. Interspeech
    • Vinyals, O.1    Deng, L.2
  • 15
    • 84905239342 scopus 로고    scopus 로고
    • Improving deep neural network acoustic models using generalized maxout networks
    • X. Zhang, J. Trmal, D. Povey, and S. Khudanpur, "Improving deep neural network acoustic models using generalized maxout networks, " in Proc. ICASSP, 2014.
    • (2014) Proc. ICASSP
    • Zhang, X.1    Trmal, J.2    Povey, D.3    Khudanpur, S.4
  • 16
    • 84867605836 scopus 로고    scopus 로고
    • Applying convolutional neural networks concepts to hybrid NN-HMM model for speech recognition
    • O. Abdel-Hamid, A. Mohamed, H. Jiang, and G. Penn, "Applying convolutional neural networks concepts to hybrid NN-HMM model for speech recognition, " in Proc. ICASSP, pp. 4277-4280, 2012.
    • (2012) Proc. ICASSP , pp. 4277-4280
    • Abdel-Hamid, O.1    Mohamed, A.2    Jiang, H.3    Penn, G.4
  • 18
    • 84906214784 scopus 로고    scopus 로고
    • Exploring convolutional neural network structures and optimization techniques for speech recognition
    • O. Abdel-Hamid, L. Deng, and D. Yu, "Exploring convolutional neural network structures and optimization techniques for speech recognition, " in Proc. Interspeech, pp. 3366-3370, 2013.
    • (2013) Proc. Interspeech , pp. 3366-3370
    • Abdel-Hamid, O.1    Deng, L.2    Yu, D.3
  • 21
    • 84890482429 scopus 로고    scopus 로고
    • Extracting deep bottleneck features using stacked autoencoders
    • J. Gehring, Y. Miao, F. Metze, and A. Waibel, "Extracting deep bottleneck features using stacked autoencoders, " in Proc. ICASSP, pp. 3377-3381, 2013.
    • (2013) Proc. ICASSP , pp. 3377-3381
    • Gehring, J.1    Miao, Y.2    Metze, F.3    Waibel, A.4
  • 22
    • 84906283232 scopus 로고    scopus 로고
    • Using conversational word bursts in spoken term detection
    • J. Chiu, and A. Rudnicky, "Using conversational word bursts in spoken term detection, " in Proc. Interspeech, 2013.
    • (2013) Proc. Interspeech
    • Chiu, J.1    Rudnicky, A.2
  • 24
    • 84906273501 scopus 로고    scopus 로고
    • Improving low-resource CDDNN- HMM using dropout and multilingual DNN training
    • Y. Miao, and F. Metze, "Improving low-resource CDDNN- HMM using dropout and multilingual DNN training, " in Proc. Interspeech, pp. 2237-2241, 2013.
    • (2013) Proc. Interspeech , pp. 2237-2241
    • Miao, Y.1    Metze, F.2
  • 25
    • 84890495545 scopus 로고    scopus 로고
    • Subspace mixture model for low-resource speech recognition in crosslingual settings
    • Y. Miao, F. Metze, and A. Waibel, "Subspace mixture model for low-resource speech recognition in crosslingual settings, " in Proc. ICASSP, pp. 7339-7342, 2013.
    • (2013) Proc. ICASSP , pp. 7339-7342
    • Miao, Y.1    Metze, F.2    Waibel, A.3
  • 27
    • 84890527827 scopus 로고    scopus 로고
    • Improving deep neural networks for LVCSR using rectified linear units and dropout
    • G. Dahl, T. N. Sainath, and G. E. Hinton, "Improving deep neural networks for LVCSR using rectified linear units and dropout, " in Proc. ICASSP, pp. 8609-8613, 2013.
    • (2013) Proc. ICASSP , pp. 8609-8613
    • Dahl, G.1    Sainath, T.N.2    Hinton, G.E.3
  • 28
    • 84890451371 scopus 로고    scopus 로고
    • Phone recognition with deep sparse rectifier neural networks
    • L. Toth, "Phone recognition with deep sparse rectifier neural networks, " in Proc. ICASSP, pp. 6985-6989, 2013.
    • (2013) Proc. ICASSP , pp. 6985-6989
    • Toth, L.1
  • 31
    • 79551480483 scopus 로고    scopus 로고
    • Stacked denoising autoencoders: Learning useful representations in a deep network with a local denoising criterion
    • P. Vincent, H. Larochelle, I. Lajoie, Y. Bengio, and P. Manzagol, "Stacked denoising autoencoders: learning useful representations in a deep network with a local denoising criterion, " Journal of Machine Learning Research, vol. 11, 2010.
    • (2010) Journal of Machine Learning Research , vol.11
    • Vincent, P.1    Larochelle, H.2    Lajoie, I.3    Bengio, Y.4    Manzagol, P.5
  • 32
    • 84892908371 scopus 로고    scopus 로고
    • A practical guide to training restricted Boltzmann machines
    • G. E. Hinton, "A practical guide to training restricted Boltzmann machines, " UTML TR., 2010.
    • (2010) UTML TR
    • Hinton, G.E.1
  • 34
    • 84910031119 scopus 로고    scopus 로고
    • Towards speaker adaptive training of deep neural network acoustic models
    • Y. Miao, H. Zhang, and F. Metze, "Towards speaker adaptive training of deep neural network acoustic models, " to appear in Proc. Interspeech, 2014.
    • (2014) Proc. Interspeech
    • Miao, Y.1    Zhang, H.2    Metze, F.3
  • 35
    • 84893668797 scopus 로고    scopus 로고
    • Neighbour selection and adaptation for rapid speaker-dependent ASR
    • U. Nallasamy, M. Fuhs, M. Woszczyna, F. Metze, and T. Schultz, "Neighbour selection and adaptation for rapid speaker-dependent ASR, " in Proc. ASRU, pp. 60-65, 2013.
    • (2013) Proc. ASRU , pp. 60-65
    • Nallasamy, U.1    Fuhs, M.2    Woszczyna, M.3    Metze, F.4    Schultz, T.5


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.