메뉴 건너뛰기




Volumn , Issue , 2013, Pages 138-143

Investigation of multilingual deep neural networks for spoken term detection

Author keywords

keyword search; Multilingual; neural networks; speech recognition; spoken term detection

Indexed keywords

BOTTLENECK FEATURES; DEEP NEURAL NETWORKS; KEYWORD SEARCH; LANGUAGE INDEPENDENTS; MULTILINGUAL; MULTILINGUAL APPROACH; SPOKEN TERM DETECTIONS; TANDEM CONFIGURATION;

EID: 84893668957     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ASRU.2013.6707719     Document Type: Conference Paper
Times cited : (89)

References (34)
  • 2
    • 0033709098 scopus 로고    scopus 로고
    • Tandem connectionist feature extraction for conventional HMM systems
    • H. Hermansky, D. Ellis, and S. Sharma, Tandem Connectionist Feature Extraction for Conventional HMM Systems, in Proc. ICASSP, 2000.
    • (2000) Proc. ICASSP
    • Hermansky, H.1    Ellis, D.2    Sharma, S.3
  • 3
    • 69349090197 scopus 로고    scopus 로고
    • Learning deep architectures for AI
    • Y Bengio, Learning deep architectures for AI, Found. Trends Mach. Learn., vol. 2, pp. 1-127, 2009.
    • (2009) Found. Trends Mach. Learn. , vol.2 , pp. 1-127
    • Bengio, Y.1
  • 4
    • 84856243373 scopus 로고    scopus 로고
    • Current trends in multilingual speech processing
    • H. Bourlard et al., Current trends in multilingual speech processing, Sadhana, vol. 36, pp. 885-915, 2011.
    • (2011) Sadhana , vol.36 , pp. 885-915
    • Bourlard, H.1
  • 5
    • 34547548235 scopus 로고    scopus 로고
    • Probabilistic and bottle-neck features for LVCSR of meetings
    • Frantisek Grezl et al., Probabilistic and bottle-neck features for LVCSR of meetings, in Proc. ICASSP, 2007.
    • (2007) Proc. ICASSP
    • Grezl, F.1
  • 6
    • 79959819891 scopus 로고    scopus 로고
    • Cross-lingual and multi-stream posterior features for lowresource LVCSR systems
    • Samuel Thomas, Sriram Ganapathy, and Hynek Hermansky, Cross-lingual and multi-stream posterior features for lowresource LVCSR systems, in Proc. Interspeech, 2010.
    • (2010) Proc. Interspeech
    • Thomas, S.1    Ganapathy, S.2    Hermansky, H.3
  • 8
    • 84867606552 scopus 로고    scopus 로고
    • Multilingual MLP features for low-resource LVCSR systems
    • Samuel Thomas, Sriram Ganapathy, and Hynek Hermansky, Multilingual MLP features for low-resource LVCSR systems, in Proc. ICASSP, 2012.
    • (2012) Proc. ICASSP
    • Thomas, S.1    Ganapathy, S.2    Hermansky, H.3
  • 9
    • 84890474441 scopus 로고    scopus 로고
    • Investigation on cross- And multilingual MLP features under matched and mismatched acoustical conditions
    • Zoltan Tuske et al., Investigation on cross- And multilingual MLP features under matched and mismatched acoustical conditions, in Proc. ICASSP, 2013.
    • (2013) Proc. ICASSP
    • Tuske, Z.1
  • 11
    • 84867224965 scopus 로고    scopus 로고
    • On the use of a multilingual neural network front-end
    • Stefano Scanzio et al., On the use of a multilingual neural network front-end, in Proc. Interspeech, 2008.
    • (2008) Proc. Interspeech
    • Scanzio, S.1
  • 13
    • 84890527497 scopus 로고    scopus 로고
    • Cross-language knowledge transfer using multilingual deep neural network with shared hidden layers
    • Jui-Ting Huang et al., Cross-language knowledge transfer using multilingual deep neural network with shared hidden layers, in Proc. ICASSP, 2013.
    • (2013) Proc. ICASSP
    • Huang, J.-T.1
  • 14
    • 84890539009 scopus 로고    scopus 로고
    • Multilingual acoustic models using distributed deep neural networks
    • Georg Heigold et al., Multilingual acoustic models using distributed deep neural networks, in Proc. ICASSP, 2013.
    • (2013) Proc. ICASSP
    • Heigold, G.1
  • 15
    • 33947619591 scopus 로고    scopus 로고
    • Cross-domain and cross-language portability of acoustic features estimated by multilayer perceptrons
    • Andreas Stolcke et al., Cross-domain and cross-language portability of acoustic features estimated by multilayer perceptrons, in Proc. ICASSP, 2006.
    • (2006) Proc. ICASSP
    • Stolcke, A.1
  • 16
    • 85135166225 scopus 로고    scopus 로고
    • Fast bootstrapping of LVCSR systems with multilingual phoneme sets
    • Tanja Schultz and AlexWaibel, Fast bootstrapping of LVCSR systems with multilingual phoneme sets, in Proc. Eurospeech, 1997.
    • (1997) Proc. Eurospeech
    • Schultz, T.1    Waibel, A.2
  • 17
    • 33846194657 scopus 로고    scopus 로고
    • Feature extraction and acoustic modeling: An approach for improved generalization across languages and accents
    • Stephane Dupont et al., Feature extraction and acoustic modeling: An approach for improved generalization across languages and accents, in Proc. ASRU, 2005.
    • (2005) Proc. ASRU
    • Dupont, S.1
  • 18
    • 0033690885 scopus 로고    scopus 로고
    • Towards language independent acoustic modeling
    • W. Byrne et al., Towards language independent acoustic modeling, in Proc. ICASSP, 2000.
    • (2000) Proc. ICASSP
    • Byrne, W.1
  • 19
    • 51449101990 scopus 로고    scopus 로고
    • Robust phone set mapping using decision tree clustering for cross-lingual phone recognition
    • Khe Chai Sim and Haizhou Li, Robust phone set mapping using decision tree clustering for cross-lingual phone recognition, in Proc. ICASSP, 2008.
    • (2008) Proc. ICASSP
    • Sim, K.C.1    Li, H.2
  • 20
    • 84893645881 scopus 로고    scopus 로고
    • The language-independent bottleneck features
    • Karel Vesely et al., The language-independent bottleneck features, in Proc. SLT, 2012.
    • (2012) Proc. SLT
    • Vesely, K.1
  • 21
    • 84858955616 scopus 로고    scopus 로고
    • Study of probabilistic and bottle-neck features in multilingual environment
    • Frantisek Grezl, Martin Karafiat, and Milos Janda, Study of probabilistic and bottle-neck features in multilingual environment, in Proc. ASRU, 2011.
    • (2011) Proc. ASRU
    • Grezl, F.1    Karafiat, M.2    Janda, M.3
  • 22
    • 85009274666 scopus 로고    scopus 로고
    • Globalphone: A multilingual speech and text database developed at karlsruhe univ
    • T. Schultz, GlobalPhone: A multilingual speech and text database developed at Karlsruhe Univ., in Proc. ICSLP, 2002.
    • (2002) Proc. ICSLP
    • Schultz, T.1
  • 24
    • 79251574977 scopus 로고    scopus 로고
    • The efficient incorporation of MLP features into automatic speech recognition systems
    • J. Park et al., The Efficient Incorporation of MLP Features into Automatic Speech Recognition Systems, Computer Speech and Language, vol. 25, pp. 519-534, 2010.
    • (2010) Computer Speech and Language , vol.25 , pp. 519-534
    • Park, J.1
  • 25
    • 33646788786 scopus 로고    scopus 로고
    • FMPE: Discriminatively trained features for speech recognition
    • Daniel Povey et al., fMPE: Discriminatively trained features for speech recognition, in Proc. ICASSP, 2005.
    • (2005) Proc. ICASSP
    • Povey, D.1
  • 27
    • 85032751458 scopus 로고    scopus 로고
    • Deep neural networks for acoustic modeling in speech recognition
    • IEEE, Nov
    • G. Hinton, L. Deng, et al., Deep Neural Networks for Acoustic Modeling in Speech Recognition, Signal Processing Mag- Azine, IEEE, vol. 29, no. 6, pp. 82-97, Nov 2012.
    • (2012) Signal Processing Mag- Azine , vol.29 , Issue.6 , pp. 82-97
    • Hinton, G.1    Deng, L.2
  • 28
    • 84858976070 scopus 로고    scopus 로고
    • Feature engineering in context-dependent deep neural networks for conversational speech transcription
    • Dec
    • F. Seide, G. Li, X. Chen, and D. Yu, Feature engineering in context-dependent deep neural networks for conversational speech transcription, in Proc. ASRU, Dec 2011.
    • (2011) Proc. ASRU
    • Seide, F.1    Li, G.2    Chen, X.3    Yu, D.4
  • 30
    • 84862931515 scopus 로고    scopus 로고
    • Experiments on cross-language attribute detection and phone recognition with minimal targetspecific training data
    • S.M. Siniscalchi et al., Experiments on cross-language attribute detection and phone recognition with minimal targetspecific training data, IEEE Transactions on Audio, Speech, and Language Processing, vol. 20, no. 3, pp. 875-887, 2012.
    • (2012) IEEE Transactions on Audio, Speech, and Language Processing , vol.20 , Issue.3 , pp. 875-887
    • Siniscalchi, S.M.1
  • 31
    • 0003822743 scopus 로고    scopus 로고
    • (for HTK version 3.4), Cambridge University
    • S. J. Young et al., The HTK Book (for HTK version 3.4), Cambridge University, 2006.
    • (2006) The HTK Book
    • Young, S.J.1
  • 32
    • 84893712779 scopus 로고    scopus 로고
    • David Johnson et al., QuickNet, http://www1.icsi.berkeley.edu/Speech/qn. html.
    • QuickNet
    • Johnson, D.1
  • 33
    • 84891308106 scopus 로고    scopus 로고
    • Srilm - An extensible language modeling toolkit
    • A. Stolcke, SRILM - An Extensible Language Modeling Toolkit, in Proc. ICSLP, 2002.
    • (2002) Proc. ICSLP
    • Stolcke, A.1
  • 34
    • 84890537373 scopus 로고    scopus 로고
    • A high-performance cantonese keyword search system
    • B. Kingsbury et al., A high-performance Cantonese keyword search system, in Proc. ICASSP, 2013.
    • (2013) Proc. ICASSP
    • Kingsbury, B.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.