메뉴 건너뛰기




Volumn 2015-January, Issue , 2015, Pages 3660-3664

Joint decoding of tandem and hybrid systems for improved keyword spotting on low resource languages

Author keywords

Deep neural network; Hybrid; Joint decoding; Keyword spotting; Tandem

Indexed keywords

COMPUTATIONAL LINGUISTICS; DECODING; HYBRID SYSTEMS; SEARCH ENGINES; SPEECH COMMUNICATION;

EID: 84959166110     PISSN: 2308457X     EISSN: 19909772     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (42)

References (43)
  • 3
    • 84910031125 scopus 로고    scopus 로고
    • Data augmentationfor low resource languages
    • A. Ragni, K. Knill, S. Rath, and M. Gales, "Data augmentationfor low resource languages, " in Proc. Interspeech, 2014, pp. 810-814.
    • (2014) Proc. Interspeech , pp. 810-814
    • Ragni, A.1    Knill, K.2    Rath, S.3    Gales, M.4
  • 4
    • 84910067354 scopus 로고    scopus 로고
    • Language independentand unsupervised acoustic models for speech recognition and keyword spotting
    • K. Knill, M. Gales, A. Ragni, and S. Rath, "Language independentand unsupervised acoustic models for speech recognition and keyword spotting, " in Proc. INTERSPEECH, 2014, pp. 20-26.
    • (2014) Proc. INTERSPEECH , pp. 20-26
    • Knill, K.1    Gales, M.2    Ragni, A.3    Rath, S.4
  • 6
    • 0030657238 scopus 로고    scopus 로고
    • Analyses of multiple evidence combination
    • J. Lee, "Analyses of multiple evidence combination, " in ACM SIGIR, 1997, pp. 267-276.
    • (1997) ACM SIGIR , pp. 267-276
    • Lee, J.1
  • 8
    • 84946036768 scopus 로고    scopus 로고
    • Low-resource keyword search strategies forTAMIL
    • N. Chen et al., "Low-resource keyword search strategies forTAMIL, " in Proc. ICASSP, 2015, pp. 5366-5370.
    • (2015) Proc. ICASSP , pp. 5366-5370
    • Chen, N.1
  • 9
    • 0030638031 scopus 로고    scopus 로고
    • A Post-processing System to Yield ReducedWord Error Rates: Recogniser Output Voting Error Reduction(ROVER)
    • J. G. Fiscus, "A Post-processing System to Yield ReducedWord Error Rates: Recogniser Output Voting Error Reduction(ROVER), " in Proc. ASRU, 1997, pp. 347-354.
    • (1997) Proc. ASRU , pp. 347-354
    • Fiscus, J.G.1
  • 10
    • 4544253834 scopus 로고    scopus 로고
    • Posterior probability decoding, confidence estimation and system combination
    • G. Evermann and P. Woodland, "Posterior Probability Decoding, Confidence Estimation and System Combination, " in Proc. Speech Transcription Workshop, vol. 27, 2000.
    • (2000) Proc. Speech Transcription Workshop , vol.27
    • Evermann, G.1    Woodland, P.2
  • 13
    • 67649518727 scopus 로고    scopus 로고
    • Sub-word modelingof out of vocabulary words in spoken term detection
    • I. Szoke, L. Burget, J. Cernocky, and M. Fapso, "Sub-word modelingof out of vocabulary words in spoken term detection, " Proc. SLT, 2008, pp. 273-276.
    • (2008) Proc. SLT , pp. 273-276
    • Szoke, I.1    Burget, L.2    Cernocky, J.3    Fapso, M.4
  • 14
    • 84890537373 scopus 로고    scopus 로고
    • A high-performance Cantonese keywordsearch system
    • B. Kingsbury et al., "A high-performance Cantonese keywordsearch system, " in Proc. ICASSP, 2013, pp. 8277-8281.
    • (2013) Proc. ICASSP , pp. 8277-8281
    • Kingsbury, B.1
  • 15
    • 84910068314 scopus 로고    scopus 로고
    • Combining tand emand hybrid systems for improved speech recognition and keywordspotting on low resource languages
    • S. Rath, K. Knill, A. Ragni, and M. Gales, "Combining tand emand hybrid systems for improved speech recognition and keywordspotting on low resource languages, " in Proc. Interspeech, 2014, pp. 835-839.
    • (2014) Proc. Interspeech , pp. 835-839
    • Rath, S.1    Knill, K.2    Ragni, A.3    Gales, M.4
  • 16
    • 79251574977 scopus 로고    scopus 로고
    • Theefficient incorporation of MLP features into automatic speechrecognition systems
    • J. Park, F. Diehl, M. Gales, M. Tomalin, and P. C. Woodland, "Theefficient incorporation of MLP features into automatic speechrecognition systems, " Computer Speech and Language, vol. 25, no. 3, pp. 519-534, 2010.
    • (2010) Computer Speech and Language , vol.25 , Issue.3 , pp. 519-534
    • Park, J.1    Diehl, F.2    Gales, M.3    Tomalin, M.4    Woodland, P.C.5
  • 20
    • 0034825241 scopus 로고    scopus 로고
    • Multi-streamadaptive evidence combination for noise robust ASR
    • A. Morris, A. Hagen, H. Glotin, and H. Bourlard, "Multi-streamadaptive evidence combination for noise robust ASR, " SpeechCommunication, vol. 34, no. 1, pp. 25-40, 2001.
    • (2001) SpeechCommunication , vol.34 , Issue.1 , pp. 25-40
    • Morris, A.1    Hagen, A.2    Glotin, H.3    Bourlard, H.4
  • 21
    • 0141676589 scopus 로고    scopus 로고
    • New entropy based combinationrules in HMM/ANN multi-stream ASR
    • H. Misra, H. Bourlard, and V. Tyagi, "New entropy based combinationrules in HMM/ANN multi-stream ASR, " in Proc. ICASSP, 2003, pp. 738-741.
    • (2003) Proc. ICASSP , pp. 738-741
    • Misra, H.1    Bourlard, H.2    Tyagi, V.3
  • 23
    • 0028204660 scopus 로고
    • Combining TDNN and HMM in a hybrid system for improved continuous-speech recognition
    • C. Dugast, L. Devillers, and X. Aubert, "Combining TDNN and HMM in a hybrid system for improved continuous-speech recognition, "IEEE Trans. Speech and Audio Processing, vol. 2, no. 1, pp. 217-223, 1994.
    • (1994) IEEE Trans. Speech and Audio Processing , vol.2 , Issue.1 , pp. 217-223
    • Dugast, C.1    Devillers, L.2    Aubert, X.3
  • 24
    • 84890492591 scopus 로고    scopus 로고
    • Revisiting hybridand gmm-hmm system combination techniques
    • P. Swietojanski, A. Ghoshal, and S. Renals, "Revisiting hybridand gmm-hmm system combination techniques, " in Proc. ICASSP, 2013, pp. 6744-6748.
    • (2013) Proc. ICASSP , pp. 6744-6748
    • Swietojanski, P.1    Ghoshal, A.2    Renals, S.3
  • 25
    • 80053417853 scopus 로고    scopus 로고
    • Joint optimization for machine translationsystem combination
    • X. He and K. Toutanova, "Joint optimization for machine translationsystem combination, " in Proc. EMNLP, 2009, pp. 1202-1211.
    • (2009) Proc. EMNLP , pp. 1202-1211
    • He, X.1    Toutanova, K.2
  • 26
    • 84905265980 scopus 로고    scopus 로고
    • Joint training of convolutionaland non-convolutional neural networks
    • H. Soltau, G. Saon, and T. Sainath, "Joint training of convolutionaland non-convolutional neural networks, " Proc. ICASSP, 2014.
    • (2014) Proc. ICASSP
    • Soltau, H.1    Saon, G.2    Sainath, T.3
  • 27
    • 84976253431 scopus 로고    scopus 로고
    • Results of the2006 spoken term detection evaluation
    • J. Fiscus, J. Ajot, J. Garofolo, and G. Doddingtion, "Results of the2006 Spoken Term Detection Evaluation, " in Proc. SIGIR, 2007, pp. 51-57.
    • (2007) Proc. SIGIR , pp. 51-57
    • Fiscus, J.1    Ajot, J.2    Garofolo, J.3    Doddingtion, G.4
  • 29
    • 84959142742 scopus 로고    scopus 로고
    • A general artificial neural networkextension for HTK
    • C. Zhang and P. Woodland, "A general artificial neural networkextension for HTK, " in Submission to InterSpeech, 2015.
    • (2015) Submission to InterSpeech
    • Zhang, C.1    Woodland, P.2
  • 30
    • 84946055405 scopus 로고    scopus 로고
    • Unicode-based graphemic systemsfor limited resource languages
    • M. Gales, K. Knill, and A. Ragni, "Unicode-based graphemic systemsfor limited resource languages, " in Proc. ICASSP, 2015.
    • (2015) Proc. ICASSP
    • Gales, M.1    Knill, K.2    Ragni, A.3
  • 31
    • 0036460908 scopus 로고    scopus 로고
    • Lightly supervised and unsupervisedacoustic model training
    • L. Lamel and J.-L. Gauvain, "Lightly supervised and unsupervisedacoustic model training, " Computer speech and language, vol. 16, pp. 115-129, 2013.
    • (2013) Computer Speech and Language , vol.16 , pp. 115-129
    • Lamel, L.1    Gauvain, J.-L.2
  • 32
    • 84890474716 scopus 로고    scopus 로고
    • Deepneural network features and semi-supervised training for low resourcespeech recognition
    • S. Thomas, M. L. Seltzer, K. Church, and H. Hermansky, "Deepneural network features and semi-supervised training for low resourcespeech recognition, " in Proc. ICASSP, 2013, pp. 6704-6708.
    • (2013) Proc. ICASSP , pp. 6704-6708
    • Thomas, S.1    Seltzer, M.L.2    Church, K.3    Hermansky, H.4
  • 33
    • 84893705111 scopus 로고    scopus 로고
    • Discriminative semi-supervised training forkeyword search in low resource languages
    • R. Hsiao, T. Ng, F. Grézl, D. Karakos, S. Tsakalidis, L. Nguyen, and R. Schwartz, "Discriminative semi-supervised training forkeyword search in low resource languages, " in Proc. ASRU, 2013, pp. 440-445.
    • (2013) Proc. ASRU , pp. 440-445
    • Hsiao, R.1    Ng, T.2    Grézl, F.3    Karakos, D.4    Tsakalidis, S.5    Nguyen, L.6    Schwartz, R.7
  • 34
    • 84890474441 scopus 로고    scopus 로고
    • Investigation oncross-and multilingual MLP features under matched and mismatchedacoustical conditions
    • Z. Tuske, J. Pinto, D. Willett, and R. Schluter, "Investigation oncross-and multilingual MLP features under matched and mismatchedacoustical conditions, " in Proc. ICASSP, 2013, pp. 6970-6974.
    • (2013) Proc. ICASSP , pp. 6970-6974
    • Tuske, Z.1    Pinto, J.2    Willett, D.3    Schluter, R.4
  • 35
    • 84905215475 scopus 로고    scopus 로고
    • MultilingualMRASTA features for low-resource keyword search and speechrecognition systems
    • Z. Tuske, D. Nolden, R. Schluter, and H. Ney, "MultilingualMRASTA features for low-resource keyword search and speechrecognition systems, " in Proc. ICASSP, 2014, pp. 7854-7858.
    • (2014) Proc. ICASSP , pp. 7854-7858
    • Tuske, Z.1    Nolden, D.2    Schluter, R.3    Ney, H.4
  • 36
    • 84858953642 scopus 로고    scopus 로고
    • The Kaldi speech recognition toolkit
    • D. Povey et al., "The Kaldi speech recognition toolkit, " in Proc. ASRU, 2011.
    • (2011) Proc. ASRU
    • Povey, D.1
  • 37
    • 0032638856 scopus 로고    scopus 로고
    • Semi-tied covariance matrices for hidden markovmodels
    • M. Gales, "Semi-tied covariance matrices for hidden markovmodels, " Speech and Audio Processing, IEEE Transactions on, vol. 7, no. 3, pp. 272-281, 1999.
    • (1999) Speech and Audio Processing, IEEE Transactions on , vol.7 , Issue.3 , pp. 272-281
    • Gales, M.1
  • 39
    • 0036296863 scopus 로고    scopus 로고
    • Minimum Phone Error and I-smoothing for improved discriminative training
    • D. Povey and P. C. Woodland, "Minimum Phone Error and I-smoothing for improved discriminative training, " in Proc. ICASSP, 2002, pp. 101-105.
    • (2002) Proc. ICASSP , pp. 101-105
    • Povey, D.1    Woodland, P.C.2
  • 41
    • 0032050110 scopus 로고    scopus 로고
    • Maximum likelihood linear transformations forHMM-based speech recognition
    • M. Gales, "Maximum likelihood linear transformations forHMM-based speech recognition, " Computer speech & language, vol. 12, no. 2, pp. 75-98, 1998.
    • (1998) Computer Speech & Language , vol.12 , Issue.2 , pp. 75-98
    • Gales, M.1
  • 42
    • 84906274730 scopus 로고    scopus 로고
    • Sequencediscriminativetraining of deep neural networks
    • K. Vesely, A. Ghoshal, L. Burget, and D. Povey, "Sequencediscriminativetraining of deep neural networks. " in Proc. Interspeech, 2013, pp. 2345-2349.
    • (2013) Proc. Interspeech , pp. 2345-2349
    • Vesely, K.1    Ghoshal, A.2    Burget, L.3    Povey, D.4
  • 43
    • 33745219793 scopus 로고    scopus 로고
    • General indexation ofweighted automata-application to spoken utterance retrieval
    • M. Mohri, C. Allauzen, and M. Saraclar, "General indexation ofweighted automata-application to spoken utterance retrieval, " Proc. HLT/NAACL, 2004, pp. 33-40.
    • (2004) Proc. HLT/NAACL , pp. 33-40
    • Mohri, M.1    Allauzen, C.2    Saraclar, M.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.