메뉴 건너뛰기




Volumn , Issue , 2012, Pages 324-329

Transcription of multi-genre media archives using out-of-domain data

Author keywords

cross domain adaptation; media archives; speech recognition; tandem

Indexed keywords

ADAPTIVE NETWORKS; CROSS-DOMAIN; DEEP NEURAL NETWORKS; MEDIA ARCHIVES; NOVEL TECHNIQUES; POSTERIOR FEATURES; SPEECH RECOGNITION SYSTEMS; SUBSTANTIAL REDUCTION; TANDEM;

EID: 84874245054     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/SLT.2012.6424244     Document Type: Conference Paper
Times cited : (36)

References (22)
  • 1
    • 70450190034 scopus 로고    scopus 로고
    • PodCastle: Collaborative training of acoustic models on the basis of wisdom of crowds for podcast transcription
    • J. Ogata and M. Goto, "PodCastle: Collaborative training of acoustic models on the basis of wisdom of crowds for podcast transcription," in Proc. Interspeech, 2009.
    • (2009) Proc. Interspeech
    • Ogata, J.1    Goto, M.2
  • 6
    • 0033709098 scopus 로고    scopus 로고
    • Tandem connectionist feature extraction for conventional HMM systems
    • H. Hermanksy, D.P.W. Ellis, and S. Sharma, "Tandem connectionist feature extraction for conventional HMM systems," in Proc. ICASSP, 2000, pp. 1635-1630.
    • (2000) Proc. ICASSP , pp. 1635-1630
    • Hermanksy, H.1    Ellis, D.P.W.2    Sharma, S.3
  • 7
    • 84055222005 scopus 로고    scopus 로고
    • Contextdependent pre-trained deep neural networks for largevocabulary speech recognition
    • G.E. Dahl, D. Yu, L. Deng, and A. Acero, "Contextdependent pre-trained deep neural networks for largevocabulary speech recognition," IEEE Transactions on Audio, Speech and Language Processing, vol. 20, no. 1, pp. 30-42, 2012.
    • (2012) IEEE Transactions on Audio, Speech and Language Processing , vol.20 , Issue.1 , pp. 30-42
    • Dahl, G.E.1    Yu, D.2    Deng, L.3    Acero, A.4
  • 9
    • 84867593213 scopus 로고    scopus 로고
    • Auto-encoder bottleneck features using deep belief networks
    • T. N. Sainath, B. Kingsbury, and B. Ramabhadran, "Auto-encoder bottleneck features using deep belief networks," in Proc. ICASSP, 2012.
    • (2012) Proc. ICASSP
    • Sainath, T.N.1    Kingsbury, B.2    Ramabhadran, B.3
  • 10
    • 4544236237 scopus 로고    scopus 로고
    • On use of task independent training data in tandem feature extraction
    • S. Sivadas and H. Hermansky, "On use of task independent training data in tandem feature extraction," in Proc. ICASSP, 2004.
    • (2004) Proc. ICASSP
    • Sivadas, S.1    Hermansky, H.2
  • 11
    • 33947619591 scopus 로고    scopus 로고
    • Cross-domain and cross-language portability of acoustic features estimated by multilayer perceptrons
    • A. Stolcke, F. Gŕezl, M.-Y. Hwang, X Lei, N. Morgan, and D. Vergyri, "Cross-domain and cross-language portability of acoustic features estimated by multilayer perceptrons," in Proc. ICASSP, 2006.
    • (2006) Proc. ICASSP
    • Stolcke, A.1    Gŕezl, F.2    Hwang, M.-Y.3    Lei, X.4    Morgan, N.5    Vergyri, D.6
  • 12
    • 78049384951 scopus 로고    scopus 로고
    • Multi-style MLP features for BN transcription
    • V.-B. Le, L. Lamel, and J.-L. Gauvain, "Multi-style MLP features for BN transcription," in Proc. ICASSP, 2010, pp. 4866-4869.
    • (2010) Proc. ICASSP , pp. 4866-4869
    • Le, V.-B.1    Lamel, L.2    Gauvain, J.-L.3
  • 13
    • 79959819891 scopus 로고    scopus 로고
    • Crosslingual and multi-stream posterior features for low resource LVCSR systems
    • S. Thomas, S. Ganapathy, and H. Hermansky, "Crosslingual and multi-stream posterior features for low resource LVCSR systems," in Proc. Interspeech, 2010.
    • (2010) Proc. Interspeech
    • Thomas, S.1    Ganapathy, S.2    Hermansky, H.3
  • 14
    • 33745805403 scopus 로고    scopus 로고
    • A fast learning algorithm for deep belief nets
    • G. Hinton, S. Osindero, and Y. Teh, "A fast learning algorithm for deep belief nets," Neural Computation, vol. 18, pp. 1527-1554, 2006.
    • (2006) Neural Computation , vol.18 , pp. 1527-1554
    • Hinton, G.1    Osindero, S.2    Teh, Y.3
  • 16
    • 34547530011 scopus 로고    scopus 로고
    • Combining discriminative feature, transform, and model training for large vocabulary speech recognition
    • J. Zheng, O. Cetin, M.-Y. Hwang, X. Lei, A. Stolcke, and N. Morgan, "Combining discriminative feature, transform, and model training for large vocabulary speech recognition," in Proc. ICASSP, 2007.
    • (2007) Proc. ICASSP
    • Zheng, J.1    Cetin, O.2    Hwang, M.-Y.3    Lei, X.4    Stolcke, A.5    Morgan, N.6
  • 17
    • 0036296863 scopus 로고    scopus 로고
    • Minimum phone error and I-smoothing for improved discriminative training
    • D. Povey and P.C. Woodland, "Minimum phone error and I-smoothing for improved discriminative training," in Proc. ICASSP. IEEE, 2002, vol. I, pp. 105-108.
    • (2002) Proc. ICASSP IEEE , vol.1 , pp. 105-108
    • Povey, D.1    Woodland, P.C.2
  • 18
    • 84878392008 scopus 로고    scopus 로고
    • Data-driven posterior features for low resource speech recognition applications
    • to appear
    • S. Thomas, S. Ganapathy, A. Jansen, and H. Hermansky, "Data-driven posterior features for low resource speech recognition applications," in Proc. Interspeech, 2012, to appear.
    • (2012) Proc. Interspeech
    • Thomas, S.1    Ganapathy, S.2    Jansen, A.3    Hermansky, H.4
  • 19
    • 79959817774 scopus 로고    scopus 로고
    • Lightly supervised recognition for automatic alignment of large coherent speech recordings
    • N. Braunschweiler, M.J.F. Gales, and S. Buchholz, "Lightly supervised recognition for automatic alignment of large coherent speech recordings," in Proc. Interspeech, 2010, pp. 2222-2225.
    • (2010) Proc. Interspeech , pp. 2222-2225
    • Braunschweiler, N.1    Gales, M.J.F.2    Buchholz, S.3
  • 22
    • 0032638856 scopus 로고    scopus 로고
    • Semi-tied covariance matrices for hidden Markov models
    • May
    • M.J.F. Gales, "Semi-tied covariance matrices for hidden Markov models," IEEE Trans. on Speech and Audio Processing, vol. 7, pp. 272-281, May 1999.
    • (1999) IEEE Trans. on Speech and Audio Processing , vol.7 , pp. 272-281
    • Gales, M.J.F.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.