메뉴 건너뛰기




Volumn 2015-August, Issue , 2015, Pages 5818-5822

Unsupervised neural network based feature extraction using weak top-down constraints

Author keywords

deep neural networks; top down constraints; Unsupervised feature extraction; zero resource speech processing

Indexed keywords

AUDIO SIGNAL PROCESSING; DYNAMIC PROGRAMMING; EXTRACTION; FEATURE EXTRACTION; SPEECH COMMUNICATION; SPEECH PROCESSING;

EID: 84946101387     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2015.7179087     Document Type: Conference Paper
Times cited : (142)

References (26)
  • 1
    • 84055222005 scopus 로고    scopus 로고
    • Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition
    • G. E. Dahl, D. Yu, L. Deng, and A. Acero, "Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition," IEEE Trans. Audio, Speech, Language Process., vol. 20, no. 1, pp. 30-42, 2012.
    • (2012) IEEE Trans. Audio, Speech, Language Process , vol.20 , Issue.1 , pp. 30-42
    • Dahl, G.E.1    Yu, D.2    Deng, L.3    Acero, A.4
  • 3
    • 84901502980 scopus 로고    scopus 로고
    • Feature learning in deep neural networks-studies on speech recognition
    • D. Yu, M. Seltzer, J. Li, J.-T. Huang, and F. Seide, "Feature learning in deep neural networks-studies on speech recognition," in Proc. ICLR, 2013.
    • (2013) Proc. ICLR
    • Yu, D.1    Seltzer, M.2    Li, J.3    Huang, J.-T.4    Seide, F.5
  • 4
    • 79959851706 scopus 로고    scopus 로고
    • Towards spoken term discovery at scale with zero resources
    • A. Jansen, K. Church, and H. Hermansky, "Towards spoken term discovery at scale with zero resources," in Proc. Interspeech, 2010.
    • (2010) Proc. Interspeech
    • Jansen, A.1    Church, K.2    Hermansky, H.3
  • 5
    • 84867809023 scopus 로고    scopus 로고
    • A nonparametric Bayesian approach to acoustic model discovery
    • C. Lee and J. R. Glass, "A nonparametric Bayesian approach to acoustic model discovery," in Proc. ACL, 2012.
    • (2012) Proc. ACL
    • Lee, C.1    Glass, J.R.2
  • 6
    • 70450158585 scopus 로고    scopus 로고
    • Unsupervised training of an HMM-based speech recognizer for topic classification
    • H. Gish, M.-H. Siu, A. Chan, and B. Belfield, "Unsupervised training of an HMM-based speech recognizer for topic classification," in Proc. Interspeech, 2009.
    • (2009) Proc. Interspeech
    • Gish, H.1    Siu, M.-H.2    Chan, A.3    Belfield, B.4
  • 7
    • 77949473673 scopus 로고    scopus 로고
    • Unsupervised spoken keyword spotting via segmental DTW on Gaussian posteriorgrams
    • Y. Zhang and J. R. Glass, "Unsupervised spoken keyword spotting via segmental DTW on Gaussian posteriorgrams," in Proc. ASRU, 2009.
    • (2009) Proc. ASRU
    • Zhang, Y.1    Glass, J.R.2
  • 10
    • 84905227103 scopus 로고    scopus 로고
    • An autoencoder based approach to unsupervised learning of subword units
    • L. Badino, C. Canevari, L. Fadiga, and G. Metta, "An autoencoder based approach to unsupervised learning of subword units," in Proc. ICASSP, 2014.
    • (2014) Proc. ICASSP
    • Badino, L.1    Canevari, C.2    Fadiga, L.3    Metta, G.4
  • 11
    • 33947643715 scopus 로고    scopus 로고
    • Unsupervised word acquisition from speech using pattern discovery
    • A. Park and J. R. Glass, "Unsupervised word acquisition from speech using pattern discovery," in Proc. ICASSP, 2006.
    • (2006) Proc. ICASSP
    • Park, A.1    Glass, J.R.2
  • 12
    • 84858987768 scopus 로고    scopus 로고
    • Efficient spoken term discovery using randomized algorithms
    • A. Jansen and B. Van Durme, "Efficient spoken term discovery using randomized algorithms," in Proc. ASRU, 2011.
    • (2011) Proc. ASRU
    • Jansen, A.1    Van Durme, B.2
  • 13
    • 84893673786 scopus 로고    scopus 로고
    • A hierarchical system for word discovery exploiting DTW-based initialization
    • O. Walter, T. Korthals, R. Haeb-Umbach, and B. Raj, "A hierarchical system for word discovery exploiting DTW-based initialization," in Proc. ASRU, 2013.
    • (2013) Proc. ASRU
    • Walter, O.1    Korthals, T.2    Haeb-Umbach, R.3    Raj, B.4
  • 14
    • 84865770260 scopus 로고    scopus 로고
    • Towards unsupervised training of speaker independent acoustic models
    • A. Jansen and K. Church, "Towards unsupervised training of speaker independent acoustic models," in Proc. Interspeech, 2011.
    • (2011) Proc. Interspeech
    • Jansen, A.1    Church, K.2
  • 15
    • 84890467020 scopus 로고    scopus 로고
    • Weak top-down constraints for unsupervised acoustic model training
    • A. Jansen, S. Thomas, and H. Hermansky, "Weak top-down constraints for unsupervised acoustic model training," in Proc. ICASSP, 2013.
    • (2013) Proc. ICASSP
    • Jansen, A.1    Thomas, S.2    Hermansky, H.3
  • 16
    • 0026400245 scopus 로고
    • An investigation of PLP and IMELDA acoustic representations and of their potential for combination
    • M. Hunt, S. M. Richardson, D. C. Bateman, and A. Piau, "An investigation of PLP and IMELDA acoustic representations and of their potential for combination," in Proc. ICASSP, 1991.
    • (1991) Proc. ICASSP
    • Hunt, M.1    Richardson, S.M.2    Bateman, D.C.3    Piau, A.4
  • 17
    • 84946685733 scopus 로고    scopus 로고
    • Phonetics embedding learning with side information
    • G. Synnaeve1, T. Schatz, and E. Dupoux, "Phonetics embedding learning with side information," in Proc. SLT, 2014.
    • (2014) Proc. SLT
    • Synnaevel, G.1    Schatz, T.2    Dupoux, E.3
  • 18
    • 69349090197 scopus 로고    scopus 로고
    • Learning deep architectures for AI
    • Y. Bengio, "Learning deep architectures for AI," Found. Trends Mach. Learning, vol. 2, no. 1, pp. 1-127, 2009.
    • (2009) Found. Trends Mach. Learning , vol.2 , Issue.1 , pp. 1-127
    • Bengio, Y.1
  • 19
    • 33746600649 scopus 로고    scopus 로고
    • Reducing the dimensionality of data with neural networks
    • G. E. Hinton and R. R. Salakhutdinov, "Reducing the dimensionality of data with neural networks," Science, vol. 313, no. 5786, pp. 504-507, 2006.
    • (2006) Science , vol.313 , Issue.5786 , pp. 504-507
    • Hinton, G.E.1    Salakhutdinov, R.R.2
  • 21
    • 0017930815 scopus 로고
    • Dynamic programming algorithm optimization for spoken word recognition
    • H. Sakoe and S. Chiba, "Dynamic programming algorithm optimization for spoken word recognition," IEEE Trans. Acoust., Speech, Signal Process., vol. 26, no. 1, pp. 43-49, 1978.
    • (1978) IEEE Trans. Acoust., Speech, Signal Process , vol.26 , Issue.1 , pp. 43-49
    • Sakoe, H.1    Chiba, S.2
  • 22
    • 56449089103 scopus 로고    scopus 로고
    • Extracting and composing robust features with denoising autoencoders
    • P. Vincent, H. Larochelle, Y. Bengio, and P.-A. Manzagol, "Extracting and composing robust features with denoising autoencoders," in Proc. ICML, 2008.
    • (2008) Proc. ICML
    • Vincent, P.1    Larochelle, H.2    Bengio, Y.3    Manzagol, P.-A.4
  • 25
    • 84905240834 scopus 로고    scopus 로고
    • Recurrent deep neural networks for robust speech recognition
    • C. Weng, D. Yu, S. Watanabe, and B.-H. Juang, "Recurrent deep neural networks for robust speech recognition," in Proc. ICASSP, 2014.
    • (2014) Proc. ICASSP
    • Weng, C.1    Yu, D.2    Watanabe, S.3    Juang, B.-H.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.