메뉴 건너뛰기




Volumn 21, Issue 4, 2013, Pages 697-710

Deep belief networks based voice activity detection

Author keywords

Deep learning; information fusion; voice activity detection

Indexed keywords

ACOUSTIC FEATURES; DEEP BELIEF NETWORKS; DEEP LEARNING; EMPIRICAL COMPARISON; GENERATIVE MODEL; HIDDEN LAYERS; INPUT LAYERS; LINEAR CLASSIFIERS; MACHINE-LEARNING; MULTIPLE FEATURE FUSION; MULTIPLE FEATURES; PERFORMANCE ANALYSIS; REAL-TIME DETECTION; VOICE ACTIVITY DETECTION;

EID: 84872300403     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2012.2229986     Document Type: Article
Times cited : (319)

References (74)
  • 1
    • 0031238211 scopus 로고    scopus 로고
    • ITU-T recommendation G.729 annex B: A silence compression scheme for use with G.729 optimized for V.70 digital simultaneous voice and data applications
    • A. Benyassine, E. Shlomot, H. Y. Su, D.Massaloux, C. Lamblin, and J. P. Petit, "ITU-T recommendation G. 729 Annex B: A silence compression scheme for use with G. 729 optimized for V. 70 digital simultaneous voice and data applications," IEEE Commun. Mag., vol. 35, no. 9, pp. 64-73, Sep. 1997. (Pubitemid 127557050)
    • (1997) IEEE Communications Magazine , vol.35 , Issue.9 , pp. 64-73
    • Benyassine, A.1    Shlomot, E.2    Su, H.-Y.3    Massaloux, D.4    Lamblin, C.5    Petit, J.-P.6
  • 2
    • 77957272576 scopus 로고    scopus 로고
    • Speech processing, transmission and quality aspects (STQ); Distributed speech recognition; Advanced front-end feature extraction algorithm; Compression algorithms
    • "Speech processing, transmission and quality aspects (STQ); distributed speech recognition; advanced front-end feature extraction algorithm; compression algorithms," ETSI ES, vol. 202, no. 050.
    • ETSI ES , vol.202 , Issue.50
  • 3
    • 84869416544 scopus 로고    scopus 로고
    • Towards generalizing classification based speech separation
    • Jan.
    • K. Han and D. L. Wang, "Towards generalizing classification based speech separation," IEEE Trans. Audio, Speech, Lang. Process., vol. 21, no. 1, pp. 1-27, Jan. 2012.
    • (2012) IEEE Trans. Audio, Speech, Lang. Process , vol.21 , Issue.1 , pp. 1-27
    • Han, K.1    Wang, D.L.2
  • 4
    • 79959840616 scopus 로고    scopus 로고
    • Investigation of full-sequence training of deep belief networks for speech recognition
    • A. Mohamed, D. Yu, and L. Deng, "Investigation of full-sequence training of deep belief networks for speech recognition," in Proc. Interspeech-10, 2010, pp. 2846-2849.
    • (2010) Proc. Interspeech-10 , pp. 2846-2849
    • Mohamed, A.1    Yu, D.2    Deng, L.3
  • 5
    • 79959828814 scopus 로고    scopus 로고
    • Deep-structured hidden conditional random fields for phonetic recognition
    • D. Yu and L. Deng, "Deep-structured hidden conditional random fields for phonetic recognition," in Proc. Interspeech-10, 2010, pp. 2986-2989.
    • (2010) Proc. Interspeech-10 , pp. 2986-2989
    • Yu, D.1    Deng, L.2
  • 7
    • 84055222005 scopus 로고    scopus 로고
    • Context-dependent pre-trained deep neural networks for large vocabulary speech recognition
    • G.Dahl, D.Yu, L.Deng, andA.Acero, "Context-dependent pre-trained deep neural networks for large vocabulary speech recognition," IEEE Trans. Audio, Speech, Lang. Process., vol. 20, no. 1, pp. 30-42, 2012.
    • (2012) IEEE Trans. Audio, Speech, Lang. Process , vol.20 , Issue.1 , pp. 30-42
    • Dahl, G.1    Yu, D.2    Deng, L.3    Acero, A.4
  • 8
    • 84867131826 scopus 로고    scopus 로고
    • Conversational speech transcription using context-dependent deep neural networks
    • D. Yu, F. Seide, and G. Li, "Conversational speech transcription using context-dependent deep neural networks," in Proc. 29th Int. Conf. Mach. Learn., 2012, pp. 1-2.
    • (2012) Proc. 29th Int. Conf. Mach. Learn , pp. 1-2
    • Yu, D.1    Seide, F.2    Li, G.3
  • 9
    • 84867732862 scopus 로고    scopus 로고
    • Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups
    • Nov.
    • G. Hinton et al., "Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups," IEEE Signal Process. Mag., vol. 11, no. 3, pp. 229-241, Nov. 2012.
    • (2012) IEEE Signal Process. Mag , vol.11 , Issue.3 , pp. 229-241
    • Hinton, G.1
  • 11
    • 67650137747 scopus 로고    scopus 로고
    • Discriminative weight training for a statistical model-based voice activity detection
    • S. I. Kang, Q. H. Jo, and J. H. Chang, "Discriminative weight training for a statistical model-based voice activity detection," IEEE Signal Process. Lett., vol. 15, pp. 170-173, 2008.
    • (2008) IEEE Signal Process. Lett , vol.15 , pp. 170-173
    • Kang, S.I.1    Jo, Q.H.2    Chang, J.H.3
  • 12
    • 65549106422 scopus 로고    scopus 로고
    • Statistical model-based voice activity detection using support vector machine
    • Q. H. Jo, J. H. Chang, J. W. Shin, and N. S. Kim, "Statistical model-based voice activity detection using support vector machine," IET Signal Process., vol. 3, no. 3, pp. 205-210, 2009.
    • (2009) IET Signal Process , vol.3 , Issue.3 , pp. 205-210
    • Jo, Q.H.1    Chang, J.H.2    Shin, J.W.3    Kim, N.S.4
  • 13
    • 77950091897 scopus 로고    scopus 로고
    • Voice activity detection based on statisticalmodels andmachine learning approaches
    • J. W. Shin, J. H. Chang, and N. S. Kim, "Voice activity detection based on statisticalmodels andmachine learning approaches," Comput. Speech Lang., vol. 24, no. 3, pp. 515-530, 2010.
    • (2010) Comput. Speech Lang , vol.24 , Issue.3 , pp. 515-530
    • Shin, J.W.1    Chang, J.H.2    Kim, N.S.3
  • 14
    • 77956289831 scopus 로고    scopus 로고
    • Discriminative training for multiple observation likelihood ratio based voice activity detection
    • T. Yu and J. H. L. Hansen, "Discriminative training for multiple observation likelihood ratio based voice activity detection," IEEE Signal Process. Lett., vol. 17, no. 11, pp. 897-900, 2010.
    • (2010) IEEE Signal Process. Lett , vol.17 , Issue.11 , pp. 897-900
    • Yu, T.1    Hansen, J.H.L.2
  • 15
    • 79952611095 scopus 로고    scopus 로고
    • Maximum margin clustering based statistical VAD with multiple observation compound feature
    • J. Wu and X. L. Zhang, "Maximum margin clustering based statistical VAD with multiple observation compound feature," IEEE Signal Process. Lett., vol. 18, no. 5, pp. 283-286, 2011.
    • (2011) IEEE Signal Process. Lett , vol.18 , Issue.5 , pp. 283-286
    • Wu, J.1    Zhang, X.L.2
  • 16
    • 79959756010 scopus 로고    scopus 로고
    • Efficient multiple kernel support vector machine based voice activity detection
    • J. Wu and X. L. Zhang, "Efficient multiple kernel support vector machine based voice activity detection," IEEE Signal Process. Lett., vol. 18, no. 8, pp. 466-499, 2011.
    • (2011) IEEE Signal Process. Lett , vol.18 , Issue.8 , pp. 466-499
    • Wu, J.1    Zhang, X.L.2
  • 17
    • 84869505051 scopus 로고    scopus 로고
    • Linearithmic time sparse and convex maximum margin clustering
    • Dec.
    • X. L. Zhang and J. Wu, "Linearithmic time sparse and convex maximum margin clustering," IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 42, no. 6, pp. 1669-1692, Dec. 2012.
    • (2012) IEEE Trans. Syst., Man, Cybern. B, Cybern , vol.42 , Issue.6 , pp. 1669-1692
    • Zhang, X.L.1    Wu, J.2
  • 18
    • 85008579584 scopus 로고    scopus 로고
    • Multiple acoustic model-based discriminative likelihood ratio weighting for voice activity detection
    • Y. Suh and H. Kim, "Multiple acoustic model-based discriminative likelihood ratio weighting for voice activity detection," IEEE Signal Process. Lett., vol. 19, no. 8, pp. 507-510, 2012.
    • (2012) IEEE Signal Process. Lett , vol.19 , Issue.8 , pp. 507-510
    • Suh, Y.1    Kim, H.2
  • 19
    • 33750216968 scopus 로고    scopus 로고
    • SVM-based speech endpoint detection using contextual speech features
    • J. Ramírez, P. Yélamos, J. M. Górriz, and J. C. Segura, "SVM-based speech endpoint detection using contextual speech features," Electron. Lett., vol. 42, no. 7, pp. 426-428, 2006.
    • (2006) Electron. Lett , vol.42 , Issue.7 , pp. 426-428
    • Ramírez, J.1    Yélamos, P.2    Górriz, J.M.3    Segura, J.C.4
  • 20
    • 78649271854 scopus 로고    scopus 로고
    • Online unsupervised classification with model comparison in the variational bayes framework for voice activity detection
    • Dec.
    • D. Cournapeau, S.Watanabe, A. Nakamura, and T. Kawahara, "Online unsupervised classification with model comparison in the variational bayes framework for voice activity detection," IEEE J. Sel. Topics Signal Process., vol. 4, no. 6, pp. 1071-1083, Dec. 2010.
    • (2010) IEEE J. Sel. Topics Signal Process , vol.4 , Issue.6 , pp. 1071-1083
    • Cournapeau, D.1    Watanabe, S.2    Nakamura, A.3    Kawahara, T.4
  • 21
    • 80053614636 scopus 로고    scopus 로고
    • Voice activity detection based on an unsupervised learning framework
    • Nov.
    • D.Ying, Y. Yan, J.Dang, and F. Soong, "Voice activity detection based on an unsupervised learning framework," IEEE Trans. Audio, Speech, Lang. Process., vol. 19, no. 8, pp. 2624-2644, Nov. 2011.
    • (2011) IEEE Trans. Audio, Speech, Lang. Process , vol.19 , Issue.8 , pp. 2624-2644
    • Ying, D.1    Yan, Y.2    Dang, J.3    Soong, F.4
  • 22
    • 0032762471 scopus 로고    scopus 로고
    • A statistical model-based voice activity detection
    • J. Sohn, N. S. Kim, and W. Sung, "A statistical model-based voice activity detection," IEEE Signal Process. Lett., vol. 6, no. 1, pp. 1-3, 1999.
    • (1999) IEEE Signal Process. Lett , vol.6 , Issue.1 , pp. 1-3
    • Sohn, J.1    Kim, N.S.2    Sung, W.3
  • 23
    • 0042863279 scopus 로고    scopus 로고
    • A soft voice activity detector based on a Laplacian-Gaussian model
    • Sep
    • S. Gazor and W. Zhang, "A soft voice activity detector based on a Laplacian-Gaussian model," IEEE Trans. Speech, Audio Process., vol. 11, no. 5, pp. 498-505, Sep. 2003.
    • (2003) IEEE Trans. Speech, Audio Process , vol.11 , Issue.5 , pp. 498-505
    • Gazor, S.1    Zhang, W.2
  • 24
    • 1842476689 scopus 로고    scopus 로고
    • Efficient voice activity detection algorithms using long-term speech information
    • J. Ramírez, J. C. Segura, C. Benitez, A. D. L. Torre, and A. Rubio, "Efficient voice activity detection algorithms using long-term speech information," Speech Commun., vol. 42, no. 3-4, pp. 271-287, 2004.
    • (2004) Speech Commun , vol.42 , Issue.3-4 , pp. 271-287
    • Ramírez, J.1    Segura, J.C.2    Benitez, C.3    Torre, A.D.L.4    Rubio, A.5
  • 25
    • 23344452899 scopus 로고    scopus 로고
    • Statistical voice activity detection using a multiple observation likelihood ratio test
    • DOI 10.1109/LSP.2005.855551
    • J. Ramírez, J. C. Segura, C. Benítez, L. García, and A. Rubio, "Statistical voice activity detection using a multiple observation likelihood ratio test," IEEE Signal Process. Lett., vol. 12, no. 10, pp. 689-692, Oct. 2005. (Pubitemid 41448576)
    • (2005) IEEE Signal Processing Letters , vol.12 , Issue.10 , pp. 689-692
    • Ramirez, J.1    Segura, J.C.2    Benitez, C.3    Garcia, L.4    Rubio, A.5
  • 26
    • 33744532633 scopus 로고    scopus 로고
    • Voice activity detection based on multiple statistical models
    • DOI 10.1109/TSP.2006.874403
    • J. H. Chang, N. S. Kim, and S. K.Mitra, "Voice activity detection based on multiple statistical models," IEEE Trans. Signal Process., vol. 54, no. 6, pp. 1965-1976, Jun. 2006. (Pubitemid 43811393)
    • (2006) IEEE Transactions on Signal Processing , vol.54 , Issue.6 , pp. 1965-1976
    • Chang, J.-H.1    Kim, N.S.2    Mitra, S.K.3
  • 27
    • 64149119904 scopus 로고    scopus 로고
    • Improved voice activity detection using contextualmultiple hypothesis testing for robust speech recognition
    • Nov
    • J.Ramírez, J. Segura, J.Górriz, and L.García, "Improved voice activity detection using contextualmultiple hypothesis testing for robust speech recognition," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 8, pp. 2177-2189, Nov. 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process , vol.15 , Issue.8 , pp. 2177-2189
    • Rramírez, J.1    Segura, J.2    Górriz, J.3    García, L.4
  • 28
    • 44149083948 scopus 로고    scopus 로고
    • A soft voice activity detection using GARCH filter and variance Gamma distribution
    • May
    • R. Tahmasbi and S. Rezaei, "A soft voice activity detection using GARCH filter and variance Gamma distribution," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 4, pp. 1129-1134, May 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process , vol.15 , Issue.4 , pp. 1129-1134
    • Tahmasbi, R.1    Rezaei, S.2
  • 30
    • 33745805403 scopus 로고    scopus 로고
    • A fast learning algorithm for deep belief nets
    • DOI 10.1162/neco.2006.18.7.1527
    • G. Hinton, S. Osindero, and Y. Teh, "A fast learning algorithm for deep belief nets," Neural Comput., vol. 18, no. 7, pp. 1527-1554, 2006. (Pubitemid 44024729)
    • (2006) Neural Computation , vol.18 , Issue.7 , pp. 1527-1554
    • Hinton, G.E.1    Osindero, S.2    Teh, Y.-W.3
  • 31
    • 33746600649 scopus 로고    scopus 로고
    • Reducing the dimensionality of data with neural networks
    • DOI 10.1126/science.1127647
    • G. Hinton and R. Salakhutdinov, "Reducing the dimensionality of data with neural networks," Science, vol. 313, no. 5786, pp. 504-507, 2006. (Pubitemid 44148451)
    • (2006) Science , vol.313 , Issue.5786 , pp. 504-507
    • Hinton, G.E.1    Salakhutdinov, R.R.2
  • 32
    • 69349090197 scopus 로고    scopus 로고
    • Learning deep architectures for AI
    • Y. Bengio, "Learning deep architectures for AI," Foundat. Trends® in Mach. Learn., vol. 2, no. 1, pp. 1-127, 2009.
    • (2009) Foundat. Trends® in Mach. Learn , vol.2 , Issue.1 , pp. 1-127
    • Bengio, Y.1
  • 35
    • 84861125212 scopus 로고    scopus 로고
    • A practical guide to training restricted Boltzmann machines
    • G. Hinton, "A practical guide to training restricted Boltzmann machines," Momentum, vol. 9, pp. 1-19, 2010.
    • Momentum , vol.9 , Issue.2010 , pp. 1-19
    • Hinton, G.1
  • 36
    • 79959858900 scopus 로고    scopus 로고
    • Learning in the deepstructured conditional random fields
    • D. Yu, L. Deng, and S. Wang, "Learning in the deepstructured conditional random fields," in Proc. NIPS Workshop, 2009, pp. 1-8.
    • (2009) Proc. NIPS Workshop , pp. 1-8
    • Yu, D.1    Deng, L.2    Wang, S.3
  • 37
    • 85032782045 scopus 로고    scopus 로고
    • Deep learning and its applications to signal and information processing [exploratory dsp]
    • Jan.
    • D. Yu and L. Deng, "Deep learning and its applications to signal and information processing [exploratory dsp]," IEEE Signal Process.Mag., vol. 28, no. 1, pp. 145-154, Jan. 2011.
    • (2011) IEEE Signal Process.Mag. , vol.28 , Issue.1 , pp. 145-154
    • Yu, D.1    Deng, L.2
  • 38
    • 56449095373 scopus 로고    scopus 로고
    • A unified architecture for natural language processing: Deep neural networks with multitask learning
    • R. Collobert and J.Weston, "A unified architecture for natural language processing: Deep neural networks with multitask learning," in Proc. 25th Int. Conf. Mach. Learn., 2008, pp. 160-167.
    • (2008) Proc. 25th Int. Conf. Mach. Learn , pp. 160-167
    • Collobert, R.1    Weston, J.2
  • 39
    • 80052067786 scopus 로고    scopus 로고
    • Reverberant speech segregation based on multipitch tracking and classification
    • Nov.
    • Z. Jin and D. L.Wang, "Reverberant speech segregation based on multipitch tracking and classification," IEEE Trans. Audio, Speech, Lang. Process., vol. 19, no. 8, pp. 2328-2337, Nov. 2011.
    • (2011) IEEE Trans. Audio, Speech, Lang. Process , vol.19 , Issue.8 , pp. 2328-2337
    • Jin, Z.1    Wang, D.L.2
  • 40
    • 84863281307 scopus 로고    scopus 로고
    • A tandemalgorithm for singing pitch extraction and voice separation from music accompaniment
    • Jul.
    • C. L. Hsu, D. L.Wang, J. S. R. Jang, and K.Hu, "A tandemalgorithm for singing pitch extraction and voice separation from music accompaniment," IEEE Trans. Audio, Speech, Lang. Process., vol. 20, no. 5, pp. 1482-1491, Jul. 2012.
    • (2012) IEEE Trans. Audio, Speech, Lang. Process , vol.20 , Issue.5 , pp. 1482-1491
    • Hsu, C.L.1    Wang, D.L.2    Jang, J.S.R.3    Hu, K.4
  • 41
    • 84870477511 scopus 로고    scopus 로고
    • Exploring monaural features for classification-based speech segregation
    • Jan.
    • Y. X.Wang, K. Han, and D. L.Wang, "Exploring monaural features for classification-based speech segregation," IEEE Trans. Audio, Speech, Lang. Process., vol. 21, no. 2, pp. 270-279, Jan. 2013.
    • (2013) IEEE Trans. Audio, Speech, Lang. Process , vol.21 , Issue.2 , pp. 270-279
    • Wang, Y.X.1    Han, K.2    Wang, D.L.3
  • 42
    • 79959823361 scopus 로고    scopus 로고
    • A new VAD framework using statistical model and human knowledge based empirical rule
    • J. Wu, X. L. Zhang, and W. Li, "A new VAD framework using statistical model and human knowledge based empirical rule," in Proc. Interspeech-10, 2010, pp. 3090-3093.
    • (2010) Proc. Interspeech-10 , pp. 3090-3093
    • Wu, J.1    Zhang, X.L.2    Li, W.3
  • 43
    • 84869496026 scopus 로고    scopus 로고
    • An efficient voice activity detection algorithm by combining statistical model and energy detection
    • J. Wu and X. L. Zhang, "An efficient voice activity detection algorithm by combining statistical model and energy detection," EURASIP J. Adv. Signal Process., vol. 2011, no. 1, pp. 18-27, 2011.
    • (2011) EURASIP J. Adv. Signal Process , vol.2011 , Issue.1 , pp. 18-27
    • Wu, J.1    Zhang, X.L.2
  • 45
    • 84987702417 scopus 로고    scopus 로고
    • The AURORA experimental framework for the performance evaluation of speech recognition systems under noisy conditions
    • D. Pearce et al., "The AURORA experimental framework for the performance evaluation of speech recognition systems under noisy conditions," Proc. ICSLP-00, vol. 4, pp. 29-32, 2000.
    • (2000) Proc. ICSLP-00 , vol.4 , pp. 29-32
    • Pearce, D.1
  • 47
    • 0038712550 scopus 로고    scopus 로고
    • Snr estimation based on amplitude modulation analysis with applications to noise suppression
    • May
    • J. Tchorz and B. Kollmeier, "Snr estimation based on amplitude modulation analysis with applications to noise suppression," IEEE Trans. Speech, Audio Process., vol. 11, no. 3, pp. 184-192, May 2003.
    • (2003) IEEE Trans. Speech, Audio Process , vol.11 , Issue.3 , pp. 184-192
    • Tchorz, J.1    Kollmeier, B.2
  • 48
    • 70349093614 scopus 로고    scopus 로고
    • An algorithm that improves speech intelligibility in noise for normal-hearing listeners
    • G. Kim, Y. Lu, Y. Hu, and P. C. Loizou, "An algorithm that improves speech intelligibility in noise for normal-hearing listeners," J. Acoust. Soc. Amer., vol. 126, pp. 1486-1494, 2009.
    • (2009) J. Acoust. Soc. Amer , vol.126 , pp. 1486-1494
    • Kim, G.1    Lu, Y.2    Hu, Y.3    Loizou, P.C.4
  • 49
    • 0037767686 scopus 로고    scopus 로고
    • A multipitch tracking algorithm for noisy speech
    • May
    • M.Wu, D. L.Wang, and G. J. Brown, "A multipitch tracking algorithm for noisy speech," IEEE Trans. Speech, Audio Process., vol. 11, no. 3, pp. 229-241, May 2003.
    • (2003) IEEE Trans. Speech, Audio Process , vol.11 , Issue.3 , pp. 229-241
    • Wu, M.1    Wang, D.L.2    Brown, G.J.3
  • 50
    • 4644265990 scopus 로고    scopus 로고
    • Monaural speech segregation based on pitch tracking and amplitude modulation
    • Sep
    • G. Hu and D. Wang, "Monaural speech segregation based on pitch tracking and amplitude modulation," IEEE Trans. Neural Netw., vol. 15, no. 5, pp. 1135-1150, Sep. 2004.
    • (2004) IEEE Trans. Neural Netw , vol.15 , Issue.5 , pp. 1135-1150
    • Hu, G.1    Wang, D.2
  • 51
    • 65249103478 scopus 로고    scopus 로고
    • A supervised learning approach to monaural segregation of reverberant speech
    • May
    • Z. Jin and D. L. Wang, "A supervised learning approach to monaural segregation of reverberant speech," IEEE Trans. Audio, Speech, Lang. Process., vol. 17, no. 4, pp. 625-638, May 2009.
    • (2009) IEEE Trans. Audio, Speech, Lang. Process , vol.17 , Issue.4 , pp. 625-638
    • Jin, Z.1    Wang, D.L.2
  • 52
    • 77955695149 scopus 로고    scopus 로고
    • A tandem algorithm for pitch estimation and voiced speech segregation
    • Nov.
    • G. Hu and D. L. Wang, "A tandem algorithm for pitch estimation and voiced speech segregation," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 8, pp. 2067-2079, Nov. 2010.
    • (2010) IEEE Trans. Audio, Speech, Lang. Process , vol.18 , Issue.8 , pp. 2067-2079
    • Hu, G.1    Wang, D.L.2
  • 53
    • 85008056718 scopus 로고    scopus 로고
    • Hmm-based multipitch tracking for noisy and reverberant speech
    • Jul.
    • Z. Jin and D. L. Wang, "Hmm-based multipitch tracking for noisy and reverberant speech," IEEE Trans. Audio, Speech, Lang. Process., vol. 19, no. 5, pp. 1091-1102, Jul. 2011.
    • (2011) IEEE Trans. Audio, Speech, Lang. Process , vol.19 , Issue.5 , pp. 1091-1102
    • Jin, Z.1    Wang, D.L.2
  • 54
    • 84863281307 scopus 로고    scopus 로고
    • A tandemalgorithm for singing pitch extraction and voice separation from music accompaniment
    • Jul.
    • C. L. Hsu, D. L.Wang, J. S. R. Jang, and K.Hu, "A tandemalgorithm for singing pitch extraction and voice separation from music accompaniment," IEEE Trans. Audio, Speech, Lang. Process., vol. 20, no. 5, pp. 1482-1491, Jul. 2012.
    • (2012) IEEE Trans. Audio, Speech, Lang. Process , vol.20 , Issue.5 , pp. 1482-1491
    • Hsu, C.L.1    Wang, D.L.2    Jang, J.S.R.3    Hu, K.4
  • 55
    • 0036299273 scopus 로고    scopus 로고
    • Pitch determination and voice quality analysis using subharmonic-to- harmonic ratio
    • X. Sun, "Pitch determination and voice quality analysis using subharmonic-to-harmonic ratio," in Proc. Int. Conf. Acoust., Speech, Signal Process., 2002, vol. 1, pp. 333-336.
    • (2002) Proc. Int. Conf. Acoust., Speech, Signal Process , vol.1 , pp. 333-336
    • Sun, X.1
  • 56
    • 84872355668 scopus 로고    scopus 로고
    • Enhanced variable rate codec, speech service option 3 for wideband spectrum digital systems
    • 3GPP2 C.S0014-A
    • "Enhanced variable rate codec, speech service option 3 for wideband spectrum digital systems," TIA/EIA/IS-127, 2004, 3GPP2 C.S0014-A.
    • (2004) TIA/EIA/IS-127
  • 58
    • 68949154453 scopus 로고    scopus 로고
    • Sparse kernel SVMs via cutting-plane training
    • T. Joachims and C. N. J. Yu, "Sparse kernel SVMs via cutting-plane training," Mach. Learn., vol. 76, no. 2, pp. 179-193, 2009.
    • (2009) Mach. Learn , vol.76 , Issue.2 , pp. 179-193
    • Joachims, T.1    Yu, C.N.J.2
  • 60
    • 84872343315 scopus 로고    scopus 로고
    • Deep learning of representations for unsupervised and transfer learning
    • Y. Bengio, "Deep learning of representations for unsupervised and transfer learning," in Proc. ICML Workshop Unsupervised Transfer Learn., 2011, vol. 7, pp. 1-20.
    • (2011) Proc. ICML Workshop Unsupervised Transfer Learn. , vol.7 , pp. 1-20
    • Bengio, Y.1
  • 63
    • 0029206489 scopus 로고
    • Locally excitatory globally inhibitory oscillator networks
    • Jan
    • D. L. Wang and D. Terman, "Locally excitatory globally inhibitory oscillator networks," IEEE Trans. Neural Netw., vol. 6, no. 1, pp. 283-286, Jan. 1995.
    • (1995) IEEE Trans. Neural Netw , vol.6 , Issue.1 , pp. 283-286
    • Wang, D.L.1    Terman, D.2
  • 64
    • 28244470718 scopus 로고    scopus 로고
    • The time dimension for scene analysis
    • Nov
    • D. L. Wang, "The time dimension for scene analysis," IEEE Trans. Neural Netw., vol. 16, no. 6, pp. 1401-1426, Nov. 2005.
    • (2005) IEEE Trans. Neural Netw , vol.16 , Issue.6 , pp. 1401-1426
    • Wang, D.L.1
  • 65
    • 85162494200 scopus 로고    scopus 로고
    • Selecting receptive fields in deep networks
    • A. Coates and A. Y. Ng, "Selecting receptive fields in deep networks," Proc. Adv. Neural Inf. Process. Syst., vol. 24, pp. 2528-2536, 2011.
    • Proc. Adv. Neural Inf. Process. Syst , vol.24 , Issue.2011 , pp. 2528-2536
    • Coates, A.1    Ng, A.Y.2
  • 66
    • 85008054377 scopus 로고    scopus 로고
    • Unvoiced speech segregation from nonspeech interference via casa and spectral subtraction
    • Aug.
    • K. Hu and D. L.Wang, "Unvoiced speech segregation from nonspeech interference via casa and spectral subtraction," IEEE Trans. Audio, Speech, Lang. Process., vol. 19, no. 6, pp. 1600-1609, Aug. 2011.
    • (2011) IEEE Trans. Audio, Speech, Lang. Process , vol.19 , Issue.6 , pp. 1600-1609
    • Hu, K.1    Wang, D.L.2
  • 67
    • 84867946385 scopus 로고    scopus 로고
    • An unsupervised approach to cochannel speech separation
    • Jan.
    • K. Hu and D. L. Wang, "An unsupervised approach to cochannel speech separation," IEEE Trans. Audio, Speech, Lang. Process., vol. 21, no. 1, pp. 122-131, Jan. 2013.
    • (2013) IEEE Trans. Audio, Speech, Lang. Process , vol.21 , Issue.1 , pp. 122-131
    • Hu, K.1    Wang, D.L.2
  • 69
    • 77956031473 scopus 로고    scopus 로고
    • A survey on transfer learning
    • Oct.
    • S. J. Pan and Q. Yang, "A survey on transfer learning," IEEE Trans. Knowl. Data Eng., vol. 22, no. 10, pp. 1345-1359, Oct. 2010.
    • (2010) IEEE Trans. Knowl. Data Eng , vol.22 , Issue.10 , pp. 1345-1359
    • Pan, S.J.1    Yang, Q.2
  • 70
    • 85006786586 scopus 로고    scopus 로고
    • Domain adaptation in machine learning and speech processing
    • F. Sha and B. Kingsbury, "Domain adaptation in machine learning and speech processing," in Tutorial of Interspeech-12, 2012, pp. 1-214.
    • (2012) Tutorial of Interspeech-12 , pp. 1-214
    • Sha, F.1    Kingsbury, B.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.