메뉴 건너뛰기




Volumn 2015-August, Issue , 2015, Pages 4390-4394

A deep neural network for time-domain signal reconstruction

Author keywords

Deep neural network; speech separation; time domain signal; time frequency masking

Indexed keywords

AUDIO SIGNAL PROCESSING; DEEP NEURAL NETWORKS; FACTORIZATION; FAST FOURIER TRANSFORMS; INVERSE PROBLEMS; SEPARATION; SIGNAL RECONSTRUCTION; SOURCE SEPARATION; SPEECH ANALYSIS; SPEECH COMMUNICATION;

EID: 84946014781     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2015.7178800     Document Type: Conference Paper
Times cited : (129)

References (20)
  • 1
    • 33845354768 scopus 로고    scopus 로고
    • Isolating the energetic component of speech-on-speech masking with ideal time-frequency segregation
    • D. Brungart, P. Chang, B. Simpson, and D.L. Wang, Isolating the energetic component of speech-on-speech masking with ideal time-frequency segregation, Journal of the Acoustical Society of America, vol. 120, pp. 4007-4018, 2006
    • (2006) Journal of the Acoustical Society of America , vol.120 , pp. 4007-4018
    • Brungart, D.1    Chang, P.2    Simpson, B.3    Wang, D.L.4
  • 2
    • 84892233308 scopus 로고    scopus 로고
    • On ideal binary mask as the computational goal of auditory scene analysis
    • Divenyi P., Ed. Kluwer Academic, Norwell MA
    • D.L.Wang, On ideal binary mask as the computational goal of auditory scene analysis, in Speech Separation by Humans and Machines, Divenyi P., Ed. Kluwer Academic, Norwell MA., 2005, pp. 181-197
    • (2005) Speech Separation by Humans and Machines , pp. 181-197
    • Wang, D.L.1
  • 3
    • 80052250414 scopus 로고    scopus 로고
    • Adaptive subgradient methods for online learning and stochastic optimization
    • J. Duchi, E. Hazan, and Y. Singer, Adaptive subgradient methods for online learning and stochastic optimization, Journal of Machine Learning Research, pp. 2121-2159, 2011
    • (2011) Journal of Machine Learning Research , pp. 2121-2159
    • Duchi, J.1    Hazan, E.2    Singer, Y.3
  • 6
    • 44149106061 scopus 로고    scopus 로고
    • Evaluation of objective quality measures for speech enhancement
    • Y. Hu and P. C. Loizou, Evaluation of objective quality measures for speech enhancement, IEEE Trans. Audio, Speech, Lang. Process., pp. 229-238, 2008
    • (2008) IEEE Trans. Audio, Speech, Lang. Process , pp. 229-238
    • Hu, Y.1    Loizou, P.C.2
  • 7
    • 0014568991 scopus 로고
    • IEEE recommended practice for speech quality measurements
    • IEEE
    • IEEE, IEEE recommended practice for speech quality measurements, IEEE Trans. Audio Electroacoust., vol. 17, pp. 225-246, 1969
    • (1969) IEEE Trans. Audio Electroacoust , vol.17 , pp. 225-246
  • 9
    • 70349093614 scopus 로고    scopus 로고
    • An algorithm that improves speech intelligibility in noise for normalhearing listeners
    • G. Kim, Y. Lu, Y. Hu, and P. Loizou, An algorithm that improves speech intelligibility in noise for normalhearing listeners, Journal of the Acoustical Society of America, pp. 1486-1494, 2009
    • (2009) Journal of the Acoustical Society of America , pp. 1486-1494
    • Kim, G.1    Lu, Y.2    Hu, Y.3    Loizou, P.4
  • 10
    • 40749125179 scopus 로고    scopus 로고
    • Factors influencing intelligibility of ideal binary-masked speech: Implications for noise reduction
    • N. Li and P. Loizou, Factors influencing intelligibility of ideal binary-masked speech: Implications for noise reduction, Journal of the Acoustical Society of America, vol. 123, no. 3, pp. 1673-1682, 2008
    • (2008) Journal of the Acoustical Society of America , vol.123 , Issue.3 , pp. 1673-1682
    • Li, N.1    Loizou, P.2
  • 12
    • 84905252792 scopus 로고    scopus 로고
    • Joint noise adaptive training for robust automatic speech recognition
    • A. Narayanan and D. Wang, Joint noise adaptive training for robust automatic speech recognition, in Proc. ICASSP, 2014, pp. 2523-2527
    • (2014) Proc. ICASSP , pp. 2523-2527
    • Narayanan, A.1    Wang, D.2
  • 14
    • 0024876950 scopus 로고
    • An analysis of a noise reduction neural network
    • S. Tamura, An analysis of a noise reduction neural network, in Proc. ICASSP, 1989, pp. 2001-2004
    • (1989) Proc. ICASSP , pp. 2001-2004
    • Tamura, S.1
  • 15
    • 84886818613 scopus 로고    scopus 로고
    • Active-set Newton algorithm for overcomplete non-negative representations of audio
    • T. Virtanen, J. Gemmeke, and B. Raj, Active-set Newton algorithm for overcomplete non-negative representations of audio, IEEE Trans. Audio, Speech, Lang. Process., pp. 2277-2289, 2013
    • (2013) IEEE Trans. Audio, Speech, Lang. Process , pp. 2277-2289
    • Virtanen, T.1    Gemmeke, J.2    Raj, B.3
  • 17
    • 84875678689 scopus 로고    scopus 로고
    • Towards scaling up classification-based speech separation
    • Y. Wang and D.L. Wang, Towards scaling up classification-based speech separation, IEEE Trans. Audio, Speech, Lang. Process., pp. 1381-1390, 2013
    • (2013) IEEE Trans. Audio, Speech, Lang. Process , pp. 1381-1390
    • Wang, Y.1    Wang, D.L.2
  • 18
    • 84870477511 scopus 로고    scopus 로고
    • Exploring monaural features for classification-based speech segregation
    • Y. Wang, K. Han, and D.L. Wang, Exploring monaural features for classification-based speech segregation, IEEE Trans. Audio, Speech, Lang. Process., pp. 270-279, 2013
    • (2013) IEEE Trans. Audio, Speech, Lang. Process , pp. 270-279
    • Wang, Y.1    Han, K.2    Wang, D.L.3
  • 20
    • 84889257121 scopus 로고    scopus 로고
    • An experimental study on speech enhancement based on deep neural networks
    • Y. Xu, J. Du, L. Dai, and C. Lee, An experimental study on speech enhancement based on deep neural networks, IEEE Signal Processing Letters, pp. 66-68, 2014
    • (2014) IEEE Signal Processing Letters , pp. 66-68
    • Xu, Y.1    Du, J.2    Dai, L.3    Lee, C.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.