SCOPUS 정보 검색 플랫폼

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

Volumn 2015-August, Issue , 2015, Pages 4390-4394

A deep neural network for time-domain signal reconstruction

(2) Wang, Yuxuan a Wang, Deliang a,b

a The Ohio State University (United States)

b OHIO STATE UNIVERSITY (United States)

Author keywords

Deep neural network; speech separation; time domain signal; time frequency masking

Indexed keywords

AUDIO SIGNAL PROCESSING; DEEP NEURAL NETWORKS; FACTORIZATION; FAST FOURIER TRANSFORMS; INVERSE PROBLEMS; SEPARATION; SIGNAL RECONSTRUCTION; SOURCE SEPARATION; SPEECH ANALYSIS; SPEECH COMMUNICATION;

INVERSE FAST FOURIER TRANSFORMS; NONNEGATIVE MATRIX FACTORIZATION; OBJECTIVE QUALITIES; SEPARATION SYSTEMS; SPEECH RESYNTHESIS; SPEECH SEPARATION; TIME-DOMAIN SIGNAL; TIME-FREQUENCY MASKING;

SPEECH INTELLIGIBILITY;

EID: 84946014781 PISSN: 15206149 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICASSP.2015.7178800 Document Type: Conference Paper

Times cited : (129)

References (20)

1
- 33845354768
- Isolating the energetic component of speech-on-speech masking with ideal time-frequency segregation
- D. Brungart, P. Chang, B. Simpson, and D.L. Wang, Isolating the energetic component of speech-on-speech masking with ideal time-frequency segregation, Journal of the Acoustical Society of America, vol. 120, pp. 4007-4018, 2006
- (2006) Journal of the Acoustical Society of America , vol.120 , pp. 4007-4018
- Brungart, D.¹ Chang, P.² Simpson, B.³ Wang, D.L.⁴

2
- 84892233308
- On ideal binary mask as the computational goal of auditory scene analysis
- Divenyi P., Ed. Kluwer Academic, Norwell MA
- D.L.Wang, On ideal binary mask as the computational goal of auditory scene analysis, in Speech Separation by Humans and Machines, Divenyi P., Ed. Kluwer Academic, Norwell MA., 2005, pp. 181-197
- (2005) Speech Separation by Humans and Machines , pp. 181-197
- Wang, D.L.¹

3
- 80052250414
- Adaptive subgradient methods for online learning and stochastic optimization
- J. Duchi, E. Hazan, and Y. Singer, Adaptive subgradient methods for online learning and stochastic optimization, Journal of Machine Learning Research, pp. 2121-2159, 2011
- (2011) Journal of Machine Learning Research , pp. 2121-2159
- Duchi, J.¹ Hazan, E.² Singer, Y.³

4
- 84885412715
- An algorithm to improve speech recognition in noise for hearing-impaired listeners
- E. Healy, S. Yoho, Y. Wang, and D.L. Wang, An algorithm to improve speech recognition in noise for hearing-impaired listeners, Journal of the Acoustical Society of America, pp. 3029-3038, 2013
- (2013) Journal of the Acoustical Society of America , pp. 3029-3038
- Healy, E.¹ Yoho, S.² Wang, Y.³ Wang, D.L.⁴

5
- 84867720412
- arXiv preprint arXiv:1207.0580
- G. E. Hinton, N. Srivastava, A. Krizhevsky, I. Sutskever, and R. R. Salakhutdinov, Improving neural networks by preventing co-adaptation of feature detectors, arXiv preprint arXiv:1207.0580, 2012
- (2012) Improving Neural Networks by Preventing Co-adaptation of Feature Detectors
- Hinton, G.E.¹ Srivastava, N.² Krizhevsky, A.³ Sutskever, I.⁴ Salakhutdinov, R.R.⁵

6
- 44149106061
- Evaluation of objective quality measures for speech enhancement
- Y. Hu and P. C. Loizou, Evaluation of objective quality measures for speech enhancement, IEEE Trans. Audio, Speech, Lang. Process., pp. 229-238, 2008
- (2008) IEEE Trans. Audio, Speech, Lang. Process , pp. 229-238
- Hu, Y.¹ Loizou, P.C.²

7
- 0014568991
- IEEE recommended practice for speech quality measurements
- IEEE
- IEEE, IEEE recommended practice for speech quality measurements, IEEE Trans. Audio Electroacoust., vol. 17, pp. 225-246, 1969
- (1969) IEEE Trans. Audio Electroacoust , vol.17 , pp. 225-246

8
- 34248183857
- DARPA TIMIT acoustic-phonetic continuous speech corpus
- J. Garofolo et al., DARPA TIMIT acoustic-phonetic continuous speech corpus, National Inst. of Standards and Technology, 1993
- (1993) National Inst. of Standards and Technology
- Garofolo, J.¹

9
- 70349093614
- An algorithm that improves speech intelligibility in noise for normalhearing listeners
- G. Kim, Y. Lu, Y. Hu, and P. Loizou, An algorithm that improves speech intelligibility in noise for normalhearing listeners, Journal of the Acoustical Society of America, pp. 1486-1494, 2009
- (2009) Journal of the Acoustical Society of America , pp. 1486-1494
- Kim, G.¹ Lu, Y.² Hu, Y.³ Loizou, P.⁴

10
- 40749125179
- Factors influencing intelligibility of ideal binary-masked speech: Implications for noise reduction
- N. Li and P. Loizou, Factors influencing intelligibility of ideal binary-masked speech: Implications for noise reduction, Journal of the Acoustical Society of America, vol. 123, no. 3, pp. 1673-1682, 2008
- (2008) Journal of the Acoustical Society of America , vol.123 , Issue.3 , pp. 1673-1682
- Li, N.¹ Loizou, P.²

11
- 34447100796
- CRC press
- P. C. Loizou, Speech enhancement: theory and practice. CRC press, 2007
- (2007) Speech Enhancement: Theory and Practice
- Loizou, P.C.¹

12
- 84905252792
- Joint noise adaptive training for robust automatic speech recognition
- A. Narayanan and D. Wang, Joint noise adaptive training for robust automatic speech recognition, in Proc. ICASSP, 2014, pp. 2523-2527
- (2014) Proc. ICASSP , pp. 2523-2527
- Narayanan, A.¹ Wang, D.²

13
- 79960916745
- An algorithm for intelligibility prediction of time-frequency weighted noisy speech
- C. Taal, R. Hendriks, R. Heusdens, and J. Jensen, An algorithm for intelligibility prediction of time-frequency weighted noisy speech, IEEE Trans. Audio, Speech, Lang. Process., pp. 2125-2136, 2011
- (2011) IEEE Trans. Audio, Speech, Lang. Process , pp. 2125-2136
- Taal, C.¹ Hendriks, R.² Heusdens, R.³ Jensen, J.⁴

14
- 0024876950
- An analysis of a noise reduction neural network
- S. Tamura, An analysis of a noise reduction neural network, in Proc. ICASSP, 1989, pp. 2001-2004
- (1989) Proc. ICASSP , pp. 2001-2004
- Tamura, S.¹

15
- 84886818613
- Active-set Newton algorithm for overcomplete non-negative representations of audio
- T. Virtanen, J. Gemmeke, and B. Raj, Active-set Newton algorithm for overcomplete non-negative representations of audio, IEEE Trans. Audio, Speech, Lang. Process., pp. 2277-2289, 2013
- (2013) IEEE Trans. Audio, Speech, Lang. Process , pp. 2277-2289
- Virtanen, T.¹ Gemmeke, J.² Raj, B.³

16
- 0009766947
- Networks for speech enhancement
- Artech House, Boston, USA
- E. A. Wan and A. T. Nelson, Networks for speech enhancement, Handbook of neural networks for speech processing. Artech House, Boston, USA, 1999
- (1999) Handbook of Neural Networks for Speech Processing
- Wan, E.A.¹ Nelson, A.T.²

17
- 84875678689
- Towards scaling up classification-based speech separation
- Y. Wang and D.L. Wang, Towards scaling up classification-based speech separation, IEEE Trans. Audio, Speech, Lang. Process., pp. 1381-1390, 2013
- (2013) IEEE Trans. Audio, Speech, Lang. Process , pp. 1381-1390
- Wang, Y.¹ Wang, D.L.²

18
- 84870477511
- Exploring monaural features for classification-based speech segregation
- Y. Wang, K. Han, and D.L. Wang, Exploring monaural features for classification-based speech segregation, IEEE Trans. Audio, Speech, Lang. Process., pp. 270-279, 2013
- (2013) IEEE Trans. Audio, Speech, Lang. Process , pp. 270-279
- Wang, Y.¹ Han, K.² Wang, D.L.³

19
- 84921740463
- On training targets for supervised speech separation
- Y. Wang, A. Narayanan, and D. Wang, On training targets for supervised speech separation, IEEE/ACM Trans. Audio, Speech, Lang. Process., pp. 1849-1858, 2014
- (2014) IEEE/ACM Trans. Audio, Speech, Lang. Process , pp. 1849-1858
- Wang, Y.¹ Narayanan, A.² Wang, D.³

20
- 84889257121
- An experimental study on speech enhancement based on deep neural networks
- Y. Xu, J. Du, L. Dai, and C. Lee, An experimental study on speech enhancement based on deep neural networks, IEEE Signal Processing Letters, pp. 66-68, 2014
- (2014) IEEE Signal Processing Letters , pp. 66-68
- Xu, Y.¹ Du, J.² Dai, L.³ Lee, C.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.