SCOPUS 정보 검색 플랫폼

2009 IEEE-RIVF International Conference on Computing and Communication Technologies: Research, Innovation and Vision for the Future, RIVF 2009

Volumn , Issue , 2009, Pages

Using artificial neural network For robust voice activity detection under adverse conditions

(3) Pham, Tuan V a Tang, Chien T a Stadtschnitzer, Michael b

a University of Technology (United States)

b GRAZ UNIVERSITY OF TECHNOLOGY (Austria)

Author keywords

[No Author keywords available]

Indexed keywords

ARTIFICIAL NEURAL NETWORK; DEVELOPED MODEL; EMPIRICAL RESULTS; HARSH ENVIRONMENT; MEL-FREQUENCY CEPSTRAL COEFFICIENTS; MODEL-BASED; NEURAL NETWORK CLASSIFIER; NEURAL NETWORK TRAINING; NOISY SPEECH; OPTIMIZATION PROCEDURES; RECENT STATE; RELIABLE MODELS; VOICE ACTIVITY DETECTION;

BACKPROPAGATION; COMPUTER SCIENCE; NETWORK PERFORMANCE; SIGNAL TO NOISE RATIO; SPEECH RECOGNITION; SYSTEMS ENGINEERING;

NEURAL NETWORKS;

EID: 71049181730 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/RIVF.2009.5174662 Document Type: Conference Paper

Times cited : (20)

References (20)

1
- 0031238211
- ITU-T recommendation G.729 annex B: A silence compression scheme for use with G.729 optimized for V.70 digital simultaneous voice and data applications
- A. Benyassine, E. Shlomot, H.-Y. Su, D. Massaloux, C. Lamblin, and J.-P. Petit, "ITU-T Recommendation G.729 Annex B: A silence compression scheme for use with G.729 optimized for V.70 digital simultaneous voice and data applications," IEEE Communications Magazine, vol. 35, no. 9, pp. 64-73, 1997.
- (1997) IEEE Communications Magazine , vol.35 , Issue.9 , pp. 64-73
- Benyassine, A.¹ Shlomot, E.² Su, H.-Y.³ Massaloux, D.⁴ Lamblin, C.⁵ Petit, J.-P.⁶

2
- 0041360463
- Noise spectrum estimation in adverse environments: Improved minima controlled recursive averaging
- I. Cohen, "Noise spectrum estimation in adverse environments: Improved minima controlled recursive averaging," IEEE Trans. on Speech and Audio Processing, vol. 11, no. 5, pp. 466-475, 2003.
- (2003) IEEE Trans. on Speech and Audio Processing , vol.11 , Issue.5 , pp. 466-475
- Cohen, I.¹

3
- 0442317754
- ETSI, ETSI ES 202 050 V1.1.3
- ETSI, ETSI ES 202 050 V1.1.3 Speech Processing, Transmission and Quality Aspects (STQ), Distributed speech recognition, Advanced frontend feature extraction algorithm, Compression algorithms, 2003.
- (2003) Speech Processing, Transmission and Quality Aspects (STQ), Distributed Speech Recognition, Advanced Frontend Feature Extraction Algorithm, Compression Algorithms

4
- 38149039412
- chapter Speaker Segmentation for Air Traffic Control, Springer
- M. Neffe, T. V. Pham, H. Hering, and G. Kubin, Speaker Classification II, LNCS, vol. 4441, chapter Speaker Segmentation for Air Traffic Control, pp. 177-191, Springer, 2007.
- (2007) Speaker Classification II, LNCS , vol.4441 , pp. 177-191
- Neffe, M.¹ Pham, T.V.² Hering, H.³ Kubin, G.⁴

5
- 0032762471
- A statistical model-based voice activity detection
- J. Sohn, N. S. Kim, and W. Sung, "A statistical model-based voice activity detection," IEEE Signal Processing Letters, vol. 6, no. 1, pp. 1-3, 1999.
- (1999) IEEE Signal Processing Letters , vol.6 , Issue.1 , pp. 1-3
- Sohn, J.¹ Kim, N.S.² Sung, W.³

6
- 0042863279
- A soft voice activity detector based on a laplacian-gaussian model
- S. Gazor and W. Zhang, "A soft voice activity detector based on a Laplacian-Gaussian model," IEEE Trans. on Speech and Audio Processing, vol. 11, no. 5, pp. 498-505, 2003.
- (2003) IEEE Trans. on Speech and Audio Processing , vol.11 , Issue.5 , pp. 498-505
- Gazor, S.¹ Zhang, W.²

7
- 33744532633
- Voice activity detection based on multiple statistical models
- J.H. Chang, N.S. Kim, and S.K. Mitra, "Voice activity detection based on multiple statistical models," IEEE Trans. on Signal Processing, vol. 54, no. 6, pp. 1965-1976, 2006.
- (2006) IEEE Trans. on Signal Processing , vol.54 , Issue.6 , pp. 1965-1976
- Chang, J.H.¹ Kim, N.S.² Mitra, S.K.³

8
- 34249676923
- Robust voice activity detection using perceptual wavelet-packet transform and Teager energy operator
- S.H. Chen, H.T. Wu, Y. Chang, and T. K. Truong, "Robust voice activity detection using perceptual wavelet-packet transform and Teager energy operator," Pattern Recognition Letters, vol. 28, no. 11, pp. 1327-1332,2007.
- (2007) Pattern Recognition Letters , vol.28 , Issue.11 , pp. 1327-1332
- Chen, S.H.¹ Wu, H.T.² Chang, Y.³ Truong, T.K.⁴

9
- 33745619765
- Statistical model-based vad algorithm with wavelet transform
- Y. C. Lee and S. S. Ahn, "Statistical model-based vad algorithm with wavelet transform," IEICE Trans. on Fundamentals of Electronics, Communications and Computer Sciences, vol. E89-A, pp. 1594-1600,2006.
- (2006) IEICE Trans. on Fundamentals of Electronics, Communications and Computer Sciences , vol.E89-A , pp. 1594-1600
- Lee, Y.C.¹ Ahn, S.S.²

10
- 66149186195
- Voice activity detection based on conditional MAP criterion
- J. W. Shin, H. J. Kwon, S. H. Jin, and N. S. Kim, "Voice activity detection based on conditional MAP criterion," Signal Processing Letters, vol. 15, pp. 257-260, 2008.
- (2008) Signal Processing Letters , vol.15 , pp. 257-260
- Shin, J.W.¹ Kwon, H.J.² Jin, S.H.³ Kim, N.S.⁴

11
- 27744483317
- An effective subband OSF-based VAD with noise reduction for robust speech recognition
- J. Ramirez, J.C. Segura, C. Benitez, A. de la Torre, and A. Rubio, "An effective subband OSF-based VAD with noise reduction for robust speech recognition," IEEE Trans. on Speech and Audio Processing, vol. 13, no. 6, pp. 1119-1129, 2005.
- (2005) IEEE Trans. on Speech and Audio Processing , vol.13 , Issue.6 , pp. 1119-1129
- Ramirez, J.¹ Segura, J.C.² Benitez, C.³ De La Torre, A.⁴ Rubio, A.⁵

12
- 84867193135
- Voice activity detection algorithms using subband power distance feature for noisy environments
- Brisbane, Australia
- T. V. Pham, M. Stadtschnitzer, F. Pernkopf, and G. Kubin, "Voice activity detection algorithms using subband power distance feature for noisy environments," in Proc. Interspeech, Brisbane, Australia, 2008.
- (2008) Proc. Interspeech
- Pham, T.V.¹ Stadtschnitzer, M.² Pernkopf, F.³ Kubin, G.⁴

13
- 0004217877
- 2nd edition, Butterworth-Heinemann
- C.J. Van Rijsbergen Newton, Information Retrieval, 2nd edition, Butterworth-Heinemann, 1979.
- (1979) Information Retrieval
- Van Rijsbergen Newton, C.J.¹

14
- 38749086536
- Voice/nonvoice classification using reliable fundamental frequency estimator for voice activated powered wheelchair control
- Soo-Young Suk, Hyun-Yeol Chung, and Hiroaki Kojima, "Voice/nonvoice classification using reliable fundamental frequency estimator for voice activated powered wheelchair control," Lecture Notes in Computer Science, vol. 4523/2007, pp. 347-357, 2007.
- (2007) Lecture Notes in Computer Science , vol.4523 , Issue.2007 , pp. 347-357
- Suk, S.-Y.¹ Chung, H.-Y.² Kojima, H.³

15
- 0003548585
- TIMIT acoustic - Phonetic - Continuous speech corpus
- National Institute of Standards and Technology
- J. S. Garofolo, L. F. Lamel,W. M. Fisher, J. G. Fiscus, D. S. Pallett, N. L. Dahlgren, and V. Zue, "TIMIT acoustic - phonetic - continuous speech corpus," Tech. Rep., National Institute of Standards and Technology, 1993.
- (1993) Tech. Rep.
- Garofolo, J.S.¹ Lamel, L.F.² Fisher, W.M.³ Fiscus, J.G.⁴ Pallett, D.S.⁵ Dahlgren, N.L.⁶ Zue, V.⁷

16
- 71049141519
- The Rice University, "Noisex-92 database,"
- The Rice University, "Noisex-92 database," http://spib.rice. edu/spib/.

17
- 44949128271
- Evaluation of objective measures for speech enhancement
- Philadelphia, PA
- Y. Hu and P. Loizou, "Evaluation of objective measures for speech enhancement," in Proceedings of INTERSPEECH-2006, Philadelphia, PA, 2006.
- (2006) Proceedings of INTERSPEECH-2006
- Hu, Y.¹ Loizou, P.²

18
- 0019053271
- Comparison of parametric representations for monosyllabic word recognition
- Aug
- S. B. Davis and P. Mermelstein, "Comparison of parametric representations for monosyllabic word recognition," Trans. Acoust., Speech, Signal Processing, Vol. 28, pp. 357-366, Aug. 1980.
- (1980) Trans. Acoust., Speech, Signal Processing , vol.28 , pp. 357-366
- Davis, S.B.¹ Mermelstein, P.²

19
- 69049105093
- A brief description of the levenberg-marquardt algorithm implemened
- Foundation for Research and Technology, Hellas
- M. I. A. Lourakis, "A brief description of the Levenberg-Marquardt algorithm implemened," Tech. Rep., Foundation for Research and Technology, Hellas, 2007.
- (2007) Tech. Rep.
- Lourakis, M.I.A.¹

20
- 71049188723
- Strategic Targeted Research Project in the 6th Frame Program of the European Union, FP6-511587
- "Services for NOmadic Workers (snow)," Strategic Targeted Research Project in the 6th Frame Program of the European Union, FP6-511587.
- Services for NOmadic Workers (snow)

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.