SCOPUS 정보 검색 플랫폼

Volumn 129, Issue , 2014, Pages 199-207

Real-time frequency-based noise-robust Automatic Speech Recognition using Multi-Nets Artificial Neural Networks: A multi-views multi-learners approach

(2) Shahamiri, Seyed Reza a Binti Salim, Siti Salwah a

a UNIVERSITY OF MALAYA (Malaysia)

Author keywords

Artificial neural network; Automatic Speech Recognition; Frequency based noise; Multiple views multiple learners; Noise robustness

Indexed keywords

NEURAL NETWORKS; SPEECH;

ACOUSTIC SIGNALS; AUTOMATIC SPEECH RECOGNITION; FREQUENCY-BASED NOISE; MULTIPLE VIEWS; NOISE ROBUST ASR; NOISE ROBUSTNESS; NOISE-ROBUST AUTOMATIC SPEECH RECOGNITION; NOISY CONDITIONS;

SPEECH RECOGNITION;

ACCURACY; ANALYTICAL ERROR; ARTICLE; ARTIFICIAL NEURAL NETWORK; AUTOMATIC SPEECH RECOGNITION; CLASSIFIER; LINGUISTICS; MULTI NETS ARTIFICIAL NEURAL NETWORK; NOISE; PRIORITY JOURNAL; SPEECH DISCRIMINATION;

EID: 84893792365 PISSN: 09252312 EISSN: 18728286 Source Type: Journal
DOI: 10.1016/j.neucom.2013.09.040 Document Type: Article

Times cited : (51)

References (40)

1
- 79952360782
- Variational noise model composition through model perturbation for robust speech recognition with time-varying background noise
- Kim W., Hansen J.H.L. Variational noise model composition through model perturbation for robust speech recognition with time-varying background noise. Speech Commun. 2011, 53:451-464.
- (2011) Speech Commun. , vol.53 , pp. 451-464
- Kim, W.¹ Hansen, J.H.L.²

2
- 45549104630
- Invited paper: automatic speech recognition: history, methods and challenges
- O'Shaughnessy D. Invited paper: automatic speech recognition: history, methods and challenges. Pattern Recognition 2008, 41:2965-2979.
- (2008) Pattern Recognition , vol.41 , pp. 2965-2979
- O'Shaughnessy, D.¹

3
- 0003889532
- McGraw-Hill
- Schalkoff R.J. Artificial Neural Networks 1997, McGraw-Hill.
- (1997) Artificial Neural Networks
- Schalkoff, R.J.¹

4
- 0029288202
- Speech recognition in noisy environments: a survey
- Gong Y.F. Speech recognition in noisy environments: a survey. Speech Commun. 1995, 16:261-291.
- (1995) Speech Commun. , vol.16 , pp. 261-291
- Gong, Y.F.¹

5
- 84855519943
- Multiple-view multiple-learner semi-supervised learning
- Sun S., Zhang Q. Multiple-view multiple-learner semi-supervised learning. Neural Process. Lett. 2011, 34:229-240.
- (2011) Neural Process. Lett. , vol.34 , pp. 229-240
- Sun, S.¹ Zhang, Q.²

6
- 78650197593
- Multiple-view multiple-learner active learning
- Zhang Q., Sun S. Multiple-view multiple-learner active learning. Pattern Recognition 2010, 43:3113-3119.
- (2010) Pattern Recognition , vol.43 , pp. 3113-3119
- Zhang, Q.¹ Sun, S.²

7
- 84887452388
- A survey of multi-view machine learning
- Sun S. A survey of multi-view machine learning. Neural Comput. Appl. 2013, 1-8.
- (2013) Neural Comput. Appl. , pp. 1-8
- Sun, S.¹

8
- 79955055107
- An automated framework for software test oracle
- Shahamiri S.R., Kadir W.M.N.W., Ibrahim S., Hashim S.Z.B. An automated framework for software test oracle. Inf. Software Technol. 2011, 53:774-788.
- (2011) Inf. Software Technol. , vol.53 , pp. 774-788
- Shahamiri, S.R.¹ Kadir, W.M.N.W.² Ibrahim, S.³ Hashim, S.Z.B.⁴

9
- 77949491279
- Speech recognition with artificial neural networks
- Dede G., Sazli M.H. Speech recognition with artificial neural networks. Digital Signal Process. 2010, 20:763-768.
- (2010) Digital Signal Process. , vol.20 , pp. 763-768
- Dede, G.¹ Sazli, M.H.²

10
- 0025254722
- Neural network architecture for isolated word recognition
- Lang K.J., Waibel A.H., Hinton G.E., Time-Delay A Neural network architecture for isolated word recognition. Neural Networks 1990, 3:23-43.
- (1990) Neural Networks , vol.3 , pp. 23-43
- Lang, K.J.¹ Waibel, A.H.² Hinton, G.E.³ Time-Delay, A.⁴

11
- 0024939480
- Modularity and scaling in large phonemic neural networks
- Waibel A., Sawai H., Shikano K. Modularity and scaling in large phonemic neural networks. IEEE Trans. Acoust. Speech Signal Process. 1989, 37:1888-1898.
- (1989) IEEE Trans. Acoust. Speech Signal Process. , vol.37 , pp. 1888-1898
- Waibel, A.¹ Sawai, H.² Shikano, K.³

12
- 79959942839
- Speaker-independent vowel recognition for Malay children using time-delay neural network
- Kuala Lumpur
- B.F. Yong, H.N. Ting, Speaker-independent vowel recognition for Malay children using time-delay neural network, in: 5th Kuala Lumpur International Conference on Biomedical Engineering (BIOMED 2011), Kuala Lumpur, 2011, pp. 565-568.
- (2011) 5th Kuala Lumpur International Conference on Biomedical Engineering (BIOMED 2011) , pp. 565-568
- Yong, B.F.¹ Ting, H.N.²

13
- 54349099783
- Effect of retroflex sounds on the recognition of Hindi voiced and unvoiced stops
- Dev A. Effect of retroflex sounds on the recognition of Hindi voiced and unvoiced stops. AI Soc. 2009, 23:603-612.
- (2009) AI Soc. , vol.23 , pp. 603-612
- Dev, A.¹

14
- 42449125707
- Categorization of Hindi phonemes by neural networks
- Dev A., Agrawal S.S., Choudhury D.R. Categorization of Hindi phonemes by neural networks. AI Soc. 2003, 17:375-382.
- (2003) AI Soc. , vol.17 , pp. 375-382
- Dev, A.¹ Agrawal, S.S.² Choudhury, D.R.³

15
- 70449644569
- Distributed TDNN-fuzzy vector quantization for HMM speech recognition
- Ouarzazate, 2009
- M. Debyeche, A. Amrouche, J.P. Haton, Distributed TDNN-fuzzy vector quantization for HMM speech recognition, in: 2009 International Conference on Multimedia Computing and Systems (ICMCS 2009), Ouarzazate, 2009, pp. 72-76.
- (2009) International Conference on Multimedia Computing and Systems (ICMCS 2009) , pp. 72-76
- Debyeche, M.¹ Amrouche, A.² Haton, J.P.³

16
- 58549096367
- Nonlinear normalization of input patterns to speaker variability in speech recognition neural networks
- Nejadgholi I., Seyyedsalehi S.A. Nonlinear normalization of input patterns to speaker variability in speech recognition neural networks. Neural Comput. Appl. 2009, 18:45-55.
- (2009) Neural Comput. Appl. , vol.18 , pp. 45-55
- Nejadgholi, I.¹ Seyyedsalehi, S.A.²

17
- 4644317224
- A Bayesian classifier for spectrographic mask estimation for missing feature speech recognition
- Seltzer M.L., Raj B., Stern R.M. A Bayesian classifier for spectrographic mask estimation for missing feature speech recognition. Speech Commun. 2004, 43:379-393.
- (2004) Speech Commun. , vol.43 , pp. 379-393
- Seltzer, M.L.¹ Raj, B.² Stern, R.M.³

18
- 0035342414
- Robust automatic speech recognition with missing and unreliable acoustic data
- Cooke M., Green P., Josifovski L., Vizinho A. Robust automatic speech recognition with missing and unreliable acoustic data. Speech Commun. 2001, 34:267-285.
- (2001) Speech Commun. , vol.34 , pp. 267-285
- Cooke, M.¹ Green, P.² Josifovski, L.³ Vizinho, A.⁴

19
- 78650691589
- Reconstruction of missing features by means of multivariate Laplace distribution (MLD) for noise robust speech recognition
- Mohammadi A., Almasganj F. Reconstruction of missing features by means of multivariate Laplace distribution (MLD) for noise robust speech recognition. Expert Syst. Appl. 2011, 38:3918-3930.
- (2011) Expert Syst. Appl. , vol.38 , pp. 3918-3930
- Mohammadi, A.¹ Almasganj, F.²

20
- 78049527664
- Sparse imputation for large vocabulary noise robust ASR
- Gemmeke J.F., Cranen B., Remes U. Sparse imputation for large vocabulary noise robust ASR. Comput. Speech Lang. 2011, 25:462-479.
- (2011) Comput. Speech Lang. , vol.25 , pp. 462-479
- Gemmeke, J.F.¹ Cranen, B.² Remes, U.³

21
- 78649325568
- Mask classification for missing-feature reconstruction for robust speech recognition in unknown background noise
- Kim W., Stern R.M. Mask classification for missing-feature reconstruction for robust speech recognition in unknown background noise. Speech Commun. 2011, 53:1-11.
- (2011) Speech Commun. , vol.53 , pp. 1-11
- Kim, W.¹ Stern, R.M.²

22
- 84893774533
- Robust speech recognition based on independent vector analysis using harmonic frequency dependency
- Jun S., Kim M., Oh M., Park H.-M. Robust speech recognition based on independent vector analysis using harmonic frequency dependency. Neural Comput. Appl. 2012, 1-7.
- (2012) Neural Comput. Appl. , pp. 1-7
- Jun, S.¹ Kim, M.² Oh, M.³ Park, H.-M.⁴

23
- 77953696646
- On the recognition of cochlear implant-like spectrally reduced speech with MFCC and HMM-based ASR
- Do C.T., Pastor D., Goalic A. On the recognition of cochlear implant-like spectrally reduced speech with MFCC and HMM-based ASR. IEEE Trans. Audio Speech Lang. Process. 2010, 18:1065-1068.
- (2010) IEEE Trans. Audio Speech Lang. Process. , vol.18 , pp. 1065-1068
- Do, C.T.¹ Pastor, D.² Goalic, A.³

24
- 80052737228
- A novel framework for noise robust ASR using cochlear implant-like spectrally reduced speech
- Do C.T., Pastor D., Goalic A. A novel framework for noise robust ASR using cochlear implant-like spectrally reduced speech. Speech Commun. 2012, 54:119-133.
- (2012) Speech Commun. , vol.54 , pp. 119-133
- Do, C.T.¹ Pastor, D.² Goalic, A.³

25
- 0032935343
- Introduction to cochlear implants
- Loizou P.C. Introduction to cochlear implants. IEEE Eng. Med. Biol. Mag. 1999, 18:32-42.
- (1999) IEEE Eng. Med. Biol. Mag. , vol.18 , pp. 32-42
- Loizou, P.C.¹

26
- 47949104834
- Speech enhancement based on generalized minimum mean square error estimators and masking properties of the auditory system
- Hansen J.H.L., Radhakrishnan V., Arehart K.H. Speech enhancement based on generalized minimum mean square error estimators and masking properties of the auditory system. IEEE Trans. Audio Speech Lang. Process. 2006, 14:2049-2063.
- (2006) IEEE Trans. Audio Speech Lang. Process. , vol.14 , pp. 2049-2063
- Hansen, J.H.L.¹ Radhakrishnan, V.² Arehart, K.H.³

27
- 0041591273
- A generalized subspace approach for enhancing speech corrupted by colored noise
- Hu Y., Loizou P.C. A generalized subspace approach for enhancing speech corrupted by colored noise. IEEE Trans. Speech Audio Process. 2003, 11:334-341.
- (2003) IEEE Trans. Speech Audio Process. , vol.11 , pp. 334-341
- Hu, Y.¹ Loizou, P.C.²

28
- 81155133929
- Bayesian separation with sparsity promotion in perceptual wavelet domain for speech enhancement and hybrid speech recognition
- Shao Y., Chang C.H. Bayesian separation with sparsity promotion in perceptual wavelet domain for speech enhancement and hybrid speech recognition. IEEE Trans. Syst. Man Cybern. A: Syst. Humans 2011, 41:284-293.
- (2011) IEEE Trans. Syst. Man Cybern. A: Syst. Humans , vol.41 , pp. 284-293
- Shao, Y.¹ Chang, C.H.²

29
- 0018455310
- Suppression of acoustic noise in speech using spectral subtraction
- Boll S.F. Suppression of acoustic noise in speech using spectral subtraction. IEEE Trans. Acoust. Speech Signal Process. 1979, 27:113-120.
- (1979) IEEE Trans. Acoust. Speech Signal Process. , vol.27 , pp. 113-120
- Boll, S.F.¹

30
- 79952624473
- The use of phase in complex spectrum subtraction for robust speech recognition
- Kleinschmidt T., Sridharan S., Mason M. The use of phase in complex spectrum subtraction for robust speech recognition. Comput. Speech Lang. 2011, 25:585-600.
- (2011) Comput. Speech Lang. , vol.25 , pp. 585-600
- Kleinschmidt, T.¹ Sridharan, S.² Mason, M.³

31
- 80052927950
- Nonlinear enhancement of noisy speech, using continuous attractor dynamics formed in recurrent neural networks
- Dehyadegary L., Seyyedsalehi S.A., Nejadgholi I. Nonlinear enhancement of noisy speech, using continuous attractor dynamics formed in recurrent neural networks. Neurocomputing 2011, 74:2716-2724.
- (2011) Neurocomputing , vol.74 , pp. 2716-2724
- Dehyadegary, L.¹ Seyyedsalehi, S.A.² Nejadgholi, I.³

32
- 33846439764
- Signal processing for in-car communication systems
- Schmidt G., Haulick T. Signal processing for in-car communication systems. Signal Process. 2006, 86:1307-1326.
- (2006) Signal Process. , vol.86 , pp. 1307-1326
- Schmidt, G.¹ Haulick, T.²

33
- 84870045418
- Directional cancellation of acoustic noise for home window applications
- Hu S., Rajamani R., Yu X. Directional cancellation of acoustic noise for home window applications. Appl. Acoust. 2013, 74:467-477.
- (2013) Appl. Acoust. , vol.74 , pp. 467-477
- Hu, S.¹ Rajamani, R.² Yu, X.³

34
- 22544432579
- A robust hybrid feedback active noise cancellation headset
- Ying S., Yu G., Kuo S.M. A robust hybrid feedback active noise cancellation headset. IEEE Trans. Speech Audio Process. 2005, 13:607-617.
- (2005) IEEE Trans. Speech Audio Process. , vol.13 , pp. 607-617
- Ying, S.¹ Yu, G.² Kuo, S.M.³

35
- 77955708406
- Active noise cancellation without secondary path identification by using an adaptive genetic algorithm
- Cheng-Yuan C., Deng-Rui C. Active noise cancellation without secondary path identification by using an adaptive genetic algorithm. IEEE Trans. Instrum. Meas. 2010, 59:2315-2327.
- (2010) IEEE Trans. Instrum. Meas. , vol.59 , pp. 2315-2327
- Cheng-Yuan, C.¹ Deng-Rui, C.²

36
- 84881048163
- Blind source extraction for robust speech recognition in multisource noisy environments
- Nesta F., Matassoni M. Blind source extraction for robust speech recognition in multisource noisy environments. Comput. Speech Lang. 2013, 27:703-725.
- (2013) Comput. Speech Lang. , vol.27 , pp. 703-725
- Nesta, F.¹ Matassoni, M.²

37
- 63449087062
- Springer
- Benesty J., Chen J., Huang Y. Microphone Array Signal Processing 2008, Springer.
- (2008) Microphone Array Signal Processing
- Benesty, J.¹ Chen, J.² Huang, Y.³

38
- 84863642080
- Artificial neural networks as multi-networks automated test oracle
- Shahamiri S.R., Kadir W.M.N.W., Ibrahim S., Hashim S.Z.B. Artificial neural networks as multi-networks automated test oracle. Autom. Software Eng. 2012, 19:303-334.
- (2012) Autom. Software Eng. , vol.19 , pp. 303-334
- Shahamiri, S.R.¹ Kadir, W.M.N.W.² Ibrahim, S.³ Hashim, S.Z.B.⁴

39
- 29644438050
- Statistical comparisons of classifiers over multiple data sets
- Demar J. Statistical comparisons of classifiers over multiple data sets. J. Mach. Learn. Res. 2006, 7:1-30.
- (2006) J. Mach. Learn. Res. , vol.7 , pp. 1-30
- Demar, J.¹

40
- 15844411850
- Confidence measures for speech recognition: a survey
- Jiang H. Confidence measures for speech recognition: a survey. Speech Commun. 2005, 45:455-470.
- (2005) Speech Commun. , vol.45 , pp. 455-470
- Jiang, H.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.