메뉴 건너뛰기




Volumn 129, Issue , 2014, Pages 199-207

Real-time frequency-based noise-robust Automatic Speech Recognition using Multi-Nets Artificial Neural Networks: A multi-views multi-learners approach

Author keywords

Artificial neural network; Automatic Speech Recognition; Frequency based noise; Multiple views multiple learners; Noise robustness

Indexed keywords

NEURAL NETWORKS; SPEECH;

EID: 84893792365     PISSN: 09252312     EISSN: 18728286     Source Type: Journal    
DOI: 10.1016/j.neucom.2013.09.040     Document Type: Article
Times cited : (51)

References (40)
  • 1
    • 79952360782 scopus 로고    scopus 로고
    • Variational noise model composition through model perturbation for robust speech recognition with time-varying background noise
    • Kim W., Hansen J.H.L. Variational noise model composition through model perturbation for robust speech recognition with time-varying background noise. Speech Commun. 2011, 53:451-464.
    • (2011) Speech Commun. , vol.53 , pp. 451-464
    • Kim, W.1    Hansen, J.H.L.2
  • 2
    • 45549104630 scopus 로고    scopus 로고
    • Invited paper: automatic speech recognition: history, methods and challenges
    • O'Shaughnessy D. Invited paper: automatic speech recognition: history, methods and challenges. Pattern Recognition 2008, 41:2965-2979.
    • (2008) Pattern Recognition , vol.41 , pp. 2965-2979
    • O'Shaughnessy, D.1
  • 4
    • 0029288202 scopus 로고
    • Speech recognition in noisy environments: a survey
    • Gong Y.F. Speech recognition in noisy environments: a survey. Speech Commun. 1995, 16:261-291.
    • (1995) Speech Commun. , vol.16 , pp. 261-291
    • Gong, Y.F.1
  • 5
    • 84855519943 scopus 로고    scopus 로고
    • Multiple-view multiple-learner semi-supervised learning
    • Sun S., Zhang Q. Multiple-view multiple-learner semi-supervised learning. Neural Process. Lett. 2011, 34:229-240.
    • (2011) Neural Process. Lett. , vol.34 , pp. 229-240
    • Sun, S.1    Zhang, Q.2
  • 6
    • 78650197593 scopus 로고    scopus 로고
    • Multiple-view multiple-learner active learning
    • Zhang Q., Sun S. Multiple-view multiple-learner active learning. Pattern Recognition 2010, 43:3113-3119.
    • (2010) Pattern Recognition , vol.43 , pp. 3113-3119
    • Zhang, Q.1    Sun, S.2
  • 7
    • 84887452388 scopus 로고    scopus 로고
    • A survey of multi-view machine learning
    • Sun S. A survey of multi-view machine learning. Neural Comput. Appl. 2013, 1-8.
    • (2013) Neural Comput. Appl. , pp. 1-8
    • Sun, S.1
  • 9
    • 77949491279 scopus 로고    scopus 로고
    • Speech recognition with artificial neural networks
    • Dede G., Sazli M.H. Speech recognition with artificial neural networks. Digital Signal Process. 2010, 20:763-768.
    • (2010) Digital Signal Process. , vol.20 , pp. 763-768
    • Dede, G.1    Sazli, M.H.2
  • 13
    • 54349099783 scopus 로고    scopus 로고
    • Effect of retroflex sounds on the recognition of Hindi voiced and unvoiced stops
    • Dev A. Effect of retroflex sounds on the recognition of Hindi voiced and unvoiced stops. AI Soc. 2009, 23:603-612.
    • (2009) AI Soc. , vol.23 , pp. 603-612
    • Dev, A.1
  • 14
    • 42449125707 scopus 로고    scopus 로고
    • Categorization of Hindi phonemes by neural networks
    • Dev A., Agrawal S.S., Choudhury D.R. Categorization of Hindi phonemes by neural networks. AI Soc. 2003, 17:375-382.
    • (2003) AI Soc. , vol.17 , pp. 375-382
    • Dev, A.1    Agrawal, S.S.2    Choudhury, D.R.3
  • 16
    • 58549096367 scopus 로고    scopus 로고
    • Nonlinear normalization of input patterns to speaker variability in speech recognition neural networks
    • Nejadgholi I., Seyyedsalehi S.A. Nonlinear normalization of input patterns to speaker variability in speech recognition neural networks. Neural Comput. Appl. 2009, 18:45-55.
    • (2009) Neural Comput. Appl. , vol.18 , pp. 45-55
    • Nejadgholi, I.1    Seyyedsalehi, S.A.2
  • 17
    • 4644317224 scopus 로고    scopus 로고
    • A Bayesian classifier for spectrographic mask estimation for missing feature speech recognition
    • Seltzer M.L., Raj B., Stern R.M. A Bayesian classifier for spectrographic mask estimation for missing feature speech recognition. Speech Commun. 2004, 43:379-393.
    • (2004) Speech Commun. , vol.43 , pp. 379-393
    • Seltzer, M.L.1    Raj, B.2    Stern, R.M.3
  • 18
    • 0035342414 scopus 로고    scopus 로고
    • Robust automatic speech recognition with missing and unreliable acoustic data
    • Cooke M., Green P., Josifovski L., Vizinho A. Robust automatic speech recognition with missing and unreliable acoustic data. Speech Commun. 2001, 34:267-285.
    • (2001) Speech Commun. , vol.34 , pp. 267-285
    • Cooke, M.1    Green, P.2    Josifovski, L.3    Vizinho, A.4
  • 19
    • 78650691589 scopus 로고    scopus 로고
    • Reconstruction of missing features by means of multivariate Laplace distribution (MLD) for noise robust speech recognition
    • Mohammadi A., Almasganj F. Reconstruction of missing features by means of multivariate Laplace distribution (MLD) for noise robust speech recognition. Expert Syst. Appl. 2011, 38:3918-3930.
    • (2011) Expert Syst. Appl. , vol.38 , pp. 3918-3930
    • Mohammadi, A.1    Almasganj, F.2
  • 20
    • 78049527664 scopus 로고    scopus 로고
    • Sparse imputation for large vocabulary noise robust ASR
    • Gemmeke J.F., Cranen B., Remes U. Sparse imputation for large vocabulary noise robust ASR. Comput. Speech Lang. 2011, 25:462-479.
    • (2011) Comput. Speech Lang. , vol.25 , pp. 462-479
    • Gemmeke, J.F.1    Cranen, B.2    Remes, U.3
  • 21
    • 78649325568 scopus 로고    scopus 로고
    • Mask classification for missing-feature reconstruction for robust speech recognition in unknown background noise
    • Kim W., Stern R.M. Mask classification for missing-feature reconstruction for robust speech recognition in unknown background noise. Speech Commun. 2011, 53:1-11.
    • (2011) Speech Commun. , vol.53 , pp. 1-11
    • Kim, W.1    Stern, R.M.2
  • 22
    • 84893774533 scopus 로고    scopus 로고
    • Robust speech recognition based on independent vector analysis using harmonic frequency dependency
    • Jun S., Kim M., Oh M., Park H.-M. Robust speech recognition based on independent vector analysis using harmonic frequency dependency. Neural Comput. Appl. 2012, 1-7.
    • (2012) Neural Comput. Appl. , pp. 1-7
    • Jun, S.1    Kim, M.2    Oh, M.3    Park, H.-M.4
  • 23
    • 77953696646 scopus 로고    scopus 로고
    • On the recognition of cochlear implant-like spectrally reduced speech with MFCC and HMM-based ASR
    • Do C.T., Pastor D., Goalic A. On the recognition of cochlear implant-like spectrally reduced speech with MFCC and HMM-based ASR. IEEE Trans. Audio Speech Lang. Process. 2010, 18:1065-1068.
    • (2010) IEEE Trans. Audio Speech Lang. Process. , vol.18 , pp. 1065-1068
    • Do, C.T.1    Pastor, D.2    Goalic, A.3
  • 24
    • 80052737228 scopus 로고    scopus 로고
    • A novel framework for noise robust ASR using cochlear implant-like spectrally reduced speech
    • Do C.T., Pastor D., Goalic A. A novel framework for noise robust ASR using cochlear implant-like spectrally reduced speech. Speech Commun. 2012, 54:119-133.
    • (2012) Speech Commun. , vol.54 , pp. 119-133
    • Do, C.T.1    Pastor, D.2    Goalic, A.3
  • 25
    • 0032935343 scopus 로고    scopus 로고
    • Introduction to cochlear implants
    • Loizou P.C. Introduction to cochlear implants. IEEE Eng. Med. Biol. Mag. 1999, 18:32-42.
    • (1999) IEEE Eng. Med. Biol. Mag. , vol.18 , pp. 32-42
    • Loizou, P.C.1
  • 26
    • 47949104834 scopus 로고    scopus 로고
    • Speech enhancement based on generalized minimum mean square error estimators and masking properties of the auditory system
    • Hansen J.H.L., Radhakrishnan V., Arehart K.H. Speech enhancement based on generalized minimum mean square error estimators and masking properties of the auditory system. IEEE Trans. Audio Speech Lang. Process. 2006, 14:2049-2063.
    • (2006) IEEE Trans. Audio Speech Lang. Process. , vol.14 , pp. 2049-2063
    • Hansen, J.H.L.1    Radhakrishnan, V.2    Arehart, K.H.3
  • 27
    • 0041591273 scopus 로고    scopus 로고
    • A generalized subspace approach for enhancing speech corrupted by colored noise
    • Hu Y., Loizou P.C. A generalized subspace approach for enhancing speech corrupted by colored noise. IEEE Trans. Speech Audio Process. 2003, 11:334-341.
    • (2003) IEEE Trans. Speech Audio Process. , vol.11 , pp. 334-341
    • Hu, Y.1    Loizou, P.C.2
  • 28
    • 81155133929 scopus 로고    scopus 로고
    • Bayesian separation with sparsity promotion in perceptual wavelet domain for speech enhancement and hybrid speech recognition
    • Shao Y., Chang C.H. Bayesian separation with sparsity promotion in perceptual wavelet domain for speech enhancement and hybrid speech recognition. IEEE Trans. Syst. Man Cybern. A: Syst. Humans 2011, 41:284-293.
    • (2011) IEEE Trans. Syst. Man Cybern. A: Syst. Humans , vol.41 , pp. 284-293
    • Shao, Y.1    Chang, C.H.2
  • 29
    • 0018455310 scopus 로고
    • Suppression of acoustic noise in speech using spectral subtraction
    • Boll S.F. Suppression of acoustic noise in speech using spectral subtraction. IEEE Trans. Acoust. Speech Signal Process. 1979, 27:113-120.
    • (1979) IEEE Trans. Acoust. Speech Signal Process. , vol.27 , pp. 113-120
    • Boll, S.F.1
  • 30
    • 79952624473 scopus 로고    scopus 로고
    • The use of phase in complex spectrum subtraction for robust speech recognition
    • Kleinschmidt T., Sridharan S., Mason M. The use of phase in complex spectrum subtraction for robust speech recognition. Comput. Speech Lang. 2011, 25:585-600.
    • (2011) Comput. Speech Lang. , vol.25 , pp. 585-600
    • Kleinschmidt, T.1    Sridharan, S.2    Mason, M.3
  • 31
    • 80052927950 scopus 로고    scopus 로고
    • Nonlinear enhancement of noisy speech, using continuous attractor dynamics formed in recurrent neural networks
    • Dehyadegary L., Seyyedsalehi S.A., Nejadgholi I. Nonlinear enhancement of noisy speech, using continuous attractor dynamics formed in recurrent neural networks. Neurocomputing 2011, 74:2716-2724.
    • (2011) Neurocomputing , vol.74 , pp. 2716-2724
    • Dehyadegary, L.1    Seyyedsalehi, S.A.2    Nejadgholi, I.3
  • 32
    • 33846439764 scopus 로고    scopus 로고
    • Signal processing for in-car communication systems
    • Schmidt G., Haulick T. Signal processing for in-car communication systems. Signal Process. 2006, 86:1307-1326.
    • (2006) Signal Process. , vol.86 , pp. 1307-1326
    • Schmidt, G.1    Haulick, T.2
  • 33
    • 84870045418 scopus 로고    scopus 로고
    • Directional cancellation of acoustic noise for home window applications
    • Hu S., Rajamani R., Yu X. Directional cancellation of acoustic noise for home window applications. Appl. Acoust. 2013, 74:467-477.
    • (2013) Appl. Acoust. , vol.74 , pp. 467-477
    • Hu, S.1    Rajamani, R.2    Yu, X.3
  • 34
    • 22544432579 scopus 로고    scopus 로고
    • A robust hybrid feedback active noise cancellation headset
    • Ying S., Yu G., Kuo S.M. A robust hybrid feedback active noise cancellation headset. IEEE Trans. Speech Audio Process. 2005, 13:607-617.
    • (2005) IEEE Trans. Speech Audio Process. , vol.13 , pp. 607-617
    • Ying, S.1    Yu, G.2    Kuo, S.M.3
  • 35
    • 77955708406 scopus 로고    scopus 로고
    • Active noise cancellation without secondary path identification by using an adaptive genetic algorithm
    • Cheng-Yuan C., Deng-Rui C. Active noise cancellation without secondary path identification by using an adaptive genetic algorithm. IEEE Trans. Instrum. Meas. 2010, 59:2315-2327.
    • (2010) IEEE Trans. Instrum. Meas. , vol.59 , pp. 2315-2327
    • Cheng-Yuan, C.1    Deng-Rui, C.2
  • 36
    • 84881048163 scopus 로고    scopus 로고
    • Blind source extraction for robust speech recognition in multisource noisy environments
    • Nesta F., Matassoni M. Blind source extraction for robust speech recognition in multisource noisy environments. Comput. Speech Lang. 2013, 27:703-725.
    • (2013) Comput. Speech Lang. , vol.27 , pp. 703-725
    • Nesta, F.1    Matassoni, M.2
  • 39
    • 29644438050 scopus 로고    scopus 로고
    • Statistical comparisons of classifiers over multiple data sets
    • Demar J. Statistical comparisons of classifiers over multiple data sets. J. Mach. Learn. Res. 2006, 7:1-30.
    • (2006) J. Mach. Learn. Res. , vol.7 , pp. 1-30
    • Demar, J.1
  • 40
    • 15844411850 scopus 로고    scopus 로고
    • Confidence measures for speech recognition: a survey
    • Jiang H. Confidence measures for speech recognition: a survey. Speech Commun. 2005, 45:455-470.
    • (2005) Speech Commun. , vol.45 , pp. 455-470
    • Jiang, H.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.