메뉴 건너뛰기




Volumn 21, Issue 3, 2007, Pages 562-578

A segment-based interpretation of HMM/ANN hybrids

Author keywords

[No Author keywords available]

Indexed keywords

HIDDEN MARKOV MODELS; NEURAL NETWORKS; PROBABILITY; SPEECH ANALYSIS;

EID: 33847686469     PISSN: 08852308     EISSN: 10958363     Source Type: Journal    
DOI: 10.1016/j.csl.2006.12.001     Document Type: Article
Times cited : (4)

References (29)
  • 1
    • 84942485483 scopus 로고    scopus 로고
    • Austin, S., Zavaliagkos, G., Makhoul, J., Schwartz, R., 1992. Speech recognition using segmental neural nets. In: Proceedings of ICASSP'92, vol. 1, pp. 625-628.
  • 2
    • 33847627122 scopus 로고    scopus 로고
    • Bourlard, H., Konig, Y., Morgan, N., 1994. REMAP: recursive estimation and maximization of a posteriori probabilities - application to transition-based connectionist speech recognition. ICSI Technical Report TR-94-064.
  • 3
    • 0030142722 scopus 로고    scopus 로고
    • Towards increasing speech recognition error rates
    • Bourlard H., Hermansky H., and Morgan N. Towards increasing speech recognition error rates. Speech Communication 18 (1996) 205-231
    • (1996) Speech Communication , vol.18 , pp. 205-231
    • Bourlard, H.1    Hermansky, H.2    Morgan, N.3
  • 4
    • 16344386023 scopus 로고    scopus 로고
    • Efficient computation of the frame-based extended union model and its application in speech recognition against partial temporal corruptions
    • Chan Y.-C., and Siu M. Efficient computation of the frame-based extended union model and its application in speech recognition against partial temporal corruptions. Computer Speech and Language 19 (2005) 301-319
    • (2005) Computer Speech and Language , vol.19 , pp. 301-319
    • Chan, Y.-C.1    Siu, M.2
  • 5
    • 0032639886 scopus 로고    scopus 로고
    • Clarkson, P., Moreno, P.J., 1999. On the Use of Support Vector Machines for Phonetic Classification. In: Proceedings of ICASSP'99, pp. 585-588.
  • 6
    • 0031269184 scopus 로고    scopus 로고
    • On the optimality of the simple Bayesian classifier under zero-one loss
    • Domingos P., and Pazzani M. On the optimality of the simple Bayesian classifier under zero-one loss. Machine Learning 29 (1997) 103-130
    • (1997) Machine Learning , vol.29 , pp. 103-130
    • Domingos, P.1    Pazzani, M.2
  • 7
    • 33847627558 scopus 로고    scopus 로고
    • Gales, M.J.F., Young, S.J., 1993. The Theory of Segmental Hidden Markov Models. Technical Report CUED/F-INFENG/TR133, Cambridge University Engineering Department.
  • 8
    • 0030372637 scopus 로고    scopus 로고
    • Glass, J.R., 1996. A probabilistic framework for feature-based speech recognition. In: Proceedings of ICSLP'96, pp. 2277-2280.
  • 9
    • 33847648093 scopus 로고    scopus 로고
    • Greenberg, S., Chang S., 2000. Linguistic dissection of switchboard-corpus automatic speech recognition systems. In: Proceedings of ISCA Workshop on ASR: Challenges for the New Millenium, pp. 195-202.
  • 10
    • 9644308136 scopus 로고    scopus 로고
    • Recent advances in the multi-stream HMM/ANN hybrid approach to noise robust ASR
    • Hagen A., and Morris A. Recent advances in the multi-stream HMM/ANN hybrid approach to noise robust ASR. Computer Speech and Language 19 (2005) 3-30
    • (2005) Computer Speech and Language , vol.19 , pp. 3-30
    • Hagen, A.1    Morris, A.2
  • 11
    • 33847666079 scopus 로고    scopus 로고
    • Hennebert, J., Ris, C., Bourlard, H., Renals, S., Morgan, N., 1997. Estimation of global posteriors and forward-backward training of hybrid HMM/ANN systems. In: Proceedings of Eurospeech'97, pp. 1951-1954.
  • 13
    • 0030142721 scopus 로고    scopus 로고
    • Five speculations (and a divertimento) on the themes of H. Bourlard, H. Hermansky, and N. Morgan
    • Jelinek F. Five speculations (and a divertimento) on the themes of H. Bourlard, H. Hermansky, and N. Morgan. Speech Communication 18 (1996) 242-246
    • (1996) Speech Communication , vol.18 , pp. 242-246
    • Jelinek, F.1
  • 14
    • 33847653374 scopus 로고    scopus 로고
    • Lee, S.C., Glass, J.R., 1998. Real-time probabilistic segmentation for segment-based speech recognition. In: Proceedings of ICSLP'98, pp.1803-1806.
  • 16
    • 0347576554 scopus 로고    scopus 로고
    • Leung, H.C., Hetherington, I.L., Zue, V.W., 1992. Speech recognition using stochastic segment neural networks. In: Proceedings of ICASSP'92, vol. 1, pp. 613-616.
  • 17
    • 0035412896 scopus 로고    scopus 로고
    • Union: A model for partial temporal corruption of speech
    • Ming J., and Smith F.J. Union: A model for partial temporal corruption of speech. Computer Speech and Language 15 (2001) 217-231
    • (2001) Computer Speech and Language , vol.15 , pp. 217-231
    • Ming, J.1    Smith, F.J.2
  • 18
    • 33847683907 scopus 로고    scopus 로고
    • Morgan, N., Bourlard, H., 1995. An Introduction to Hybrid HMM/Connectionist Continuous Speech Recognition. Signal Processing Magazine, May, 25-42.
  • 19
    • 85009253205 scopus 로고    scopus 로고
    • Morris, A. C., Payne, S., Bourlard, H., 2002. Low cost duration modeling for noise robust speech recognition. In: Proceedings of ICSLP 2002, pp. 1025-1028.
  • 20
  • 22
    • 0000114416 scopus 로고    scopus 로고
    • Pronunciation modeling by sharing Gaussian densities across phonetic models
    • Saraçlar M., Nock H., and Khudanpur S. Pronunciation modeling by sharing Gaussian densities across phonetic models. Computer Speech and Language 14 (2000) 137-160
    • (2000) Computer Speech and Language , vol.14 , pp. 137-160
    • Saraçlar, M.1    Nock, H.2    Khudanpur, S.3
  • 24
    • 33646050954 scopus 로고    scopus 로고
    • Tóth, L., Kocsor, A., 2005. Explicit duration modelling in HMM/ANN Hybrids. Proceedings of TSD'2005, pp. 310-317.
  • 25
    • 10444286907 scopus 로고    scopus 로고
    • Telephone speech recognition via the combination of knowledge sources in a segmental speech model
    • Tóth L., Kocsor A., and Gosztolya G. Telephone speech recognition via the combination of knowledge sources in a segmental speech model. Acta Cybernetica 16 (2004) 643-657
    • (2004) Acta Cybernetica , vol.16 , pp. 643-657
    • Tóth, L.1    Kocsor, A.2    Gosztolya, G.3
  • 26
    • 0032048095 scopus 로고    scopus 로고
    • Assessing the importance of the segmentation probability in segment-based speech recognition
    • Verhasselt J., Illina I., Martens J.-P., Gong Y., and Haton J.-P. Assessing the importance of the segmentation probability in segment-based speech recognition. Speech Communication 24 1 (1998) 51-72
    • (1998) Speech Communication , vol.24 , Issue.1 , pp. 51-72
    • Verhasselt, J.1    Illina, I.2    Martens, J.-P.3    Gong, Y.4    Haton, J.-P.5
  • 27
    • 33847662518 scopus 로고    scopus 로고
    • Vicsi, K., Tóth, L., Kocsor, A., Csirik, J., 2002. MTBA - A Hungarian Telephone Speech Database. Híradástechnika, LVII (8) (in Hungarian). http://alpha.ttt.bme.hu/speech/hdbMTBA.php.
  • 28
    • 33847628876 scopus 로고    scopus 로고
    • Young, S. et al., 1995. The HMM Toolkit (HTK) - software and manual. http://htk.eng.cam.ac.uk.
  • 29
    • 0028288775 scopus 로고    scopus 로고
    • Zavaliagkos, G., Zhao, J., Schwartz, R., Makhoul, J., 1994. A Hybrid Segmental Neural Net/Hidden Markov Model System for Continuous Speech Recognition. IEEE Trans. Speech and Audio Proc., 2(1), Part II, pp. 151-159.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.