메뉴 건너뛰기




Volumn 19, Issue 2, 2011, Pages 225-241

Analysis of MLP-based hierarchical phoneme posterior probability estimator

Author keywords

Hierarchical systems; multilayer perceptrons; posterior probabilities; Volterra series

Indexed keywords

ACOUSTIC FEATURES; CONDITIONAL PROBABILITIES; FEATURE SPACE; HIERARCHICAL ARCHITECTURES; MLP CLASSIFIERS; MULTI LAYER PERCEPTRON; MULTILAYER PERCEPTRONS; PHONEME RECOGNITION; PHONETIC CONFUSIONS; POSTERIOR PROBABILITY; TEMPORAL PATTERN; TRAINING DATA; VOLTERRA SERIES;

EID: 78049251448     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2010.2045943     Document Type: Article
Times cited : (64)

References (58)
  • 1
    • 85032751546 scopus 로고    scopus 로고
    • Pushing the envelope-Aside
    • Sep
    • N. Morgan et al., "Pushing the envelope-Aside", IEEE Signal Process. Mag., vol. 22, no. 5, pp. 81-88, Sep. 2005.
    • (2005) IEEE Signal Process. Mag. , vol.22 , Issue.5 , pp. 81-88
    • Morgan, N.1
  • 2
    • 34047270914 scopus 로고    scopus 로고
    • Recent innovations in speech-to-text transcription at SRI-ICSI-UW
    • Sep
    • A. Stolcke et al., "Recent innovations in speech-to-text transcription at SRI-ICSI-UW", IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 5, pp. 1729-1744, Sep. 2006.
    • (2006) IEEE Trans. Audio, Speech, Lang. Process. , vol.14 , Issue.5 , pp. 1729-1744
    • Stolcke, A.1
  • 4
    • 84867209138 scopus 로고    scopus 로고
    • Transcribing broadcast data using MLP features
    • P. Fousek, L. Lamel, and J.-L. Gauvain, "Transcribing broadcast data using MLP features", in Proc. Interspeech, 2008, pp. 1433-1436.
    • (2008) Proc. Interspeech , pp. 1433-1436
    • Fousek, P.1    Lamel, L.2    Gauvain, J.-L.3
  • 7
    • 0001595997 scopus 로고
    • Neural network classifiers estimate Bayesian a posteriori probabilities
    • M. Richard and R. Lippmann, "Neural network classifiers estimate Bayesian a posteriori probabilities", Neural Comput., vol. 3, pp. 461-483, 1991.
    • (1991) Neural Comput. , vol.3 , pp. 461-483
    • Richard, M.1    Lippmann, R.2
  • 9
    • 38549149272 scopus 로고    scopus 로고
    • Ph. D. dissertation, École Polytechnique Fédérale de Lausanne EPFL, Lausanne, Switzerland
    • H. Misra, "Multi-stream processing for noise robust speech recognition", Ph. D. dissertation, École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland, 2006.
    • (2006) Multi-stream Processing for Noise Robust Speech Recognition
    • Misra, H.1
  • 11
  • 14
  • 16
    • 0032653597 scopus 로고    scopus 로고
    • Size matters: An empirical study of neural network training for large vocabulary continuous speech recognition
    • D. Ellis and N. Morgan, "Size matters: An empirical study of neural network training for large vocabulary continuous speech recognition", in Proc. IEEE Conf. Acoust., Speech, Signal Process. (ICASSP), 1999, vol. 2, pp. 1013-1016.
    • (1999) Proc. IEEE Conf. Acoust., Speech, Signal Process. (ICASSP) , vol.2 , pp. 1013-1016
    • Ellis, D.1    Morgan, N.2
  • 18
    • 64849090489 scopus 로고    scopus 로고
    • Conditional random fields for integrating local discriminative classifiers
    • J. Morris and Fosler-Lussier, "Conditional random fields for integrating local discriminative classifiers", IEEE Trans. Audio, Speech, Lang. Process., vol. 16, no. 3, pp. 617-628, 2008.
    • (2008) IEEE Trans. Audio, Speech, Lang. Process. , vol.16 , Issue.3 , pp. 617-628
    • Morris, J.1    Fosler-Lussier2
  • 20
  • 21
    • 84872543023 scopus 로고    scopus 로고
    • Efficient BackProp
    • Y. LeCun, L. Bottou, G. Orr, and K.-R. Muller, G. Orr and K.-R. Muller, Eds., in, Berlin, Germany: Springer-Verlag
    • Y. LeCun, L. Bottou, G. Orr, and K.-R. Muller, G. Orr and K.-R. Muller, Eds., "Efficient BackProp", in Neural Networks: Tricks of the Trade, ser. Lecture Notes in Computer Science. Berlin, Germany: Springer-Verlag, 1998, pp. 9-50, no. 1524.
    • (1998) Neural Networks: Tricks of the Trade, Ser. Lecture Notes in Computer Science , Issue.1524 , pp. 9-50
  • 23
    • 84921836542 scopus 로고    scopus 로고
    • Learning discriminative temporal patterns in speech: Development of novel TRAPS-like classifiers
    • B. Chen, S. Chang, and S. Sivadas, "Learning discriminative temporal patterns in speech: Development of novel TRAPS-like classifiers", in Proc. ICSLP, 2001, pp. 429-432.
    • (2001) Proc. ICSLP , pp. 429-432
    • Chen, B.1    Chang, S.2    Sivadas, S.3
  • 24
    • 0028516073 scopus 로고
    • How do humans process and recognize speech?
    • J. Allen, "How do humans process and recognize speech?", IEEE Trans. Speech. Audio. Process., vol. 2, pp. 567-577, 1994.
    • (1994) IEEE Trans. Speech. Audio. Process. , vol.2 , pp. 567-577
    • Allen, J.1
  • 26
    • 0030355935 scopus 로고    scopus 로고
    • A new ASR approach based on independent processing and recombination of partial frequency bands
    • H. Bourlard and S. Dupont, "A new ASR approach based on independent processing and recombination of partial frequency bands", in Proc. ICSLP, 1996, pp. 422-425.
    • (1996) Proc. ICSLP , pp. 422-425
    • Bourlard, H.1    Dupont, S.2
  • 28
    • 33745213373 scopus 로고    scopus 로고
    • Multi-resolution RASTA filtering for tandem based ASR
    • H. Hermansky and P. Fousek, "Multi-resolution RASTA filtering for tandem based ASR", in Proc. Interspeech, 2005, pp. 361-364.
    • (2005) Proc. Interspeech , pp. 361-364
    • Hermansky, H.1    Fousek, P.2
  • 30
    • 84867222011 scopus 로고    scopus 로고
    • On the combination of auditory and modulation frequency channels for ASR applications
    • F. Valente and H. Hermansky, "On the combination of auditory and modulation frequency channels for ASR applications", in Proc. Interspeech, 2008, pp. 2242-2245.
    • (2008) Proc. Interspeech , pp. 2242-2245
    • Valente, F.1    Hermansky, H.2
  • 32
    • 0028392167 scopus 로고
    • An application of recurrent nets to phone probability estimation
    • Mar
    • A. Robinson, "An application of recurrent nets to phone probability estimation", IEEE Trans. Neural Netw., vol. 5, no. 2, pp. 298-305, Mar. 1994.
    • (1994) IEEE Trans. Neural Netw. , vol.5 , Issue.2 , pp. 298-305
    • Robinson, A.1
  • 36
    • 33646788786 scopus 로고    scopus 로고
    • FMPE: Discriminatively trained features for speech recognition
    • D. Povey et al., "FMPE: Discriminatively trained features for speech recognition", in Proc. IEEE Conf. Acoust., Speech, Signal Process. (ICASSP), 2005, vol. 1, pp. 961-964.
    • (2005) Proc. IEEE Conf. Acoust., Speech, Signal Process. (ICASSP) , vol.1 , pp. 961-964
    • Povey, D.1
  • 38
    • 34249951707 scopus 로고    scopus 로고
    • Discriminative semi-parametric trajectory models for speech recognition
    • K. Sim and M. Gales, "Discriminative semi-parametric trajectory models for speech recognition", Comput. Speech Lang., vol. 21, no. 4, pp. 669-687, 2007.
    • (2007) Comput. Speech Lang. , vol.21 , Issue.4 , pp. 669-687
    • Sim, K.1    Gales, M.2
  • 40
    • 0024768209 scopus 로고
    • Speaker-independent phone recognition using hidden Markov models
    • Nov
    • K.-F. Lee and H.-W. Hon, "Speaker-independent phone recognition using hidden Markov models", IEEE Trans. Acoust., Speech, Signal Process., vol. 37, no. 11, pp. 1641-1648, Nov. 1989.
    • (1989) IEEE Trans. Acoust., Speech, Signal Process. , vol.37 , Issue.11 , pp. 1641-1648
    • Lee, K.-F.1    Hon, H.-W.2
  • 41
    • 4544236272 scopus 로고    scopus 로고
    • Developmentof the 2003 CU-HTK conversational telephone speech transcription system
    • G. Evermann et al., "Developmentof the 2003 CU-HTK conversational telephone speech transcription system", in Proc. IEEE Conf. Acoust., Speech, Signal Process. (ICASSP), 2004, pp. 249-252.
    • (2004) Proc. IEEE Conf. Acoust., Speech, Signal Process. (ICASSP) , pp. 249-252
    • Evermann, G.1
  • 42
    • 4544282337 scopus 로고    scopus 로고
    • The CU-HTK english CTS system
    • Online. Available
    • P. C. Woodland et al., "The CU-HTK english CTS system", in Proc. Rich Transcription Workshop, 2003 [Online]. Available: http://www.itl.nist.gov/iad/mig/tests/rt/2003-spring/presentations/ctsslides. 2up. letter.pdf
    • (2003) Proc. Rich Transcription Workshop
    • Woodland, P.C.1
  • 44
    • 33745533621 scopus 로고    scopus 로고
    • The development of AMI system for transcription of speech in meetings
    • S. Renals and S. Bengio, Eds., Springer-Verlag
    • T. Hain et al., "The development of AMI system for transcription of speech in meetings", in Proc. Mach. Learn. Multimodal Interaction: 2nd Int. Workshop, Revised Sel. Papers, S. Renals and S. Bengio, Eds., 2005, no. 3869, pp. 344-356, Springer-Verlag.
    • (2005) Proc. Mach. Learn. Multimodal Interaction: 2nd Int. Workshop, Revised Sel. Papers , Issue.3869 , pp. 344-356
    • Hain, T.1
  • 45
    • 33745184705 scopus 로고    scopus 로고
    • Documentation and User Guide to UNISYN Lexicon and Post-Lexical Rules
    • Univ. of Edinburgh, Tech. Rep.
    • S. Fitt, "Documentation and User Guide to UNISYN Lexicon and Post-Lexical Rules", Center for Speech Technol. Res., Univ. of Edinburgh, 2000, Tech. Rep.
    • (2000) Center for Speech Technol. Res.
    • Fitt, S.1
  • 46
    • 78049283751 scopus 로고    scopus 로고
    • The ICSI Quicknet Software Package Online. Available
    • The ICSI Quicknet Software Package [Online]. Available: http://www.icsi.berkeley.edu/Speech/qn.html
  • 47
    • 78049255318 scopus 로고    scopus 로고
    • SRILM-The SRI Language Modeling Toolkit Online. Available
    • SRILM-The SRI Language Modeling Toolkit [Online]. Available: http://www.speech.sri.com/projects/srilm
  • 48
    • 70450152774 scopus 로고    scopus 로고
    • Juicer: A weighted finite state transducer speech decoder
    • Papers, S. Renals, S. Bengio, and J. Fiscus, Eds., Springer-Verlag
    • D. Moore et al., "Juicer: A weighted finite state transducer speech decoder", in Proc. Mach. Learn. for Multimodal Interaction: 3rd Int. Workshop, Revised Sel. Papers, S. Renals, S. Bengio, and J. Fiscus, Eds., 2006, no. 4299, pp. 285-296, Springer-Verlag.
    • (2006) Proc. Mach. Learn. for Multimodal Interaction: 3rd Int. Workshop, Revised Sel , Issue.4299 , pp. 285-296
    • Moore, D.1
  • 52
    • 0033115776 scopus 로고    scopus 로고
    • Volterra series analysis and synthesis of a neural network for velocity estimation
    • Apr
    • W. Gray and B. Nabet, "Volterra series analysis and synthesis of a neural network for velocity estimation", IEEE Trans. Syst., Man, Cybern., vol. 29, no. 2, pp. 190-197, Apr. 1999.
    • (1999) IEEE Trans. Syst., Man, Cybern. , vol.29 , Issue.2 , pp. 190-197
    • Gray, W.1    Nabet, B.2
  • 53
    • 0028312539 scopus 로고
    • Nonlinear noise filtering and beamforming using the perceptron and its Volterra approximation
    • Jan
    • W. Knecht, "Nonlinear noise filtering and beamforming using the perceptron and its Volterra approximation", IEEE Trans. Speech Audio Process., vol. 2, no. 1, pp. 55-62, Jan. 1994.
    • (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.1 , pp. 55-62
    • Knecht, W.1
  • 54
    • 10944245424 scopus 로고    scopus 로고
    • Volterra series and neural networks to model an electronic device nonlinear behavior
    • G. Stegmayer, "Volterra series and neural networks to model an electronic device nonlinear behavior", in Proc. IEEE Conf. Neural Netw., 2004, vol. 4, pp. 2907-2910.
    • (2004) Proc. IEEE Conf. Neural Netw. , vol.4 , pp. 2907-2910
    • Stegmayer, G.1
  • 57
    • 84867199768 scopus 로고    scopus 로고
    • Combining evidence from a generative and a discriminative model in phoneme recognition
    • J. Pinto and H. Hermansky, "Combining evidence from a generative and a discriminative model in phoneme recognition", in Proc. Interspeech, 2008, pp. 2414-2417.
    • (2008) Proc. Interspeech , pp. 2414-2417
    • Pinto, J.1    Hermansky, H.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.