메뉴 건너뛰기




Volumn , Issue , 2009, Pages 173-187

Speechfind: Advances in rich content based spoken document retrieval

Author keywords

[No Author keywords available]

Indexed keywords


EID: 84898268902     PISSN: None     EISSN: None     Source Type: Book    
DOI: 10.4018/978-1-59904-879-6.ch017     Document Type: Chapter
Times cited : (1)

References (50)
  • 3
    • 0031177213 scopus 로고    scopus 로고
    • Combined Bayesian and predictive techniques for rapid speaker adaptation of continuous density hidden Markov models
    • Ahadi, S. M., & Woodland, P. C. (1997). Combined Bayesian and predictive techniques for rapid speaker adaptation of continuous density hidden Markov models. Computer Speech and Language, 11, 187-206.
    • (1997) Computer Speech and Language , vol.11 , pp. 187-206
    • Ahadi, S.M.1    Woodland, P.C.2
  • 5
    • 44949259254 scopus 로고    scopus 로고
    • A robust fusion method for multilingual spoken document retrieval systems employing tiered resources
    • Pittsburgh
    • Akbacak, M., & Hansen, J. H. L. (2006). A robust fusion method for multilingual spoken document retrieval systems employing tiered resources. In Proceedings of the ISCA INTERSPEECH-2006/ICSLP-2006, Pittsburgh (pp. 1177-1180).
    • (2006) Proceedings of the ISCA INTERSPEECH-2006/ICSLP-2006 , pp. 1177-1180
    • Akbacak, M.1    Hansen, J.H.L.2
  • 7
    • 33947113758 scopus 로고    scopus 로고
    • Advances in phone-based modeling for automatic accent classification
    • Angkititrakul, P., & Hansen, J. H. L. (2006). Advances in phone-based modeling for automatic accent classification. IEEE Trans. Audio, Speech & Language Proc., 14(2), 634-646.
    • (2006) IEEE Trans. Audio, Speech & Language Proc , vol.14 , Issue.2 , pp. 634-646
    • Angkititrakul, P.1    Hansen, J.H.L.2
  • 9
    • 0030757418 scopus 로고    scopus 로고
    • A study of temporal features and frequency characteristics in American English foreign accent
    • Arslan, L. M., & Hansen, J. H. L. (1997). A study of temporal features and frequency characteristics in American English foreign accent. The Journal of the Acoustical Society of America, 102(1), 28-40.
    • (1997) The Journal of the Acoustical Society of America , vol.102 , Issue.1 , pp. 28-40
    • Arslan, L.M.1    Hansen, J.H.L.2
  • 10
    • 0034229795 scopus 로고    scopus 로고
    • A comparative study of traditional and newly proposed features for recognition of speech under stress
    • Bou-Ghazale, S. E., & Hansen, J. H. L. (2000). A comparative study of traditional and newly proposed features for recognition of speech under stress. IEEE Transactions on Speech & Audio Processing, 8(4), 429-442.
    • (2000) IEEE Transactions on Speech & Audio Processing , vol.8 , Issue.4 , pp. 429-442
    • Bou-Ghazale, S.E.1    Hansen, J.H.L.2
  • 12
    • 85135272864 scopus 로고    scopus 로고
    • Maximum a posterior linear regression for hidden Markov model adaptation
    • Budapest
    • Chesta, C., Siohan, O., & Lee, C. H. (1999). Maximum a posterior linear regression for hidden Markov model adaptation. In Proceedings of Eurospeech-99, Budapest (pp. 203-206).
    • (1999) Proceedings of Eurospeech-99 , pp. 203-206
    • Chesta, C.1    Siohan, O.2    Lee, C.H.3
  • 13
    • 84874875877 scopus 로고    scopus 로고
    • Maximum a posterior linear regression with elliptically symmetric matrix priors
    • Chou, W. (1999). Maximum a posterior linear regression with elliptically symmetric matrix priors. In Proceedings of Eurospeech (pp. 1-4).
    • (1999) Proceedings of Eurospeech , pp. 1-4
    • Chou, W.1
  • 15
    • 85009150731 scopus 로고    scopus 로고
    • Building a test collection for speech-driven web retrieval
    • Geneva
    • Fujii, A., & Itou, K. (2003). Building a test collection for speech-driven Web retrieval. In Proceedings of Eurospeech-2003, Geneva (pp. 1153-1156).
    • (2003) Proceedings of Eurospeech-2003 , pp. 1153-1156
    • Fujii, A.1    Itou, K.2
  • 16
    • 0028419019 scopus 로고
    • Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains
    • Gauvain, J.-L., & Lee, C.-H. (1994). Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains. IEEE Trans. on Speech and Audio Proc., 2, 291-298.
    • (1994) IEEE Trans. On Speech and Audio Proc , vol.2 , pp. 291-298
    • Gauvain, J.-L.1    Lee, C.-H.2
  • 17
    • 0030283741 scopus 로고    scopus 로고
    • Analysis and compensation of speech under stress and noise for environmental robustness in speech recognition
    • Hansen, J. H. L. (1996). Analysis and compensation of speech under stress and noise for environmental robustness in speech recognition. Speech Communications, Special Issue on Speech Under Stress, 20(2), 151-170.
    • (1996) Speech Communications, Special Issue on Speech Under Stress , vol.20 , Issue.2 , pp. 151-170
    • Hansen, J.H.L.1
  • 21
    • 34047274787 scopus 로고    scopus 로고
    • Advances in unsupervised audio classification and segmentation for the broadcast news and NGSW corpora
    • Huang, R., & Hansen, J. H. L. (2006). Advances in unsupervised audio classification and segmentation for the broadcast news and NGSW corpora. IEEE Trans. Audio, Speech and Language Processing, 14(3), 907-919.
    • (2006) IEEE Trans. Audio, Speech and Language Processing , vol.14 , Issue.3 , pp. 907-919
    • Huang, R.1    Hansen, J.H.L.2
  • 25
    • 0029288633 scopus 로고
    • Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
    • Leggetter, C., & Woodland, P. (1995). Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models. Computer Speech and Language, 9, 171-185.
    • (1995) Computer Speech and Language , vol.9 , pp. 171-185
    • Leggetter, C.1    Woodland, P.2
  • 27
    • 0036816475 scopus 로고    scopus 로고
    • Content analysis for audio classification and segmenta tion
    • Lu, L., Zhang, H., & Jiang, H. (2002). Content analysis for audio classification and segmenta tion. IEEE Trans. Speech & Audio Proc., 10(7), 504-516.
    • (2002) IEEE Trans. Speech & Audio Proc , vol.10 , Issue.7 , pp. 504-516
    • Lu, L.1    Zhang, H.2    Jiang, H.3
  • 28
    • 79951784751 scopus 로고    scopus 로고
    • Automatic summarization of broadcast news using structural features
    • Geneva
    • Maskey, S. R., & Hirschberg, J. (2003). Automatic summarization of broadcast news using structural features. In Proceedings of Eurospeech-2003, Geneva (pp. 1173-1176).
    • (2003) Proceedings of Eurospeech-2003 , pp. 1173-1176
    • Maskey, S.R.1    Hirschberg, J.2
  • 30
    • 0034857759 scopus 로고    scopus 로고
    • Speaker change detection and speaker clustering using VQ distortion for broadcast news speech recognition
    • Mori, K., & Nakagawa, S. (2001). Speaker change detection and speaker clustering using VQ distortion for broadcast news speech recognition. In Proceedings of the IEEE ICASSP-01: Inter. Conf. Acoust. Speech, Sig. Proc. (Vol. 1, pp. 413-416).
    • (2001) Proceedings of the IEEE ICASSP-01: Inter. Conf. Acoust. Speech, Sig. Proc , vol.1 , pp. 413-416
    • Mori, K.1    Nakagawa, S.2
  • 31
    • 0035441593 scopus 로고    scopus 로고
    • Spoken language recognition-a step toward multilinguality in speech processing
    • Navratil, J. (2001). Spoken language recognition-a step toward multilinguality in speech processing. IEEE Transactions on Speech & Audio Processing, 9, 678-685.
    • (2001) IEEE Transactions on Speech & Audio Processing , vol.9 , pp. 678-685
    • Navratil, J.1
  • 36
    • 0033688848 scopus 로고    scopus 로고
    • High resolution speech feature parameterization for monophone based stressed speech recognition
    • Sarikaya, R., & Hansen, J. H. L. (2000). High resolution speech feature parameterization for monophone based stressed speech recognition. IEEE Signal Processing Letters, 7(7), 182-185.
    • (2000) IEEE Signal Processing Letters , vol.7 , Issue.7 , pp. 182-185
    • Sarikaya, R.1    Hansen, J.H.L.2
  • 38
    • 85050187568 scopus 로고    scopus 로고
    • Lattice-based search for spoken utterance retrieval
    • Boston
    • Saraclar, M., & Sproat, R. (2004). Lattice-based search for spoken utterance retrieval. In Proceedings of the HLT-NAACL 2004, Boston (pp. 129-136).
    • (2004) Proceedings of the HLT-NAACL 2004 , pp. 129-136
    • Saraclar, M.1    Sproat, R.2
  • 41
    • 0036461005 scopus 로고    scopus 로고
    • Structural maximum a posteriori linear regression for fast HMM adaptation
    • Siohan, O., Myrvoll, T. A., & Lee, C. H. (2002). Structural maximum a posteriori linear regression for fast HMM adaptation. Computer Speech and Language, 16(1), 5-24.
    • (2002) Computer Speech and Language , vol.16 , Issue.1 , pp. 5-24
    • Siohan, O.1    Myrvoll, T.A.2    Lee, C.H.3
  • 42
    • 44949221428 scopus 로고    scopus 로고
    • Analysis of lombard effect under different types and levels of background noise with application to in-set speaker ID systems
    • Pittsburgh
    • Varadarajan, V. S., & Hansen, J. H. L. (2006). Analysis of Lombard effect under different types and levels of background noise with application to in-set speaker ID systems. In Proceedings of the ISCA INTERSPEECH-2006/ICSLP-2006, Pittsburgh (pp. 937-940).
    • (2006) Proceedings of the ISCA INTERSPEECH-2006/ICSLP-2006 , pp. 937-940
    • Varadarajan, V.S.1    Hansen, J.H.L.2
  • 47
    • 85009164449 scopus 로고    scopus 로고
    • A new perspective on feature extraction for robust invehicle speech recognition
    • Geneva
    • Yapanel, U., & Hansen, J. H. L. (2003). A new perspective on feature extraction for robust invehicle speech recognition. In Proceedings of Eurospeech-03, Geneva (pp. 1281-1284).
    • (2003) Proceedings of Eurospeech-03 , pp. 1281-1284
    • Yapanel, U.1    Hansen, J.H.L.2
  • 49
    • 22544475615 scopus 로고    scopus 로고
    • Efficient audio stream segmentation via the T2 statistic based Bayesian information criterion
    • Zhou, B., & Hansen, J. H. L. (2005a). Efficient audio stream segmentation via the T2 statistic based Bayesian information criterion. IEEE Trans. Speech & Audio Proc., 13(4), 467-474.
    • (2005) IEEE Trans. Speech & Audio Proc , vol.13 , Issue.4 , pp. 467-474
    • Zhou, B.1    Hansen, J.H.L.2
  • 50
    • 22544443963 scopus 로고    scopus 로고
    • Rapid discriminative acoustic modeling based on eigenspace mapping for fast speaker adaptation
    • Zhou, B., & Hansen, J. H. L. (2005b). Rapid discriminative acoustic modeling based on Eigenspace mapping for fast speaker adaptation. IEEE Trans. Speech & Audio Proc., 13(4), 554-564.
    • (2005) IEEE Trans. Speech & Audio Proc , vol.13 , Issue.4 , pp. 554-564
    • Zhou, B.1    Hansen, J.H.L.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.