SCOPUS 정보 검색 플랫폼

Handbook of Research on Digital Libraries: Design, Development, and Impact

Volumn , Issue , 2009, Pages 173-187

Speechfind: Advances in rich content based spoken document retrieval

(2) Kim, Wooil a Hansen, John H L a

a The University of Texas at Dallas (United States)

Author keywords

[No Author keywords available]

Indexed keywords

EID: 84898268902 PISSN: None EISSN: None Source Type: Book
DOI: 10.4018/978-1-59904-879-6.ch017 Document Type: Chapter

Times cited : (1)

References (50)

1
- 0342321463
- The THISL broadcast new retrieval system
- Abberley, D., Kirby, D., Renals, S., & Robinson, T. (1999). The THISL broadcast new retrieval system. In Proceedings of the ESCA ETRW Workshop Accessing Information in Spoken Audio (pp.14-19).
- (1999) Proceedings of the ESCA ETRW Workshop Accessing Information in Spoken Audio , pp. 14-19
- Abberley, D.¹ Kirby, D.² Renals, S.³ Robinson, T.⁴

2
- 84893452104
- Paper presented at ICASSP-02
- Adami, A., Kajarekar, S., & Hermansky, H. (2002). A new speaker change detection method for two-speaker segmentation. Paper presented at ICASSP-02.
- (2002) A New Speaker Change Detection Method for Two-speaker Segmentation
- Adami, A.¹ Kajarekar, S.² Hermansky, H.³

3
- 0031177213
- Combined Bayesian and predictive techniques for rapid speaker adaptation of continuous density hidden Markov models
- Ahadi, S. M., & Woodland, P. C. (1997). Combined Bayesian and predictive techniques for rapid speaker adaptation of continuous density hidden Markov models. Computer Speech and Language, 11, 187-206.
- (1997) Computer Speech and Language , vol.11 , pp. 187-206
- Ahadi, S.M.¹ Woodland, P.C.²

4
- 0141702085
- Environmental sniffing: Noise knowledge estimation for robust speech systems
- Hong Kong
- Akbacak, M., & Hansen, J. H. L. (2003). Environmental sniffing: Noise knowledge estimation for robust speech systems. In Proceedings of the IEEE ICASSP-2003: Inter. Conf. Acoust. Speech & Signal, Hong Kong (Vol. 2, pp. 113-116).
- (2003) Proceedings of the IEEE ICASSP-2003: Inter. Conf. Acoust. Speech & Signal , vol.2 , pp. 113-116
- Akbacak, M.¹ Hansen, J.H.L.²

5
- 44949259254
- A robust fusion method for multilingual spoken document retrieval systems employing tiered resources
- Pittsburgh
- Akbacak, M., & Hansen, J. H. L. (2006). A robust fusion method for multilingual spoken document retrieval systems employing tiered resources. In Proceedings of the ISCA INTERSPEECH-2006/ICSLP-2006, Pittsburgh (pp. 1177-1180).
- (2006) Proceedings of the ISCA INTERSPEECH-2006/ICSLP-2006 , pp. 1177-1180
- Akbacak, M.¹ Hansen, J.H.L.²

6
- 50449102573
- Environmental sniffing: Noise knowledge estimation for robust speech systems
- Akbacak, M., & Hansen, J. H. L. (2007). Environmental sniffing: Noise knowledge estimation for robust speech systems. IEEE Transactions on Audio, Speech and Language Processing, 15(2), 465-477.
- (2007) IEEE Transactions on Audio, Speech and Language Processing , vol.15 , Issue.2 , pp. 465-477
- Akbacak, M.¹ Hansen, J.H.L.²

7
- 33947113758
- Advances in phone-based modeling for automatic accent classification
- Angkititrakul, P., & Hansen, J. H. L. (2006). Advances in phone-based modeling for automatic accent classification. IEEE Trans. Audio, Speech & Language Proc., 14(2), 634-646.
- (2006) IEEE Trans. Audio, Speech & Language Proc , vol.14 , Issue.2 , pp. 634-646
- Angkititrakul, P.¹ Hansen, J.H.L.²

8
- 50249182472
- Discriminative in-set/out-of-set speaker recognition
- Angkititrakul, P., & Hansen, J. H. L. (2007). Discriminative in-set/out-of-set speaker recognition. IEEE Transactions on Audio, Speech and Language Processing, 15(2), 498-508.
- (2007) IEEE Transactions on Audio, Speech and Language Processing , vol.15 , Issue.2 , pp. 498-508
- Angkititrakul, P.¹ Hansen, J.H.L.²

9
- 0030757418
- A study of temporal features and frequency characteristics in American English foreign accent
- Arslan, L. M., & Hansen, J. H. L. (1997). A study of temporal features and frequency characteristics in American English foreign accent. The Journal of the Acoustical Society of America, 102(1), 28-40.
- (1997) The Journal of the Acoustical Society of America , vol.102 , Issue.1 , pp. 28-40
- Arslan, L.M.¹ Hansen, J.H.L.²

10
- 0034229795
- A comparative study of traditional and newly proposed features for recognition of speech under stress
- Bou-Ghazale, S. E., & Hansen, J. H. L. (2000). A comparative study of traditional and newly proposed features for recognition of speech under stress. IEEE Transactions on Speech & Audio Processing, 8(4), 429-442.
- (2000) IEEE Transactions on Speech & Audio Processing , vol.8 , Issue.4 , pp. 429-442
- Bou-Ghazale, S.E.¹ Hansen, J.H.L.²

11
- 0002595416
- Speaker, environment and channel change detection and clustering via the Bayesian information criterion
- Chen, S., & Gopalakrishnan, P. (1998). Speaker, environment and channel change detection and clustering via the Bayesian information criterion. In Proceedings of the Broadcast News Trans. & Under. Workshop.
- (1998) Proceedings of the Broadcast News Trans. & Under. Workshop
- Chen, S.¹ Gopalakrishnan, P.²

12
- 85135272864
- Maximum a posterior linear regression for hidden Markov model adaptation
- Budapest
- Chesta, C., Siohan, O., & Lee, C. H. (1999). Maximum a posterior linear regression for hidden Markov model adaptation. In Proceedings of Eurospeech-99, Budapest (pp. 203-206).
- (1999) Proceedings of Eurospeech-99 , pp. 203-206
- Chesta, C.¹ Siohan, O.² Lee, C.H.³

13
- 84874875877
- Maximum a posterior linear regression with elliptically symmetric matrix priors
- Chou, W. (1999). Maximum a posterior linear regression with elliptically symmetric matrix priors. In Proceedings of Eurospeech (pp. 1-4).
- (1999) Proceedings of Eurospeech , pp. 1-4
- Chou, W.¹

14
- 84898393614
- Paper presented at ICASSP-01, Utah
- Dharanipragada, S., & Rao, B. (2001). MVDR-based feature extraction for robust speech recognition. Paper presented at ICASSP-01, Utah.
- (2001) MVDR-based Feature Extraction for Robust Speech Recognition
- Dharanipragada, S.¹ Rao, B.²

15
- 85009150731
- Building a test collection for speech-driven web retrieval
- Geneva
- Fujii, A., & Itou, K. (2003). Building a test collection for speech-driven Web retrieval. In Proceedings of Eurospeech-2003, Geneva (pp. 1153-1156).
- (2003) Proceedings of Eurospeech-2003 , pp. 1153-1156
- Fujii, A.¹ Itou, K.²

16
- 0028419019
- Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains
- Gauvain, J.-L., & Lee, C.-H. (1994). Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains. IEEE Trans. on Speech and Audio Proc., 2, 291-298.
- (1994) IEEE Trans. On Speech and Audio Proc , vol.2 , pp. 291-298
- Gauvain, J.-L.¹ Lee, C.-H.²

17
- 0030283741
- Analysis and compensation of speech under stress and noise for environmental robustness in speech recognition
- Hansen, J. H. L. (1996). Analysis and compensation of speech under stress and noise for environmental robustness in speech recognition. Speech Communications, Special Issue on Speech Under Stress, 20(2), 151-170.
- (1996) Speech Communications, Special Issue on Speech Under Stress , vol.20 , Issue.2 , pp. 151-170
- Hansen, J.H.L.¹

18
- 85008020310
- SpeechFind: Advances in spoken document retrieval for a national gallery of the spoken word
- Hansen, J. H. L., Huang, R., Zhou, B., Seadle, M., Deller, J. R., Jr., Gurijala, A. R., et al. (2005). SpeechFind: Advances in spoken document retrieval for a national gallery of the spoken word. IEEE Trans. on Speech and Audio Proc., 13(5), 712-730.
- (2005) IEEE Trans. On Speech and Audio Proc , vol.13 , Issue.5 , pp. 712-730
- Hansen, J.H.L.¹ Huang, R.² Zhou, B.³ Seadle, M.⁴ Deller Jr., J.R.⁵ Gurijala, A.R.⁶

19
- 85009083936
- Audio stream phrase recognition for a national gallery of the spoken word: 'One small step'
- Beijing
- Hansen, J. H. L., Zhou, B., Akbacak, M., Sarikaya, R., & Pellom, B. (2000). Audio stream phrase recognition for a national gallery of the spoken word: 'One small step'. In Proceedings of the ICSLP-2000: Inter. Conf. Spoken Lang. Proc., Beijing (Vol. 3, pp. 1089-1092).
- (2000) Proceedings of the ICSLP-2000: Inter. Conf. Spoken Lang. Proc , vol.3 , pp. 1089-1092
- Hansen, J.H.L.¹ Zhou, B.² Akbacak, M.³ Sarikaya, R.⁴ Pellom, B.⁵

20
- 0033705979
- Automatic speech summarization based on word significance and linguistic likelihood
- Hori, C., & Furui, S. (2000). Automatic speech summarization based on word significance and linguistic likelihood. In Proceedings of the IEEE ICASSP-00: Inter. Conf. Acoust. Speech, Sig. Proc. (Vol. 3, pp. 1579-1582).
- (2000) Proceedings of the IEEE ICASSP-00: Inter. Conf. Acoust. Speech, Sig. Proc , vol.3 , pp. 1579-1582
- Hori, C.¹ Furui, S.²

21
- 34047274787
- Advances in unsupervised audio classification and segmentation for the broadcast news and NGSW corpora
- Huang, R., & Hansen, J. H. L. (2006). Advances in unsupervised audio classification and segmentation for the broadcast news and NGSW corpora. IEEE Trans. Audio, Speech and Language Processing, 14(3), 907-919.
- (2006) IEEE Trans. Audio, Speech and Language Processing , vol.14 , Issue.3 , pp. 907-919
- Huang, R.¹ Hansen, J.H.L.²

22
- 64149085238
- Dialect/accent classification using unrestricted audio
- Huang, R., & Hansen, J. H. L. (2007). Dialect/accent classification using unrestricted audio. IEEE Trans. on Audio Speech and Language Processing, 15(2), 453-464.
- (2007) IEEE Trans. On Audio Speech and Language Processing , vol.15 , Issue.2 , pp. 453-464
- Huang, R.¹ Hansen, J.H.L.²

23
- 84898168713
- Paper presented at the EUSIPCO-2004, 12th European Signal Processing Conference, Vienna, Austria, Paper
- Kurimo, M., Zhou, B., Huang, R., & Hansen, J. H. L. (2004). Language modeling structures in audio transcription for retrieval of historical speeches. Paper presented at the EUSIPCO-2004, 12th European Signal Processing Conference, Vienna, Austria (Paper 1530).
- (2004) Language Modeling Structures in Audio Transcription for Retrieval of Historical Speeches , pp. 1530
- Kurimo, M.¹ Zhou, B.² Huang, R.³ Hansen, J.H.L.⁴

24
- 0141496213
- Unsupervised language model adaptation for broadcast news
- Langzhou, C., Gauvain, J.-L., Lamel, L., & Adda, G. (2003). Unsupervised language model adaptation for broadcast news. In Proceedings of the IEEE ICASSP-03: Inter. Conf. Acoust. Speech, Sig. Proc. (Vol. 1, pp. 220-223).
- (2003) Proceedings of the IEEE ICASSP-03: Inter. Conf. Acoust. Speech, Sig. Proc , vol.1 , pp. 220-223
- Langzhou, C.¹ Gauvain, J.-L.² Lamel, L.³ Adda, G.⁴

25
- 0029288633
- Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
- Leggetter, C., & Woodland, P. (1995). Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models. Computer Speech and Language, 9, 171-185.
- (1995) Computer Speech and Language , vol.9 , pp. 171-185
- Leggetter, C.¹ Woodland, P.²

26
- 84898231233
- France, ACM Multimedia
- Lu, L., & Zhang, H. (2002). Speaker change detection and tracking in real-time news broadcasting analysis. France: ACM Multimedia.
- (2002) Speaker Change Detection and Tracking in Real-time News Broadcasting Analysis
- Lu, L.¹ Zhang, H.²

27
- 0036816475
- Content analysis for audio classification and segmenta tion
- Lu, L., Zhang, H., & Jiang, H. (2002). Content analysis for audio classification and segmenta tion. IEEE Trans. Speech & Audio Proc., 10(7), 504-516.
- (2002) IEEE Trans. Speech & Audio Proc , vol.10 , Issue.7 , pp. 504-516
- Lu, L.¹ Zhang, H.² Jiang, H.³

28
- 79951784751
- Automatic summarization of broadcast news using structural features
- Geneva
- Maskey, S. R., & Hirschberg, J. (2003). Automatic summarization of broadcast news using structural features. In Proceedings of Eurospeech-2003, Geneva (pp. 1173-1176).
- (2003) Proceedings of Eurospeech-2003 , pp. 1173-1176
- Maskey, S.R.¹ Hirschberg, J.²

29
- 0141702097
- Towards domain independent speaker clustering
- Moh, Y., Nguyen, P., & Junqua, J.-C. (2003). Towards domain independent speaker clustering. In Proceedings of the IEEE ICASSP-03: Inter. Conf. Acoust. Speech, Sig. Proc. (Vol. 2, pp. 85-88).
- (2003) Proceedings of the IEEE ICASSP-03: Inter. Conf. Acoust. Speech, Sig. Proc , vol.2 , pp. 85-88
- Moh, Y.¹ Nguyen, P.² Junqua, J.-C.³

30
- 0034857759
- Speaker change detection and speaker clustering using VQ distortion for broadcast news speech recognition
- Mori, K., & Nakagawa, S. (2001). Speaker change detection and speaker clustering using VQ distortion for broadcast news speech recognition. In Proceedings of the IEEE ICASSP-01: Inter. Conf. Acoust. Speech, Sig. Proc. (Vol. 1, pp. 413-416).
- (2001) Proceedings of the IEEE ICASSP-01: Inter. Conf. Acoust. Speech, Sig. Proc , vol.1 , pp. 413-416
- Mori, K.¹ Nakagawa, S.²

31
- 0035441593
- Spoken language recognition-a step toward multilinguality in speech processing
- Navratil, J. (2001). Spoken language recognition-a step toward multilinguality in speech processing. IEEE Transactions on Speech & Audio Processing, 9, 678-685.
- (2001) IEEE Transactions on Speech & Audio Processing , vol.9 , pp. 678-685
- Navratil, J.¹

32
- 0032665630
- Experiments in topic indexing of broadcast news using neural networks
- Neukirchen, C., Willett, D., & Rigoll, G. (1999). Experiments in topic indexing of broadcast news using neural networks. In Proceedings of the IEEE ICASSP-99: Inter. Conf. Acoust. Speech, Sig. Proc. (Vol. 2, pp. 1093-1096).
- (1999) Proceedings of the IEEE ICASSP-99: Inter. Conf. Acoust. Speech, Sig. Proc , vol.2 , pp. 1093-1096
- Neukirchen, C.¹ Willett, D.² Rigoll, G.³

33
- 85135155427
- A comparative study of speaker adaptation techniques
- Neumeyer, L. R., Sankar, A., & Digalakis, V. V. (1995). A comparative study of speaker adaptation techniques. In Proceedings of Eurospeech-95 (pp. 1127-1130).
- (1995) Proceedings of Eurospeech-95 , pp. 1127-1130
- Neumeyer, L.R.¹ Sankar, A.² Digalakis, V.V.³

34
- 0003411512
- (Tech. Rep.). Cambridge University
- Robertson, S. E., & Sparck Jones, K. (1997). Simple, proven approaches to text retrieval (Tech. Rep.). Cambridge University.
- (1997) Simple, Proven Approaches to Text Retrieval
- Robertson, S.E.¹ Sparck Jones, K.²

35
- 0003173603
- Okapi/Keenbow at TREC-8)
- Robertson, S. E., & Walker, S. (1999). Okapi/Keenbow at TREC-8). In Proceedings of TREC-8.
- (1999) Proceedings of TREC-8
- Robertson, S.E.¹ Walker, S.²

36
- 0033688848
- High resolution speech feature parameterization for monophone based stressed speech recognition
- Sarikaya, R., & Hansen, J. H. L. (2000). High resolution speech feature parameterization for monophone based stressed speech recognition. IEEE Signal Processing Letters, 7(7), 182-185.
- (2000) IEEE Signal Processing Letters , vol.7 , Issue.7 , pp. 182-185
- Sarikaya, R.¹ Hansen, J.H.L.²

37
- 85009271609
- Towards automatic closed captioning: Low latency real time broadcast news transcription
- Denver
- Saraclar, M., Riley, M., Bocchieri, E., & Goffin, V. (2002). Towards automatic closed captioning: Low latency real time broadcast news transcription. In Proceedings of the ICSLP-2002: Inter. Conf. Spoken Lang., Denver (pp. 1741-1744).
- (2002) Proceedings of the ICSLP-2002: Inter. Conf. Spoken Lang , pp. 1741-1744
- Saraclar, M.¹ Riley, M.² Bocchieri, E.³ Goffin, V.⁴

38
- 85050187568
- Lattice-based search for spoken utterance retrieval
- Boston
- Saraclar, M., & Sproat, R. (2004). Lattice-based search for spoken utterance retrieval. In Proceedings of the HLT-NAACL 2004, Boston (pp. 129-136).
- (2004) Proceedings of the HLT-NAACL 2004 , pp. 129-136
- Saraclar, M.¹ Sproat, R.²

39
- 0030640789
- Structural MAP speaker adaptation using hierarchical priors
- Santa Barbara, CA
- Shinoda, K., & Lee, C. H. (1997). Structural MAP speaker adaptation using hierarchical priors. In Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding, Santa Barbara, CA (pp. 381-388).
- (1997) Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding , pp. 381-388
- Shinoda, K.¹ Lee, C.H.²

40
- 85009102300
- Paper presented at the 22nd ACM SIGIR Conference, Berkeley, CA
- Singhal, A., & Pereira, F. (1999). Document expansion for speech retrieval. Paper presented at the 22nd ACM SIGIR Conference, Berkeley, CA.
- (1999) Document Expansion for Speech Retrieval
- Singhal, A.¹ Pereira, F.²

41
- 0036461005
- Structural maximum a posteriori linear regression for fast HMM adaptation
- Siohan, O., Myrvoll, T. A., & Lee, C. H. (2002). Structural maximum a posteriori linear regression for fast HMM adaptation. Computer Speech and Language, 16(1), 5-24.
- (2002) Computer Speech and Language , vol.16 , Issue.1 , pp. 5-24
- Siohan, O.¹ Myrvoll, T.A.² Lee, C.H.³

42
- 44949221428
- Analysis of lombard effect under different types and levels of background noise with application to in-set speaker ID systems
- Pittsburgh
- Varadarajan, V. S., & Hansen, J. H. L. (2006). Analysis of Lombard effect under different types and levels of background noise with application to in-set speaker ID systems. In Proceedings of the ISCA INTERSPEECH-2006/ICSLP-2006, Pittsburgh (pp. 937-940).
- (2006) Proceedings of the ISCA INTERSPEECH-2006/ICSLP-2006 , pp. 937-940
- Varadarajan, V.S.¹ Hansen, J.H.L.²

43
- 0032678104
- Probabilistic models for topic detection and tracking
- Walls, F., Jin, H., Sista, S., & Schwartz, R. (1999). Probabilistic models for topic detection and tracking. In Proceedings of the IEEE ICASSP-99: Inter. Conf. Acoust. Speech, Sig. Proc. (Vol. 1, pp. 521-524).
- (1999) Proceedings of the IEEE ICASSP-99: Inter. Conf. Acoust. Speech, Sig. Proc , vol.1 , pp. 521-524
- Walls, F.¹ Jin, H.² Sista, S.³ Schwartz, R.⁴

44
- 0034852839
- Multi-scale-audio indexing for translingual spoken document retrieval
- Wang, H.-M., Meng, H., Schone, P., Chen, B., & Lo, W.-K. (2001). Multi-scale-audio indexing for translingual spoken document retrieval. In Proceedings of the IEEE ICASSP-01: Inter. Conf. Acoust. Speech, Sig. Proc. (Vol. 1, pp. 605-608).
- (2001) Proceedings of the IEEE ICASSP-01: Inter. Conf. Acoust. Speech, Sig. Proc , vol.1 , pp. 605-608
- Wang, H.-M.¹ Meng, H.² Schone, P.³ Chen, B.⁴ Lo, W.-K.⁵

45
- 0003756969
- Morgan Kaufmann
- Witten, I. H., Moffat, A., & Bell, T. C. (1999). Managing gigabytes: Compressing and indexing documents and images. Morgan Kaufmann.
- (1999) Managing Gigabytes: Compressing and Indexing Documents and Images
- Witten, I.H.¹ Moffat, A.² Bell, T.C.³

46
- 0002615167
- Speaker adaptation: Techniques and challenges
- Keystone, CO
- Woodland, P. C. (1999). Speaker adaptation: Techniques and challenges. In Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, Keystone, CO (pp. 85-90).
- (1999) Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding , pp. 85-90
- Woodland, P.C.¹

47
- 85009164449
- A new perspective on feature extraction for robust invehicle speech recognition
- Geneva
- Yapanel, U., & Hansen, J. H. L. (2003). A new perspective on feature extraction for robust invehicle speech recognition. In Proceedings of Eurospeech-03, Geneva (pp. 1281-1284).
- (2003) Proceedings of Eurospeech-03 , pp. 1281-1284
- Yapanel, U.¹ Hansen, J.H.L.²

48
- 85009275098
- SPEECHFIND: An experimental on-line spoken document retrieval system for historical audio archives
- Denver
- Zhou, B., & Hansen, J. H. L. (2002). SPEECHFIND: An experimental on-line spoken document retrieval system for historical audio archives. In Proceedings of the ICSLP-2002: International Conferference on Spoken Language Processing, Denver (Vol. 3, pp. 1969-1972).
- (2002) Proceedings of the ICSLP-2002: International Conferference on Spoken Language Processing , vol.3 , pp. 1969-1972
- Zhou, B.¹ Hansen, J.H.L.²

49
- 22544475615
- Efficient audio stream segmentation via the T2 statistic based Bayesian information criterion
- Zhou, B., & Hansen, J. H. L. (2005a). Efficient audio stream segmentation via the T2 statistic based Bayesian information criterion. IEEE Trans. Speech & Audio Proc., 13(4), 467-474.
- (2005) IEEE Trans. Speech & Audio Proc , vol.13 , Issue.4 , pp. 467-474
- Zhou, B.¹ Hansen, J.H.L.²

50
- 22544443963
- Rapid discriminative acoustic modeling based on eigenspace mapping for fast speaker adaptation
- Zhou, B., & Hansen, J. H. L. (2005b). Rapid discriminative acoustic modeling based on Eigenspace mapping for fast speaker adaptation. IEEE Trans. Speech & Audio Proc., 13(4), 554-564.
- (2005) IEEE Trans. Speech & Audio Proc , vol.13 , Issue.4 , pp. 554-564
- Zhou, B.¹ Hansen, J.H.L.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.