메뉴 건너뛰기




Volumn 13, Issue 5, 2005, Pages 712-730

SpeechFind: Advances in Spoken Document Retrieval for a National Gallery of the Spoken Word

Author keywords

[No Author keywords available]

Indexed keywords


EID: 85008020310     PISSN: 10636676     EISSN: None     Source Type: Journal    
DOI: 10.1109/TSA.2005.852088     Document Type: Article
Times cited : (88)

References (104)
  • 1
    • 85008030749 scopus 로고    scopus 로고
    • [Online]. Available
    • [Online]. Available: http://www.ngsw.org
  • 2
    • 85008036539 scopus 로고    scopus 로고
    • [Online]. Available: (Original website) http://speechfind.utdallas.edu/
    • [Online]. Available: (Original website) http://speechfind.colorado.edu/; http://speechfind.utdallas.edu/
  • 3
    • 85009275098 scopus 로고    scopus 로고
    • SPEECHFIND: An experimental on-line spoken document retrieval system for historical audio archives
    • Denver, CO, Sep.
    • B. Zhou and J. H. L. Hansen, “SPEECHFIND: An experimental on-line spoken document retrieval system for historical audio archives,” in Proc. Int. Conf. Spoken Language Process., vol. 3, Denver, CO, Sep. 2002, pp. 1969–1972.
    • (2002) Proc. Int. Conf. Spoken Language Process. , vol.3 , pp. 1969-1972
    • Zhou, B.1    Hansen, J.H.L.2
  • 4
    • 85009083936 scopus 로고    scopus 로고
    • Audio stream phrase recognition for a National Gallery of the Spoken Word: 'One small step
    • Beijing, China, Oct.
    • J. H. L. Hansen, B. Zhou, M. Akbacak, R. Sarikaya, and B. Pellom, “Audio stream phrase recognition for a National Gallery of the Spoken Word: 'One small step’,” in Proc. Int. Conf. Spoken Lang. Process., vol. 3, Beijing, China, Oct. 2000, pp. 1089–1092.
    • (2000) Proc. Int. Conf. Spoken Lang. Process. , vol.3 , pp. 1089-1092
    • Hansen, J.H.L.1    Zhou, B.2    Akbacak, M.3    Sarikaya, R.4    Pellom, B.5
  • 5
    • 84901265818 scopus 로고    scopus 로고
    • Engineering challenges in the creation of a National Gallery of the Spoken Word: Transcript-free search of audio archives
    • Roanoke, VA, Jun.
    • J. H. L. Hansen, J. Deller, and M. Seadle, “Engineering challenges in the creation of a National Gallery of the Spoken Word: Transcript-free search of audio archives,” in Proc. IEEE ACM Joint Conf. Digital Libraries, Roanoke, VA, Jun. 2001, pp. 235–236.
    • (2001) Proc. IEEE ACM Joint Conf. Digital Libraries , pp. 235-236
    • Hansen, J.H.L.1    Deller, J.2    Seadle, M.3
  • 8
    • 0036288688 scopus 로고    scopus 로고
    • A new speaker change detection method for two-speaker segmentation
    • A. Adami, S. Kajarekar, and H. Hermansky, “A new speaker change detection method for two-speaker segmentation,” in Proc. ICASSP, 2002.
    • (2002) Proc. ICASSP
    • Adami, A.1    Kajarekar, S.2    Hermansky, H.3
  • 9
    • 0037700756 scopus 로고    scopus 로고
    • Speaker change detection and tracking in real-time news broadcasting analysis
    • Paris, France, Dec.
    • L. Lu and H. Zhang, “Speaker change detection and tracking in real-time news broadcasting analysis,” in Proc. ACM Multimedia, Paris, France, Dec. 2002.
    • (2002) Proc. ACM Multimedia
    • Lu, L.1    Zhang, H.2
  • 10
    • 85009164449 scopus 로고    scopus 로고
    • A new perspective on feature extraction for robust in-vehicle speech recognition
    • Geneva, Switzerland, Sep.
    • U. Yapanel and J. H. L. Hansen, “A new perspective on feature extraction for robust in-vehicle speech recognition,” in Proc. Eurospeech, Geneva, Switzerland, Sep. 2003, pp. 1281–1284.
    • (2003) Proc. Eurospeech , pp. 1281-1284
    • Yapanel, U.1    Hansen, J.H.L.2
  • 11
    • 0002782496 scopus 로고    scopus 로고
    • Automatic segmentation, classification and clustering of broadcast news audio
    • Chantilly, VA
    • M. Siegler, U. Jain, B. Raj, and R. M. Stern, “Automatic segmentation, classification and clustering of broadcast news audio,” in Proc. DARPA Speech Recog. Workshop, Chantilly, VA, 1997, pp. 97–99.
    • (1997) Proc. DARPA Speech Recog. Workshop , pp. 97-99
    • Siegler, M.1    Jain, U.2    Raj, B.3    Stern, R.M.4
  • 12
    • 3543118757 scopus 로고    scopus 로고
    • Speaker, environment and channel change detection and clustering via the Bayesian information criterion
    • S. Chen and P. Gopalakrishnan, “Speaker, environment and channel change detection and clustering via the Bayesian information criterion,” in Proc. Broadcast News Trans. Under. Workshop, 1998.
    • (1998) Proc. Broadcast News Trans. Under. Workshop
    • Chen, S.1    Gopalakrishnan, P.2
  • 13
    • 0034842452 scopus 로고    scopus 로고
    • MVDR-based feature extraction for robust speech recognition
    • Salt Lake City, UT
    • S. Dharanipragada and B. Rao, “MVDR-based feature extraction for robust speech recognition,” in ICASSP, Salt Lake City, UT, 2001.
    • (2001) ICASSP
    • Dharanipragada, S.1    Rao, B.2
  • 14
    • 0002751623 scopus 로고    scopus 로고
    • Segment generation and clustering in the HTK: Broadcast news transcription system
    • Herndon, VA
    • T. Hain, S. Johnson, A. Tuerk, P. Woodland, and S. Young, “Segment generation and clustering in the HTK: Broadcast news transcription system,” in DARPA Broadcast News Workshop, Herndon, VA, 1998.
    • (1998) DARPA Broadcast News Workshop
    • Hain, T.1    Johnson, S.2    Tuerk, A.3    Woodland, P.4    Young, S.5
  • 15
    • 4544369704 scopus 로고    scopus 로고
    • Unsupervised audio segmentation and classification for robust spoken document retrieval
    • Montreal, QC, Canada, May
    • R. Huang and J. H. L. Hansen, “Unsupervised audio segmentation and classification for robust spoken document retrieval,” in Proc. IEEE ICASSP, vol. 1, Montreal, QC, Canada, May 2004, pp. 741–744.
    • (2004) Proc. IEEE ICASSP , vol.1 , pp. 741-744
    • Huang, R.1    Hansen, J.H.L.2
  • 16
    • 22544475615 scopus 로고    scopus 로고
    • Efficient audio stream segmentation via T2 statistic based Bayesian information criterion (T2-BIC)
    • Jul.
    • B. Zhou and J. H. L. Hansen, “Efficient audio stream segmentation via T2 statistic based Bayesian information criterion (T2-BIC),” IEEE Trans. Speech Audio Process., vol. 13, no. 4, Jul. 2005.
    • (2005) IEEE Trans. Speech Audio Process. , vol.13 , Issue.4
    • Zhou, B.1    Hansen, J.H.L.2
  • 17
    • 0003990972 scopus 로고    scopus 로고
    • Managing Gigabytes: Compressing and Indexing Documents and Images
    • San Francisco, CA: Morgan Kaufmann
    • I. H. Witten, A. Moffat, and T. C. Bell, Managing Gigabytes: Compressing and Indexing Documents and Images. San Francisco, CA: Morgan Kaufmann, 1999.
    • (1999)
    • Witten, I.H.1    Moffat, A.2    Bell, T.C.3
  • 18
    • 80053431219 scopus 로고    scopus 로고
    • Introduction to latent semantic analysis
    • T. Landauer, P. Foltz, and D. Laham, “Introduction to latent semantic analysis,” Discourse Processes, vol. 25, pp. 259–284, 1998.
    • (1998) Discourse Processes , vol.25 , pp. 259-284
    • Landauer, T.1    Foltz, P.2    Laham, D.3
  • 19
    • 85008059184 scopus 로고    scopus 로고
    • N. District Court Calif
    • “N. District Court Calif.,” A&M Records, Inc. v. Napster, Inc., 99–5183, 2000.
    • (2000) A&M Records, Inc. v. Napster, Inc. , pp. 99-5183
  • 20
    • 85008019005 scopus 로고    scopus 로고
    • 11th Circuit Court of Appeals
    • “11th Circuit Court of Appeals,” Estate of Martin Luther King v. CBS, 98–9079, 1999.
    • (1999) Estate of Martin Luther King v. CBS , pp. 98-9079
  • 21
    • 84886521049 scopus 로고    scopus 로고
    • Copyright in the networked world: New rules for images
    • M. Seadle, “Copyright in the networked world: New rules for images,” Library Hi Tech., vol. 20, no. 2, 2002.
    • (2002) Library Hi Tech. , vol.20 , Issue.2
    • Seadle, M.1
  • 22
    • 3042593473 scopus 로고    scopus 로고
    • Whose rules? Intellectual property, culture, and indigenous communities
    • Mar.
    • M. Seadle, “Whose rules? Intellectual property, culture, and indigenous communities,” D-Lib Mag., vol. 8, no. 3, Mar. 2002.
    • (2002) D-Lib Mag. , vol.8 , Issue.3
    • Seadle, M.1
  • 23
    • 64349124357 scopus 로고    scopus 로고
    • Copyright in the networked world: Multimedia fair use
    • M. Seadle, “Copyright in the networked world: Multimedia fair use,” Library Hi Tech., vol. 19, no. 4, 2001.
    • (2001) Library Hi Tech. , vol.19 , Issue.4
    • Seadle, M.1
  • 24
    • 3042602303 scopus 로고    scopus 로고
    • Spoken words, unspoken meanings: A DLI2 project ethnography
    • Nov.
    • M. Seadle, “Spoken words, unspoken meanings: A DLI2 project ethnography,” D-Lib Mag., Nov. 2000.
    • (2000) D-Lib Mag.
    • Seadle, M.1
  • 25
    • 0030283741 scopus 로고    scopus 로고
    • Analysis and compensation of speech under stress and noise for environmental robustness in speech recognition
    • Nov.
    • J. H. L. Hansen, “Analysis and compensation of speech under stress and noise for environmental robustness in speech recognition,” Speech Commun., Special Issue on Speech Under Stress, vol. 20, no. 2, pp. 151–170, Nov. 1996.
    • (1996) Speech Commun., Special Issue on Speech Under Stress , vol.20 , Issue.2 , pp. 151-170
    • Hansen, J.H.L.1
  • 26
    • 0033688848 scopus 로고    scopus 로고
    • High resolution speech feature parameterization for monophone based stressed speech recognition
    • Jul.
    • R. Sarikaya and J. H. L. Hansen, “High resolution speech feature parameterization for monophone based stressed speech recognition,” IEEE Signal Process. Lett., vol. 7, no. 7, pp. 182–185, Jul. 2000.
    • (2000) IEEE Signal Process. Lett. , vol.7 , Issue.7 , pp. 182-185
    • Sarikaya, R.1    Hansen, J.H.L.2
  • 27
    • 0034229795 scopus 로고    scopus 로고
    • A comparative study of traditional and newly proposed features for recognition of speech under stress
    • Jul.
    • S. E. Bou-Ghazale and J. H. L. Hansen, “A comparative study of traditional and newly proposed features for recognition of speech under stress,” IEEE Trans. Speech Audio Process., vol. 8, no. 4, pp. 429–442, Jul. 2000.
    • (2000) IEEE Trans. Speech Audio Process. , vol.8 , Issue.4 , pp. 429-442
    • Bou-Ghazale, S.E.1    Hansen, J.H.L.2
  • 28
    • 0030757418 scopus 로고    scopus 로고
    • A study of temporal features and frequency characteristics in American English foreign accent
    • Jul.
    • L. M. Arslan and J. H. L. Hansen, “A study of temporal features and frequency characteristics in American English foreign accent,” J. Acoust. Soc. Amer., vol. 102, no. 1, pp. 28–40, Jul. 1997.
    • (1997) J. Acoust. Soc. Amer. , vol.102 , Issue.1 , pp. 28-40
    • Arslan, L.M.1    Hansen, J.H.L.2
  • 29
    • 85008062974 scopus 로고    scopus 로고
    • Advances in phone-based modeling for automatic accent classification
    • Speech Audio Proc., to be published.
    • P. Angkititrakul and J. H. L. Hansen, “Advances in phone-based modeling for automatic accent classification,” IEEE Trans. Speech Audio Proc., to be published.
    • IEEE Trans.
    • Angkititrakul, P.1    Hansen, J.H.L.2
  • 30
    • 85135191939 scopus 로고
    • Talker-Independent keyword spotting for information retrieval
    • J. Foote et al., “Talker-Independent keyword spotting for information retrieval,” in Proc. Eurospeech, vol. 3, 1995, pp. 2145–2149.
    • (1995) Proc. Eurospeech , vol.3 , pp. 2145-2149
    • Foote, J.1
  • 31
    • 84892177707 scopus 로고    scopus 로고
    • Experiments in broadcast news transcription
    • Seattle, WA
    • P. C. Woodland et al., “Experiments in broadcast news transcription,” in Proc. IEEE ICASSP, Seattle, WA, 1998, pp. 909–912.
    • (1998) Proc. IEEE ICASSP , pp. 909-912
    • Woodland, P.C.1
  • 32
    • 85008058495 scopus 로고    scopus 로고
    • [Online]. Available
    • [Online]. Available: http://speechbot.research.compaq.com/
  • 33
    • 85008018727 scopus 로고    scopus 로고
    • [Online]. Available
    • [Online]. Available: http://www.dragonsys.com/news/pr/audiomine.html
  • 34
    • 0002494419 scopus 로고    scopus 로고
    • A system for interactively skimming recorded speech
    • B. Arons, “A system for interactively skimming recorded speech,” ACM Trans. Computer-Human Interaction, vol. 4, no. 1, pp. 3–38, 1997.
    • (1997) ACM Trans. Computer-Human Interaction , vol.4 , Issue.1 , pp. 3-38
    • Arons, B.1
  • 35
    • 0035278951 scopus 로고    scopus 로고
    • Confidence measures for large vocabulary continuous speech recognition
    • Mar.
    • V. Wessel, R. Schluter, K. Macherey, and H. Ney, “Confidence measures for large vocabulary continuous speech recognition,” IEEE Trans. Speech Audio Process., vol. 9, no. 3, pp. 288–298, Mar. 2001.
    • (2001) IEEE Trans. Speech Audio Process. , vol.9 , Issue.3 , pp. 288-298
    • Wessel, V.1    Schluter, R.2    Macherey, K.3    Ney, H.4
  • 37
  • 41
    • 0036293939 scopus 로고    scopus 로고
    • Toward automatic corpus preparation for a German broadcast news transcription system
    • Denver, CO, May
    • W. Macherey and H. Ney, “Toward automatic corpus preparation for a German broadcast news transcription system,” in Proc. Int. Conf. Spoken Lang. Process., vol. 1, Denver, CO, May 2002, pp. 733–736.
    • (2002) Proc. Int. Conf. Spoken Lang. Process. , vol.1 , pp. 733-736
    • Macherey, W.1    Ney, H.2
  • 43
    • 85009198487 scopus 로고    scopus 로고
    • Morpheme-based lexical modeling for Korean broadcast news transcription
    • Geneva, Switzerland, Sep.
    • Y.-H. Park, D.-H. Ahn, and M. Chung, “Morpheme-based lexical modeling for Korean broadcast news transcription,” in Proc. Eurospeech, Geneva, Switzerland, Sep. 2003, pp. 1129–1132.
    • (2003) Proc. Eurospeech , pp. 1129-1132
    • Park, Y.-H.1    Ahn, D.-H.2    Chung, M.3
  • 44
    • 85009227418 scopus 로고    scopus 로고
    • Named entity extraction from Japanese broadcast news
    • Geneva, Switzerland, Sep.
    • A. Kobayashi, F. J. Och, and H. Ney, “Named entity extraction from Japanese broadcast news,” in Proc. Eurospeech, Geneva, Switzerland, Sep. 2003, pp. 1125–1128.
    • (2003) Proc. Eurospeech , pp. 1125-1128
    • Kobayashi, A.1    Och, F.J.2    Ney, H.3
  • 45
    • 0742324997 scopus 로고    scopus 로고
    • Sequential estimation with optimal forgetting for robust speech recognition
    • Jan.
    • M. Afify and O. Siohan, “Sequential estimation with optimal forgetting for robust speech recognition,” IEEE Trans. Speech and Audio Processing, vol. 12, no. 1, pp. 19–26, Jan. 2004.
    • (2004) IEEE Trans. Speech and Audio Processing , vol.12 , Issue.1 , pp. 19-26
    • Afify, M.1    Siohan, O.2
  • 47
    • 85009268616 scopus 로고    scopus 로고
    • Comparing isolately spoken keywords with spontaneously spoken queries for Japanese spoken document retrieval
    • Denver, CO, Sep.
    • H. Nishizaki and S. Nakagawa, “Comparing isolately spoken keywords with spontaneously spoken queries for Japanese spoken document retrieval,” in Proc. Int. Conf. Spoken Lang. Process., Denver, CO, Sep. 2002, pp. 1505–1508.
    • (2002) Proc. Int. Conf. Spoken Lang. Process. , pp. 1505-1508
    • Nishizaki, H.1    Nakagawa, S.2
  • 48
    • 0036649836 scopus 로고    scopus 로고
    • Discriminating capabilities of syllable-based features and approaches of utilizing them for voice retrieval of speech information in Mandarin Chinese
    • Jul.
    • B. Chen, H.-M. Wang, and L.-S. Lee, “Discriminating capabilities of syllable-based features and approaches of utilizing them for voice retrieval of speech information in Mandarin Chinese,” IEEE Trans. Speech Audio Proc., vol. 10, no. 5, pp. 303–314, Jul. 2002.
    • (2002) IEEE Trans. Speech Audio Proc. , vol.10 , Issue.5 , pp. 303-314
    • Chen, B.1    Wang, H.-M.2    Lee, L.-S.3
  • 49
    • 0347968278 scopus 로고    scopus 로고
    • Bayesian learning of speech duration models
    • Nov.
    • J.-T. Chien and C.-H. Huang, “Bayesian learning of speech duration models,” IEEE Trans. Speech Audio Process., vol. 11, no. 6, pp. 558–567, Nov. 2003.
    • (2003) IEEE Trans. Speech Audio Process. , vol.11 , Issue.6 , pp. 558-567
    • Chien, J.-T.1    Huang, C.-H.2
  • 50
    • 85009197924 scopus 로고    scopus 로고
    • Improved Chinese broadcast news transcription by language modeling with temporally consistent training corpora and iterative phrase extraction
    • Geneva, Switzerland, Sep.
    • P.-C. Chang, S.-P. Liao, and L.-S. Lee, “Improved Chinese broadcast news transcription by language modeling with temporally consistent training corpora and iterative phrase extraction,” in Proc. Eurospeech, Geneva, Switzerland, Sep. 2003, pp. 421–424.
    • (2003) Proc. Eurospeech , pp. 421-424
    • Chang, P.-C.1    Liao, S.-P.2    Lee, L.-S.3
  • 51
    • 3042825296 scopus 로고    scopus 로고
    • State-dependent phonetic tied mixtures with pronunciation modeling for spontaneous speech recognition
    • Jul.
    • Y. Liu and P. Fung, “State-dependent phonetic tied mixtures with pronunciation modeling for spontaneous speech recognition,” IEEE Trans. Speech Audio Process., vol. 12, no. 4, pp. 351–364, Jul. 2004.
    • (2004) IEEE Trans. Speech Audio Process. , vol.12 , Issue.4 , pp. 351-364
    • Liu, Y.1    Fung, P.2
  • 52
    • 0033693282 scopus 로고    scopus 로고
    • Retrieval of broadcast news speech in Mandarin Chinese collected in taiwan using syllable-level statistical characteristics
    • Istanbul, Turkey, Jun.
    • B. Chen, H.-M. Wang, and L.-S. Lee, “Retrieval of broadcast news speech in Mandarin Chinese collected in taiwan using syllable-level statistical characteristics,” in Proc. Int. Conf. Acoust. Speech, Signal Process., vol. 3, Istanbul, Turkey, Jun. 2000, pp. 1771–1774.
    • (2000) Proc. Int. Conf. Acoust. Speech, Signal Process. , vol.3 , pp. 1771-1774
    • Chen, B.1    Wang, H.-M.2    Lee, L.-S.3
  • 53
    • 0141702122 scopus 로고    scopus 로고
    • Audio segmentation, classification and clustering in a broadcast news task
    • Hong Kong, Apr.
    • H. Meinedo and J. Neto, “Audio segmentation, classification and clustering in a broadcast news task,” in Proc. IEEE Inter. Conf. Acoust. Speech, Signal Process., vol. 2, Hong Kong, Apr. 2003, pp. 5–8.
    • (2003) Proc. IEEE Inter. Conf. Acoust. Speech, Signal Process. , vol.2 , pp. 5-8
    • Meinedo, H.1    Neto, J.2
  • 55
    • 0034847329 scopus 로고    scopus 로고
    • Automatic transcription of compressed broadcast audio
    • Salt Lake City, UT, May
    • C. Barras, L. Lamel, and J.-L. Gauvain, “Automatic transcription of compressed broadcast audio,” in Proc. Int. Conf. Acoust. Speech, Signal Process., vol. 1, Salt Lake City, UT, May 2001, pp. 265–268.
    • (2001) Proc. Int. Conf. Acoust. Speech, Signal Process. , vol.1 , pp. 265-268
    • Barras, C.1    Lamel, L.2    Gauvain, J.-L.3
  • 56
    • 85009150731 scopus 로고    scopus 로고
    • Building a test collection for speech-driven web retrieval
    • Geneva, Switzerland, Sep.
    • A. Fujii and K. Itou, “Building a test collection for speech-driven web retrieval,” in Proc. Eurospeech, Geneva, Switzerland, Sep. 2003, pp. 1153–1156.
    • (2003) Proc. Eurospeech , pp. 1153-1156
    • Fujii, A.1    Itou, K.2
  • 57
    • 85009275390 scopus 로고    scopus 로고
    • Multi-scale and multi-model integration for improved performance in Chinese spoken document retrieval
    • Denver, CO, Sep.
    • W.-K. Lo, H. M. Meng, and P. C. Ching, “Multi-scale and multi-model integration for improved performance in Chinese spoken document retrieval,” in Proc. Int. Conf. Spoken Lang. Process., Denver, CO, Sep. 2002, pp. 1513–1516.
    • (2002) Proc. Int. Conf. Spoken Lang. Process. , pp. 1513-1516
    • Lo, W.-K.1    Meng, H.M.2    Ching, P.C.3
  • 58
    • 85009271609 scopus 로고    scopus 로고
    • Toward automatic closed captioning :Low latency real time broadcast news transcription
    • Denver, CO, Sep.
    • M. Saraclar, M. Riley, E. Bocchieri, and V. Go, “Toward automatic closed captioning :Low latency real time broadcast news transcription,” in Proc. Int. Conf. Spoken Lang. Process., Denver, CO, Sep. 2002, pp. 1741-1744.
    • (2002) Proc. Int. Conf. Spoken Lang. Process. , pp. 1741-1744
    • Saraclar, M.1    Riley, M.2    Bocchieri, E.3    Go, V.4
  • 59
    • 79951784751 scopus 로고    scopus 로고
    • Automatic summarization of broadcast news using structural features
    • Geneva, Switzerland, Sep.
    • S. R. Maskey and J. Hirschberg, “Automatic summarization of broadcast news using structural features,” in Proc. Eurospeech, Geneva, Switzerland, Sep. 2003, pp. 1173–1176.
    • (2003) Proc. Eurospeech , pp. 1173-1176
    • Maskey, S.R.1    Hirschberg, J.2
  • 60
    • 0033705979 scopus 로고    scopus 로고
    • Automatic speech summarization based on word significance and linguistic likelihood
    • Istanbul, Turkey, Jun.
    • C. Hori and S. Furui, “Automatic speech summarization based on word significance and linguistic likelihood,” in Proc. Int. Conf. Acoust. Speech, Signal Process., vol. 3, Istanbul, Turkey, Jun. 2000, pp. 1579–1582.
    • (2000) Proc. Int. Conf. Acoust. Speech, Signal Process. , vol.3 , pp. 1579-1582
    • Hori, C.1    Furui, S.2
  • 61
    • 0032665630 scopus 로고    scopus 로고
    • Experiments in topic indexing of broadcast news using neural networks
    • Phoenix, AZ, Mar.
    • C. Neukirchen, D. Willett, and G. Rigoll, “Experiments in topic indexing of broadcast news using neural networks,” in Proc. Int. Conf. Acoust. Speech, Signal Process., vol. 2, Phoenix, AZ, Mar. 1999, pp. 1093–1096.
    • (1999) Proc. Int. Conf. Acoust. Speech, Signal Process. , vol.2 , pp. 1093-1096
    • Neukirchen, C.1    Willett, D.2    Rigoll, G.3
  • 63
    • 0034857759 scopus 로고    scopus 로고
    • Speaker change detection and speaker clustering using VQ distortion for broadcast news speech recognition
    • Salt Lake City, UT, May
    • K. Mori and S. Nakagawa, “Speaker change detection and speaker clustering using VQ distortion for broadcast news speech recognition,” in Proc. IEEE Int. Conf. Acoust. Speech, Signal Process., vol. 1, Salt Lake City, UT, May 2001, pp. 413-416.
    • (2001) Proc. IEEE Int. Conf. Acoust. Speech, Signal Process. , vol.1 , pp. 413-416
    • Mori, K.1    Nakagawa, S.2
  • 66
    • 84979938858 scopus 로고    scopus 로고
    • Language modeling structures in audio transcription for retrieval of historical speeches
    • Vienna, Austria, Sep.
    • M. Kurimo, B. Zhou, R. Huang, and J. H. L. Hansen, “Language modeling structures in audio transcription for retrieval of historical speeches,” in Proc. 12th Eur. Signal Process. Conf, Vienna, Austria, Sep. 6–10, 2004, pp. 557–560.
    • (2004) Proc. 12th Eur. Signal Process. Conf , pp. 6-10
    • Kurimo, M.1    Zhou, B.2    Huang, R.3    Hansen, J.H.L.4
  • 68
    • 0035441593 scopus 로고    scopus 로고
    • Spoken language recognition-a step toward multilinguality in speech processing
    • Sep.
    • J. Navratil, “Spoken language recognition-a step toward multilinguality in speech processing,” IEEE Trans. Speech Audio Process., vol. 9, no. 5, pp. 678–685, Sep. 2001.
    • (2001) IEEE Trans. Speech Audio Process. , vol.9 , Issue.5 , pp. 678-685
    • Navratil, J.1
  • 70
    • 0004026002 scopus 로고    scopus 로고
    • Digital Watermarking
    • San Diego, CA: Academic
    • I. J. Cox, M. L. Miller, and J. A. Bloom, Digital Watermarking. San Diego, CA: Academic, 2002.
    • (2002)
    • Cox, I.J.1    Miller, M.L.2    Bloom, J.A.3
  • 71
    • 85009183675 scopus 로고    scopus 로고
    • Speech watermarking by parametric embedding with an ℓ∞ fidelity criterion
    • Geneva, Switzerland, Sep.
    • A. Gurijala and J. R. Deller Jr., “Speech watermarking by parametric embedding with an ℓ∞ fidelity criterion,” in Proc. Interspeech/Eurospeech, Geneva, Switzerland, Sep. 2003, pp. 2933–2936.
    • (2003) Proc. Interspeech/Eurospeech , pp. 2933-2936
    • Gurijala, A.1    Deller, J.R.2
  • 72
    • 84979921765 scopus 로고    scopus 로고
    • Discrete-Time Processing of Speech Signals
    • Second ed. Piscataway, NJ: IEEE, ch. 5.
    • J. R. Deller Jr., J. H. L. Hansen, and J. G. Proakis, Discrete-Time Processing of Speech Signals, Second ed. Piscataway, NJ: IEEE, 2000, ch. 5.
    • (2000)
    • Deller, J.R.1    Hansen, J.H.L.2    Proakis, J.G.3
  • 73
    • 4143057226 scopus 로고    scopus 로고
    • Speech watermarking with objective fidelity and robustness criterion
    • Pacific Grove, CA, Nov.
    • A. Gurijala and J. R. Deller Jr., “Speech watermarking with objective fidelity and robustness criterion,” in Proc. Asilomar Conf. Signals, Syst., Comput., Pacific Grove, CA, Nov. 2003.
    • (2003) Proc. Asilomar Conf. Signals, Syst., Comput.
    • Gurijala, A.1    Deller, J.R.2
  • 74
    • 85008036550 scopus 로고    scopus 로고
    • Speech watermarking through parametric modeling
    • submitted for publication.
    • A. Gurijala and J. R. Deller Jr., “Speech watermarking through parametric modeling,”, submitted for publication.
    • Gurijala, A.1    Deller, J.R.2
  • 76
    • 0033221637 scopus 로고    scopus 로고
    • BEACON: An adaptive set-membership filtering technique with sparse updates
    • Nov.
    • S. Nagaraj, S. Gollamudi, S. Kapoor, and Y. F. Huang, “BEACON: An adaptive set-membership filtering technique with sparse updates,” IEEE Trans. Signal Process., vol. 47, no. 11, pp. 2928–2941, Nov. 1999.
    • (1999) IEEE Trans. Signal Process. , vol.47 , Issue.11 , pp. 2928-2941
    • Nagaraj, S.1    Gollamudi, S.2    Kapoor, S.3    Huang, Y.F.4
  • 78
    • 85009090165 scopus 로고    scopus 로고
    • High-level feature weighted GMM network for audio stream classification
    • Jeju Island, Korea, Oct.
    • R. Huang and J. H. L. Hansen, “High-level feature weighted GMM network for audio stream classification,” in Proc. Int. Conf. Spoken Language Process., Jeju Island, Korea, Oct. 2004.
    • (2004) Proc. Int. Conf. Spoken Language Process.
    • Huang, R.1    Hansen, J.H.L.2
  • 79
    • 0003648234 scopus 로고
    • An Introduction to Multivariate Statistical Analysis
    • New York: Wiley
    • T. Anderson, An Introduction to Multivariate Statistical Analysis. New York: Wiley, 1958.
    • (1958)
    • Anderson, T.1
  • 80
    • 0031177213 scopus 로고    scopus 로고
    • Combined Bayesian and predictive techniques for rapid speaker adaptation of continuous density hidden Markov models
    • S. M. Ahadi and P. C. Woodland, “Combined Bayesian and predictive techniques for rapid speaker adaptation of continuous density hidden Markov models,” Comput. Speech Language, vol. 11, pp. 187–206, 1997.
    • (1997) Comput. Speech Language , vol.11 , pp. 187-206
    • Ahadi, S.M.1    Woodland, P.C.2
  • 81
    • 0028419019 scopus 로고
    • Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains
    • Apr.
    • J. L. Gauvain and C. H. Lee, “Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains,” IEEE Trans. Speech Audio Process., vol. 2, no. 2, pp. 291–298, Apr. 1994.
    • (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.2 , pp. 291-298
    • Gauvain, J.L.1    Lee, C.H.2
  • 82
    • 0029288633 scopus 로고
    • Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
    • C. Leggetter and P. Woodland, “Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models,” Comput. Speech Language, vol. 9, pp. 171–185, 1995.
    • (1995) Comput. Speech Language , vol.9 , pp. 171-185
    • Leggetter, C.1    Woodland, P.2
  • 83
    • 85135155427 scopus 로고
    • A comparative study of speaker adaptation techniques
    • Madrid, Spain
    • L. R. Neumeyer, A. Sankar, and V. V. Digalakis, “A comparative study of speaker adaptation techniques,” in Proc. Eurospeech, Madrid, Spain, 1995, pp. 1127–1130.
    • (1995) Proc. Eurospeech , pp. 1127-1130
    • Neumeyer, L.R.1    Sankar, A.2    Digalakis, V.V.3
  • 85
    • 0036461005 scopus 로고    scopus 로고
    • Structural maximum a posteriori linear regression for fast HMM adaptation
    • Jan.
    • O. Siohan, T. A. Myrvoll, and C. H. Lee, “Structural maximum a posteriori linear regression for fast HMM adaptation,” Comput. Speech Language, vol. 16, no. 1, pp. 5–24, Jan. 2002.
    • (2002) Comput. Speech Language , vol.16 , Issue.1 , pp. 5-24
    • Siohan, O.1    Myrvoll, T.A.2    Lee, C.H.3
  • 86
    • 85135272864 scopus 로고    scopus 로고
    • Maximum a posterior linear regression for hidden Markov model adaptation
    • Budapest, Hungary
    • C. Chesta, O. Siohan, and C. H. Lee, “Maximum a posterior linear regression for hidden Markov model adaptation,” in Proc. Eurospeech, Budapest, Hungary, 1999, pp. 203–206.
    • (1999) Proc. Eurospeech , pp. 203-206
    • Chesta, C.1    Siohan, O.2    Lee, C.H.3
  • 87
    • 84874875877 scopus 로고    scopus 로고
    • Maximum a posterior linear regression with elliptically symmetric matrix priors
    • Budapest, Hungary
    • W. Chou, “Maximum a posterior linear regression with elliptically symmetric matrix priors,” in Proc. Eurospeech, Budapest, Hungary, 1999, pp. 1–4.
    • (1999) Proc. Eurospeech , pp. 1-4
    • Chou, W.1
  • 89
    • 85008060129 scopus 로고    scopus 로고
    • Rapid discriminative acoustic modeling based on eigenspace mapping for fast speaker adaptation
    • to be published.
    • B. Zhou and J. H. L. Hansen, “Rapid discriminative acoustic modeling based on eigenspace mapping for fast speaker adaptation,” IEEE Trans. Speech Audio Process., to be published.
    • IEEE Trans. Speech Audio Process.
    • Zhou, B.1    Hansen, J.H.L.2
  • 91
    • 0003411512 scopus 로고    scopus 로고
    • Simple, Proven Approaches to Text Retrieval
    • Cambridge Univ., Cambridge, U.K.
    • S. E. Robertson and K. S. Jones, ““Simple, Proven Approaches to Text Retrieval,” Tech. Rep., Cambridge Univ., Cambridge, U.K., 1997.
    • (1997) Tech. Rep.
    • Robertson, S.E.1    Jones, K.S.2
  • 92
    • 85009102300 scopus 로고    scopus 로고
    • Document expansion for speech retrieval
    • Berkeley, CA, Aug.
    • A. Singhal and F. Pereira, “Document expansion for speech retrieval,” in Proc. 22nd ACM SIGIR Conf, Berkeley, CA, Aug. 1999.
    • (1999) Proc. 22nd ACM SIGIR Conf
    • Singhal, A.1    Pereira, F.2
  • 93
    • 0141702085 scopus 로고    scopus 로고
    • Environmental sniffing: Noise knowledge estimation for robust speech systems
    • Hong Kong, Apr.
    • M. Akbacak and J. H. L. Hansen, “Environmental sniffing: Noise knowledge estimation for robust speech systems,” in Proc. Int. Conf. Acoust. Speech Signal Process., vol. 2, Hong Kong, Apr. 2003, pp. 113–116.
    • (2003) Proc. Int. Conf. Acoust. Speech Signal Process. , vol.2 , pp. 113-116
    • Akbacak, M.1    Hansen, J.H.L.2
  • 94
    • 85009228811 scopus 로고    scopus 로고
    • ENVIRONMENTAL SNIFFING: Robust digit recognition for an in-vehicle environment
    • Geneva, Switzerland, Sep.
    • M. Akbacak and J. H. L. Hansen, “ENVIRONMENTAL SNIFFING: Robust digit recognition for an in-vehicle environment,” in Proc. INTERSPEECH/Eurospeech, Geneva, Switzerland, Sep. 2003, pp. 2177–2180.
    • (2003) Proc. INTERSPEECH/Eurospeech , pp. 2177-2180
    • Akbacak, M.1    Hansen, J.H.L.2
  • 95
    • 0036816475 scopus 로고    scopus 로고
    • Content analysis for audio classification and segmentation
    • Oct.
    • L. Lu, H. Zhang, and H. Jiang, “Content analysis for audio classification and segmentation,” IEEE Trans. Speech Audio Proc., vol. 10, no. 7, pp. 504–516, Oct. 2002.
    • (2002) IEEE Trans. Speech Audio Proc. , vol.10 , Issue.7 , pp. 504-516
    • Lu, L.1    Zhang, H.2    Jiang, H.3
  • 96
    • 85050713839 scopus 로고    scopus 로고
    • Audio Parsing and Rapid Speaker Adaptation in Speech Recognition for Spoken Document Retrieval
    • Ph.D. dissertation, Robust Speech Processing Group, Center for Spoken Language Research, Univ. Colorado, Boulder, CO
    • B. Zhou, “Audio Parsing and Rapid Speaker Adaptation in Speech Recognition for Spoken Document Retrieval,” Ph.D. dissertation, Robust Speech Processing Group, Center for Spoken Language Research, Univ. Colorado, Boulder, CO, 2003.
    • (2003)
    • Zhou, B.1
  • 97
    • 85008052248 scopus 로고    scopus 로고
    • [Online]. Available
    • [Online]. Available: http://www.ukans.edu/carrie/docs/am-docs_index.html
  • 98
    • 85008062974 scopus 로고    scopus 로고
    • Advances in phone-based modeling for automatic accent classification
    • Speech Audio Proc., to be published.
    • P. Angkititrakul and J. H. L. Hansen, “Advances in phone-based modeling for automatic accent classification,” IEEE Trans. Speech Audio Proc., to be published.
    • IEEE Trans.
    • Angkititrakul, P.1    Hansen, J.H.L.2
  • 99
    • 0030784572 scopus 로고    scopus 로고
    • Stochastic trajectory modeling and sentences searching for continuous speech recognition
    • Jan.
    • Y. Gong, “Stochastic trajectory modeling and sentences searching for continuous speech recognition,” IEEE Trans. Speech. Audio Proc., vol. 5, no. 1, pp. 33–44, Jan. 1997.
    • (1997) IEEE Trans. Speech. Audio Proc. , vol.5 , Issue.1 , pp. 33-44
    • Gong, Y.1
  • 100
    • 85008017681 scopus 로고    scopus 로고
    • Discriminative in-set/out-of-set speaker recognition
    • Speech Audio Processing, submitted for publication.
    • P. Angkititrakul and J. H. L. Hansen, “Discriminative in-set/out-of-set speaker recognition,” IEEE Trans. Speech Audio Processing, submitted for publication.
    • IEEE Trans.
    • Angkititrakul, P.1    Hansen, J.H.L.2
  • 101
    • 85050187568 scopus 로고    scopus 로고
    • Lattice-based search for spoken utterance retrieval
    • Boston, MA, May
    • M. Saraclar and R. Sproat, “Lattice-based search for spoken utterance retrieval,” in Proc. HLT-NAACL, Boston, MA, May 2004, pp. 129–136.
    • (2004) Proc. HLT-NAACL , pp. 129-136
    • Saraclar, M.1    Sproat, R.2
  • 102
    • 0027929445 scopus 로고
    • On structuring probabilistic dependencies in stochastic language modeling
    • H. Ney, U. Essen, and R. Kneser, “On structuring probabilistic dependencies in stochastic language modeling,” Comput. Speech Language, vol. 8, pp. 1–38, 1994.
    • (1994) Comput. Speech Language , vol.8 , pp. 1-38
    • Ney, H.1    Essen, U.2    Kneser, R.3
  • 103
    • 84891308106 scopus 로고    scopus 로고
    • SRILM - An extensible language modeling toolkit
    • Denver, CO, Sep.
    • A. Stolcke, “SRILM - An extensible language modeling toolkit,” in Proc. Int. Conf. Spoken Language Process., Denver, CO, Sep. 2002, pp. 901–904.
    • (2002) Proc. Int. Conf. Spoken Language Process. , pp. 901-904
    • Stolcke, A.1
  • 104
    • 24144437364 scopus 로고    scopus 로고
    • Speech transcription and spoken document retrieval in Finnish in machine learning for multimodal interaction
    • Lecture Notes in Computer Science
    • M. Kurimo, V. Turunen, and I. Ekman, “Speech transcription and spoken document retrieval in Finnish in machine learning for multimodal interaction,” in Revised Selected Papers MLMI 2004 Workshop, vol. 3361, Lecture Notes in Computer Science, 2005, pp. 253–262.
    • (2005) Revised Selected Papers MLMI 2004 Workshop , vol.3361 , pp. 253-262
    • Kurimo, M.1    Turunen, V.2    Ekman, I.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.