메뉴 건너뛰기




Volumn 7, Issue 1, 1999, Pages 2-10

Overview of audio information retrieval

Author keywords

[No Author keywords available]

Indexed keywords

AUDIO ACOUSTICS; INTERFACES (COMPUTER); MULTIMEDIA SYSTEMS; SPEECH RECOGNITION;

EID: 0032646977     PISSN: 09424962     EISSN: None     Source Type: Journal    
DOI: 10.1007/s005300050106     Document Type: Article
Times cited : (256)

References (53)
  • 4
    • 0024610919 scopus 로고
    • A tutorial on hidden Markov models and selected applications in speech recognition
    • Rabiner LR (1989) A tutorial on hidden Markov models and selected applications in speech recognition. Proc. IEEE 77(2):257-286
    • (1989) Proc. IEEE , vol.77 , Issue.2 , pp. 257-286
    • Rabiner, L.R.1
  • 5
    • 0025592394 scopus 로고
    • A hidden Markov model based keyword recognition system
    • IEEE CS Press, Piscataway, N.J.
    • Rose RC, Paul DB (1990) A hidden Markov model based keyword recognition system. In: Proc. ICASSP 90, IEEE CS Press, Piscataway, N.J., pp 129-132
    • (1990) Proc. ICASSP , vol.90 , pp. 129-132
    • Rose, R.C.1    Paul, D.B.2
  • 6
    • 85015255565 scopus 로고
    • Training and search algorithms for an interactive wordspotting system
    • IEEE CS Press, Piscataway, N.J.
    • Wilcox LD, Bush MA (1992) Training and search algorithms for an interactive wordspotting system. In: Proc. ICASSP 92, vol. 2, IEEE CS Press, Piscataway, N.J., pp 97-100
    • (1992) Proc. ICASSP 92 , vol.2 , pp. 97-100
    • Wilcox, L.D.1    Bush, M.A.2
  • 7
    • 0842269220 scopus 로고
    • Phonetic-based word spotter: Various configurations and application to event spotting
    • Berlin, Germany, ESCA
    • Jeanrenaud P, Ng K, Siu M, Rohlicek J, Gish H (1993) Phonetic-based word spotter: Various configurations and application to event spotting. In: Proc. Eurospeech 93, Berlin, Germany, ESCA, pp 2145-2148
    • (1993) Proc. Eurospeech 93 , pp. 2145-2148
    • Jeanrenaud, P.1    Ng, K.2    Siu, M.3    Rohlicek, J.4    Gish, H.5
  • 8
    • 0039627177 scopus 로고
    • Techniques for information retrieval from speech messages
    • Rose RC (1991) Techniques for information retrieval from speech messages. Lincoln Lab J 4(1):45-60
    • (1991) Lincoln Lab J , vol.4 , Issue.1 , pp. 45-60
    • Rose, R.C.1
  • 9
    • 0012316245 scopus 로고
    • Benchmark tests for the ARPA spoken language program
    • January 1995
    • Pallett D et al. (1995) Benchmark tests for the ARPA spoken language program. In: Proc. ARPA SLS Technology Workshop, January 1995
    • (1995) Proc. ARPA SLS Technology Workshop
    • Pallett, D.1
  • 10
    • 0029726004 scopus 로고    scopus 로고
    • Robust talker-independent audio document retrieval
    • April 1996, Atlanta, Ga. IEEE CS Press, Piscataway, N.J.
    • Jones GJF, Foote JT, Spärck Jones K, Young SJ (1996) Robust talker-independent audio document retrieval. In: Proc. ICASSP 96, volume 1, April 1996, Atlanta, Ga. IEEE CS Press, Piscataway, N.J., pp 311-314
    • (1996) Proc. ICASSP 96 , vol.1 , pp. 311-314
    • Jones, G.J.F.1    Foote, J.T.2    Spärck Jones, K.3    Young, S.J.4
  • 11
    • 0028996845 scopus 로고
    • Reducing word error rate on conversational speech from the Switchboard corpus
    • May 1995, Detroit, Mich. IEEE CS Press, Piscataway, N.J.
    • Jeanrenaud P, Eide E, Chaudhari U, McDonough J, Ng K, Siu M, Gish H (1995) Reducing word error rate on conversational speech from the Switchboard corpus. In: Proc. ICASSP 95, May 1995, Detroit, Mich. IEEE CS Press, Piscataway, N.J., pp 53-56
    • (1995) Proc. ICASSP 95 , pp. 53-56
    • Jeanrenaud, P.1    Eide, E.2    Chaudhari, U.3    McDonough, J.4    Ng, K.5    Siu, M.6    Gish, H.7
  • 12
    • 0001374417 scopus 로고    scopus 로고
    • Informedia: News-on-demand - Multimedia information acquisition and retrieval
    • Maybury MT (ed) chapter 10. MIT Press, Cambridge, Mass.
    • Hauptmann A, Witbrock M (1997) Informedia: News-on-demand -multimedia information acquisition and retrieval. In: Maybury MT (ed) Intelligent Multimedia Information Retrieval, chapter 10. MIT Press, Cambridge, Mass., pp 215-240 (available on the internet: http://www.cs.cmu.edu/afs/c.s/user/alex/www/)
    • (1997) Intelligent Multimedia Information Retrieval , pp. 215-240
    • Hauptmann, A.1    Witbrock, M.2
  • 13
    • 0030676366 scopus 로고    scopus 로고
    • Broadcast news transcription
    • April 1997, IEEE CS Press, Piscataway, N.J.
    • Kubala F, Jin H, Matsoukas S, Nguyen L, Schwartz R (1997) Broadcast news transcription. In: Proc. ICASSP 97, volume 1, April 1997, IEEE CS Press, Piscataway, N.J., pp 203-206
    • (1997) Proc. ICASSP 97 , vol.1 , pp. 203-206
    • Kubala, F.1    Jin, H.2    Matsoukas, S.3    Nguyen, L.4    Schwartz, R.5
  • 15
    • 84964500666 scopus 로고
    • Approaches to topic identification on the Switchboard corpus
    • Adelaide, Australia, IEEE CS Press, Piscataway, N.J.
    • McDonough J, Ng K, Jeanrenaud P, Gish H, Rohlicek JR (1994) Approaches to topic identification on the Switchboard corpus. In: Proc. ICASSP 94, volume 1, Adelaide, Australia, IEEE CS Press, Piscataway, N.J., pp 385-388
    • (1994) Proc. ICASSP 94 , vol.1 , pp. 385-388
    • McDonough, J.1    Ng, K.2    Jeanrenaud, P.3    Gish, H.4    Rohlicek, J.R.5
  • 16
    • 0028996935 scopus 로고
    • Improved topic spotting through statistical modelling of keyword dependencies
    • May 1995, Detroit, Mich. IEEE CS Press, Piscataway, N.J.
    • Wright JH, Carey MJ, Parris ES (1995) Improved topic spotting through statistical modelling of keyword dependencies. In: Proc. ICASSP 95, May 1995, Detroit, Mich. IEEE CS Press, Piscataway, N.J., pp 313-316
    • (1995) Proc. ICASSP 95 , pp. 313-316
    • Wright, J.H.1    Carey, M.J.2    Parris, E.S.3
  • 17
    • 0029388086 scopus 로고
    • Topic discrimination using higher-order statistical models of spotted keywords
    • Wright JH, Carey MJ, Parris ES (1995) Topic discrimination using higher-order statistical models of spotted keywords. Comput Speech Lang 9(4):381-405
    • (1995) Comput Speech Lang , vol.9 , Issue.4 , pp. 381-405
    • Wright, J.H.1    Carey, M.J.2    Parris, E.S.3
  • 18
    • 0028996903 scopus 로고
    • Video Mail Retrieval: The effect of word spotting accuracy on precision
    • IEEE CS Press, Piscataway, N.J.
    • Jones GJF, Foote JT, Spärck Jones K, Young SJ (1995) Video Mail Retrieval: the effect of word spotting accuracy on precision. In: Proc. ICASSP 95, volume 1. IEEE CS Press, Piscataway, N.J., pp 309-312
    • (1995) Proc. ICASSP 95 , vol.1 , pp. 309-312
    • Jones, G.J.F.1    Foote, J.T.2    Spärck Jones, K.3    Young, S.J.4
  • 19
    • 0003919853 scopus 로고
    • Speaker dependent keyword spotting for hand-held devices
    • Engineering Department, Cambridge University, Cambridge, UK
    • Knill KM, Young SJ (1994) Speaker dependent keyword spotting for hand-held devices. Technical Report 193. Engineering Department, Cambridge University, Cambridge, UK
    • (1994) Technical Report , vol.193
    • Knill, K.M.1    Young, S.J.2
  • 20
    • 0039789208 scopus 로고
    • First experiences with a system for content based retrieval of information from speech recordings
    • August 1995
    • Schäuble P, Wechsler M (1995) First experiences with a system for content based retrieval of information from speech recordings. In: IJCAI Workshop: Intelligent Multimedia Information Retrieval, August 1995 (available on the Internet: ftp://ftp.inf.ethz.ch/pub/publications/papers/is/ir/ijcai95.ps.gz)
    • (1995) IJCAI Workshop: Intelligent Multimedia Information Retrieval
    • Schäuble, P.1    Wechsler, M.2
  • 21
    • 51849114797 scopus 로고
    • Speech retrieval based on automatic indexing
    • Rijsbergen CJ van (ed) September 1995, University of Glasgow, Glasgow, UK
    • Wechsler M, Schäuble P (1995) Speech retrieval based on automatic indexing. In: Rijsbergen CJ van (ed) Proceedings of the MIRO Workshop, September 1995, University of Glasgow, Glasgow, UK
    • (1995) Proceedings of the MIRO Workshop
    • Wechsler, M.1    Schäuble, P.2
  • 22
    • 0002470735 scopus 로고    scopus 로고
    • Subword unit representations for spoken document retrieval
    • ESCA
    • Ng K, Zue V (1997) Subword unit representations for spoken document retrieval. In: Proc. Eurospeech 97. ESCA (available on the Internet: http://www.sls.lcs.mit.edu/̃kng/papers/sir-eurospeech97.ps)
    • (1997) Proc. Eurospeech 97
    • Ng, K.1    Zue, V.2
  • 23
    • 85012973695 scopus 로고
    • A fast lattice-based approach to vocabulary independent wordspotting
    • Adelaide, Australia, IEEE CS Press, Piscataway, N.J.
    • James DA, Young SJ (1994) A fast lattice-based approach to vocabulary independent wordspotting. In: Proc. ICASSP 94, volume 1, Adelaide, Australia, IEEE CS Press, Piscataway, N.J., pp 377-380
    • (1994) Proc. ICASSP 94 , vol.1 , pp. 377-380
    • James, D.A.1    Young, S.J.2
  • 25
    • 0030711143 scopus 로고    scopus 로고
    • Acoustic indexing for multimedia retrieval and browsing
    • April 1997, Munich, Germany, IEEE CS Press, Piscataway, N.J.
    • Young SJ, Brown MG, Foote JT, Jones GJF, Spärck Jones K (1997) Acoustic indexing for multimedia retrieval and browsing. In: Proc. ICASSP 97, volume 1, April 1997, Munich, Germany, IEEE CS Press, Piscataway, N.J., pp 199-202
    • (1997) Proc. ICASSP 97 , vol.1 , pp. 199-202
    • Young, S.J.1    Brown, M.G.2    Foote, J.T.3    Jones, G.J.F.4    Spärck Jones, K.5
  • 26
    • 0029725603 scopus 로고
    • Keyword spotting for video soundtrack indexing
    • April 1996, Atlanta, Ga. IEEE CS Press, Piscataway, N.J.
    • Gelin P, Wellekens C (1990) Keyword spotting for video soundtrack indexing. In: Proc. ICASSP 96, volume 1, April 1996, Atlanta, Ga. IEEE CS Press, Piscataway, N.J., pp 299-302
    • (1990) Proc. ICASSP 96 , vol.1 , pp. 299-302
    • Gelin, P.1    Wellekens, C.2
  • 27
    • 0027311604 scopus 로고
    • Application of large vocabulary continuous speech recognition to topic and speaker identification using telephone speech
    • May 1993, San Francisco, Calif. IEEE CS Press, Piscataway, N.J.
    • Gillick L, Baker J, Bridle J, et al. (1993) Application of large vocabulary continuous speech recognition to topic and speaker identification using telephone speech. In: Proc. ICASSP 93, volume 11, May 1993, San Francisco, Calif. IEEE CS Press, Piscataway, N.J., pp 471-474
    • (1993) Proc. ICASSP 93 , vol.11 , pp. 471-474
    • Gillick, L.1    Baker, J.2    Bridle, J.3
  • 28
    • 0029462573 scopus 로고
    • Automating the creation of a digital video library
    • November 1995, San Francisco, Calif. ACM Press, New York
    • Smith MA, Christel MG (1995) Automating the creation of a digital video library. In: Proc. ACM Multimedia 95, November 1995, San Francisco, Calif. ACM Press, New York, pp 357-358
    • (1995) Proc. ACM Multimedia 95 , pp. 357-358
    • Smith, M.A.1    Christel, M.G.2
  • 30
    • 79952385877 scopus 로고
    • Segmentation of speech using speaker identification
    • April 1994, IEEE CS Press, Piscataway, N.J.
    • Wilcox L, Chen F, Balasubramanian V (1994) Segmentation of speech using speaker identification. In: Proc. ICASSP 94, volume S1, April 1994, IEEE CS Press, Piscataway, N.J., pp 161-164
    • (1994) Proc. ICASSP 94 , vol.S1 , pp. 161-164
    • Wilcox, L.1    Chen, F.2    Balasubramanian, V.3
  • 31
    • 0003204113 scopus 로고    scopus 로고
    • Acoustic segmentation for audio browsers
    • July 1996, Sydney, Australia
    • Kimber D, Wilcox L (1996) Acoustic segmentation for audio browsers. In: Proc. Interface Conference, July 1996, Sydney, Australia (available on the Internet: http://www.fxpal.xerox.com/ abstracts/kim96.htm)
    • (1996) Proc. Interface Conference
    • Kimber, D.1    Wilcox, L.2
  • 33
    • 0030648380 scopus 로고    scopus 로고
    • Speaker identification based text to audio alignment for an audio retrieval system
    • April 1997, Munich, Germany, IEEE CS Press, Piscataway, N.J.
    • Roy D, Malamud C (1997) Speaker identification based text to audio alignment for an audio retrieval system. In: Proc. ICASSP 97, April 1997, Munich, Germany, IEEE CS Press, Piscataway, N.J., pp 1099-1102
    • (1997) Proc. ICASSP 97 , pp. 1099-1102
    • Roy, D.1    Malamud, C.2
  • 34
    • 0343261941 scopus 로고    scopus 로고
    • Toward content-based audio indexing and retrieval and a new speaker discrimination technique
    • Rosenthal DF, Okuno HG (eds) Lawrence Erlbaum, New York
    • Wyse L, Smoliar S (1998) Toward content-based audio indexing and retrieval and a new speaker discrimination technique. In: Rosenthal DF, Okuno HG (eds) Readings In Computational Auditory Scene Analysis. Lawrence Erlbaum, New York
    • (1998) Readings in Computational Auditory Scene Analysis
    • Wyse, L.1    Smoliar, S.2
  • 35
    • 0344139635 scopus 로고
    • Automatic film genre classification
    • November 1995, San Francisco, Calif. ACM Press, New York
    • Fischer S, Effelsberg W (1995) Automatic film genre classification. In: Proc. ACM Multimedia '95, November 1995, San Francisco, Calif. ACM Press, New York, pp 295-304
    • (1995) Proc. ACM Multimedia '95 , pp. 295-304
    • Fischer, S.1    Effelsberg, W.2
  • 36
    • 0003674953 scopus 로고    scopus 로고
    • Automatic audio content analysis
    • University of Mannheim, Mannheim, Germany
    • Pfeiffer S, Fischer S, Effelsberg W (1996) Automatic audio content analysis. Technical Report TR-96-008, University of Mannheim, Mannheim, Germany, (available on the Internet: ftp://pi4.informatik. uni-mannheim.de/pub/techreports/1996/TR-96-008.ps.gz)
    • (1996) Technical Report TR-96-008
    • Pfeiffer, S.1    Fischer, S.2    Effelsberg, W.3
  • 37
    • 0029765670 scopus 로고    scopus 로고
    • Real-time discrimination of broadcast speech/music
    • May 1996, Atlanta, Ga. IEEE CS Press, Piscataway, N.J.
    • Saunders J (1996) Real-time discrimination of broadcast speech/music. In: Proc. ICASSP 96, volume 11, May 1996, Atlanta, Ga. IEEE CS Press, Piscataway, N.J., pp 993-996
    • (1996) Proc. ICASSP 96 , vol.11 , pp. 993-996
    • Saunders, J.1
  • 38
    • 0030648077 scopus 로고    scopus 로고
    • Construction and evaluation of a robust multifeature music/speech discriminator
    • April 1997, IEEE CS Press, Piscataway, N.J.
    • Scheirer E, Slaney M (1997) Construction and evaluation of a robust multifeature music/speech discriminator. In: Proc. ICASSP 97, volume 11, April 1997, IEEE CS Press, Piscataway, N.J., pp 1331-1334
    • (1997) Proc. ICASSP 97 , vol.11 , pp. 1331-1334
    • Scheirer, E.1    Slaney, M.2
  • 39
    • 0030364785 scopus 로고    scopus 로고
    • Automatic transcription of general audio data: Preliminary analyses
    • October 1996, Philadelphia, Pa.
    • Spina M, Zue V (1996) Automatic transcription of general audio data: Preliminary analyses. In: Proc. International Conference on Spoken Language Processing, October 1996, Philadelphia, Pa., pp 594-597.
    • (1996) Proc. International Conference on Spoken Language Processing , pp. 594-597
    • Spina, M.1    Zue, V.2
  • 41
    • 0002494419 scopus 로고    scopus 로고
    • SpeechSkimmer: A system for interactively skimming recorded speech
    • Arons B (1997) SpeechSkimmer: A system for interactively skimming recorded speech. ACM Trans Comput Hum Interaction 4(1):3-38 (available on the Internet: http://barons.www.media.mit.edu/people/ barons/papers/ToCHI97.ps)
    • (1997) ACM Trans Comput Hum Interaction , vol.4 , Issue.1 , pp. 3-38
    • Arons, B.1
  • 42
    • 0010951029 scopus 로고
    • Organizing sounds with neural nets
    • San Francisco, Calif. International Computer Music Association
    • Feiten B, Ungvary T (1991) Organizing sounds with neural nets. In: Proc. 1991 Int. Computer Music Conf., San Francisco, Calif. International Computer Music Association
    • (1991) Proc. 1991 Int. Computer Music Conf.
    • Feiten, B.1    Ungvary, T.2
  • 43
    • 0028555270 scopus 로고    scopus 로고
    • Automatic indexing of a sound database using self-organizing neural nets
    • 19
    • Feiten B, Günzel S (19) Automatic indexing of a sound database using self-organizing neural nets. Comput Music J 18(3):53-65
    • Comput Music J , vol.18 , Issue.3 , pp. 53-65
    • Feiten, B.1    Günzel, S.2
  • 44
    • 0030242072 scopus 로고    scopus 로고
    • Content-based classification, search, and retrieval of audio
    • Wold E, Blum T, Keslar D, Wheaton J (1996) Content-based classification, search, and retrieval of audio. IEEE Multimedia 3(3):27-36
    • (1996) IEEE Multimedia , vol.3 , Issue.3 , pp. 27-36
    • Wold, E.1    Blum, T.2    Keslar, D.3    Wheaton, J.4
  • 45
    • 0004678264 scopus 로고
    • A model distance measure for talker clustering and identification
    • April 1994, Adelaide, Australia. IEEE CS Press, Piscataway, N.J.
    • Foote JT, Silverman HF (1994) A model distance measure for talker clustering and identification. In: Proc. ICASSP 94, volume S1, April 1994, Adelaide, Australia. IEEE CS Press, Piscataway, N.J., pp 317-32
    • (1994) Proc. ICASSP 94 , vol.S1 , pp. 317-332
    • Foote, J.T.1    Silverman, H.F.2
  • 46
    • 0031269043 scopus 로고    scopus 로고
    • Rapid speaker identification using discrete MMI feature quantisation
    • Foote JT (1998) Rapid speaker identification using discrete MMI feature quantisation. Expert Syst Appl 13(4):283-289
    • (1998) Expert Syst Appl , vol.13 , Issue.4 , pp. 283-289
    • Foote, J.T.1
  • 47
    • 57649180845 scopus 로고    scopus 로고
    • Content-based retrieval of music and audio
    • 19 Kuo CCJ. et al. (eds)
    • Foote JT (19 ) Content-based retrieval of music and audio. In Kuo CCJ. et al. (eds) Multimedia Storage and Archiving Systems II, Proc. SPIE, volume 3229, pp 138-147 (available on the Internet: http://svr-www.eng-cam.ac.uk/̃jtf/papers/spie97-abs.html)
    • Multimedia Storage and Archiving Systems II, Proc. SPIE , vol.3229 , pp. 138-147
    • Foote, J.T.1
  • 48
    • 0029456574 scopus 로고
    • Query by humming
    • November 1995, San Francisco, Calif. ACM Press, New York
    • Ghias A, et al. (1995) Query by humming. In: Proc. ACM Multimedia 95, November 1995, San Francisco, Calif. ACM Press, New York, pp 231-236
    • (1995) Proc. ACM Multimedia 95 , pp. 231-236
    • Ghias, A.1
  • 49
    • 0029695822 scopus 로고    scopus 로고
    • Towards the digital music library: Tune retrieval from acoustic input
    • McNab R, Smith L, Witten I, Henderson C, Cunningham S (1996) Towards the digital music library: Tune retrieval from acoustic input. In: Proc. Digital Libraries 96, pp 11-18 (available on the Internet: http://www.cs.waikato.ac.nz/̃rjmcnab/papers/mt.ps.gz)
    • (1996) Proc. Digital Libraries 96 , pp. 11-18
    • McNab, R.1    Smith, L.2    Witten, I.3    Henderson, C.4    Cunningham, S.5
  • 51
    • 0030394830 scopus 로고    scopus 로고
    • Open-vocabulary speech indexing for voice and video mail retrieval
    • November 1996, Boston, Mass. ACM Press, New York
    • Brown MG, Foote JT, Jones GJF, Spärck Jones K, Young SJ (1996) Open-vocabulary speech indexing for voice and video mail retrieval. In: Proc. ACM Multimedia 96, November 1996, Boston, Mass. ACM Press, New York, pp 35-43
    • (1996) Proc. ACM Multimedia 96 , pp. 35-43
    • Brown, M.G.1    Foote, J.T.2    Jones, G.J.F.3    Spärck Jones, K.4    Young, S.J.5
  • 53
    • 84969383696 scopus 로고    scopus 로고
    • Article extraction and classification of TV news using image and speech processing
    • Kyoto, Japan
    • Ariki S (1996) Article extraction and classification of TV news using image and speech processing. In: International Symposium on Cooperative Database Systems for Advanced Applications (CODAS-96), Kyoto, Japan (available on the Internet: http://banjo.kuis.kyoto-u.ac.jp/̃tarumi/juten/J/event/ kyoto-camera/L006.ps)
    • (1996) International Symposium on Cooperative Database Systems for Advanced Applications (CODAS-96)
    • Ariki, S.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.