메뉴 건너뛰기




Volumn , Issue , 2006, Pages 1-285

MPEG-7 Audio and Beyond: Audio Content Indexing and Retrieval

Author keywords

[No Author keywords available]

Indexed keywords


EID: 84889435599     PISSN: None     EISSN: None     Source Type: Book    
DOI: 10.1002/0470093366     Document Type: Book
Times cited : (247)

References (215)
  • 1
    • 4243152700 scopus 로고    scopus 로고
    • Content-based Identification of Audio Material Using MPEG-7 Low Level Description
    • International Symposium Music Information Retrieval, Bloomington, IN, USA, October
    • Allamanche E., Herre J., Helmuth O., Fröba B., Kasten T. and Cremer M. (2001) "Content-based Identification of Audio Material Using MPEG-7 Low Level Description", International Symposium Music Information Retrieval, Bloomington, IN, USA, October.
    • (2001)
    • Allamanche, E.1    Herre, J.2    Helmuth, O.3    Fröba, B.4    Kasten, T.5    Cremer, M.6
  • 2
    • 84889309871 scopus 로고    scopus 로고
    • Basic Speech Sounds, their Analysis and Features
    • in Spoken Dialogues with Computers, Academic Press, London
    • Angelini B., Falavigna D., Omologo M. and De Mori R. (1998) "Basic Speech Sounds, their Analysis and Features", in Spoken Dialogues with Computers, pp. 69-121, Academic Press, London.
    • (1998) , pp. 69-121
    • Angelini, B.1    Falavigna, D.2    Omologo, M.3    De Mori, R.4
  • 3
    • 0020148958 scopus 로고
    • Synthesis by Spectral Amplitude and 'Brightness' Matching Analyzed Musical Sounds
    • Beauchamp J. W. (1982) "Synthesis by Spectral Amplitude and 'Brightness' Matching Analyzed Musical Sounds", Journal of Audio Engineering Society, vol. 30, no. 6, pp. 396-406.
    • (1982) Journal of Audio Engineering Society , vol.30 , Issue.6 , pp. 396-406
    • Beauchamp, J.W.1
  • 4
    • 84889311325 scopus 로고    scopus 로고
    • A Hierarchical Approach to Automatic Musical Genre Classification
    • 6th International Conference on Digital Audio Effects (DAFX), London, UK, September
    • Burred J. J. and Lerch A. (2003) "A Hierarchical Approach to Automatic Musical Genre Classification", 6th International Conference on Digital Audio Effects (DAFX), London, UK, September.
    • (2003)
    • Burred, J.J.1    Lerch, A.2
  • 5
    • 33645801332 scopus 로고    scopus 로고
    • Hierarchical Automatic Audio Signal Classification
    • Burred J. J. and Lerch A. (2004) "Hierarchical Automatic Audio Signal Classification", Journal of the Audio Engineering Society, vol. 52, no. 7/8, pp. 724-739.
    • (2004) Journal of the Audio Engineering Society , vol.52 , Issue.7-8 , pp. 724-739
    • Burred, J.J.1    Lerch, A.2
  • 6
    • 84892166605 scopus 로고    scopus 로고
    • A Spectrally Mixed Excitation (SMX) Vocoder with Robust Parameter Determination
    • Seattle, WA , USA, May
    • Cho Y. D., Kim M. Y. and Kim S. R. (1998) "A Spectrally Mixed Excitation (SMX) Vocoder with Robust Parameter Determination", ICASSP '98, vol. 2, pp. 601-604, Seattle, WA , USA, May.
    • (1998) ICASSP '98 , vol.2 , pp. 601-604
    • Cho, Y.D.1    Kim, M.Y.2    Kim, S.R.3
  • 7
    • 0019053271 scopus 로고
    • Comparison of Parametric Representations for Monosyllabic Word Recognition in Continuously Spoken Sentences
    • Davis S. B. and Mermelstein P. (1980) "Comparison of Parametric Representations for Monosyllabic Word Recognition in Continuously Spoken Sentences", IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 28, no. 4, pp. 357-365.
    • (1980) IEEE Transactions on Acoustics, Speech, and Signal Processing , vol.28 , Issue.4 , pp. 357-365
    • Davis, S.B.1    Mermelstein, P.2
  • 8
    • 85009074922 scopus 로고    scopus 로고
    • Harmonic Tunnelling: Tracking Non-Stationary Noises during Speech
    • Eurospeech 2001, Aalborg, Denmark, September
    • Ealey D., Kelleher H. and Pearce D. (2001) "Harmonic Tunnelling: Tracking Non-Stationary Noises during Speech", Eurospeech 2001, Aalborg, Denmark, September.
    • (2001)
    • Ealey, D.1    Kelleher, H.2    Pearce, D.3
  • 9
    • 0003901864 scopus 로고    scopus 로고
    • Speech and Audio Signal Processing: Processing and Perception of Speech and Music
    • John Wiley & Sons, Inc., New York
    • Gold B. and Morgan N. (1999) Speech and Audio Signal Processing: Processing and Perception of Speech and Music, John Wiley & Sons, Inc., New York.
    • (1999)
    • Gold, B.1    Morgan, N.2
  • 10
    • 0018139926 scopus 로고
    • Perceptual Effects of Spectral Modifications on Musical Timbres
    • Grey J. M. and Gordon J. W. (1978) "Perceptual Effects of Spectral Modifications on Musical Timbres", Journal of Acoustical Society of America, vol. 63, no. 5, pp. 1493-1500.
    • (1978) Journal of Acoustical Society of America , vol.63 , Issue.5 , pp. 1493-1500
    • Grey, J.M.1    Gordon, J.W.2
  • 11
    • 0003455850 scopus 로고    scopus 로고
    • Information Technology -Multimedia Content Description Interface -Part 4: Audio
    • ISO/IEC, FDIS 15938-4:2001(E), June
    • ISO/IEC (2001) Information Technology -Multimedia Content Description Interface -Part 4: Audio, FDIS 15938-4:2001(E), June.
    • (2001)
  • 12
    • 0032671913 scopus 로고    scopus 로고
    • Silence Detection for Multimedia Communication Systems
    • Jacobs S., Eleftheriadis A. and Anastassiou D. (1999) "Silence Detection for Multimedia Communication Systems", Multimedia Systems, vol. 7, no. 2, pp. 157-164.
    • (1999) Multimedia Systems , vol.7 , Issue.2 , pp. 157-164
    • Jacobs, S.1    Eleftheriadis, A.2    Anastassiou, D.3
  • 13
    • 4544361760 scopus 로고    scopus 로고
    • Comparison of MPEG-7 Audio Spectrum Projection Features and MFCC Applied to Speaker Recognition, Sound Classification and Audio Segmentation
    • ICASSP'2004, Montreal, Canada, May
    • Kim H.-G. and Sikora T. (2004) "Comparison of MPEG-7 Audio Spectrum Projection Features and MFCC Applied to Speaker Recognition, Sound Classification and Audio Segmentation", ICASSP'2004, Montreal, Canada, May.
    • (2004)
    • Kim, H.-G.1    Sikora, T.2
  • 14
    • 0002477067 scopus 로고
    • Why is musical timbre so hard to understand?
    • in Structure and perception of electroacoustic sound and music, Elsevier, Amsterdam
    • Krumhansl C. L. (1989) "Why is musical timbre so hard to understand?" in Structure and perception of electroacoustic sound and music, pp. 43-53, Elsevier, Amsterdam.
    • (1989) , pp. 43-53
    • Krumhansl, C.L.1
  • 15
    • 0034293572 scopus 로고    scopus 로고
    • A Common Perceptual Space for Harmonic and Percussive Timbres
    • Lakatos S. (2000) "A Common Perceptual Space for Harmonic and Percussive Timbres", Perception and Psychophysics, vol. 62, no. 7, pp. 1426-1439.
    • (2000) Perception and Psychophysics , vol.62 , Issue.7 , pp. 1426-1439
    • Lakatos, S.1
  • 17
    • 0034273520 scopus 로고    scopus 로고
    • Content-based Audio Classification and Retrieval using the Nearest Feature Line Method
    • Li S. Z. (2000) "Content-based Audio Classification and Retrieval using the Nearest Feature Line Method", IEEE Transactions on Speech and Audio Processing, vol. 8, no. 5, pp. 619-625.
    • (2000) IEEE Transactions on Speech and Audio Processing , vol.8 , Issue.5 , pp. 619-625
    • Li, S.Z.1
  • 18
    • 79955939942 scopus 로고    scopus 로고
    • Mel Frequency Cepstral Coefficients for Music Modeling
    • International Symposium on Music Information Retrieval (ISMIR), Plymouth, MA, October
    • Logan B. (2000) "Mel Frequency Cepstral Coefficients for Music Modeling", International Symposium on Music Information Retrieval (ISMIR), Plymouth, MA, October.
    • (2000)
    • Logan, B.1
  • 20
    • 0003769779 scopus 로고    scopus 로고
    • Introduction to MPEG-7
    • John Wiley & Sons, Ltd, Chicherter
    • Manjunath B. S., Salembier P. and Sikora T. (2002) Introduction to MPEG-7, John Wiley & Sons, Ltd, Chicherter.
    • (2002)
    • Manjunath, B.S.1    Salembier, P.2    Sikora, T.3
  • 21
    • 0012468695 scopus 로고    scopus 로고
    • Perspectives on the Contribution of Timbre to Musical Structure
    • McAdams S. (1999) "Perspectives on the Contribution of Timbre to Musical Structure", Computer Music Journal, vol. 23, no. 3, pp. 85-102.
    • (1999) Computer Music Journal , vol.23 , Issue.3 , pp. 85-102
    • McAdams, S.1
  • 22
    • 0029442124 scopus 로고
    • Perceptual Scaling of Synthesized Musical Timbres: Common Dimensions, Specificities, and Latent Subject Classes
    • McAdams S., Winsberg S., Donnadieu S., De Soete G. and Krimphoff J. (1995) "Perceptual Scaling of Synthesized Musical Timbres: Common Dimensions, Specificities, and Latent Subject Classes", Psychological Research, no. 58, pp. 177-192.
    • (1995) Psychological Research , vol.58 , pp. 177-192
    • McAdams, S.1    Winsberg, S.2    Donnadieu, S.3    De Soete, G.4    Krimphoff, J.5
  • 23
    • 0016113915 scopus 로고
    • The Optimum Comb Method of Pitch Period Analysis of Continuous Digitized Speech
    • Moorer J. (1974) "The Optimum Comb Method of Pitch Period Analysis of Continuous Digitized Speech", IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 22, no. 5, pp. 330-338.
    • (1974) IEEE Transactions on Acoustics, Speech, and Signal Processing , vol.22 , Issue.5 , pp. 330-338
    • Moorer, J.1
  • 24
    • 0001628038 scopus 로고
    • Nonlinear Filtering of Multiplied and Convolved Signals
    • Oppenheim A. V., Schafer R. W. and Stockham T. G. (1968) "Nonlinear Filtering of Multiplied and Convolved Signals", IEEE Proceedings, vol. 56, no. 8, pp. 1264-1291.
    • (1968) IEEE Proceedings , vol.56 , Issue.8 , pp. 1264-1291
    • Oppenheim, A.V.1    Schafer, R.W.2    Stockham, T.G.3
  • 25
    • 33746597668 scopus 로고    scopus 로고
    • Salient Feature Extraction of Musical Instrument Signals
    • Thesis for the Degree of Master of Arts in Electro-Acoustic Music, Dartmouth College
    • Park T. H. (2000) "Salient Feature Extraction of Musical Instrument Signals", Thesis for the Degree of Master of Arts in Electro-Acoustic Music, Dartmouth College.
    • (2000)
    • Park, T.H.1
  • 26
    • 84889344642 scopus 로고    scopus 로고
    • Instrument Sound Description in the Context of MPEG-7
    • ICMC'2000 International Computer Music Conference, Berlin, Germany, August
    • Peeters G., McAdams S. and Herrera P. (2000) "Instrument Sound Description in the Context of MPEG-7", ICMC'2000 International Computer Music Conference, Berlin, Germany, August.
    • (2000)
    • Peeters, G.1    McAdams, S.2    Herrera, P.3
  • 27
    • 0003425258 scopus 로고
    • Digital Processing of Speech Signals
    • Prentice Hall, Englewood Cliffs, NJ
    • Rabiner L. R. and Schafer R. W. (1978) Digital Processing of Speech Signals, Prentice Hall, Englewood Cliffs, NJ.
    • (1978)
    • Rabiner, L.R.1    Schafer, R.W.2
  • 28
    • 0030648077 scopus 로고    scopus 로고
    • Construction and Evaluation of a Robust Multifeature Speech/Music Discriminator
    • Munich, Germany, April
    • Scheirer E. and Slaney M. (1997) "Construction and Evaluation of a Robust Multifeature Speech/Music Discriminator", ICASSP '97, vol. 2, pp. 1331-1334, Munich, Germany, April.
    • (1997) ICASSP '97 , vol.2 , pp. 1331-1334
    • Scheirer, E.1    Slaney, M.2
  • 30
    • 85032751556 scopus 로고    scopus 로고
    • Multimedia Content Analysis Using Both Audio and Visual Cues
    • Wang Y., Liu Z. and Huang J.-C. (2000) "Multimedia Content Analysis Using Both Audio and Visual Cues", IEEE Signal Processing Magazine, vol. 17, no. 6, pp. 12-36.
    • (2000) IEEE Signal Processing Magazine , vol.17 , Issue.6 , pp. 12-36
    • Wang, Y.1    Liu, Z.2    Huang, J.-C.3
  • 31
    • 0030242072 scopus 로고    scopus 로고
    • Content-Based Classification, Search, and Retrieval of Audio
    • Wold E., Blum T., Keslar D. and Wheaton J. (1996) "Content-Based Classification, Search, and Retrieval of Audio", IEEE MultiMedia, vol. 3, no. 3, pp. 27-36.
    • (1996) IEEE MultiMedia , vol.3 , Issue.3 , pp. 27-36
    • Wold, E.1    Blum, T.2    Keslar, D.3    Wheaton, J.4
  • 32
    • 0141855132 scopus 로고    scopus 로고
    • Comparing MFCC and MPEG-7 Audio Features for Feature Extraction, Maximum Likelihood HMM and Entropic Prior HMM for Sports Audio Classification
    • Hong Kong, April
    • Xiong Z., Radhakrishnan R., Divakaran A. and Huang T. S. (2003) "Comparing MFCC and MPEG-7 Audio Features for Feature Extraction, Maximum Likelihood HMM and Entropic Prior HMM for Sports Audio Classification", IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP'03), vol. 5, pp. 628-631, Hong Kong, April.
    • (2003) IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP'03) , vol.5 , pp. 628-631
    • Xiong, Z.1    Radhakrishnan, R.2    Divakaran, A.3    Huang, T.S.4
  • 34
    • 84948186412 scopus 로고    scopus 로고
    • Non-Negative Component Parts of Sound for Classification
    • IEEE International Symposium on Signal Processing and Information Technology, Darmstadt, Germany, December
    • Cho Y.-C., Choi S. and Bang S.-Y. (2003) "Non-Negative Component Parts of Sound for Classification", IEEE International Symposium on Signal Processing and Information Technology, Darmstadt, Germany, December.
    • (2003)
    • Cho, Y.-C.1    Choi, S.2    Bang, S.-Y.3
  • 35
    • 34249753618 scopus 로고
    • Support Vector Networks
    • Cortes C. and Vapnik V. (1995) "Support Vector Networks", Machine Learning, vol. 20, pp. 273-297.
    • (1995) Machine Learning , vol.20 , pp. 273-297
    • Cortes, C.1    Vapnik, V.2
  • 36
    • 0004236492 scopus 로고
    • Matrix Computations
    • Johns Hopkins University Press, Baltimore, MD
    • Golub G. H. and Van Loan C. F. (1993) Matrix Computations, Johns Hopkins University Press, Baltimore, MD.
    • (1993)
    • Golub, G.H.1    Van Loan, C.F.2
  • 37
    • 0004063090 scopus 로고    scopus 로고
    • Neural Networks
    • 2nd Edition, Prentice Hall, Englewood Cliffs, NJ
    • Haykins S. (1998) Neural Networks, 2nd Edition, Prentice Hall, Englewood Cliffs, NJ.
    • (1998)
    • Haykins, S.1
  • 38
    • 0032629347 scopus 로고    scopus 로고
    • Fast and Robust Fixed-Point algorithms for Independent Component Analysis
    • Hyvärinen A., (1999) "Fast and Robust Fixed-Point algorithms for Independent Component Analysis", IEEE Transactions on Neural Networks, vol. 10, no. 3, pp. 626-634.
    • (1999) IEEE Transactions on Neural Networks , vol.10 , Issue.3 , pp. 626-634
    • Hyvärinen, A.1
  • 39
    • 0003905759 scopus 로고    scopus 로고
    • Independent Component Analysis
    • John Wiley & Sons, Inc., New York
    • Hyvärinen A., Karhunen J. and Oja E. (2001) Independent Component Analysis, John Wiley & Sons, Inc., New York.
    • (2001)
    • Hyvärinen, A.1    Karhunen, J.2    Oja, E.3
  • 40
    • 0003946510 scopus 로고
    • Principal Component Analysis
    • Springer-Verlag, Berlin
    • Jollife I. T. (1986) Principal Component Analysis, Springer-Verlag, Berlin.
    • (1986)
    • Jollife, I.T.1
  • 41
    • 4544361760 scopus 로고    scopus 로고
    • Comparison of MPEG-7 Audio Spectrum Projection Features and MFCC applied to Speaker Recognition, Sound Classification and Audio Segmentation
    • Proceedings IEEE ICASSP 2004, Montreal, Canada, May
    • Kim H.-G. and Sikora T. (2004a) "Comparison of MPEG-7 Audio Spectrum Projection Features and MFCC applied to Speaker Recognition, Sound Classification and Audio Segmentation", Proceedings IEEE ICASSP 2004, Montreal, Canada, May.
    • (2004)
    • Kim, H.-G.1    Sikora, T.2
  • 42
    • 84889408786 scopus 로고    scopus 로고
    • Audio Spectrum Projection Based on Several Basis Decomposition Algorithms Applied to General Sound Recognition and Audio Segmentation
    • Proceedings of EURASIP-EUSIPCO 2004, Vienna, Austria, September
    • Kim H.-G. and Sikora T. (2004b) "Audio Spectrum Projection Based on Several Basis Decomposition Algorithms Applied to General Sound Recognition and Audio Segmentation", Proceedings of EURASIP-EUSIPCO 2004, Vienna, Austria, September.
    • (2004)
    • Kim, H.-G.1    Sikora, T.2
  • 43
    • 84889470436 scopus 로고    scopus 로고
    • How Efficient Is MPEG-7 Audio for Sound Classification, Musical Instrument Identification, Speaker Recognition, and Speaker-Based Segmentation?
    • IEEE Transactions on Speech and Audio Processing, submitted
    • Kim H.-G. and Sikora T. (2004c) "How Efficient Is MPEG-7 Audio for Sound Classification, Musical Instrument Identification, Speaker Recognition, and Speaker-Based Segmentation?", IEEE Transactions on Speech and Audio Processing, submitted.
    • (2004)
    • Kim, H.-G.1    Sikora, T.2
  • 44
    • 85009168586 scopus 로고    scopus 로고
    • Speaker Recognition Using MPEG-7 Descriptors
    • Proceedings EUROSPEECH 2003, Geneva, Switzerland, September
    • Kim H.-G., Berdahl E., Moreau N. and Sikora T. (2003) "Speaker Recognition Using MPEG-7 Descriptors", Proceedings EUROSPEECH 2003, Geneva, Switzerland, September.
    • (2003)
    • Kim, H.-G.1    Berdahl, E.2    Moreau, N.3    Sikora, T.4
  • 45
    • 84889291047 scopus 로고    scopus 로고
    • How Efficient is MPEG-7 for General Sound Recognition?
    • 25th International AES Conference "Metadata for Audio", London, UK, June
    • Kim H.-G., Burred J. J. and Sikora T. (2004a) "How Efficient is MPEG-7 for General Sound Recognition?", 25th International AES Conference "Metadata for Audio", London, UK, June.
    • (2004)
    • Kim, H.-G.1    Burred, J.J.2    Sikora, T.3
  • 47
    • 0033592606 scopus 로고    scopus 로고
    • Learning the Parts of Objects by Non-Negative Matrix Factorization
    • Lee D. D. and Seung H. S. (1999) "Learning the Parts of Objects by Non-Negative Matrix Factorization", Nature, vol. 401, pp. 788-791.
    • (1999) Nature , vol.401 , pp. 788-791
    • Lee, D.D.1    Seung, H.S.2
  • 48
    • 84898964201 scopus 로고    scopus 로고
    • Algorithms for Non-Negative Matrix Factorization
    • NIPS 2001 Conference, Vancouver, Canada
    • Lee D. D. and Seung H. S. (2001) "Algorithms for Non-Negative Matrix Factorization", NIPS 2001 Conference, Vancouver, Canada.
    • (2001)
    • Lee, D.D.1    Seung, H.S.2
  • 49
    • 0003769779 scopus 로고    scopus 로고
    • Introduction to MPEG-7
    • John Wiley & Sons, Ltd, Chichester
    • Manjunath B. S., Salembier P. and Sikora T. (2001) Introduction to MPEG-7, John Wiley & Sons, Ltd, Chichester.
    • (2001)
    • Manjunath, B.S.1    Salembier, P.2    Sikora, T.3
  • 50
    • 0004244302 scopus 로고
    • Fundamentals of Speech Recognition
    • Prentice Hall, Englewood Cliffs, NJ
    • Rabiner L. R. and Jung B. (1993) Fundamentals of Speech Recognition, Prentice Hall, Englewood Cliffs, NJ.
    • (1993)
    • Rabiner, L.R.1    Jung, B.2
  • 51
    • 0029355999 scopus 로고
    • Speaker Identification and Verification Using Gaussian Mixture Speaker Models
    • Reynolds D. A. (1995) Speaker Identification and Verification Using Gaussian Mixture Speaker Models, Speech Communication, pp. 91-108.
    • (1995) Speech Communication , pp. 91-108
    • Reynolds, D.A.1
  • 52
    • 84945116938 scopus 로고    scopus 로고
    • Non-Negative Matrix Factorization for Polyphonic Music Transcription
    • IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, New Paltz, NY, USA, October
    • Smaragdis P. and Brown J. C. (2003) "Non-Negative Matrix Factorization for Polyphonic Music Transcription", IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, New Paltz, NY, USA, October.
    • (2003)
    • Smaragdis, P.1    Brown, J.C.2
  • 53
    • 84889309871 scopus 로고    scopus 로고
    • Basic Speech Sounds, their Analysis and Features
    • in Spoken Dialogues with Computers, R. De Mori (ed.), Academic Press, London
    • Angelini B., Falavigna D., Omologo M. and De Mori R. (1998) "Basic Speech Sounds, their Analysis and Features", in Spoken Dialogues with Computers, pp. 69-121, R. De Mori (ed.), Academic Press, London.
    • (1998) , pp. 69-121
    • Angelini, B.1    Falavigna, D.2    Omologo, M.3    De Mori, R.4
  • 54
    • 84889315094 scopus 로고    scopus 로고
    • A System for Searching and Browsing Spoken Communications
    • HLT-NAACL 2004 Workshop on Interdisciplinary Approaches to Speech Indexing and Retrieval, Boston, MA, USA, May
    • Begeja L., Renger B., Saraclar M., Gibbon D., Liu Z. and Shahraray B. (2004) "A System for Searching and Browsing Spoken Communications", HLT-NAACL 2004 Workshop on Interdisciplinary Approaches to Speech Indexing and Retrieval, pp. 1-8, Boston, MA, USA, May.
    • (2004) , pp. 1-8
    • Begeja, L.1    Renger, B.2    Saraclar, M.3    Gibbon, D.4    Liu, Z.5    Shahraray, B.6
  • 55
  • 56
    • 0004116125 scopus 로고
    • Implementation of the SMART Information Retrieval System
    • Computer Science Department, Cornell University, Report 85-686
    • Buckley C. (1985) "Implementation of the SMART Information Retrieval System", Computer Science Department, Cornell University, Report 85-686.
    • (1985)
    • Buckley, C.1
  • 57
    • 0004119259 scopus 로고
    • The Sound Pattern of English
    • MIT Press, Cambridge, MA
    • Chomsky N. and Halle M. (1968) The Sound Pattern of English, MIT Press, Cambridge, MA.
    • (1968)
    • Chomsky, N.1    Halle, M.2
  • 58
    • 84889399246 scopus 로고    scopus 로고
    • Phonetic Searching vs. LVCSR: How to Find What You Really Want in Audio Archives
    • AVIOS 2001, San Jose, CA, USA, April
    • Clements M., Cardillo P. S. and Miller M. S. (2001) "Phonetic Searching vs. LVCSR: How to Find What You Really Want in Audio Archives", AVIOS 2001, San Jose, CA, USA, April.
    • (2001)
    • Clements, M.1    Cardillo, P.S.2    Miller, M.S.3
  • 59
    • 78650946218 scopus 로고    scopus 로고
    • Information Retrieval Techniques for Speech Applications
    • ACM SIGIR 2001 Workshop "Information Retrieval Techniques for Speech Applications"
    • Coden A. R., Brown E. and Srinivasan S. (2001) "Information Retrieval Techniques for Speech Applications", ACM SIGIR 2001 Workshop "Information Retrieval Techniques for Speech Applications".
    • (2001)
    • Coden, A.R.1    Brown, E.2    Srinivasan, S.3
  • 60
    • 84889316721 scopus 로고    scopus 로고
    • A Model for Combining Semantic and Phonetic Term Similarity for Spoken Document and Spoken Query Retrieval
    • International Computer Science Institute, Berkeley, CA, tr-99-020, December
    • Crestani F. (1999) "A Model for Combining Semantic and Phonetic Term Similarity for Spoken Document and Spoken Query Retrieval", International Computer Science Institute, Berkeley, CA, tr-99-020, December.
    • (1999)
    • Crestani, F.1
  • 61
    • 84889317644 scopus 로고    scopus 로고
    • Using Semantic and Phonetic Term Similarity for Spoken Document Retrieval and Spoken Query Processing
    • in Technologies for Constructing Intelligent Systems, J. G.-R. B. Bouchon-Meunier and R. R. Yager (eds) Springer-Verlag, Heidelberg, Germany
    • Crestani F. (2002) "Using Semantic and Phonetic Term Similarity for Spoken Document Retrieval and Spoken Query Processing" in Technologies for Constructing Intelligent Systems, pp. 363-376, J. G.-R. B. Bouchon-Meunier and R. R. Yager (eds) Springer-Verlag, Heidelberg, Germany.
    • (2002) , pp. 363-376
    • Crestani, F.1
  • 62
    • 0032270571 scopus 로고    scopus 로고
    • "Is This Document Relevant? . . . Probably": A Survey of Probabilistic Models in Information Retrieval
    • Crestani F., Lalmas M., van Rijsbergen C. J. and Campbell I. (1998) " "Is This Document Relevant? . . . Probably": A Survey of Probabilistic Models in Information Retrieval", ACM Computing Surveys, vol. 30, no. 4, pp. 528-552.
    • (1998) ACM Computing Surveys , vol.30 , Issue.4 , pp. 528-552
    • Crestani, F.1    Lalmas, M.2    Van Rijsbergen, C.J.3    Campbell, I.4
  • 63
    • 0028996879 scopus 로고
    • Language Modelling by Variable Length Sequences: Theoretical Formulation and Evaluation of Multigrams
    • ICASSP'95, Detroit, USA
    • Deligne S. and Bimbot F. (1995) "Language Modelling by Variable Length Sequences: Theoretical Formulation and Evaluation of Multigrams", ICASSP'95, pp. 169-172, Detroit, USA.
    • (1995) , pp. 169-172
    • Deligne, S.1    Bimbot, F.2
  • 64
    • 84889269862 scopus 로고    scopus 로고
    • Phoneme-Level Indexing for Fast and Vocabulary-Independent Voice/Voice Retrieval
    • ESCA Tutorial and Research Workshop (ETRW), "Accessing Information in Spoken Audio", Cambridge, UK, April
    • Ferrieux A. and Peillon S. (1999) "Phoneme-Level Indexing for Fast and Vocabulary-Independent Voice/Voice Retrieval", ESCA Tutorial and Research Workshop (ETRW), "Accessing Information in Spoken Audio", Cambridge, UK, April.
    • (1999)
    • Ferrieux, A.1    Peillon, S.2
  • 65
    • 0012577933 scopus 로고    scopus 로고
    • The LIMSI SDR System for TREC-9
    • NIST, 9th Text Retrieval Conference (TREC 9), Gaithersburg, MD, USA, November
    • Gauvain J.-L., Lamel L., Barras C., Adda G. and de Kercardio Y. (2000) "The LIMSI SDR System for TREC-9", NIST, 9th Text Retrieval Conference (TREC 9), pp. 335-341, Gaithersburg, MD, USA, November.
    • (2000) , pp. 335-341
    • Gauvain, J.-L.1    Lamel, L.2    Barras, C.3    Adda, G.4    De Kercardio, Y.5
  • 66
    • 0023776395 scopus 로고
    • Multi-Level Acoustic Segmentation of Continuous Speech
    • ICASSP'88, New York, USA, April
    • Glass J. and Zue V. W. (1988) "Multi-Level Acoustic Segmentation of Continuous Speech", ICASSP'88, pp. 429-432, New York, USA, April.
    • (1988) , pp. 429-432
    • Glass, J.1    Zue, V.W.2
  • 67
    • 0030372637 scopus 로고    scopus 로고
    • A Probabilistic Framework for Featurebased Speech Recognition
    • Philadelphia, PA, USA, October
    • Glass J., Chang J. and McCandless M. (1996) "A Probabilistic Framework for Featurebased Speech Recognition", ICSLP'96, vol. 4, pp. 2277-2280, Philadelphia, PA, USA, October.
    • (1996) ICSLP'96 , vol.4 , pp. 2277-2280
    • Glass, J.1    Chang, J.2    McCandless, M.3
  • 68
    • 0026989462 scopus 로고
    • A System for Retrieving Speech Documents
    • ACM, SIGIR
    • Glavitsch U. and Schäuble P. (1992) "A System for Retrieving Speech Documents", ACM, SIGIR, pp. 168-176.
    • (1992) , pp. 168-176
    • Glavitsch, U.1    Schäuble, P.2
  • 69
    • 0003901864 scopus 로고    scopus 로고
    • Speech and Audio Signal Processing
    • John Wiley & Sons, Inc., New York
    • Gold B. and Morgan N. (1999) Speech and Audio Signal Processing, John Wiley & Sons, Inc., New York.
    • (1999)
    • Gold, B.1    Morgan, N.2
  • 70
    • 0003877861 scopus 로고    scopus 로고
    • Heterogeneous acoustic measurements and multiple classifiers for speech recognition
    • PhD Thesis, Massachusetts Institute of Technology (MIT), Cambridge, MA
    • Halberstadt A. K. (1998) "Heterogeneous acoustic measurements and multiple classifiers for speech recognition", PhD Thesis, Massachusetts Institute of Technology (MIT), Cambridge, MA.
    • (1998)
    • Halberstadt, A.K.1
  • 71
    • 0004185151 scopus 로고
    • Clustering Algorithms
    • John Wiley & Sons, Inc., New York
    • Hartigan J. (1975) Clustering Algorithms, John Wiley & Sons, Inc., New York.
    • (1975)
    • Hartigan, J.1
  • 72
    • 78649307442 scopus 로고    scopus 로고
    • Audio Hot Spotting and Retrieval using Multiple Features
    • HLT-NAACL 2004 Workshop on Interdisciplinary Approaches to Speech Indexing and Retrieval, Boston, MA, USA, May
    • Hu Q., Goodman F., Boykin S., Fish R. and Greiff W. (2004) "Audio Hot Spotting and Retrieval using Multiple Features", HLT-NAACL 2004 Workshop on Interdisciplinary Approaches to Speech Indexing and Retrieval, pp. 13-17, Boston, MA, USA, May.
    • (2004) , pp. 13-17
    • Hu, Q.1    Goodman, F.2    Boykin, S.3    Fish, R.4    Greiff, W.5
  • 73
    • 0004671920 scopus 로고
    • The Application of Classical Information Retrieval Techniques to Spoken Documents
    • PhD Thesis, University of Cambridge, Speech, Vision and Robotic Group, Cambridge, UK
    • James D. A. (1995) "The Application of Classical Information Retrieval Techniques to Spoken Documents", PhD Thesis, University of Cambridge, Speech, Vision and Robotic Group, Cambridge, UK.
    • (1995)
    • James, D.A.1
  • 74
    • 0003786003 scopus 로고    scopus 로고
    • Statistical Methods for Speech Recognition
    • MIT Press, Cambridge, MA
    • Jelinek F. (1998) Statistical Methods for Speech Recognition, MIT Press, Cambridge, MA.
    • (1998)
    • Jelinek, F.1
  • 75
    • 0002623652 scopus 로고    scopus 로고
    • Spoken Document Retrieval for TREC-9 at Cambridge University
    • NIST, 9th Text Retrieval Conference (TREC 9), Gaithersburg, MD, USA, November
    • Johnson S. E., Jourlin P., Spärck Jones K. and Woodland P. C. (2000) "Spoken Document Retrieval for TREC-9 at Cambridge University", NIST, 9th Text Retrieval Conference (TREC 9), pp. 117-126, Gaithersburg, MD, USA, November.
    • (2000) , pp. 117-126
    • Johnson, S.E.1    Jourlin, P.2    Spärck Jones, K.3    Woodland, P.C.4
  • 76
    • 0030379111 scopus 로고    scopus 로고
    • Retrieving Spoken Documents by Combining Multiple Index Sources
    • ACM SIGIR'96, Zurich, Switzerland, August
    • Jones G. J. F., Foote J. T., Spärk Jones K. and Young S. J. (1996) "Retrieving Spoken Documents by Combining Multiple Index Sources", ACM SIGIR'96, pp. 30-38, Zurich, Switzerland, August.
    • (1996) , pp. 30-38
    • Jones, G.J.F.1    Foote, J.T.2    Spärk Jones, K.3    Young, S.J.4
  • 77
    • 0023312404 scopus 로고
    • Estimation of Probabilities from Sparse Data for the Language Model Component of a Speech Recognizer
    • Katz S. M. (1987) "Estimation of Probabilities from Sparse Data for the Language Model Component of a Speech Recognizer", IEEE Transactions on Acoustics, Speech and Signal Processing, vol. 3, pp. 400-401.
    • (1987) IEEE Transactions on Acoustics, Speech and Signal Processing , vol.3 , pp. 400-401
    • Katz, S.M.1
  • 78
    • 0344139642 scopus 로고
    • Speech-based Retrieval using Semantic Co-Occurrence Filtering
    • ARPA, Human Language Technologies (HLT) Conference, Plainsboro, NJ, USA
    • Kupiec J., Kimber D. and Balasubramanian V. (1994) "Speech-based Retrieval using Semantic Co-Occurrence Filtering", ARPA, Human Language Technologies (HLT) Conference, pp. 373-377, Plainsboro, NJ, USA.
    • (1994) , pp. 373-377
    • Kupiec, J.1    Kimber, D.2    Balasubramanian, V.3
  • 79
    • 33745217037 scopus 로고    scopus 로고
    • Using Syllable-based Indexing Features and Language Models to Improve German Spoken Document Retrieval
    • ISCA, Eurospeech 2003, Geneva, Switzerland, September
    • Larson M. and Eickeler S. (2003) "Using Syllable-based Indexing Features and Language Models to Improve German Spoken Document Retrieval", ISCA, Eurospeech 2003, pp. 1217-1220, Geneva, Switzerland, September.
    • (2003) , pp. 1217-1220
    • Larson, M.1    Eickeler, S.2
  • 80
    • 85009112218 scopus 로고    scopus 로고
    • Multi-layer Subword Units for Open-Vocabulary Spoken Document Retrieval
    • ICSLP'2004, Jeju Island, Korea, October
    • Lee S. W., Tanaka K. and Itoh Y. (2004) "Multi-layer Subword Units for Open-Vocabulary Spoken Document Retrieval", ICSLP'2004, Jeju Island, Korea, October.
    • (2004)
    • Lee, S.W.1    Tanaka, K.2    Itoh, Y.3
  • 81
    • 0001116877 scopus 로고
    • Binary Codes Capable of Correcting Deletions, Insertions and Reversals
    • Levenshtein V. I. (1966) "Binary Codes Capable of Correcting Deletions, Insertions and Reversals", Soviet Physics Doklady, vol. 10, no. 8, pp. 707-710.
    • (1966) Soviet Physics Doklady , vol.10 , Issue.8 , pp. 707-710
    • Levenshtein, V.I.1
  • 83
    • 84889344869 scopus 로고    scopus 로고
    • Word and Sub-word Indexing Approaches for Reducing the Effects of OOV Queries on Spoken Audio
    • Human Language Technology Conference (HLT 2002), San Diego, CA, USA, March
    • Logan B., Moreno P. J. and Deshmukh O. (2002) "Word and Sub-word Indexing Approaches for Reducing the Effects of OOV Queries on Spoken Audio", Human Language Technology Conference (HLT 2002), San Diego, CA, USA, March.
    • (2002)
    • Logan, B.1    Moreno, P.J.2    Deshmukh, O.3
  • 84
    • 85009154200 scopus 로고    scopus 로고
    • Keyword Recognition and Extraction by Multiple-LVCSRs with 60,000 Words in Speech-driven WEB Retrieval Task
    • ICSLP'2004, Jeju Island, Korea, October
    • Matsushita M., Nishizaki H., Nakagawa S. and Utsuro T. (2004) "Keyword Recognition and Extraction by Multiple-LVCSRs with 60,000 Words in Speech-driven WEB Retrieval Task", ICSLP'2004, Jeju Island, Korea, October.
    • (2004)
    • Matsushita, M.1    Nishizaki, H.2    Nakagawa, S.3    Utsuro, T.4
  • 85
    • 84889470946 scopus 로고    scopus 로고
    • Combination of Phone N-Grams for a MPEG-7-based Spoken Document Retrieval System
    • EUSIPCO 2004, Vienna, Austria, September
    • Moreau N., Kim H.-G. and Sikora T. (2004a) "Combination of Phone N-Grams for a MPEG-7-based Spoken Document Retrieval System", EUSIPCO 2004, Vienna, Austria, September.
    • (2004)
    • Moreau, N.1    Kim, H.-G.2    Sikora, T.3
  • 86
    • 84889420489 scopus 로고    scopus 로고
    • Phone-based Spoken Document Retrieval in Conformance with the MPEG-7 Standard
    • 25th International AES Conference "Metadata for Audio", London, UK, June
    • Moreau N., Kim H.-G. and Sikora T. (2004b) "Phone-based Spoken Document Retrieval in Conformance with the MPEG-7 Standard", 25th International AES Conference "Metadata for Audio", London, UK, June.
    • (2004)
    • Moreau, N.1    Kim, H.-G.2    Sikora, T.3
  • 87
    • 33745186799 scopus 로고    scopus 로고
    • Phonetic Confusion Based Document Expansion for Spoken Document Retrieval
    • ICSLP Interspeech 2004, Jeju Island, Korea, October
    • Moreau N., Kim H.-G. and Sikora T. (2004c) "Phonetic Confusion Based Document Expansion for Spoken Document Retrieval", ICSLP Interspeech 2004, Jeju Island, Korea, October.
    • (2004)
    • Moreau, N.1    Kim, H.-G.2    Sikora, T.3
  • 88
    • 84889403089 scopus 로고    scopus 로고
    • Scoring Algorithms for Wordspotting Systems
    • HLT-NAACL 2004 Workshop on Interdisciplinary Approaches to Speech Indexing and Retrieval, Boston, MA, USA, May
    • Morris R. W., Arrowood J. A., Cardillo P. S. and Clements M. A. (2004) "Scoring Algorithms for Wordspotting Systems", HLT-NAACL 2004 Workshop on Interdisciplinary Approaches to Speech Indexing and Retrieval, pp. 18-21, Boston, MA, USA, May.
    • (2004) , pp. 18-21
    • Morris, R.W.1    Arrowood, J.A.2    Cardillo, P.S.3    Clements, M.A.4
  • 89
    • 0034274806 scopus 로고    scopus 로고
    • Experiments in Spoken Document Retrieval Using Phoneme N-grams
    • Ng C., Wilkinson R. and Zobel J. (2000) "Experiments in Spoken Document Retrieval Using Phoneme N-grams", Speech Communication, vol. 32, no. 1, pp. 61-77.
    • (2000) Speech Communication , vol.32 , Issue.1 , pp. 61-77
    • Ng, C.1    Wilkinson, R.2    Zobel, J.3
  • 90
    • 0038576501 scopus 로고    scopus 로고
    • Towards Robust Methods for Spoken Document Retrieval
    • Sydney, Australia, November
    • Ng K. (1998) "Towards Robust Methods for Spoken Document Retrieval", ICSLP'98, vol. 3, pp. 939-342, Sydney, Australia, November.
    • (1998) ICSLP'98 , vol.3 , pp. 939-342
    • Ng, K.1
  • 91
    • 84937320583 scopus 로고    scopus 로고
    • Subword-based Approaches for Spoken Document Retrieval
    • PhD Thesis, Massachusetts Institute of Technology (MIT), Cambridge, MA
    • Ng K. (2000) "Subword-based Approaches for Spoken Document Retrieval", PhD Thesis, Massachusetts Institute of Technology (MIT), Cambridge, MA.
    • (2000)
    • Ng, K.1
  • 92
    • 0031636298 scopus 로고    scopus 로고
    • Phonetic Recognition for Spoken Document Retrieval
    • ICASSP'98, Seattle, WA, USA
    • Ng K. and Zue V. (1998) "Phonetic Recognition for Spoken Document Retrieval", ICASSP'98, pp. 325-328, Seattle, WA, USA.
    • (1998) , pp. 325-328
    • Ng, K.1    Zue, V.2
  • 93
    • 0034300710 scopus 로고    scopus 로고
    • Subword-based Approaches for Spoken Document Retrieval
    • Ng K. and Zue V. W. (2000) "Subword-based Approaches for Spoken Document Retrieval", Speech Communication, vol. 32, no. 3, pp. 157-186.
    • (2000) Speech Communication , vol.32 , Issue.3 , pp. 157-186
    • Ng, K.1    Zue, V.W.2
  • 94
    • 85017287102 scopus 로고
    • An Efficient A Stack Decoder Algorithm for Continuous Speech Recognition with a Stochastic Language Model
    • ICASSP'92, San Francisco, USA
    • Paul D. B. (1992) "An Efficient A Stack Decoder Algorithm for Continuous Speech Recognition with a Stochastic Language Model", ICASSP'92, pp. 25-28, San Francisco, USA.
    • (1992) , pp. 25-28
    • Paul, D.B.1
  • 95
    • 84948481845 scopus 로고
    • An Algorithm for Suffix Stripping
    • Porter M. (1980) "An Algorithm for Suffix Stripping", Program, vol. 14, no. 3, pp. 130-137.
    • (1980) Program , vol.14 , Issue.3 , pp. 130-137
    • Porter, M.1
  • 96
    • 0024610919 scopus 로고
    • A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition
    • Rabiner L. (1989) "A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition", Proceedings of the IEEE, vol. 77, no. 2, pp. 257-286.
    • (1989) Proceedings of the IEEE , vol.77 , Issue.2 , pp. 257-286
    • Rabiner, L.1
  • 97
    • 0004244302 scopus 로고
    • Fundamentals of Speech Recognition
    • Prentice Hall, Englewood Cliffs, NJ
    • Rabiner L. and Juang B.-H. (1993) Fundamentals of Speech Recognition, Prentice Hall, Englewood Cliffs, NJ.
    • (1993)
    • Rabiner, L.1    Juang, B.-H.2
  • 98
    • 0017630891 scopus 로고
    • The probability ranking principle in IR
    • Robertson E. S. (1977) "The probability ranking principle in IR", Journal of Documentation, vol. 33, no. 4, pp. 294-304.
    • (1977) Journal of Documentation , vol.33 , Issue.4 , pp. 294-304
    • Robertson, E.S.1
  • 99
    • 0029386354 scopus 로고
    • Keyword Detection in Conversational Speech Utterances Using Hidden Markov Model Based Continuous Speech Recognition
    • Rose R. C. (1995) "Keyword Detection in Conversational Speech Utterances Using Hidden Markov Model Based Continuous Speech Recognition", Computer, Speech and Language, vol. 9, no. 4, pp. 309-333.
    • (1995) Computer, Speech and Language , vol.9 , Issue.4 , pp. 309-333
    • Rose, R.C.1
  • 100
    • 45549117987 scopus 로고
    • Term-Weighting Approaches in Automatic Text Retrieval
    • Salton G. and Buckley C. (1988) "Term-Weighting Approaches in Automatic Text Retrieval", Information Processing and Management, vol. 24, no. 5, pp. 513-523.
    • (1988) Information Processing and Management , vol.24 , Issue.5 , pp. 513-523
    • Salton, G.1    Buckley, C.2
  • 101
    • 0003653039 scopus 로고
    • Introduction to Modern Information Retrieval
    • McGraw-Hill, New York
    • Salton G. and McGill M. J. (1983) Introduction to Modern Information Retrieval, McGraw-Hill, New York.
    • (1983)
    • Salton, G.1    McGill, M.J.2
  • 102
    • 0033658324 scopus 로고    scopus 로고
    • Phonetic Confusion Matrix Based Spoken Document Retrieval
    • 23rd Annual ACM Conference on Research and Development in Information Retrieval (SIGIR'00), Athens, Greece, July
    • Srinivasan S. and Petkovic D. (2000) "Phonetic Confusion Matrix Based Spoken Document Retrieval", 23rd Annual ACM Conference on Research and Development in Information Retrieval (SIGIR'00), pp. 81-87, Athens, Greece, July.
    • (2000) , pp. 81-87
    • Srinivasan, S.1    Petkovic, D.2
  • 103
    • 84889282489 scopus 로고    scopus 로고
    • Common Evaluation Measures
    • TREC, NIST, 10th Text Retrieval Conference (TREC 2001), Gaithersburg, MD, USA, November
    • TREC (2001) "Common Evaluation Measures", NIST, 10th Text Retrieval Conference (TREC 2001), pp. A-14, Gaithersburg, MD, USA, November.
    • (2001) , pp. 14
  • 104
    • 0004217877 scopus 로고
    • Information Retrieval
    • Butterworths, London
    • van Rijsbergen C. J. (1979) Information Retrieval, Butterworths, London.
    • (1979)
    • van Rijsbergen, C.J.1
  • 105
    • 0002565067 scopus 로고    scopus 로고
    • Overview of the Seventh Text REtrieval Conference
    • NIST, 7th Text Retrieval Conference (TREC-7), Gaithersburg, MD, USA, November
    • Voorhees E. and Harman D. K. (1998) "Overview of the Seventh Text REtrieval Conference", NIST, 7th Text Retrieval Conference (TREC-7), pp. 1-24, Gaithersburg, MD, USA, November.
    • (1998) , pp. 1-24
    • Voorhees, E.1    Harman, D.K.2
  • 106
    • 0002748692 scopus 로고    scopus 로고
    • Okapi at TREC-6 Automatic Ad Hoc, VLC, Routing, Filtering and QSDR
    • 6th Text Retrieval Conference (TREC-6), Gaithersburg, MD, USA, November
    • Walker S., Robertson S. E., Boughanem M., Jones G. J. F. and Spärck Jones K. (1997) "Okapi at TREC-6 Automatic Ad Hoc, VLC, Routing, Filtering and QSDR", 6th Text Retrieval Conference (TREC-6), pp. 125-136, Gaithersburg, MD, USA, November.
    • (1997) , pp. 125-136
    • Walker, S.1    Robertson, S.E.2    Boughanem, M.3    Jones, G.J.F.4    Spärck Jones, K.5
  • 107
    • 0010052837 scopus 로고    scopus 로고
    • Spoken Document Retrieval Based on Phoneme Recognition
    • PhD Thesis, Swiss Federal Institute of Technology (ETH), Zurich
    • Wechsler M. (1998) "Spoken Document Retrieval Based on Phoneme Recognition", PhD Thesis, Swiss Federal Institute of Technology (ETH), Zurich.
    • (1998)
    • Wechsler, M.1
  • 108
    • 0032282577 scopus 로고    scopus 로고
    • New Techniques for Open-Vocabulary Spoken Document Retrieval
    • 21st Annual ACM Conference on Research and Development in Information Retrieval (SIGIR'98), Melbourne, Australia, August
    • Wechsler M., Munteanu E. and Schäuble P. (1998) "New Techniques for Open-Vocabulary Spoken Document Retrieval", 21st Annual ACM Conference on Research and Development in Information Retrieval (SIGIR'98), pp. 20-27, Melbourne, Australia, August.
    • (1998) , pp. 20-27
    • Wechsler, M.1    Munteanu, E.2    Schäuble, P.3
  • 109
    • 33745207743 scopus 로고    scopus 로고
    • SAMPA computer readable phonetic alphabet
    • in Handbook of Standards and Resources for Spoken Language Systems, D. Gibbon, R. Moore and R. Winski (eds), Mouton de Gruyter, Berlin and New York
    • Wells J. C. (1997) "SAMPA computer readable phonetic alphabet", in Handbook of Standards and Resources for Spoken Language Systems, D. Gibbon, R. Moore and R. Winski (eds), Mouton de Gruyter, Berlin and New York.
    • (1997)
    • Wells, J.C.1
  • 110
    • 0025517070 scopus 로고
    • Automatic Recognition of Keywords in Unconstrained Speech Using Hidden Markov Models
    • Wilpon J. G., Rabiner L. R. and Lee C.-H. (1990) "Automatic Recognition of Keywords in Unconstrained Speech Using Hidden Markov Models", Transactions on Acoustics, Speech and Signal Processing, vol. 38, no. 11, pp. 1870-1878.
    • (1990) Transactions on Acoustics, Speech and Signal Processing , vol.38 , Issue.11 , pp. 1870-1878
    • Wilpon, J.G.1    Rabiner, L.R.2    Lee, C.-H.3
  • 111
    • 84889299571 scopus 로고    scopus 로고
    • Speech Recognition and Information Retrieval: Experiments in Retrieving Spoken Documents
    • DARPA Speech Recognition Workshop, Chantilly, VA, USA, February
    • Witbrock M. and Hauptmann A. G. (1997) "Speech Recognition and Information Retrieval: Experiments in Retrieving Spoken Documents", DARPA Speech Recognition Workshop, Chantilly, VA, USA, February.
    • (1997)
    • Witbrock, M.1    Hauptmann, A.G.2
  • 112
    • 85009089367 scopus 로고    scopus 로고
    • A Hybrid Word/Phoneme-Based Approach for Improved Vocabulary-Independent Search in Spontaneous Speech
    • ICSLP'2004, Jeju Island, Korea, October
    • Yu P. and Seide F. T. B. (2004) "A Hybrid Word/Phoneme-Based Approach for Improved Vocabulary-Independent Search in Spontaneous Speech", ICSLP'2004, Jeju Island, Korea, October.
    • (2004)
    • Yu, P.1    Seide, F.T.B.2
  • 113
    • 85029488480 scopus 로고
    • Fast and Practical Approximate String Matching
    • Combinatorial Pattern Matching, Third Annual Symposium, Barcelona, Spain
    • Baeza-Yates R. (1992) "Fast and Practical Approximate String Matching", Combinatorial Pattern Matching, Third Annual Symposium, pp. 185-192, Barcelona, Spain.
    • (1992) , pp. 185-192
    • Baeza-Yates, R.1
  • 114
    • 13344269607 scopus 로고    scopus 로고
    • Evaluation of Distance Measures for MPEG-7 Melody Contours
    • International Workshop on Multimedia Signal Processing, IEEE Signal Processing Society, Siena, Italy
    • Batke J. M., Eisenberg G., Weishaupt P. and Sikora T. (2004a) "Evaluation of Distance Measures for MPEG-7 Melody Contours", International Workshop on Multimedia Signal Processing, IEEE Signal Processing Society, Siena, Italy.
    • (2004)
    • Batke, J.M.1    Eisenberg, G.2    Weishaupt, P.3    Sikora, T.4
  • 115
    • 84886074605 scopus 로고    scopus 로고
    • A Query by Humming System Using MPEG-7 Descriptors
    • Proceedings of the 116th AES Convention, AES, Berlin, Germany
    • Batke J. M., Eisenberg G., Weishaupt P. and Sikora T. (2004b) "A Query by Humming System Using MPEG-7 Descriptors", Proceedings of the 116th AES Convention, AES, Berlin, Germany.
    • (2004)
    • Batke, J.M.1    Eisenberg, G.2    Weishaupt, P.3    Sikora, T.4
  • 116
    • 0001835850 scopus 로고
    • Accurate Short-term Analysis of the Fundamental Frequency and the Harmonics-to-Noise Ratio of a Sampled Sound
    • IFA Proceedings 17, Institute of Phonetic Sciences of the University of Amsterdam, the Netherlands
    • Boersma P. (1993) "Accurate Short-term Analysis of the Fundamental Frequency and the Harmonics-to-Noise Ratio of a Sampled Sound", IFA Proceedings 17, Institute of Phonetic Sciences of the University of Amsterdam, the Netherlands.
    • (1993)
    • Boersma, P.1
  • 117
    • 0036031477 scopus 로고    scopus 로고
    • Melody Retrieval on the Web
    • Proceedings of ACM/SPIE Conference on Multimedia Computing and Networking, Boston, MA, USA
    • Chai W. and Vercoe B. (2002) "Melody Retrieval on the Web", Proceedings of ACM/SPIE Conference on Multimedia Computing and Networking, Boston, MA, USA.
    • (2002)
    • Chai, W.1    Vercoe, B.2
  • 119
    • 13344261703 scopus 로고    scopus 로고
    • BeatBank - An MPEG-7 compliant query by tapping system
    • Proceedings of the 116th AES Convention, Berlin, Germany
    • Eisenberg G., Batke J. M. and Sikora T. (2004) "BeatBank - An MPEG-7 compliant query by tapping system", Proceedings of the 116th AES Convention, Berlin, Germany.
    • (2004)
    • Eisenberg, G.1    Batke, J.M.2    Sikora, T.3
  • 120
    • 0033677009 scopus 로고    scopus 로고
    • A Robust Predominant-f0 Estimation Method for Real-time Detection of Melody and Bass Lines in CD Recordings
    • Proceedings of ICASSP, Tokyo, Japan
    • Goto M. (2000) "A Robust Predominant-f0 Estimation Method for Real-time Detection of Melody and Bass Lines in CD Recordings", Proceedings of ICASSP, pp. 757-760, Tokyo, Japan.
    • (2000) , pp. 757-760
    • Goto, M.1
  • 121
    • 0034848863 scopus 로고    scopus 로고
    • A Predominant-f0 Estimation Method for CD Recordings: Map Estimation Using EM Algorithm for Adaptive Tone Models
    • Proceedings of ICASSP, pp. V-3365-3368, Tokyo, Japan
    • Goto M. (2001) "A Predominant-f0 Estimation Method for CD Recordings: Map Estimation Using EM Algorithm for Adaptive Tone Models", Proceedings of ICASSP, pp. V-3365-3368, Tokyo, Japan.
    • (2001)
    • Goto, M.1
  • 122
    • 11844270131 scopus 로고    scopus 로고
    • Techniques for the Automated Analysis of Musical Audio
    • PhD Thesis, University of Cambridge, Cambridge, UK
    • Hainsworth S. W. (2003) "Techniques for the Automated Analysis of Musical Audio", PhD Thesis, University of Cambridge, Cambridge, UK.
    • (2003)
    • Hainsworth, S.W.1
  • 123
    • 84889440508 scopus 로고    scopus 로고
    • An Audio Front-End for Query-by-Humming Systems
    • 2nd Annual International Symposium on Music Information Retrieval, ISMIR, Bloomington, IN, USA
    • Haus G. and Pollastri E. (2001) "An Audio Front-End for Query-by-Humming Systems", 2nd Annual International Symposium on Music Information Retrieval, ISMIR, Bloomington, IN, USA.
    • (2001)
    • Haus, G.1    Pollastri, E.2
  • 124
    • 84889456437 scopus 로고    scopus 로고
    • GUIDO/MIR-An experimental musical information retrieval system based on Guido music notation
    • Proceedings of the Second Annual International Symposium on Music Information Retrieval, Bloomington, IN, USA
    • Hoos H. H., Renz K. and Görg M. (2001) "GUIDO/MIR-An experimental musical information retrieval system based on Guido music notation", Proceedings of the Second Annual International Symposium on Music Information Retrieval, Bloomington, IN, USA.
    • (2001)
    • Hoos, H.H.1    Renz, K.2    Görg, M.3
  • 125
    • 0003455850 scopus 로고    scopus 로고
    • Information Technology - Multimedia Content Description Interface - Part 4: Audio
    • ISO, 15938-4:2001(E)
    • ISO (2001a) Information Technology - Multimedia Content Description Interface - Part 4: Audio, 15938-4:2001(E).
    • (2001)
  • 126
    • 0012179370 scopus 로고    scopus 로고
    • Information Technology - Multimedia Content Description Interface - Part 5: Multimedia Description Schemes
    • ISO, 15938-5:2001(E)
    • ISO (2001b) Information Technology - Multimedia Content Description Interface - Part 5: Multimedia Description Schemes, 15938-5:2001(E).
    • (2001)
  • 127
    • 0005008397 scopus 로고    scopus 로고
    • Analysis of a Contour-based Representation for Melody
    • Proceedings of the International Symposium on Music Information Retrieval, Boston, MA, USA
    • Kim Y. E., Chai W., Garcia R. and Vercoe B. (2000) "Analysis of a Contour-based Representation for Melody", Proceedings of the International Symposium on Music Information Retrieval, Boston, MA, USA.
    • (2000)
    • Kim, Y.E.1    Chai, W.2    Garcia, R.3    Vercoe, B.4
  • 128
    • 84889324368 scopus 로고    scopus 로고
    • Means of Integrating Audio Content Analysis Algorithms
    • 110th Audio Engineering Society Convention, Amsterdam, the Netherlands
    • Klapuri A. (2001) "Means of Integrating Audio Content Analysis Algorithms", 110th Audio Engineering Society Convention, Amsterdam, the Netherlands.
    • (2001)
    • Klapuri, A.1
  • 129
    • 33748519104 scopus 로고    scopus 로고
    • Signal Processing Methods for the Automatic Transcription of Music
    • PhD Thesis, Tampere University of Technology, Tampere, Finland
    • Klapuri A. (2004) "Signal Processing Methods for the Automatic Transcription of Music", PhD Thesis, Tampere University of Technology, Tampere, Finland.
    • (2004)
    • Klapuri, A.1
  • 130
    • 84948666520 scopus 로고    scopus 로고
    • Efficient Calculation of a Physiologicallymotivated Representation for Sound
    • IEEE International Conference on Digital Signal Processing, Santorini, Greece
    • Klapuri A. P. and Astola J. T. (2002) "Efficient Calculation of a Physiologicallymotivated Representation for Sound", IEEE International Conference on Digital Signal Processing, Santorini, Greece.
    • (2002)
    • Klapuri, A.P.1    Astola, J.T.2
  • 131
    • 0003769779 scopus 로고    scopus 로고
    • Introduction to MPEG-7
    • 1 Edition, John Wiley & Sons, Ltd, Chichester
    • Manjunath B. S., Salembier P. and Sikora T. (eds) (2002) Introduction to MPEG-7, 1 Edition, John Wiley & Sons, Ltd, Chichester.
    • (2002)
    • Manjunath, B.S.1    Salembier, P.2    Sikora, T.3
  • 132
    • 0037728485 scopus 로고    scopus 로고
    • Signal Processing for Melody Transcription
    • Proceedings of the 19th Australasian Computer Science Conference, Waikato, New Zealand
    • McNab R. J., Smith L. A. and Witten I. H. (1996a) "Signal Processing for Melody Transcription", Proceedings of the 19th Australasian Computer Science Conference, Waikato, New Zealand.
    • (1996)
    • McNab, R.J.1    Smith, L.A.2    Witten, I.H.3
  • 133
    • 0029695822 scopus 로고    scopus 로고
    • Towards the Digital Music Library: Tune retrieval from acoustic input
    • Proceedings of the first ACM International Conference on Digital Libraries, Bethesda, MD, USA
    • McNab R. J., Smith L. A., Witten I. H., Henderson C. L. and Cunningham S. J. (1996b) "Towards the Digital Music Library: Tune retrieval from acoustic input", Proceedings of the first ACM International Conference on Digital Libraries, pp. 11-18, Bethesda, MD, USA.
    • (1996) , pp. 11-18
    • McNab, R.J.1    Smith, L.A.2    Witten, I.H.3    Henderson, C.L.4    Cunningham, S.J.5
  • 134
    • 0025740746 scopus 로고
    • Virtual Pitch and Phase Sensitivity of a Computer Model of the Auditory Periphery. I: Pitch identification
    • Meddis R. and Hewitt M. J. (1991) "Virtual Pitch and Phase Sensitivity of a Computer Model of the Auditory Periphery. I: Pitch identification", Journal of the Acoustical Society of America, vol. 89, no. 6, pp. 2866-2882.
    • (1991) Journal of the Acoustical Society of America , vol.89 , Issue.6 , pp. 2866-2882
    • Meddis, R.1    Hewitt, M.J.2
  • 135
    • 84889359896 scopus 로고    scopus 로고
    • Die Ganze Musik im Internet
    • Musicline, QBH system provided by phononet GmbH
    • Musicline (n.d.) "Die Ganze Musik im Internet", QBH system provided by phononet GmbH.
  • 136
    • 84889317725 scopus 로고    scopus 로고
    • Musipedia, the open music encyclopedia
    • Musipedia
    • Musipedia (2004) "Musipedia, the open music encyclopedia", www.musipedia.org.
    • (2004)
  • 137
    • 84889386344 scopus 로고    scopus 로고
    • Information technology -Multimedia content description interface -Part 4: Audio, AMENDMENT 1: Audio extensions
    • N57, Audio Group Text of ISO/IEC 15938-4:2002/FDAM 1
    • N57 (2003) Information technology -Multimedia content description interface -Part 4: Audio, AMENDMENT 1: Audio extensions, Audio Group Text of ISO/IEC 15938-4:2002/FDAM 1.
    • (2003)
  • 139
    • 0031972902 scopus 로고    scopus 로고
    • Tempo and Beat Analysis of Acoustic Musical Signals
    • Scheirer E. D. (1998) "Tempo and Beat Analysis of Acoustic Musical Signals", Journal of the Acoustical Society of America, vol. 103, no. 1, pp. 588-601.
    • (1998) Journal of the Acoustical Society of America , vol.103 , Issue.1 , pp. 588-601
    • Scheirer, E.D.1
  • 140
    • 84889269903 scopus 로고    scopus 로고
    • Pitch Detection of the Singing Voice in Musical Audio
    • Proceedings of the 114th AES Convention, Amsterdam, the Netherlands
    • Shandilya S. and Rao P. (2003) "Pitch Detection of the Singing Voice in Musical Audio", Proceedings of the 114th AES Convention, Amsterdam, the Netherlands.
    • (2003)
    • Shandilya, S.1    Rao, P.2
  • 141
    • 4744373951 scopus 로고    scopus 로고
    • Music Information Retrieval Technology
    • PhD Thesis, Royal Melbourne Institute of Technology, Melbourne, Australia
    • Uitdenbogerd A. L. (2002) "Music Information Retrieval Technology", PhD Thesis, Royal Melbourne Institute of Technology, Melbourne, Australia.
    • (2002)
    • Uitdenbogerd, A.L.1
  • 142
    • 0033279561 scopus 로고    scopus 로고
    • Matching Techniques for Large Music Databases
    • Proceedings of the ACM Multimedia Conference (ed. D. Bulterman, K. Jeffay and H. J. Zhang), Orlando, Florida
    • Uitdenbogerd A. L. and Zobel J. (1999) "Matching Techniques for Large Music Databases", Proceedings of the ACM Multimedia Conference (ed. D. Bulterman, K. Jeffay and H. J. Zhang), pp. 57-66, Orlando, Florida.
    • (1999) , pp. 57-66
    • Uitdenbogerd, A.L.1    Zobel, J.2
  • 143
    • 13344250717 scopus 로고    scopus 로고
    • Music Ranking Techniques Evaluated
    • Proceedings of the Australasian Computer Science Conference (ed. M. Oudshoorn), Melbourne, Australia
    • Uitdenbogerd A. L. and Zobel J. (2002) "Music Ranking Techniques Evaluated", Proceedings of the Australasian Computer Science Conference (ed. M. Oudshoorn), pp. 275-283, Melbourne, Australia.
    • (2002) , pp. 275-283
    • Uitdenbogerd, A.L.1    Zobel, J.2
  • 144
    • 84866006845 scopus 로고    scopus 로고
    • A Probabilistic Model for the Transcription of Single-voice Melodies
    • Finnish Signal Processing Symposium, FINSIG Tampere University of Technology, Tampere, Finland
    • Viitaniemi T., Klapuri A. and Eronen A. (2003) "A Probabilistic Model for the Transcription of Single-voice Melodies", Finnish Signal Processing Symposium, FINSIG Tampere University of Technology, Tampere, Finland.
    • (2003)
    • Viitaniemi, T.1    Klapuri, A.2    Eronen, A.3
  • 145
    • 84855721130 scopus 로고    scopus 로고
    • Wikipedia, the free encyclopedia
    • Wikipedia
    • Wikipedia (2001) "Wikipedia, the free encyclopedia", http://en.wikipedia.org.
    • (2001)
  • 146
    • 4243152700 scopus 로고    scopus 로고
    • Content-based Identification of Audio Material Using MPEG-7 Low Level Description
    • International Symposium on Music Information Retrieval, Bloomington, NI, USA, October
    • Allamanche E., Herre J., Helmuth O., Fröba B., Kasten T. and Cremer M. (2001) "Content-based Identification of Audio Material Using MPEG-7 Low Level Description", International Symposium on Music Information Retrieval, Bloomington, NI, USA, October.
    • (2001)
    • Allamanche, E.1    Herre, J.2    Helmuth, O.3    Fröba, B.4    Kasten, T.5    Cremer, M.6
  • 147
    • 0005540823 scopus 로고    scopus 로고
    • Modern Information Retrieval
    • Addison-Wesley, Reading, MA
    • Baeza-Yates R. and Ribeiro-Neto B. (1999) Modern Information Retrieval, Addison-Wesley, Reading, MA.
    • (1999)
    • Baeza-Yates, R.1    Ribeiro-Neto, B.2
  • 148
    • 29344471330 scopus 로고    scopus 로고
    • Automatic Song Identification in Noisy Broadcast Audio
    • International Conference on Signal and Image Processing (SIP 2002), Kauai, HI, USA, August
    • Batlle E., Masip J. and Guaus E. (2002) "Automatic Song Identification in Noisy Broadcast Audio", International Conference on Signal and Image Processing (SIP 2002), Kauai, HI, USA, August.
    • (2002)
    • Batlle, E.1    Masip, J.2    Guaus, E.3
  • 149
    • 4243471699 scopus 로고    scopus 로고
    • Method and Article of Manufacture for Content-Based Analysis, Storage, Retrieval and Segmentation of Audio Information
    • US Patent 5918.223
    • Blum T., Keislar D., Wheaton J. and Wold E. (1999) "Method and Article of Manufacture for Content-Based Analysis, Storage, Retrieval and Segmentation of Audio Information", US Patent 5918.223.
    • (1999)
    • Blum, T.1    Keislar, D.2    Wheaton, J.3    Wold, E.4
  • 150
    • 17444446371 scopus 로고    scopus 로고
    • Extracting Noise-Robust Features from Audio Data
    • ICASSP 2002, Orlando, FL, USA, May
    • Burges C., Platt J. and Jana S. (2002) "Extracting Noise-Robust Features from Audio Data", ICASSP 2002, Orlando, FL, USA, May.
    • (2002)
    • Burges, C.1    Platt, J.2    Jana, S.3
  • 151
    • 84889321947 scopus 로고    scopus 로고
    • Statistical Significance in Song-Spotting in Audio
    • International Symposium on Music Information Retrieval (MUSIC IR 2001), Bloomington, IN, USA, October
    • Cano P., Kaltenbrunner M., Mayor O. and Batlle E. (2001) "Statistical Significance in Song-Spotting in Audio", International Symposium on Music Information Retrieval (MUSIC IR 2001), Bloomington, IN, USA, October.
    • (2001)
    • Cano, P.1    Kaltenbrunner, M.2    Mayor, O.3    Batlle, E.4
  • 152
    • 84942244978 scopus 로고    scopus 로고
    • A Review of Algorithms for Audio Fingerprinting
    • International Workshop on Multimedia Signal Processing (MMSP 2002), St Thomas, Virgin Islands, December
    • Cano P., Batlle E., Kalker T. and Haitsma J. (2002a) "A Review of Algorithms for Audio Fingerprinting", International Workshop on Multimedia Signal Processing (MMSP 2002), St Thomas, Virgin Islands, December.
    • (2002)
    • Cano, P.1    Batlle, E.2    Kalker, T.3    Haitsma, J.4
  • 153
    • 84889413586 scopus 로고    scopus 로고
    • Robust Sound Modeling for Song Detection in Broadcast Audio
    • AES 112th International Convention, Munich, Germany, May
    • Cano P., Batlle E., Mayer H. and Neuschmied H. (2002b) "Robust Sound Modeling for Song Detection in Broadcast Audio", AES 112th International Convention, Munich, Germany, May.
    • (2002)
    • Cano, P.1    Batlle, E.2    Mayer, H.3    Neuschmied, H.4
  • 154
    • 84889297121 scopus 로고    scopus 로고
    • Audio Fingerprinting: Concepts and Applications
    • International Conference on Fuzzy Systems Knowledge Discovery (FSKD'02), Singapore, November
    • Cano P., Gómez E., Batlle E., Gomes L. and Bonnet M. (2002c) "Audio Fingerprinting: Concepts and Applications", International Conference on Fuzzy Systems Knowledge Discovery (FSKD'02), Singapore, November.
    • (2002)
    • Cano, P.1    Gómez, E.2    Batlle, E.3    Gomes, L.4    Bonnet, M.5
  • 156
  • 157
    • 84889342431 scopus 로고    scopus 로고
    • Mixed Watermarking-Fingerprinting Approach for Integrity Verification of Audio Recordings
    • International Telecommunications Symposium (ITS 2002), Natal, Brazil, September
    • Gómez E., Cano P., Gomes L., Batlle E. and Bonnet M. (2002) "Mixed Watermarking-Fingerprinting Approach for Integrity Verification of Audio Recordings", International Telecommunications Symposium (ITS 2002), Natal, Brazil, September.
    • (2002)
    • Gómez, E.1    Cano, P.2    Gomes, L.3    Batlle, E.4    Bonnet, M.5
  • 158
    • 33845940056 scopus 로고    scopus 로고
    • A Highly Robust Audio Fingerprinting System
    • 3rd International Conference on Music Information Retrieval (ISMIR2002), Paris, France, October
    • Haitsma J. and Kalker T. (2002) "A Highly Robust Audio Fingerprinting System", 3rd International Conference on Music Information Retrieval (ISMIR2002), Paris, France, October.
    • (2002)
    • Haitsma, J.1    Kalker, T.2
  • 159
    • 84942246936 scopus 로고    scopus 로고
    • Scalable Robust Audio Fingerprinting Using MPEG-7 Content Description
    • IEEE Workshop on Multimedia Signal Processing (MMSP 2002), Virgin Islands, December
    • Herre J., Hellmuth O. and Cremer M. (2002) "Scalable Robust Audio Fingerprinting Using MPEG-7 Content Description", IEEE Workshop on Multimedia Signal Processing (MMSP 2002), Virgin Islands, December.
    • (2002)
    • Herre, J.1    Hellmuth, O.2    Cremer, M.3
  • 160
    • 84889446449 scopus 로고    scopus 로고
    • Applications and Challenges for Audio Fingerprinting
    • 111th AES Convention, New York, USA, December
    • Kalker T. (2001) "Applications and Challenges for Audio Fingerprinting", 111th AES Convention, New York, USA, December.
    • (2001)
    • Kalker, T.1
  • 161
    • 84889465844 scopus 로고    scopus 로고
    • Signal Recognition System and Method
    • US Patent 5.210.820
    • Kenyon S. (1999) "Signal Recognition System and Method", US Patent 5.210.820.
    • (1999)
    • Kenyon, S.1
  • 162
    • 0034849207 scopus 로고    scopus 로고
    • Very Quick Audio Searching: Introducing Global Pruning to the Time-Series Active Search
    • Salt Lake City, UT, USA, May
    • Kimura A., Kashino K., Kurozumi T. and Murase H. (2001) "Very Quick Audio Searching: Introducing Global Pruning to the Time-Series Active Search", ICASSP'01, vol. 3, pp. 1429-1432, Salt Lake City, UT, USA, May.
    • (2001) ICASSP'01 , vol.3 , pp. 1429-1432
    • Kimura, A.1    Kashino, K.2    Kurozumi, T.3    Murase, H.4
  • 163
    • 84873545221 scopus 로고    scopus 로고
    • Identification of Highly Distorted Audio Material for Querying Large Scale Databases
    • 112th AES International Convention, Munich, Germany, May
    • Kurth F., Ribbrock A. and Clausen M. (2002) "Identification of Highly Distorted Audio Material for Querying Large Scale Databases", 112th AES International Convention, Munich, Germany, May.
    • (2002)
    • Kurth, F.1    Ribbrock, A.2    Clausen, M.3
  • 165
    • 0025489558 scopus 로고
    • Detecting and Logging Advertisements Using its Sound
    • Lourens J. G. (1990) "Detecting and Logging Advertisements Using its Sound", IEEE Transactions on Broadcasting, vol. 36, no. 3, pp. 231-233.
    • (1990) IEEE Transactions on Broadcasting , vol.36 , Issue.3 , pp. 231-233
    • Lourens, J.G.1
  • 166
    • 20444444996 scopus 로고    scopus 로고
    • A Perceptual Audio Hashing Algorithm: A Tool for Robust Audio Identification and Information Hiding
    • 4th Workshop on Information Hiding, Pittsburgh, PA, USA, April
    • Mihcak M. K. and Venkatesan R. (2001) "A Perceptual Audio Hashing Algorithm: A Tool for Robust Audio Identification and Information Hiding", 4th Workshop on Information Hiding, Pittsburgh, PA, USA, April.
    • (2001)
    • Mihcak, M.K.1    Venkatesan, R.2
  • 167
    • 0035099428 scopus 로고    scopus 로고
    • A New Approach to the Automatic Recognition of Musical Recordings
    • Papaodysseus C., Roussopoulos G., Fragoulis D. and Alexiou C. (2001) "A New Approach to the Automatic Recognition of Musical Recordings", Journal of the AES, vol. 49, no. 1/2, pp. 23-35.
    • (2001) Journal of the AES , vol.49 , Issue.1-2 , pp. 23-35
    • Papaodysseus, C.1    Roussopoulos, G.2    Fragoulis, D.3    Alexiou, C.4
  • 168
    • 4744354885 scopus 로고    scopus 로고
    • Request for Information on Audio Fingerprinting Technologies
    • RIAA/IFPI, available at
    • RIAA/IFPI (2001) "Request for Information on Audio Fingerprinting Technologies", available at http://www.ifpi.org/site-content/press/20010615.html.
    • (2001)
  • 169
    • 0034478682 scopus 로고    scopus 로고
    • Short-term Sound Stream Characterization for Reliable, Real-Time Occurrence Monitoring of Given Sound-Prints
    • 10th IEEE Mediterranean Electrotechnical Conference (MELECON 2000), Cyprus, May
    • Richly G., Varga L., Kovács F. and Hosszú G. (2000) "Short-term Sound Stream Characterization for Reliable, Real-Time Occurrence Monitoring of Given Sound-Prints", 10th IEEE Mediterranean Electrotechnical Conference (MELECON 2000), pp. 29-31, Cyprus, May.
    • (2000) , pp. 29-31
    • Richly, G.1    Varga, L.2    Kovács, F.3    Hosszú, G.4
  • 170
    • 0030681105 scopus 로고    scopus 로고
    • Transform-Based Indexing of Audio Data for Multimedia Databases
    • IEEE International Conference on Multimedia Computing and Systems (ICMCS '97), Ottawa, Canada, June
    • Subramanya S., Simba R., Narahari B. and Youssef A. (1997) "Transform-Based Indexing of Audio Data for Multimedia Databases", IEEE International Conference on Multimedia Computing and Systems (ICMCS '97), pp. 211-218, Ottawa, Canada, June.
    • (1997) , pp. 211-218
    • Subramanya, S.1    Simba, R.2    Narahari, B.3    Youssef, A.4
  • 171
    • 85143189691 scopus 로고    scopus 로고
    • Modulation Frequency Features for Audio Fingerprinting
    • ICASSP 2002, Orlando, FL, USA, May
    • Sukittanon S. and Atlas L. (2002) "Modulation Frequency Features for Audio Fingerprinting", ICASSP 2002, Orlando, FL, USA, May.
    • (2002)
    • Sukittanon, S.1    Atlas, L.2
  • 172
    • 0004172718 scopus 로고    scopus 로고
    • Pattern Recognition
    • Academic Press, San Diego, CA
    • Theodoris S. and Koutroumbas K. (1998) Pattern Recognition, Academic Press, San Diego, CA.
    • (1998)
    • Theodoris, S.1    Koutroumbas, K.2
  • 173
    • 84894907010 scopus 로고    scopus 로고
    • Semantic Video Retrieval Using Audio Analysis
    • Proceedings CIVR 2002, London, UK, July
    • Bakker E. M. and Lew M. S. (2002) "Semantic Video Retrieval Using Audio Analysis", Proceedings CIVR 2002, pp. 271-277, London, UK, July.
    • (2002) , pp. 271-277
    • Bakker, E.M.1    Lew, M.S.2
  • 174
    • 0031233424 scopus 로고    scopus 로고
    • Speaker Recognition: A Tutorial
    • Cambell J. R. (1997) "Speaker Recognition: A Tutorial", Proceedings of the IEEE, vol. 85, no. 9, pp. 1437-1462.
    • (1997) Proceedings of the IEEE , vol.85 , Issue.9 , pp. 1437-1462
    • Cambell, J.R.1
  • 175
    • 0002595416 scopus 로고    scopus 로고
    • Speaker Environment and Channel Change Detection and Clustering via the Bayesian Information Criterion
    • DARPA Broadcast News Transcription and Understanding Workshop 1998, Lansdowne, VA, USA, February
    • Chen S. and Gopalakrishnan P. (1998) "Speaker Environment and Channel Change Detection and Clustering via the Bayesian Information Criterion", DARPA Broadcast News Transcription and Understanding Workshop 1998, Lansdowne, VA, USA, February.
    • (1998)
    • Chen, S.1    Gopalakrishnan, P.2
  • 176
    • 6344242294 scopus 로고    scopus 로고
    • Detection of Soccer Goal Shots Using Joint Multimedia Features and Classification Rules
    • Proceedings of the Fourth International Workshop on Multimedia Data Mining (MDM/KDD2003), Washington, DC, USA, August
    • Chen S.-C., Shyu M.-L., Zhang C., Luo L. and Chen M. (2003) "Detection of Soccer Goal Shots Using Joint Multimedia Features and Classification Rules", Proceedings of the Fourth International Workshop on Multimedia Data Mining (MDM/KDD2003), pp. 36-44, Washington, DC, USA, August.
    • (2003) , pp. 36-44
    • Chen, S.-C.1    Shyu, M.-L.2    Zhang, C.3    Luo, L.4    Chen, M.5
  • 177
    • 85009212151 scopus 로고    scopus 로고
    • A Sequential Metric-Based Audio Segmentation Method via the Bayesian Information Criterion
    • Proceedings EUROSPEECH 2003, Geneva, Switzerland, September
    • Cheng S.-S and Wang H.-M. (2003) "A Sequential Metric-Based Audio Segmentation Method via the Bayesian Information Criterion", Proceedings EUROSPEECH 2003, Geneva, Switzerland, September.
    • (2003)
    • Cheng, S.-S.1    Wang, H.-M.2
  • 178
    • 84948186412 scopus 로고    scopus 로고
    • Non-Negative Component Parts of Sound for Classification
    • IEEE International Symposium on Signal Processing and Information Technology, Darmstadt, Germany, December
    • Cho Y.-C., Choi S. and Bang S.-Y. (2003) "Non-Negative Component Parts of Sound for Classification", IEEE International Symposium on Signal Processing and Information Technology, Darmstadt, Germany, December.
    • (2003)
    • Cho, Y.-C.1    Choi, S.2    Bang, S.-Y.3
  • 179
    • 0030381663 scopus 로고    scopus 로고
    • Unsupervised Speaker Segmentation in Telephone Conversations
    • Proceedings, Nineteenth Convention of Electrical and Electronics Engineers, Israel
    • Cohen A. and Lapidus V. (1996) "Unsupervised Speaker Segmentation in Telephone Conversations", Proceedings, Nineteenth Convention of Electrical and Electronics Engineers, Israel, pp. 102-105.
    • (1996) , pp. 102-105
    • Cohen, A.1    Lapidus, V.2
  • 180
    • 0035500783 scopus 로고    scopus 로고
    • Speech Enhancement for Non-Stationary Environments
    • Cohen I. and Berdugo, B. (2001) "Speech Enhancement for Non-Stationary Environments", Signal Processing, vol. 81, pp. 2403-2418.
    • (2001) Signal Processing , vol.81 , pp. 2403-2418
    • Cohen, I.1    Berdugo, B.2
  • 181
    • 0034273195 scopus 로고    scopus 로고
    • DISTBIC: A Speaker-Based Segmentation for Audio Data Indexing
    • Delacourt P. and Welekens C. J. (2000) "DISTBIC: A Speaker-Based Segmentation for Audio Data Indexing", Speech Communication, vol. 32, pp. 111-126.
    • (2000) Speech Communication , vol.32 , pp. 111-126
    • Delacourt, P.1    Welekens, C.J.2
  • 182
    • 0003578015 scopus 로고
    • Cluster Analysis
    • 3rd Edition, Oxford University Press, New York
    • Everitt B. S. (1993) Cluster Analysis, 3rd Edition, Oxford University Press, New York.
    • (1993)
    • Everitt, B.S.1
  • 184
    • 0000808717 scopus 로고    scopus 로고
    • Partitioning and Transcription of Broadcast News Data
    • Proceedings of ICSLP 1998, Sydney, Australia, November
    • Gauvain J. L., Lamel L. and Adda G. (1998) "Partitioning and Transcription of Broadcast News Data", Proceedings of ICSLP 1998, Sydney, Australia, November.
    • (1998)
    • Gauvain, J.L.1    Lamel, L.2    Adda, G.3
  • 186
    • 0026400244 scopus 로고
    • Segregation of Speaker for Speech Recognition and Speaker Identification
    • Proceedings of ICASSP, Toronto, Canada, May
    • Gish H., Siu M.-H. and Rohlicek R. (1991) "Segregation of Speaker for Speech Recognition and Speaker Identification", Proceedings of ICASSP, pp. 873-876, Toronto, Canada, May.
    • (1991) , pp. 873-876
    • Gish, H.1    Siu, M.-H.2    Rohlicek, R.3
  • 187
    • 0025041264 scopus 로고
    • Perceptual Linear Predictive (PLP) Analysis of Speech
    • Hermansky H. (1990) "Perceptual Linear Predictive (PLP) Analysis of Speech", Journal of the Acoustical Society of America, vol. 87, no. 4, pp. 1738-1752.
    • (1990) Journal of the Acoustical Society of America , vol.87 , Issue.4 , pp. 1738-1752
    • Hermansky, H.1
  • 190
    • 0033692969 scopus 로고    scopus 로고
    • Strategies for Automatic Segmentation of Audio Data
    • Proceedings ICASSP 2000, Istanbul, Turkey, June
    • Kemp T., Schmidt M., Westphal M. and Waibel A. (2000) "Strategies for Automatic Segmentation of Audio Data", Proceedings ICASSP 2000, Istanbul, Turkey, June.
    • (2000)
    • Kemp, T.1    Schmidt, M.2    Westphal, M.3    Waibel, A.4
  • 191
    • 8844234947 scopus 로고    scopus 로고
    • Automatic Segmentation of Speakers in Broadcast Audio Material
    • IS&T/SPIE's Electronic Imaging 2004, San Jose, CA, USA, January
    • Kim H.-G. and Sikora T. (2004a) "Automatic Segmentation of Speakers in Broadcast Audio Material", IS&T/SPIE's Electronic Imaging 2004, San Jose, CA, USA, January.
    • (2004)
    • Kim, H.-G.1    Sikora, T.2
  • 192
    • 4544361760 scopus 로고    scopus 로고
    • Comparison of MPEG-7 Audio Spectrum Projection Features and MFCC Applied to Speaker Recognition, Sound Classification and Audio Segmentation
    • Proceedings ICASSP 2004, Montreal, Canada, May
    • Kim H.-G. and Sikora T. (2004b) "Comparison of MPEG-7 Audio Spectrum Projection Features and MFCC Applied to Speaker Recognition, Sound Classification and Audio Segmentation", Proceedings ICASSP 2004, Montreal, Canada, May.
    • (2004)
    • Kim, H.-G.1    Sikora, T.2
  • 193
    • 85009101164 scopus 로고    scopus 로고
    • Speech Enhancement based on Smoothing of Spectral Noise Floor
    • Proceedings INTERSPEECH 2004 -ICSLP, Jeju Island, South Korea, October
    • Kim H.-G. and Sikora T. (2004c) "Speech Enhancement based on Smoothing of Spectral Noise Floor", Proceedings INTERSPEECH 2004 -ICSLP, Jeju Island, South Korea, October.
    • (2004)
    • Kim, H.-G.1    Sikora, T.2
  • 195
    • 84889446323 scopus 로고    scopus 로고
    • Speaker Change Detection and Tracking in Real-time News Broadcasting Analysis
    • Proceedings 9th ACM International Conference on Multimedia, 2001, Ottawa, Canada, October
    • Lu L. and Zhang H.-J. (2001) "Speaker Change Detection and Tracking in Real-time News Broadcasting Analysis", Proceedings 9th ACM International Conference on Multimedia, 2001, pp. 203-211, Ottawa, Canada, October.
    • (2001) , pp. 203-211
    • Lu, L.1    Zhang, H.-J.2
  • 196
    • 84889394943 scopus 로고    scopus 로고
    • A Robust Audio Classification and Segmentation Method
    • Proceedings 10th ACM International Conference on Multimedia, 2002, Juan les Pins, France, December
    • Lu L., Jiang H. and Zhang H.-J. (2002) "A Robust Audio Classification and Segmentation Method", Proceedings 10th ACM International Conference on Multimedia, 2002, Juan les Pins, France, December.
    • (2002)
    • Lu, L.1    Jiang, H.2    Zhang, H.-J.3
  • 197
    • 0032667465 scopus 로고    scopus 로고
    • Tracking Speech-presence Uncertainty to Improve Speech Enhancement in Non-stationary Noise Environments
    • Phoenix, AZ, USA, March
    • Malah D., Cox R. and Accardi A. (1999) "Tracking Speech-presence Uncertainty to Improve Speech Enhancement in Non-stationary Noise Environments", Proceedings ICASSP 1999, vol. 2, pp. 789-792, Phoenix, AZ, USA, March.
    • (1999) Proceedings ICASSP 1999 , vol.2 , pp. 789-792
    • Malah, D.1    Cox, R.2    Accardi, A.3
  • 198
    • 33745186799 scopus 로고    scopus 로고
    • Phonetic Confusion Based Document Expansion for Spoken Document Retrieval
    • ICSLP Interspeech 2004, Jeju Island, Korea, October
    • Moreau N., Kim H.-G. and Sikora T. (2004) "Phonetic Confusion Based Document Expansion for Spoken Document Retrieval", ICSLP Interspeech 2004, Jeju Island, Korea, October.
    • (2004)
    • Moreau, N.1    Kim, H.-G.2    Sikora, T.3
  • 199
    • 0003425258 scopus 로고
    • Digital Processing of Speech Signals
    • Prentice Hall (Signal Processing Series), Englewood Cliffs, NJ
    • Rabiner L. R. and Schafer R. W. (1978) Digital Processing of Speech Signals, Prentice Hall (Signal Processing Series), Englewood Cliffs, NJ.
    • (1978)
    • Rabiner, L.R.1    Schafer, R.W.2
  • 200
    • 85128386923 scopus 로고    scopus 로고
    • Blind Clustering of Speech Utterances Based on Speaker and Language Characteristics
    • Proceedings ICASSP 1998, Seattle, WA, USA, May
    • Reynolds D. A., Singer E., Carlson B. A., McLaughlin J. J., O'Leary G.C. and Zissman M. A. (1998) "Blind Clustering of Speech Utterances Based on Speaker and Language Characteristics", Proceedings ICASSP 1998, Seattle, WA, USA, May.
    • (1998)
    • Reynolds, D.A.1    Singer, E.2    Carlson, B.A.3    McLaughlin, J.J.4    O'Leary, G.C.5    Zissman, M.A.6
  • 201
    • 84889346572 scopus 로고    scopus 로고
    • Automatic Segmentation, Classification and Clustering of Broadcast News Audio
    • Proceedings of Speech Recognition Workshop, Chantilly, VA, USA, February
    • Siegler M. A., Jain U., Raj B. and Stern R. M. (1997) "Automatic Segmentation, Classification and Clustering of Broadcast News Audio", Proceedings of Speech Recognition Workshop, Chantilly, VA, USA, February.
    • (1997)
    • Siegler, M.A.1    Jain, U.2    Raj, B.3    Stern, R.M.4
  • 202
    • 85009265801 scopus 로고
    • An Unsupervised, Sequential Learning Algorithm for the Segmentation of Speech Waveforms with Multiple Speakers
    • Proceedings ICASSP 1992, vol.2, San Francisco, USA, March
    • Siu M.-H., Yu G. and Gish H. (1992) "An Unsupervised, Sequential Learning Algorithm for the Segmentation of Speech Waveforms with Multiple Speakers", Proceedings ICASSP 1992, vol.2, pp. 189-192, San Francisco, USA, March.
    • (1992) , pp. 189-192
    • Siu, M.-H.1    Yu, G.2    Gish, H.3
  • 203
    • 84889324982 scopus 로고    scopus 로고
    • Speaker Tracking and Detection with Multiple Speakers
    • Seattle, WA, USA, May
    • Solomonoff A., Mielke A., Schmidt M. and Gish H. (1998) "Speaker Tracking and Detection with Multiple Speakers", Proceedings ICASSP 1998, vol. 2, pp. 757-760, Seattle, WA, USA, May.
    • (1998) Proceedings ICASSP 1998 , vol.2 , pp. 757-760
    • Solomonoff, A.1    Mielke, A.2    Schmidt, M.3    Gish, H.4
  • 204
    • 0037521928 scopus 로고    scopus 로고
    • Speaker Tracking and Detection with Multiple Speakers
    • Proceedings EUROSPEECH 1999, Budapest, Hungary, September
    • Sommez K., Heck L. and Weintraub M. (1999) "Speaker Tracking and Detection with Multiple Speakers", Proceedings EUROSPEECH 1999, Budapest, Hungary, September.
    • (1999)
    • Sommez, K.1    Heck, L.2    Weintraub, M.3
  • 205
    • 0033279679 scopus 로고    scopus 로고
    • Towards Robust Features for Classifying Audio in the CueVideo System
    • Proceedings 7th ACM International Conference on Multimedia, Ottawa, Canada, October
    • Srinivasan S., Petkovic D. and Ponceleon D. (1999) "Towards Robust Features for Classifying Audio in the CueVideo System", Proceedings 7th ACM International Conference on Multimedia, pp. 393-400, Ottawa, Canada, October.
    • (1999) , pp. 393-400
    • Srinivasan, S.1    Petkovic, D.2    Ponceleon, D.3
  • 206
    • 0027252184 scopus 로고
    • Speech Segmentation and Clustering Based on Speaker Features
    • Minneapolis, USA, April
    • Sugiyama M., Murakami J. and Watanabe H. (1993) "Speech Segmentation and Clustering Based on Speaker Features", Proceedings ICASSP 1993, vol. 2, pp. 395-398, Minneapolis, USA, April.
    • (1993) Proceedings ICASSP 1993 , vol.2 , pp. 395-398
    • Sugiyama, M.1    Murakami, J.2    Watanabe, H.3
  • 207
    • 0003775661 scopus 로고    scopus 로고
    • Improved Speaker Segmentation and Segments Clustering Using the Bayesian Information Criterion
    • Proceedings EUROSPEECH 1999, Budapest, Hungary, September
    • Tritschler A. and Gopinath R. (1999) "Improved Speaker Segmentation and Segments Clustering Using the Bayesian Information Criterion", Proceedings EUROSPEECH 1999, Budapest, Hungary, September.
    • (1999)
    • Tritschler, A.1    Gopinath, R.2
  • 208
    • 11244258944 scopus 로고    scopus 로고
    • Sports Highlight Detection from Keyword Sequences Using HMM
    • Proceedings ICME 2004, Taipei, China, June
    • Wang J., Xu C., Chng E. S. and Tian Q. (2004) "Sports Highlight Detection from Keyword Sequences Using HMM", Proceedings ICME 2004, Taipei, China, June.
    • (2004)
    • Wang, J.1    Xu, C.2    Chng, E.S.3    Tian, Q.4
  • 209
    • 85032751556 scopus 로고    scopus 로고
    • Multimedia Content Analysis Using Audio and Visual Information
    • Wang Y., Liu Z. and Huang J. (2000) "Multimedia Content Analysis Using Audio and Visual Information", IEEE Signal Processing Magazine (invited paper), vol. 17, no. 6, pp. 12-36.
    • (2000) IEEE Signal Processing Magazine (invited paper) , vol.17 , Issue.6 , pp. 12-36
    • Wang, Y.1    Liu, Z.2    Huang, J.3
  • 210
    • 79952385877 scopus 로고
    • Segmentation of Speech Using Speaker Identification
    • Proceedings ICASSP 1994, Adelaide, Australia, April
    • Wilcox L., Chen F., Kimber D. and Balasubramanian V. (1994) "Segmentation of Speech Using Speaker Identification", Proceedings ICASSP 1994, Adelaide, Australia, April.
    • (1994)
    • Wilcox, L.1    Chen, F.2    Kimber, D.3    Balasubramanian, V.4
  • 211
    • 84892177707 scopus 로고    scopus 로고
    • Experiments in Broadcast News Transcription
    • Proceedings ICASSP 1998, Seattle, WA, USA, May
    • Woodland P. C., Hain T., Johnson S., Niesler T., Tuerk A. and Young S. (1998) "Experiments in Broadcast News Transcription", Proceedings ICASSP 1998, Seattle, WA, USA, May.
    • (1998)
    • Woodland, P.C.1    Hain, T.2    Johnson, S.3    Niesler, T.4    Tuerk, A.5    Young, S.6
  • 212
    • 61949211336 scopus 로고    scopus 로고
    • UBM-Based Real-Time Speaker Segmentation for Broadcasting News
    • ICME 2003, Hong Kong, April
    • Wu T., Lu L., Chen K. and Zhang H.-J. (2003) "UBM-Based Real-Time Speaker Segmentation for Broadcasting News", ICME 2003, vol.2, pp. 721-724, Hong Kong, April.
    • (2003) , vol.2 , pp. 721-724
    • Wu, T.1    Lu, L.2    Chen, K.3    Zhang, H.-J.4
  • 213
    • 0141743478 scopus 로고    scopus 로고
    • Audio Events Detection Based Highlights Extraction from Baseball, Golf and Soccer Games in a Unified Framework
    • Hong Kong, April
    • Xiong Z., Radhakrishnan R., Divakaran A. and Huang T. S. (2003) "Audio Events Detection Based Highlights Extraction from Baseball, Golf and Soccer Games in a Unified Framework", Proceedings ICASSP 2003, vol. 5, pp. 632-635, Hong Kong, April.
    • (2003) Proceedings ICASSP 2003 , vol.5 , pp. 632-635
    • Xiong, Z.1    Radhakrishnan, R.2    Divakaran, A.3    Huang, T.S.4
  • 214
    • 85009160774 scopus 로고    scopus 로고
    • An Improved Model-Based Speaker Segmentation System
    • Proceedings EUROSPEECH 2003, Geneva, Switzerland, September
    • Yu P., Seide F., Ma C. and Chang E. (2003) "An Improved Model-Based Speaker Segmentation System", Proceedings EUROSPEECH 2003, Geneva, Switzerland, September.
    • (2003)
    • Yu, P.1    Seide, F.2    Ma, C.3    Chang, E.4
  • 215
    • 85009089453 scopus 로고    scopus 로고
    • Unsupervised Audio Stream Segmentation and Clustering via the Bayesian Information Criterion
    • Proceedings ICSLP 2000, Beijing, China, October
    • Zhou B. W. and John H. L. (2000) "Unsupervised Audio Stream Segmentation and Clustering via the Bayesian Information Criterion", Proceedings ICSLP 2000, Beijing, China, October.
    • (2000)
    • Zhou, B.W.1    John, H.L.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.