메뉴 건너뛰기




Volumn 42, Issue 4, 2014, Pages 637-661

A survey of tagging techniques for music, speech and environmental sound

Author keywords

Automatic tagging; Environmental sound tagging; Manual tagging; Music tagging; Semi automatic tagging; Sound tagging; Speech recognition

Indexed keywords

SURVEYS;

EID: 84920251124     PISSN: 02692821     EISSN: 15737462     Source Type: Journal    
DOI: 10.1007/s10462-012-9362-y     Document Type: Article
Times cited : (20)

References (81)
  • 4
    • 68149121920 scopus 로고    scopus 로고
    • An efficient code for environmental sound classification
    • Arora R, Lutfi RA (2009) An efficient code for environmental sound classification. J Acoust Soc Am 126: 7
    • (2009) J Acoust Soc Am , vol.126 , pp. 7
    • Arora, R.1    Lutfi, R.A.2
  • 5
    • 77955555508 scopus 로고    scopus 로고
    • Detecting bird sounds in a complex acoustic environment and application to bioacoustic monitoring
    • Bardeli R, Wolff D, Kurth F, Koch M, Tauchert KH, Frommolt KH (2010) Detecting bird sounds in a complex acoustic environment and application to bioacoustic monitoring. Pattern Recognit Lett 31(12): 1524–1534
    • (2010) Pattern Recognit Lett , vol.31 , Issue.12 , pp. 1524-1534
    • Bardeli, R.1    Wolff, D.2    Kurth, F.3    Koch, M.4    Tauchert, K.H.5    Frommolt, K.H.6
  • 7
    • 80051647410 scopus 로고    scopus 로고
    • Automatic tagging of audio: the state-of-the-art. In: Machine audition: principles, algorithms and systems. IGI Global
    • Bertin-Mahieux T, Eck D, Mandel M (2011) Automatic tagging of audio: the state-of-the-art. In: Machine audition: principles, algorithms and systems. IGI Global, pp 334–352
    • (2011)
    • Bertin-Mahieux, T.1    Eck, D.2    Mandel, M.3
  • 8
    • 77955227370 scopus 로고    scopus 로고
    • Bridging the gap between tagging and querying vocabularies: analyses and applications for enhancing multimedia IR. Web semantics: science
    • Bischoff K, Firan CS, Nejdl W, Paiu R (2010) Bridging the gap between tagging and querying vocabularies: analyses and applications for enhancing multimedia IR. Web semantics: science, services and agents on the world wide web
    • (2010) services and agents on the world wide web
    • Bischoff, K.1    Firan, C.S.2    Nejdl, W.3    Paiu, R.4
  • 9
    • 72249096328 scopus 로고    scopus 로고
    • Automated sound recording and analysis techniques for bird surveys and conservation
    • Brandes ST (2008) Automated sound recording and analysis techniques for bird surveys and conservation. Bird Conserv Int (SupplementS1) 18: S163–S173
    • (2008) Bird Conserv Int (SupplementS1) , vol.18 , pp. S163-S173
    • Brandes, S.T.1
  • 10
    • 33750315192 scopus 로고    scopus 로고
    • Using image processing to detect and classify narrow-band cricket and frog calls
    • Brandes T, Naskrecki P, Figueroa H (2006) Using image processing to detect and classify narrow-band cricket and frog calls. J Acoust Soc Am 120: 2950–2957
    • (2006) J Acoust Soc Am , vol.120 , pp. 2950-2957
    • Brandes, T.1    Naskrecki, P.2    Figueroa, H.3
  • 11
    • 77951191695 scopus 로고    scopus 로고
    • Audio classification of bird species: a statistical manifold approach. Paper presented at the data mining, 2009. ICDM ’09
    • Briggs F, Raich R, Fern XZ (6–9 Dec 2009) Audio classification of bird species: a statistical manifold approach. Paper presented at the data mining, 2009. ICDM ’09. Ninth IEEE international conference on
    • Ninth IEEE international conference on
    • Briggs, F.1    Raich, R.2    Fern, X.Z.3
  • 16
    • 33750282566 scopus 로고    scopus 로고
    • Semi-automatic classification of bird vocalizations using spectral peak tracks
    • Chen Z, Maher RC (2006) Semi-automatic classification of bird vocalizations using spectral peak tracks. J Acoust Soc Am 120(5): 2974–2984
    • (2006) J Acoust Soc Am , vol.120 , Issue.5 , pp. 2974-2984
    • Chen, Z.1    Maher, R.C.2
  • 17
    • 78049393889 scopus 로고    scopus 로고
    • A call-independent and automatic acoustic system for the individual recognition of animals: a novel model using four passerines
    • Cheng J, Sun Y, Ji L (2010) A call-independent and automatic acoustic system for the individual recognition of animals: a novel model using four passerines. Pattern Recognit 43(11): 3846–3852
    • (2010) Pattern Recognit , vol.43 , Issue.11 , pp. 3846-3852
    • Cheng, J.1    Sun, Y.2    Ji, L.3
  • 20
    • 0042830801 scopus 로고    scopus 로고
    • Comparison of techniques for environmental sound recognition
    • Cowling M, Sitte R (2003) Comparison of techniques for environmental sound recognition. Pattern Recognit Lett 24(15): 2895–2907
    • (2003) Pattern Recognit Lett , vol.24 , Issue.15 , pp. 2895-2907
    • Cowling, M.1    Sitte, R.2
  • 21
    • 58349089673 scopus 로고    scopus 로고
    • Classification of audio signals using SVM and RBFNN
    • Dhanalakshmi P, Palanivel S, Ramalingam V (2009) Classification of audio signals using SVM and RBFNN. Expert Syst Appl 36(3 Part 2): 6069–6075
    • (2009) Expert Syst Appl , vol.36 , pp. 6069-6075
    • Dhanalakshmi, P.1    Palanivel, S.2    Ramalingam, V.3
  • 22
    • 84863279149 scopus 로고    scopus 로고
    • Acoustic component detection for automatic species recognition in environmental monitoring. Paper presented at the intelligent sensors, sensor networks and information processing (ISSNIP)
    • Duan S, Towsey M, Zhang J, Truskinger A, Wimmer J, Roe P (6–9 Dec 2011) Acoustic component detection for automatic species recognition in environmental monitoring. Paper presented at the intelligent sensors, sensor networks and information processing (ISSNIP), 2011 seventh international conference on
    • 2011 seventh international conference on
    • Duan, S.1    Towsey, M.2    Zhang, J.3    Truskinger, A.4    Wimmer, J.5
  • 23
    • 0034270644 scopus 로고    scopus 로고
    • Audio-visual speech modeling for continuous speech recognition
    • Dupont S, Luettin J (2000) Audio-visual speech modeling for continuous speech recognition. IEEE Trans Multimed 2(3): 141–151
    • (2000) IEEE Trans Multimed , vol.2 , Issue.3 , pp. 141-151
    • Dupont, S.1    Luettin, J.2
  • 26
    • 38049069776 scopus 로고    scopus 로고
    • Fifty years of progress in speech and speaker recognition
    • Furui S (2004) Fifty years of progress in speech and speaker recognition. Acoust Soc Am J 116(4): 2497–2498
    • (2004) Acoust Soc Am J , vol.116 , Issue.4 , pp. 2497-2498
    • Furui, S.1
  • 28
    • 85016708928 scopus 로고    scopus 로고
    • Content-based classification and retrieval of wild animal sounds using feature selection algorithm. Paper presented at the machine learning and computing (ICMLC)
    • Gunasekaran S, Revathy K (2010) Content-based classification and retrieval of wild animal sounds using feature selection algorithm. Paper presented at the machine learning and computing (ICMLC), 2010 second international conference on
    • (2010) 2010 second international conference on
    • Gunasekaran, S.1    Revathy, K.2
  • 31
    • 56349111817 scopus 로고    scopus 로고
    • Frog classification using machine learning techniques
    • Huang C-J, Yang Y-J, Yang D-X, Chen Y-J (2009) Frog classification using machine learning techniques. Expert Syst Appl 36(2 Part 2): 3737–3743
    • (2009) Expert Syst Appl , vol.36 , pp. 3737-3743
    • Huang, C.-J.1    Yang, Y.-J.2    Yang, D.-X.3    Chen, Y.-J.4
  • 32
    • 84873445412 scopus 로고    scopus 로고
    • MoodSwings: a collaborative game for music mood label
    • Kim YE, Schmidt E, Emelle L (2008) MoodSwings: a collaborative game for music mood label. ISMIR’ 08: 231–236
    • (2008) ISMIR’ , vol.8 , pp. 231-236
    • Kim, Y.E.1    Schmidt, E.2    Emelle, L.3
  • 33
    • 9444222712 scopus 로고    scopus 로고
    • Zwan P Processing of Musical Data Employing Rough Sets and Artificial Neural Networks
    • Springer, Berlin
    • Kostek B, Szczuko P, Zwan P Processing of Musical Data Employing Rough Sets and Artificial Neural Networks.(2004) In: Tsumoto S, Slowinski R (eds) Rough sets and current trends in computing. Springer, Berlin 3066: pp 539–548
    • (2004) Tsumoto S, Slowinski R , vol.3066 , pp. 539-548
    • Kostek, B.1    Szczuko, P.2    sets, R.3
  • 35
    • 4544345188 scopus 로고    scopus 로고
    • Bird classification algorithms: theory and experimental results. Paper presented at the IEEE international conference on acoustics, speech, and signal processing, 2004
    • Kwan C, Mei G, Zhao X, Ren Z, Xu R, Stanford V et al (17–21 May 2004) Bird classification algorithms: theory and experimental results. Paper presented at the IEEE international conference on acoustics, speech, and signal processing, 2004. Proceedings (ICASSP ’04)
    • Proceedings (ICASSP ’04)
    • Kwan, C.1    Mei, G.2    Zhao, X.3    Ren, Z.4    Xu, R.5
  • 36
    • 77950819438 scopus 로고    scopus 로고
    • ICMLA ’09
    • Lakshminarayanan B, Raich R, Fern X (13–15 Dec 2009) A syllable-level probabilistic framework for bird species identification. Paper presented at the machine learning and applications, 2009. ICMLA ’09. International conference on
    • (2009) International conference on
    • Lakshminarayanan, B.1    Raich, R.2
  • 37
    • 84920252097 scopus 로고    scopus 로고
    • DCOSS, Greece
    • Lau A, Mason R, Pham B, Richards M, Roe P, Zhang J (11–14 June 2008) Monitoring the environment through acoustics using smartphone-based sensors and 3G networking. Paper presented at the proceedings of the second international workshop on wireless sensor network deployments (WiDeploy08); 4th IEEE international conference on distributed computing in sensor systems, DCOSS 2008, Greece
    • (2008) 4th IEEE international conference on distributed computing in sensor systems
  • 38
    • 84873647698 scopus 로고    scopus 로고
    • Evaluation of algorithms using games: the case of music tagging
    • Law E, West K, Mandel M, Bay M, Downie JS (2009) Evaluation of algorithms using games: the case of music tagging. Evaluation, pp, 387–392
    • (2009) Evaluation , pp. 387-392
    • Law, E.1    West, K.2    Mandel, M.3    Bay, M.4    Downie, J.S.5
  • 39
    • 63049114780 scopus 로고    scopus 로고
    • Music information retrieval using social tags and audio
    • Levy M, Sandler M (2009) Music information retrieval using social tags and audio. Multimed IEEE Trans 11(3): 383–395
    • (2009) Multimed IEEE Trans , vol.11 , Issue.3 , pp. 383-395
    • Levy, M.1    Sandler, M.2
  • 40
    • 72949103915 scopus 로고    scopus 로고
    • On the suitability of state-of-the-art music information retrieval methods for analyzing, categorizing and accessing non-Western and ethnic music collections
    • Lidy T, Silla CN Jr, Cornelis O, Gouyon F, Rauber A, Kaestner CAA et al (2010) On the suitability of state-of-the-art music information retrieval methods for analyzing, categorizing and accessing non-Western and ethnic music collections. Signal Process 90(4): 1032–1048
    • (2010) Signal Process , vol.90 , Issue.4 , pp. 1032-1048
    • Lidy, T.1    Silla, C.N.2    Cornelis, O.3    Gouyon, F.4    Rauber, A.5    Kaestner, C.A.A.6
  • 46
    • 77949270178 scopus 로고    scopus 로고
    • Social tagging in recommender systems: a survey of the state-of-the-art and possible extensions
    • Milicevic A, Nanopoulos A, Ivanovic M (2010) Social tagging in recommender systems: a survey of the state-of-the-art and possible extensions. Artif Intell Rev 33(3): 187–209
    • (2010) Artif Intell Rev , vol.33 , Issue.3 , pp. 187-209
    • Milicevic, A.1    Nanopoulos, A.2    Ivanovic, M.3
  • 47
    • 85016707935 scopus 로고    scopus 로고
    • Improving auto-tagging by modeling semantic co-occurrences. Paper presented at the international society of music information retrieval conference
    • Miotto R, Barrington L, Lanckriet G (2010) Improving auto-tagging by modeling semantic co-occurrences. Paper presented at the international society of music information retrieval conference, Utrecht
    • (2010) Utrecht
    • Miotto, R.1    Barrington, L.2    Lanckriet, G.3
  • 48
    • 84920260591 scopus 로고    scopus 로고
    • Breiteneder C: Discrimination and retrieval of animal sounds
    • Mitrovic D, Zeppelzauer M, Breiteneder C (2006) Discrimination and retrieval of animal sounds
    • (2006) Zeppelzauer M
    • Mitrovic, D.1
  • 49
    • 85016678785 scopus 로고    scopus 로고
    • ELMAR ’09
    • Mitrovic D, Zeppelzauer M, Eidenberger H (2009) On feature selection in environmental sound recognition. Paper presented at the ELMAR, 2009. ELMAR ’09. International symposium
    • (2009) International symposium
    • Mitrovic, D.1    Zeppelzauer, M.2
  • 54
    • 77954634120 scopus 로고    scopus 로고
    • Tag integrated multi-label music style classification with hypergraph. In: Proceedings of the 10th international society for music information retrieval conference
    • Ogihara FWXWBSTLaM
    • Ogihara FWXWBSTLaM (2009) Tag integrated multi-label music style classification with hypergraph. In: Proceedings of the 10th international society for music information retrieval conference, pp 363–368
    • (2009)
  • 55
    • 84885708772 scopus 로고    scopus 로고
    • Advanced data mining techniques, 1st edn. Springer, p 138
    • Olson DL, Delen D (2008) Advanced data mining techniques, 1st edn. Springer, p 138, ISBN 3540769161
    • (2008) ISBN 3540769161
    • Olson, D.L.1    Delen, D.2
  • 56
    • 33846152329 scopus 로고    scopus 로고
    • Music retrieval: a tutorial and review
    • Orio N (2006) Music retrieval: a tutorial and review. Found Trends Inf Retr 1(1): 1–96
    • (2006) Found Trends Inf Retr , vol.1 , Issue.1 , pp. 1-96
    • Orio, N.1
  • 66
    • 32044455069 scopus 로고    scopus 로고
    • Classification of acoustic events using SVM-based clustering schemes
    • Temko A, Nadeu C (2006) Classification of acoustic events using SVM-based clustering schemes. Pattern Recognit 39(4): 682–694
    • (2006) Pattern Recognit , vol.39 , Issue.4 , pp. 682-694
    • Temko, A.1    Nadeu, C.2
  • 67
    • 68749105565 scopus 로고    scopus 로고
    • Acoustic event detection in meeting-room environments
    • Temko A, Nadeu C (2009) Acoustic event detection in meeting-room environments. Pattern Recognit Lett 30(14): 1281–1288
    • (2009) Pattern Recognit Lett , vol.30 , Issue.14 , pp. 1281-1288
    • Temko, A.1    Nadeu, C.2
  • 68
    • 70349705475 scopus 로고    scopus 로고
    • Lightweight acoustic classification for cane-toad monitoring. Paper presented at the signals, systems and computers
    • Thanh D, Bulusu N, Wen H (2008) Lightweight acoustic classification for cane-toad monitoring. Paper presented at the signals, systems and computers, 42nd asilomar conference on signal processing
    • (2008) 42nd asilomar conference on signal processing
    • Thanh, D.1    Bulusu, N.2    Wen, H.3
  • 71
    • 84920254863 scopus 로고    scopus 로고
    • Truskinger AM, Yang H, Wimmer J, Zhang J, Williamson I, Roe P (2011) Large scale participatory acoustic sensor data analysis: tools and reputation models to enhance effectiveness
    • Truskinger AM, Yang H, Wimmer J, Zhang J, Williamson I, Roe P (2011) Large scale participatory acoustic sensor data analysis: tools and reputation models to enhance effectiveness
  • 73
    • 0036648502 scopus 로고    scopus 로고
    • Musical genre classification of audio signals
    • Tzanetakis G, Cook P (2002) Musical genre classification of audio signals. Speech Audio Process IEEE Trans 10(5): 293–302
    • (2002) Speech Audio Process IEEE Trans , vol.10 , Issue.5 , pp. 293-302
    • Tzanetakis, G.1    Cook, P.2
  • 74
    • 33748893216 scopus 로고    scopus 로고
    • Environmental sounds recognition system using the speech recognition system techniques. Paper presented at the electrical and electronics engineering
    • Uribe OA, Meana HMP, Miyatake MN (7–9 Sep 2005) Environmental sounds recognition system using the speech recognition system techniques. Paper presented at the electrical and electronics engineering, 2005 2nd international conference on
    • 2005 2nd international conference on
    • Uribe, O.A.1    Meana, H.M.P.2    Miyatake, M.N.3
  • 75
    • 85016693299 scopus 로고    scopus 로고
    • Data mining applied to acoustic bird species recognition. Paper presented at the pattern recognition, 2006. ICPR 2006
    • Vilches E, Escobar IA, Vallejo EE, Taylor CE (2006) Data mining applied to acoustic bird species recognition. Paper presented at the pattern recognition, 2006. ICPR 2006. 18th international conference on
    • (2006) 18th international conference on
    • Vilches, E.1    Escobar, I.A.2    Vallejo, E.E.3    Taylor, C.E.4
  • 76
    • 80051605016 scopus 로고    scopus 로고
    • Audio recognition in the wild: static and dynamic classification on a real-world database of animal vocalizations. Paper presented at the acoustics, speech and signal processing (ICASSP)
    • Weninger F, Schuller B (2011) Audio recognition in the wild: static and dynamic classification on a real-world database of animal vocalizations. Paper presented at the acoustics, speech and signal processing (ICASSP), 2011 IEEE international conference on
    • (2011) 2011 IEEE international conference on
    • Weninger, F.1    Schuller, B.2
  • 78
    • 0030242072 scopus 로고    scopus 로고
    • Content-based classification, search, and retrieval of audio
    • Wold E, Blum T, Keislar D, Wheaten J (1996) Content-based classification, search, and retrieval of audio. Multimed IEEE 3(3): 27–36
    • (1996) Multimed IEEE , vol.3 , Issue.3 , pp. 27-36
    • Wold, E.1    Blum, T.2    Keislar, D.3    Wheaten, J.4
  • 79
    • 85016695147 scopus 로고    scopus 로고
    • Using reputation management in participatory sensing for data classification. Paper presented at the proeccedings of 2nd international conference on ambient systems
    • Yang H, Zhang J, Roe P (2011) Using reputation management in participatory sensing for data classification. Paper presented at the proeccedings of 2nd international conference on ambient systems, networks and technologies
    • (2011) networks and technologies
    • Yang, H.1    Zhang, J.2    Roe, P.3
  • 80
    • 35148837275 scopus 로고    scopus 로고
    • Comparison of pattern recognition techniques for the classification of impact acoustic emissions
    • Yella S, Gupta NK, Dougherty MS (2007) Comparison of pattern recognition techniques for the classification of impact acoustic emissions. Transp Res Part C Emerg Technol 15(6): 345–360
    • (2007) Transp Res Part C Emerg Technol , vol.15 , Issue.6 , pp. 345-360
    • Yella, S.1    Gupta, N.K.2    Dougherty, M.S.3
  • 81
    • 85016696328 scopus 로고    scopus 로고
    • Eco-environmental sound classification based on matching pursuit and support vector Machine. Paper presented at the information engineering and computer science (ICIECS)
    • Yong L, Ying L (25–26 Dec 2010) Eco-environmental sound classification based on matching pursuit and support vector Machine. Paper presented at the information engineering and computer science (ICIECS), 2010 2nd international conference on
    • 2010 2nd international conference on
    • Yong, L.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.