-
1
-
-
0032646977
-
Overview of audio information retrieval
-
J. Foote, “Overview of audio information retrieval,” Multimedia Systems, vol. 7, no. 1, pp. 2-10, 1999.
-
(1999)
Multimedia Systems
, vol.7
, Issue.1
, pp. 2-10
-
-
Foote, J.1
-
2
-
-
82255178542
-
-
Wiley-Blackwell, Oxford, UK
-
D. Wang and G. J. Brown, Computational Auditory Scene Analysis: Principles, Algorithms and Applications, Wiley-Blackwell, Oxford, UK, 2006.
-
(2006)
Computational Auditory Scene Analysis: Principles, Algorithms and Applications
-
-
Wang, D.1
Brown, G.J.2
-
3
-
-
85023704013
-
A qualitative study investigating automated identification of living macrolepidoptera using the Digital Automated Identification SYstem (DAISY)
-
A. T. Watson, M. A. O’Neill, and I. J. Kitching, “A qualitative study investigating automated identification of living macrolepidoptera using the Digital Automated Identification SYstem (DAISY),” Systematics and Biodiversity, vol. 1, no. 3, pp. 287-300, 2003.
-
(2003)
Systematics and Biodiversity
, vol.1
, Issue.3
, pp. 287-300
-
-
Watson, A.T.1
O’Neill, M.A.2
Kitching, I.J.3
-
4
-
-
1942423755
-
Automated species identification: Why not?
-
K. J. Gaston and M. A. O’Neill, “Automated species identification: why not?” Philosophical Transactions of the Royal Society B, vol. 359, no. 1444, pp. 655-667, 2004.
-
(2004)
Philosophical Transactions of the Royal Society B
, vol.359
, Issue.1444
, pp. 655-667
-
-
Gaston, K.J.1
O’Neill, M.A.2
-
5
-
-
27844543811
-
Automatic recognition of animal vocalizations using averaged MFCC and linear discriminant analysis
-
C.-H. Lee, C.-H. Chou, C.-C. Han, and R.-Z. Huang, “Automatic recognition of animal vocalizations using averaged MFCC and linear discriminant analysis,” Pattern Recognition Letters, vol. 27, no. 2, pp. 93-101, 2006.
-
(2006)
Pattern Recognition Letters
, vol.27
, Issue.2
, pp. 93-101
-
-
Lee, C.-H.1
Chou, C.-H.2
Han, C.-C.3
Huang, R.-Z.4
-
6
-
-
41849096779
-
Audio events detection in public transport vehicles
-
Toronto, Canada
-
J.-L. Rouas, J. Louradour, and S. Ambellouis, “Audio events detection in public transport vehicles,” in Proceedings of IEEE Intelligent Transportation System Conference, Toronto, Canada, 2006.
-
Proceedings of IEEE Intelligent Transportation System Conference
, pp. 2006
-
-
Rouas, J.-L.1
Louradour, J.2
Ambellouis, S.3
-
7
-
-
0036648502
-
Musical genre classification of audio signals
-
G. Tzanetakis and P. Cook, “Musical genre classification of audio signals,” IEEE Transactions on Speech and Audio Processing, vol. 10, no. 5, pp. 293-302, 2002.
-
(2002)
IEEE Transactions on Speech and Audio Processing
, vol.10
, Issue.5
, pp. 293-302
-
-
Tzanetakis, G.1
Cook, P.2
-
8
-
-
33947285571
-
Content analysis for acoustic environment classification in mobile robots
-
Arlington, Va, USA
-
S. Chu, S. Narayanan, and C.-C. Jay Kuo, “Content analysis for acoustic environment classification in mobile robots,” in Proceedings of the AAAI Fall Symposium, Aurally Informed Performance: Integrating Machine Listening and Auditory Presentation in Robotic Systems, Arlington, Va, USA, 2006.
-
(2006)
Proceedings of the AAAI Fall Symposium, Aurally Informed Performance: Integrating Machine Listening and Auditory Presentation in Robotic Systems
-
-
Chu, S.1
Narayanan, S.2
Jay Kuo, C.-C.3
-
9
-
-
0035364397
-
MPEG-7 sound-recognition tools
-
M. Casey, “MPEG-7 sound-recognition tools,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 11, no. 6, pp. 737-747, 2001.
-
(2001)
IEEE Transactions on Circuits and Systems for Video Technology
, vol.11
, Issue.6
, pp. 737-747
-
-
Casey, M.1
-
10
-
-
4544361760
-
Comparison of MPEG-7 audio spectrum projection features and MFCC applied to speaker recognition, sound classification and audio segmentation
-
Montreal, Canada, May
-
H. G. Kim and T. Sikora, “Comparison of MPEG-7 audio spectrum projection features and MFCC applied to speaker recognition, sound classification and audio segmentation,” in Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP ’04),Montreal, Canada, May 2004.
-
(2004)
Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP ’04)
-
-
Kim, H.G.1
Sikora, T.2
-
11
-
-
0030242072
-
Content-based classification, search, and retrieval of audio
-
E. Wold, T. Blum, D. Keislar, and J. Wheaton, “Content-based classification, search, and retrieval of audio,” IEEEMultimedia, vol. 3, no. 3, pp. 27-36, 1996.
-
(1996)
Ieeemultimedia
, vol.3
, Issue.3
, pp. 27-36
-
-
Wold, E.1
Blum, T.2
Keislar, D.3
Wheaton, J.4
-
12
-
-
0037506426
-
Content-based classification and retrieval of audio
-
Architectures, and Implementations VIII, Proceedings of SPIE, San Diego, Calif, USA, July
-
T. Zhang and C.-C. J. Kuo, “Content-based classification and retrieval of audio,” in Proceedings of the 43rd Annual Conference on Advanced Signal Processing Algorithms, Architectures, and Implementations VIII, Proceedings of SPIE, San Diego, Calif, USA, July 1998.
-
(1998)
Proceedings of the 43Rd Annual Conference on Advanced Signal Processing Algorithms
-
-
Zhang, T.1
Kuo, C.-C.J.2
-
13
-
-
52049100783
-
Audio signal feature extraction and classification using local discriminant bases
-
K. Umapathy, S. Krishnan, and R. K. Rao, “Audio signal feature extraction and classification using local discriminant bases,” IEEE Transactions on Audio, Speech and Language Processing, vol. 15, no. 4, pp. 1236-1246, 2007.
-
(2007)
IEEE Transactions on Audio, Speech and Language Processing
, vol.15
, Issue.4
, pp. 1236-1246
-
-
Umapathy, K.1
Krishnan, S.2
Rao, R.K.3
-
14
-
-
49549085544
-
Temporal feature integration for music genre classification
-
A. Meng, P. Ahrendt, J. Larsen, and L. K. Hansen, “Temporal feature integration for music genre classification,” IEEE Transactions on Audio, Speech and Language Processing, vol. 15, no. 5, pp. 1654-1664, 2007.
-
(2007)
IEEE Transactions on Audio, Speech and Language Processing
, vol.15
, Issue.5
, pp. 1654-1664
-
-
Meng, A.1
Ahrendt, P.2
Larsen, J.3
Hansen, L.K.4
-
15
-
-
70350482320
-
Temporal integration for audio classification with application to musical instrument classification
-
C. Joder, S. Essid, and G. Richard, “Temporal integration for audio classification with application to musical instrument classification,” IEEE Transactions on Audio, Speech and Language Processing, vol. 17, no. 1, pp. 174-186, 2009.
-
(2009)
IEEE Transactions on Audio, Speech and Language Processing
, vol.17
, Issue.1
, pp. 174-186
-
-
Joder, C.1
Essid, S.2
Richard, G.3
-
16
-
-
0024610919
-
Tutorial on hidden Markov models and selected applications in speech recognition
-
L. R. Rabiner, “Tutorial on hidden Markov models and selected applications in speech recognition,” Proceedings of the IEEE, vol. 77, no. 2, pp. 257-286, 1989.
-
(1989)
Proceedings of the IEEE
, vol.77
, Issue.2
, pp. 257-286
-
-
Rabiner, L.R.1
-
18
-
-
47649094799
-
Perceptually motivated wavelet packet transformfor bioacoustic signal enhancement
-
Y. Ren, M. T. Johnson, and J. Tao, “Perceptually motivated wavelet packet transformfor bioacoustic signal enhancement,” Journal of the Acoustical Society of America, vol. 124, no. 1, pp. 316-327, 2008.
-
(2008)
Journal of the Acoustical Society of America
, vol.124
, Issue.1
, pp. 316-327
-
-
Ren, Y.1
Johnson, M.T.2
Tao, J.3
-
19
-
-
1542680533
-
A practical guide to wavelet analysis
-
C. Torrence and G. P. Compo, “A practical guide to wavelet analysis,” Bulletin of the American Meteorological Society, vol. 79, no. 1, pp. 61-78, 1998.
-
(1998)
Bulletin of the American Meteorological Society
, vol.79
, Issue.1
, pp. 61-78
-
-
Torrence, C.1
Compo, G.P.2
-
20
-
-
44949087900
-
A wavelet-based parameterization for speech/music segmentation
-
Pittsburg, Pa, USA, September
-
E. Didiot, I. Illina, O. Mella, D. Fohr, and J.-P. Haton, “A wavelet-based parameterization for speech/music segmentation,” in Proceedings of the European Conference on Speech Communication and Technology (Interspeech ’06), Pittsburg, Pa, USA, September 2006.
-
(2006)
Proceedings of the European Conference on Speech Communication and Technology (Interspeech ’06)
-
-
Didiot, E.1
Illina, I.2
Mella, O.3
Fohr, D.4
Haton, J.-P.5
-
22
-
-
0032626552
-
Best wavelet-packet bases for audio coding using perceptual and rate-distortion criteria
-
Phoenix, Ariz, USA, March
-
M. Erne, G. Moschytz, and C. Faller, “Best wavelet-packet bases for audio coding using perceptual and rate-distortion criteria,” in Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP ’99), Phoenix, Ariz, USA, March 1999.
-
(1999)
Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP ’99)
-
-
Erne, M.1
Moschytz, G.2
Faller, C.3
-
23
-
-
10044279084
-
Perceptual criterion fragile audio watermarking using adaptive wavelet packets
-
Cambridge, UK, August
-
X. Quan and H. Zhang, “Perceptual criterion fragile audio watermarking using adaptive wavelet packets,” in Proceedings of the International Conference on Pattern Recognition (ICPR ’04), Cambridge, UK, August 2004.
-
Proceedings of the International Conference on Pattern Recognition (ICPR ’04)
, pp. 2004
-
-
Quan, X.1
Zhang, H.2
-
24
-
-
48349143474
-
Waveprint: Efficient wavelet-based audio fingerprinting
-
S. Baluja and M. Covell, “Waveprint: efficient wavelet-based audio fingerprinting,” Pattern Recognition, vol. 41, no. 11, pp. 3467-3480, 2008.
-
(2008)
Pattern Recognition
, vol.41
, Issue.11
, pp. 3467-3480
-
-
Baluja, S.1
Covell, M.2
-
25
-
-
0000665811
-
Critical bands
-
J. V. Tobias, Ed, Academic Press, New York, NY, USA
-
B. Scharf, “Critical bands,” in Foundations of Modern Auditory Theory, J. V. Tobias, Ed., vol. 1, pp. 157-202, Academic Press, New York, NY, USA, 1970.
-
(1970)
Foundations of Modern Auditory Theory
, vol.1
, pp. 157-202
-
-
Scharf, B.1
-
26
-
-
0004194950
-
-
Academic Press, New York, NY, USA, 3rd edition
-
W. A. Yost, Fundamentals of Hearing, Academic Press, New York, NY, USA, 3rd edition, 1994.
-
(1994)
Fundamentals of Hearing
-
-
Yost, W.A.1
-
27
-
-
85004122351
-
-
Torch Machine Learning Library, http://www.torch.ch.
-
-
-
-
28
-
-
34547645414
-
The bag-offrames approach to audio pattern recognition: A sufficient model for urban soundscapes but not for polyphonic music
-
J.-J. Aucouturier, B. Defreville, and F. Pachet, “The bag-offrames approach to audio pattern recognition: a sufficient model for urban soundscapes but not for polyphonic music,” Journal of the Acoustical Society of America, vol. 122, no. 2, pp. 881-891, 2007.
-
(2007)
Journal of the Acoustical Society of America
, vol.122
, Issue.2
, pp. 881-891
-
-
Aucouturier, J.-J.1
Defreville, B.2
Pachet, F.3
-
29
-
-
85087232259
-
Frame level noise classification in mobile environments
-
Phoenix, Ariz, USA, March
-
K. E. Maleh, A. Samouelian, and P. Kabal, “Frame level noise classification in mobile environments,” in Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP ’09), Phoenix, Ariz, USA, March 2009.
-
(2009)
Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP ’09)
-
-
Maleh, K.E.1
Samouelian, A.2
Kabal, P.3
-
30
-
-
34547136108
-
Audio analysis using the discrete wavelet transform
-
Skiathos, Greece, September
-
G. Tzanetakis, G. Essl, and P. Cook, “Audio analysis using the discrete wavelet transform,” in Proceedings of the WSES International Conference on Acoustics and Music: Theory Applications, Skiathos, Greece, September 2001.
-
(2001)
Proceedings of the WSES International Conference on Acoustics and Music: Theory Applications
-
-
Tzanetakis, G.1
Essl, G.2
Cook, P.3
-
31
-
-
20444491279
-
Automatic classification of speech and music using neural networks
-
Washington, DC, USA, November
-
M. K. S. Khan, W. G. Al-Khatib, and M. Moinuddin, “Automatic classification of speech and music using neural networks,” in Proceedings of the 2nd International Workshop on Multimedia Databases, Washington, DC, USA, November 2004.
-
(2004)
Proceedings of the 2Nd International Workshop on Multimedia Databases
-
-
Khan, M.K.S.1
Al-Khatib, W.G.2
Moinuddin, M.3
-
33
-
-
0002537922
-
Algorithm 808: ARFIT— a Matlab package for the estimation of parameters and eigenmodes of multivariate autoregressive models
-
T. Schneider and A. Neumaier, “Algorithm 808: ARFIT— a Matlab package for the estimation of parameters and eigenmodes of multivariate autoregressive models,” ACM Transactions on Mathematical Software, vol. 27, no. 1, pp. 58-65, 2001.
-
(2001)
ACM Transactions on Mathematical Software
, vol.27
, Issue.1
, pp. 58-65
-
-
Schneider, T.1
Neumaier, A.2
-
34
-
-
0032762471
-
A statistical model-based voice activity detection
-
J. Sohn, N. S. Kim, and W. Sung, “A statistical model-based voice activity detection,” IEEE Signal Processing Letters, vol. 6, no. 1, pp. 1-3, 1999.
-
(1999)
IEEE Signal Processing Letters
, vol.6
, Issue.1
, pp. 1-3
-
-
Sohn, J.1
Kim, N.S.2
Sung, W.3
-
35
-
-
0003957032
-
-
San Francisco, Calif, USA, 2nd edition
-
I. H. Witten and E. Frank, Data Mining: Practical Machine Learning Tools and Techniques, Morgan Kaufmann, San Francisco, Calif, USA, 2nd edition, 2005.
-
(2005)
Data Mining: Practical Machine Learning Tools and Techniques, Morgan Kaufmann
-
-
Witten, I.H.1
Frank, E.2
|