-
1
-
-
84863730848
-
Scream and gunshot detection in noisy environments
-
Poznan, Poland Sep. 3-7
-
L. Gerosa, G. Valenzise, F. Antonacci, M. Tagliasacchi, and A. Sarti, "Scream and gunshot detection in noisy environments, " in Proc. 15th Eur. Signal Process. Conf., Poznan, Poland, Sep. 3-7, 2007.
-
(2007)
Proc. 15th Eur. Signal Process. Conf.
-
-
Gerosa, L.1
Valenzise, G.2
Antonacci, F.3
Tagliasacchi, M.4
Sarti, A.5
-
2
-
-
80051605016
-
Audio recognition in the wild: Static and dynamic classification on a real-world database of animal vocalizations
-
F. Weninger and B. Schuller, "Audio recognition in the wild: Static and dynamic classification on a real-world database of animal vocalizations, " in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2011, pp. 337-340.
-
(2011)
Proc IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP
, pp. 337-340
-
-
Weninger, F.1
Schuller, B.2
-
3
-
-
68149163531
-
Environmental sound recognition with time-frequency audio features
-
Aug
-
S. Chu, S. Narayanan, and C. Kuo, "Environmental sound recognition with time-frequency audio features, " IEEE Trans. Audio, Speech, Lang. Process., vol. 17, no. 6, pp. 1142-1158, Aug. 2009.
-
(2009)
IEEE Trans. Audio, Speech, Lang. Process
, vol.17
, Issue.6
, pp. 1142-1158
-
-
Chu, S.1
Narayanan, S.2
Kuo, C.3
-
4
-
-
85008548582
-
Time-frequency matrix feature extraction and classification of environmental audio signals
-
Sep.
-
B. Ghoraani and S. Krishnan, "Time-frequency matrix feature extraction and classification of environmental audio signals, " IEEE Trans. Audio, Speech, Lang. Process., vol. 19, no. 7, pp. 2197-2209, Sep. 2011.
-
(2011)
IEEE Trans. Audio, Speech, Lang. Process
, vol.19
, Issue.7
, pp. 2197-2220
-
-
Ghoraani, B.1
Krishnan, S.2
-
5
-
-
85032753469
-
Machine hearing: An emerging field
-
Sep.
-
R. Lyon, "Machine hearing: An emerging field, " IEEE Signal Process. Mag., vol. 27, no. 5, pp. 131-139, Sep. 2010.
-
(2010)
IEEE Signal Process. Mag
, vol.27
, Issue.5
, pp. 131-139
-
-
Lyon, R.1
-
6
-
-
78650982481
-
Spectrogram image feature for sound event classification in mismatched conditions
-
J. Dennis, H. Tran, and H. Li, "Spectrogram image feature for sound event classification in mismatched conditions, " IEEE Signal Process. Lett., vol. 18, no. 2, pp. 130-133, 2011.
-
(2011)
IEEE Signal Process. Lett
, vol.18
, Issue.2
, pp. 130-133
-
-
Dennis, J.1
Tran, H.2
Li, H.3
-
8
-
-
85032752225
-
Missing-feature approaches in speech recognition
-
DOI 10.1109/MSP.2005.1511828
-
B. Raj and R. Stern, "Missing-feature approaches in speech recognition, " IEEE Signal Process. Mag., vol. 22, no. 5, pp. 101-116, Sep. 2005. (Pubitemid 41488524)
-
(2005)
IEEE Signal Processing Magazine
, vol.22
, Issue.5
, pp. 101-116
-
-
Raj, B.1
Stern, R.M.2
-
9
-
-
84865804537
-
Image representation of the subband power distribution for robust sound classification
-
Aug
-
J. Dennis, H. Tran, and H. Li, "Image representation of the subband power distribution for robust sound classification, " in Proc. 12 Annu. Conf. Int. Speech Commun. Assoc., Aug. 2011, pp. 2437-2440.
-
(2011)
Proc. 12 Annu. Conf. Int. Speech Commun. Assoc.
, pp. 2437-2440
-
-
Dennis, J.1
Tran, H.2
Li, H.3
-
10
-
-
0042830801
-
Comparison of techniques for environmental sound recognition
-
DOI 10.1016/S0167-8655(03)00147-8
-
M. Cowling and R. Sitte, "Comparison of techniques for environmental sound recognition, " Pattern Recognit. Lett., vol. 24, no. 15, pp. 2895-2907, 2003. (Pubitemid 37027809)
-
(2003)
Pattern Recognition Letters
, vol.24
, Issue.15
, pp. 2895-2907
-
-
Cowling, M.1
Sitte, R.2
-
12
-
-
34347345718
-
Parametric representations of bird sounds for automatic species recognition
-
Nov
-
P. Somervuo, A. Harma, and S. Fagerlund, "Parametric representations of bird sounds for automatic species recognition, " IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 6, pp. 2252-2263, Nov. 2006.
-
(2006)
IEEE Trans. Audio, Speech, Lang. Process
, vol.14
, Issue.6
, pp. 2252-2263
-
-
Somervuo, P.1
Harma, A.2
Fagerlund, S.3
-
13
-
-
76949107820
-
Sound indexing using morphological description
-
Mar.
-
G. Peeters and E. Deruty, "Sound indexing using morphological description, " IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 3, pp. 675-687, Mar. 2010.
-
(2010)
IEEE Trans. Audio, Speech, Lang. Process
, vol.18
, Issue.3
, pp. 675-687
-
-
Peeters, G.1
Deruty, E.2
-
14
-
-
79957687384
-
Sound event recognition with probabilistic distance SVMs
-
Aug.
-
H. Tran and L. Haizhou, "Sound event recognition with probabilistic distance SVMs, " IEEE Trans. Audio, Speech, Lang. Process., vol. 19, no. 6, pp. 1556-1568, Aug. 2011.
-
(2011)
IEEE Trans. Audio, Speech, Lang. Process
, vol.19
, Issue.6
, pp. 1556-1568
-
-
Tran, H.1
Haizhou, L.2
-
15
-
-
14244272507
-
Methods for capturing spectro-temporal modulations in automatic speech recognition
-
M. Kleinschmidt, "Methods for capturing spectro-temporal modulations in automatic speech recognition, " Acta Acustica United With Acustica, vol. 88, no. 3, pp. 416-422, 2002. (Pubitemid 34732124)
-
(2002)
Acta Acustica united with Acustica
, vol.88
, Issue.3
, pp. 416-422
-
-
Kleinschmidt, M.1
-
16
-
-
0033709098
-
Tandem connectionist feature extraction for conventional hmm systems
-
H. Hermansky, D. Ellis, and S. Sharma, "Tandem connectionist feature extraction for conventional hmm systems, " in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP '00), 2000, vol. 3, pp. 1635-1638.
-
(2000)
Proc IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP '00
, vol.3
, pp. 1635-1638
-
-
Hermansky, H.1
Ellis, D.2
Sharma, S.3
-
19
-
-
70349205535
-
Audio classification from time-frequency texture
-
G. Yu and J. Slotine, "Audio classification from time-frequency texture, " in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., (ICASSP '09), 2009, pp. 1677-1680.
-
(2009)
Proc IEEE Int. Conf. Acoust., Speech, Signal Process., (ICASSP '09
, pp. 1677-1680
-
-
Yu, G.1
Slotine, J.2
-
20
-
-
84863744672
-
Gradient-based musical feature extraction based on scale-invariant feature transform
-
T. Matsui, M. Goto, J. Vert, and Y. Uchiyama, "Gradient-based musical feature extraction based on scale-invariant feature transform, " in Proc. 19th Eur. Signal Process. Conf., 2011, pp. 724-728.
-
(2011)
Proc. 19th Eur. Signal Process. Conf
, pp. 724-728
-
-
Matsui, T.1
Goto, M.2
Vert, J.3
Uchiyama, Y.4
-
21
-
-
0035342414
-
Robust automatic speech recognition with missing and unreliable acoustic data
-
DOI 10.1016/S0167-6393(00)00034-0, PII S0167639300000340
-
M. Cooke, P. Green, L. Josifovski, and A. Vizinho, "Robust automatic speech recognition with missing and unreliable acoustic data, " Speech Commun., vol. 34, no. 3, pp. 267-285, 2001. (Pubitemid 32284867)
-
(2001)
Speech Communication
, vol.34
, Issue.3
, pp. 267-285
-
-
Cooke, M.1
Green, P.2
Josifovski, L.3
Vizinho, A.4
-
22
-
-
4644317224
-
A Bayesian classifier for spectrographic mask estimation for missing feature speech recognition
-
M. Seltzer, B. Raj, and R. Stern, "A Bayesian classifier for spectrographic mask estimation for missing feature speech recognition, " Speech Commun., vol. 43, no. 4, pp. 379-393, 2004.
-
(2004)
Speech Commun
, vol.43
, Issue.4
, pp. 379-393
-
-
Seltzer, M.1
Raj, B.2
Stern, R.3
-
23
-
-
0141624530
-
An efficient auditory filterbank based on the gammatone function
-
R. Patterson, I. Nimmo-Smith, J. Holdsworth, and P. Rice, "An efficient auditory filterbank based on the gammatone function, " APU Rep., 1988, vol. 2341.
-
(1988)
APU Rep
, pp. 2341
-
-
Patterson, R.1
Nimmo-Smith, I.2
Holdsworth, J.3
Rice, P.4
-
24
-
-
0003913694
-
An efficient implementation of the Patterson-Holdsworth auditory filter bank
-
Tech. Rep.
-
M. Slaney, "An efficient implementation of the Patterson-Holdsworth auditory filter bank, " Apple Computer, 1993, Tech. Rep. .
-
(1993)
Apple Computer
-
-
Slaney, M.1
-
25
-
-
0003626435
-
-
Upper Saddle River NJ Prentice-Hall ISBN 0-201-18075-8
-
R. Gonzalez and R. Woods, Digital Image Processing. Upper Saddle River, NJ: Prentice-Hall, 2002, ISBN 0-201-18075-8.
-
(2002)
Digital Image Processing
-
-
Gonzalez, R.1
Woods, R.2
-
26
-
-
3042535216
-
Distinctive image features from scale-invariant keypoints
-
D. Lowe, "Distinctive image features from scale-invariant keypoints, " Int. J. Comput. Vis., vol. 60, no. 2, pp. 91-110, 2004.
-
(2004)
Int. J. Comput. Vis
, vol.60
, Issue.2
, pp. 91-110
-
-
Lowe, D.1
-
27
-
-
0018455310
-
Suppression of acoustic noise in speech using spectral subtraction
-
S. Boll, "Suppression of acoustic noise in speech using spectral subtraction, " IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-27, no. 2, pp. 113-120, Apr. 1979. (Pubitemid 9467471)
-
(1979)
IEEE Trans Acoust Speech Signal Process
, vol.ASSP-27
, Issue.2
, pp. 113-120
-
-
Boll Steven, F.1
-
28
-
-
0024753593
-
Speech recognition using noise-adaptive prototypes
-
DOI 10.1109/29.35387
-
A. Nádas, D. Nahamoo, and M. Picheny, "Speech recognition using noise-adaptive prototypes, " IEEE Trans. Acoust., Speech, Signal Process., vol. 37, no. 10, pp. 1495-1503, Oct. 1989. (Pubitemid 20617876)
-
(1989)
IEEE Transactions on Acoustics, Speech, and Signal Processing
, vol.37
, Issue.10
, pp. 1495-1503
-
-
Nadas Arthur1
Nahamoo David2
Picheny Michael, A.3
-
29
-
-
51949090223
-
In defense of nearestneighbor based image classification
-
O. Boiman, E. Shechtman, and M. Irani, "In defense of nearestneighbor based image classification, " in Proc. IEEE Comput. Vision, Pattern Recognit. (CVPR), 2008, pp. 1-8.
-
(2008)
Proc IEEE Comput. Vision, Pattern Recognit. (CVPR
, pp. 1-8
-
-
Boiman, O.1
Shechtman, E.2
Irani, M.3
-
30
-
-
84856621489
-
Hellinger distance decision trees are robust and skew-insensitive
-
D. A. Cieslak, T. R. Hoens, N. V. Chawla, and W. P. Kegelmeyer, "Hellinger distance decision trees are robust and skew-insensitive, " Data Min. Knowl. Discov., pp. 136-158, 2012.
-
(2012)
Data Min. Knowl. Discov
, pp. 136-158
-
-
Cieslak, D.A.1
Hoens, T.R.2
Chawla, N.V.3
Kegelmeyer, W.P.4
-
31
-
-
78049391669
-
Acoustical sound database in real environments for sound scene understanding and hands-free speech recognition
-
S. Nakamura, K. Hiyane, F. Asano, T. Nishiura, and T. Yamada, "Acoustical sound database in real environments for sound scene understanding and hands-free speech recognition, " in Proc. ICLRE, 2000, pp. 965-968.
-
(2000)
Proc. ICLRE
, pp. 965-968
-
-
Nakamura, S.1
Hiyane, K.2
Asano, F.3
Nishiura, T.4
Yamada, T.5
-
32
-
-
33745185408
-
-
version 1. 1 ETSI STQ Aurora DSRWorking Group Tech. Rep. ES
-
A. Sorin and T. Ramabadran, "Extended advanced front end algorithm description, version 1. 1 ETSI STQ Aurora DSRWorking Group, 2003, vol. 202, Tech. Rep. ES, p. 212.
-
(2003)
Extended Advanced Front End Algorithm Description
, vol.202
, pp. 212
-
-
Sorin, A.1
Ramabadran, T.2
-
33
-
-
0003822743
-
-
S. Young, G. Evermann, D. Kershaw, G. Moore, J. Odell, D. Ollason, V. Valtchev, and P. Woodland, The HTK Book.
-
The HTK Book
-
-
Young, S.1
Evermann, G.2
Kershaw, D.3
Moore, G.4
Odell, J.5
Ollason, D.6
Valtchev, V.7
Woodland, P.8
-
34
-
-
0027623210
-
Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems
-
A. Varga and H. Steeneken, "Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems, " Speech Commun., vol. 12, no. 3, pp. 247-251, 1993.
-
(1993)
Speech Commun
, vol.12
, Issue.3
, pp. 247-251
-
-
Varga, A.1
Steeneken, H.2
|