-
2
-
-
68149163531
-
Environmental sound recognition with time-frequency audio features
-
Selina Chu, Shrikanth Narayanan, and CC Jay Kuo, "Environmental sound recognition with time-frequency audio features, " IEEE Transactions on Audio, Speech, and Language Processing, vol. 17, no. 6, pp. 1142-1158, 2009.
-
(2009)
IEEE Transactions on Audio, Speech, and Language Processing
, vol.17
, Issue.6
, pp. 1142-1158
-
-
Chu, S.1
Narayanan, S.2
Jay Kuo, C.C.3
-
3
-
-
44249121489
-
Audio keywords generation for sports video analysis
-
Min Xu, Changsheng Xu, Lingyu Duan, Jesse S Jin, and Suhuai Luo, "Audio keywords generation for sports video analysis, " ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM), vol. 4, no. 2, pp. 11, 2008.
-
(2008)
ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM)
, vol.4
, Issue.2
, pp. 11
-
-
Xu, M.1
Xu, C.2
Duan, L.3
Jin, J.S.4
Luo, S.5
-
4
-
-
84887056523
-
Context-dependent sound event detection
-
Toni Heittola, Annamaria Mesaros, Antti Eronen, and Tuomas Virtanen, "Context-dependent sound event detection, " EURASIP Journal on Audio, Speech, and Music Processing, vol. 2013, no. 1, pp. 1-13, 2013.
-
(2013)
EURASIP Journal on Audio, Speech, and Music Processing
, vol.2013
, Issue.1
, pp. 1-13
-
-
Heittola, T.1
Mesaros, A.2
Eronen, A.3
Virtanen, T.4
-
5
-
-
79959754926
-
Acoustic event detection in real life recordings
-
Annamaria Mesaros, Toni Heittola, Antti Eronen, and Tuomas Virtanen, "Acoustic event detection in real life recordings, " in 18th European Signal Processing Conference, 2010, pp. 1267-1271.
-
(2010)
18th European Signal Processing Conference
, pp. 1267-1271
-
-
Mesaros, A.1
Heittola, T.2
Eronen, A.3
Virtanen, T.4
-
6
-
-
84890450206
-
Supervised model training for overlapping sound events based on unsupervised source separation
-
Toni Heittola, Annamaria Mesaros, Tuomas Virtanen, and Moncef Gabbouj, "Supervised model training for overlapping sound events based on unsupervised source separation, " in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2013, pp. 8677-8681.
-
(2013)
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
, pp. 8677-8681
-
-
Heittola, T.1
Mesaros, A.2
Virtanen, T.3
Gabbouj, M.4
-
7
-
-
84865687106
-
NMF-based environmental sound source separation using time-variant gain features
-
Satoshi Innami and Hiroyuki Kasai, "NMF-based environmental sound source separation using time-variant gain features, " Computers & Mathematics with Applications, vol. 64, no. 5, pp. 1333-1342, 2012.
-
(2012)
Computers & Mathematics with Applications
, vol.64
, Issue.5
, pp. 1333-1342
-
-
Innami, S.1
Kasai, H.2
-
8
-
-
84880541040
-
Realtime detection of overlapping sound events with non-negative matrix factorization
-
Springer
-
Arnaud Dessein, Arshia Cont, and Guillaume Lemaitre, "Realtime detection of overlapping sound events with non-negative matrix factorization, " in Matrix Information Geometry, pp. 341-371. Springer, 2013.
-
(2013)
Matrix Information Geometry
, pp. 341-371
-
-
Dessein, A.1
Cont, A.2
Lemaitre, G.3
-
10
-
-
84946032854
-
Sound event detection in real life recordings using coupled matrix factorization of spectral representations and class activity annotations
-
Annamaria Mesaros, Toni Heittola, Onur Dikmen, and Tuomas Virtanen, "Sound event detection in real life recordings using coupled matrix factorization of spectral representations and class activity annotations, " in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2015, pp. 606-618.
-
(2015)
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
, pp. 606-618
-
-
Mesaros, A.1
Heittola, T.2
Dikmen, O.3
Virtanen, T.4
-
11
-
-
84876157594
-
Overlapping sound event recognition using local spectrogram features and the generalised hough transform
-
Jonathan Dennis, Huy Dat Tran, and Eng Siong Chng, "Overlapping sound event recognition using local spectrogram features and the generalised hough transform, " Pattern Recognition Letters, vol. 34, no. 9, pp. 1085-1093, 2013.
-
(2013)
Pattern Recognition Letters
, vol.34
, Issue.9
, pp. 1085-1093
-
-
Dennis, J.1
Dat Tran, H.2
Siong Chng, E.3
-
12
-
-
84951103511
-
Polyphonic sound event detection using multi label deep neural networks
-
Emre Cakir, Toni Heittola, Heikki Huttunen, and Tuomas Virtanen, "Polyphonic sound event detection using multi label deep neural networks, " in IEEE International Joint Conference on Neural Networks (IJCNN), 2015.
-
(2015)
IEEE International Joint Conference on Neural Networks (IJCNN)
-
-
Cakir, E.1
Heittola, T.2
Huttunen, H.3
Virtanen, T.4
-
13
-
-
0031573117
-
Long short-term memory
-
Sepp Hochreiter and Jürgen Schmidhuber, "Long short-term memory, " Neural computation, vol. 9, no. 8, pp. 1735-1780, 1997.
-
(1997)
Neural Computation
, vol.9
, Issue.8
, pp. 1735-1780
-
-
Hochreiter, S.1
Schmidhuber, J.2
-
14
-
-
27744588611
-
Framewise phoneme classification with bidirectional LSTM and other neural network architectures
-
Alex Graves and Jürgen Schmidhuber, "Framewise phoneme classification with bidirectional LSTM and other neural network architectures, " Neural Networks, vol. 18, no. 5, pp. 602-610, 2005.
-
(2005)
Neural Networks
, vol.18
, Issue.5
, pp. 602-610
-
-
Graves, A.1
Schmidhuber, J.2
-
15
-
-
84890543083
-
Speech recognition with deep recurrent neural networks
-
Alex Graves, Abdel-rahman Mohamed, and Geoffrey Hinton, "Speech recognition with deep recurrent neural networks, " in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2013, pp. 6645-6649.
-
(2013)
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
, pp. 6645-6649
-
-
Graves, A.1
Mohamed, A.2
Hinton, G.3
-
16
-
-
84863771261
-
Universal onset detection with bidirectional long short-term memory neural networks
-
Florian Eyben, Sebastian Böck, Björn Schuller, and Alex Graves, "Universal onset detection with bidirectional long short-term memory neural networks., " in International Society for Music Information Retrieval Conference (ISMIR), 2010, pp. 589-594.
-
(2010)
International Society for Music Information Retrieval Conference (ISMIR)
, pp. 589-594
-
-
Eyben, F.1
Böck, S.2
Schuller, B.3
Graves, A.4
-
18
-
-
0031268931
-
Bidirectional recurrent neural networks
-
Mike Schuster and Kuldip K Paliwal, "Bidirectional recurrent neural networks, " IEEE Transactions on Signal Processing, vol. 45, no. 11, pp. 2673-2681, 1997.
-
(1997)
IEEE Transactions on Signal Processing
, vol.45
, Issue.11
, pp. 2673-2681
-
-
Schuster, M.1
Paliwal, K.K.2
-
19
-
-
0028392483
-
Learning long-term dependencies with gradient descent is difficult
-
Yoshua Bengio, Patrice Simard, and Paolo Frasconi, "Learning long-term dependencies with gradient descent is difficult, " IEEE Transactions on Neural Networks, vol. 5, no. 2, pp. 157-166, 1994.
-
(1994)
IEEE Transactions on Neural Networks
, vol.5
, Issue.2
, pp. 157-166
-
-
Bengio, Y.1
Simard, P.2
Frasconi, P.3
-
20
-
-
64849110608
-
A novel connectionist system for unconstrained handwriting recognition
-
Alex Graves, Marcus Liwicki, Santiago Fernández, Roman Bertolami, Horst Bunke, and Jürgen Schmidhuber, "A novel connectionist system for unconstrained handwriting recognition, " IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 31, no. 5, pp. 855-868, 2009.
-
(2009)
IEEE Transactions on Pattern Analysis and Machine Intelligence
, vol.31
, Issue.5
, pp. 855-868
-
-
Graves, A.1
Liwicki, M.2
Fernández, S.3
Bertolami, R.4
Bunke, H.5
Schmidhuber, J.6
-
21
-
-
0024753593
-
Speech recognition using noise-adaptive prototypes
-
Arthur Nádas, David Nahamoo, Michael Picheny, et al., "Speech recognition using noise-adaptive prototypes, " IEEE Transactions on Acoustics, Speech and Signal Processing, vol. 37, no. 10, pp. 1495-1503, 1989.
-
(1989)
IEEE Transactions on Acoustics, Speech and Signal Processing
, vol.37
, Issue.10
, pp. 1495-1503
-
-
Nádas, A.1
Nahamoo, D.2
Picheny, M.3
-
24
-
-
84863806337
-
Audio context recognition using audio event histograms
-
Toni Heittola, Annamaria Mesaros, Antti Eronen, and Tuomas Virtanen, "Audio context recognition using audio event histograms, " in Proc. of the 18th European Signal Processing Conference (EUSIPCO), 2010, pp. 1272-1276.
-
(2010)
Proc. of the 18th European Signal Processing Conference (EUSIPCO)
, pp. 1272-1276
-
-
Heittola, T.1
Mesaros, A.2
Eronen, A.3
Virtanen, T.4
-
25
-
-
0025503558
-
Backpropagation through time: What it does and how to do it
-
Paul J Werbos, "Backpropagation through time: what it does and how to do it, " Proceedings of the IEEE, vol. 78, no. 10, pp. 1550-1560, 1990.
-
(1990)
Proceedings of the IEEE
, vol.78
, Issue.10
, pp. 1550-1560
-
-
Werbos, P.J.1
-
26
-
-
84893343292
-
Lecture 6. 5-rmsprop: Divide the gradient by a running average of its recent magnitude
-
Tijmen Tieleman and Geoffrey Hinton, "Lecture 6. 5-rmsprop: Divide the gradient by a running average of its recent magnitude, " COURSERA: Neural Networks for Machine Learning, vol. 4, 2012.
-
(2012)
COURSERA: Neural Networks for Machine Learning
, vol.4
-
-
Tieleman, T.1
Hinton, G.2
-
27
-
-
84930639546
-
Introducing currennt: The Munich opensource CUDA recurrent neural network toolkit
-
Felix Weninger, "Introducing currennt: The Munich opensource CUDA recurrent neural network toolkit, " Journal of Machine Learning Research, vol. 16, pp. 547-551, 2015.
-
(2015)
Journal of Machine Learning Research
, vol.16
, pp. 547-551
-
-
Weninger, F.1
|