-
1
-
-
78651563436
-
Bidirectional LSTM networks for context-sensitive keyword detection in a cognitive virtual agent framework
-
M. Wöllmer, F. Eyben, A. Graves, B. Schuller, and G. Rigoll, "Bidirectional LSTM networks for context-sensitive keyword detection in a cognitive virtual agent framework," Cognitive Computation, vol. 2, no. 3, pp. 180-190, 2010.
-
(2010)
Cognitive Computation
, vol.2
, Issue.3
, pp. 180-190
-
-
Wöllmer, M.1
Eyben, F.2
Graves, A.3
Schuller, B.4
Rigoll, G.5
-
2
-
-
54349106040
-
Switching linear dynamic systems for noise robust speech recognition
-
B. Mesot and D. Barber, "Switching linear dynamic systems for noise robust speech recognition," IEEE Transactions on Audio, Speech, and Language Processing, vol. 15, no. 6, pp. 1850-1858, 2007.
-
(2007)
IEEE Transactions on Audio, Speech, and Language Processing
, vol.15
, Issue.6
, pp. 1850-1858
-
-
Mesot, B.1
Barber, D.2
-
3
-
-
67650135931
-
Recognition of noisy speech: A comparative survey of robust model architecture and feature enhancement
-
ID 942617
-
B. Schuller, M. Wöllmer, T. Moosmayr, and G. Rigoll, "Recognition of noisy speech: A comparative survey of robust model architecture and feature enhancement," Journal on Audio, Speech, and Music Processing, 2009, ID 942617.
-
(2009)
Journal on Audio, Speech, and Music Processing
-
-
Schuller, B.1
Wöllmer, M.2
Moosmayr, T.3
Rigoll, G.4
-
4
-
-
34547522358
-
An acoustic model based on Kullback-Leibler divergence for posterior features
-
G. Aradilla, J. Vepa, and H. Bourlard, "An acoustic model based on Kullback-Leibler divergence for posterior features," in Proc. of ICASSP, Honolulu, HI, 2007, pp. 657-660.
-
Proc. of ICASSP, Honolulu, HI, 2007
, pp. 657-660
-
-
Aradilla, G.1
Vepa, J.2
Bourlard, H.3
-
5
-
-
51449103447
-
Optimizing bottle-neck features for LVCSR
-
F. Grezl and P. Fousek, "Optimizing bottle-neck features for LVCSR," in Proc. of ICASSP, Las Vegas, NV, 2008, pp. 4729-4732.
-
Proc. of ICASSP, Las Vegas, NV, 2008
, pp. 4729-4732
-
-
Grezl, F.1
Fousek, P.2
-
6
-
-
78049359820
-
Spoken term detection with connectionist temporal classification - A novel hybrid CTC-DBN approach
-
M. Wöllmer, F. Eyben, B. Schuller, and G. Rigoll, "Spoken term detection with connectionist temporal classification - a novel hybrid CTC-DBN approach," in Proc. of ICASSP, Dallas, Texas, 2010, pp. 5274-5277.
-
Proc. of ICASSP, Dallas, Texas, 2010
, pp. 5274-5277
-
-
Wöllmer, M.1
Eyben, F.2
Schuller, B.3
Rigoll, G.4
-
7
-
-
0041914606
-
Gradient flow in recurrent nets: The difficulty of learning long-term dependencies
-
S. C. Kremer and J. F. Kolen, Eds. IEEE Press
-
S. Hochreiter, Y. Bengio, P. Frasconi, and J. Schmidhuber, "Gradient flow in recurrent nets: the difficulty of learning long-term dependencies," in A Field Guide to Dynamical Recurrent Neural Networks, S. C. Kremer and J. F. Kolen, Eds. IEEE Press, 2001.
-
(2001)
A Field Guide to Dynamical Recurrent Neural Networks
-
-
Hochreiter, S.1
Bengio, Y.2
Frasconi, P.3
Schmidhuber, J.4
-
8
-
-
33745213373
-
Multi-resolution RASTA filtering for TANDEM-based ASR
-
H. Hermansky and P. Fousek, "Multi-resolution RASTA filtering for TANDEM-based ASR," in Proc. of European Conf. on Speech Communication and Technology, Lisbon, Portugal, 2008, pp. 361-364.
-
Proc. of European Conf. on Speech Communication and Technology, Lisbon, Portugal, 2008
, pp. 361-364
-
-
Hermansky, H.1
Fousek, P.2
-
9
-
-
0031573117
-
Long short-term memory
-
S. Hochreiter and J. Schmidhuber, "Long short-term memory," Neural Computation, vol. 9, no. 8, pp. 1735-1780, 1997.
-
(1997)
Neural Computation
, vol.9
, Issue.8
, pp. 1735-1780
-
-
Hochreiter, S.1
Schmidhuber, J.2
-
10
-
-
27744588611
-
Framewise phoneme classification with bidirectional LSTM and other neural network architectures
-
A. Graves and J. Schmidhuber, "Framewise phoneme classification with bidirectional LSTM and other neural network architectures," Neural Networks, vol. 18, no. 5-6, pp. 602-610, 2005.
-
(2005)
Neural Networks
, vol.18
, Issue.5-6
, pp. 602-610
-
-
Graves, A.1
Schmidhuber, J.2
-
11
-
-
33749251046
-
Bidirectional LSTM networks for improved phoneme classification and recognition
-
A. Graves, S. Fernandez, and J. Schmidhuber, "Bidirectional LSTM networks for improved phoneme classification and recognition," in Proc. of ICANN, Warsaw, Poland, 2005, pp. 602-610.
-
Proc. of ICANN, Warsaw, Poland, 2005
, pp. 602-610
-
-
Graves, A.1
Fernandez, S.2
Schmidhuber, J.3
-
12
-
-
70349203870
-
Robust discriminative keyword spotting for emotionally colored spontaneous speech using bidirectional LSTM networks
-
M. Wöllmer, F. Eyben, J. Keshet, A. Graves, B. Schuller, and G. Rigoll, "Robust discriminative keyword spotting for emotionally colored spontaneous speech using bidirectional LSTM networks," in Proc. of ICASSP, Taipei, Taiwan, 2009.
-
Proc. of ICASSP, Taipei, Taiwan, 2009
-
-
Wöllmer, M.1
Eyben, F.2
Keshet, J.3
Graves, A.4
Schuller, B.5
Rigoll, G.6
-
13
-
-
38149014113
-
An application of recurrent neural networks to discriminative keyword spotting
-
S. Fernandez, A. Graves, and J. Schmidhuber, "An application of recurrent neural networks to discriminative keyword spotting," in Proc. of ICANN, Porto, Portugal, 2007, pp. 220-229.
-
Proc. of ICANN, Porto, Portugal, 2007
, pp. 220-229
-
-
Fernandez, S.1
Graves, A.2
Schmidhuber, J.3
-
14
-
-
79959821052
-
Recognition of spontaneous conversational speech using long short-term memory phoneme predictions
-
M. Wöllmer, F. Eyben, B. Schuller, and G. Rigoll, "Recognition of spontaneous conversational speech using long short-term memory phoneme predictions," in Proc. of Interspeech, Makuhari, Japan, 2010, pp. 1946-1949.
-
Proc. of Interspeech, Makuhari, Japan, 2010
, pp. 1946-1949
-
-
Wöllmer, M.1
Eyben, F.2
Schuller, B.3
Rigoll, G.4
-
15
-
-
70349199112
-
COSINE - A corpus of multi-party conversational speech in noisy environments
-
A. Stupakov, E. Hanusa, J. Bilmes, and D. Fox, "COSINE - a corpus of multi-party conversational speech in noisy environments," in Proc. of ICASSP, Taipei, Taiwan, 2009.
-
Proc. of ICASSP, Taipei, Taiwan, 2009
-
-
Stupakov, A.1
Hanusa, E.2
Bilmes, J.3
Fox, D.4
-
16
-
-
78650977476
-
OpenSMILE - The Munich versatile and fast open-source audio feature extractor
-
F. Eyben, M. Wöllmer, and B. Schuller, "openSMILE - the Munich versatile and fast open-source audio feature extractor," in Proc. of ACM Multimedia, Firenze, Italy, 2010.
-
Proc. of ACM Multimedia, Firenze, Italy, 2010
-
-
Eyben, F.1
Wöllmer, M.2
Schuller, B.3
-
17
-
-
77956721304
-
Combining long short-term memory and dynamic bayesian networks for incremental emotion-sensitive artificial listening
-
M. Wöllmer, B. Schuller, F. Eyben, and G. Rigoll, "Combining long short-term memory and dynamic bayesian networks for incremental emotion-sensitive artificial listening," IEEE Journal of Selected Topics in Signal Processing, vol. 4, no. 5, pp. 867-881, 2010.
-
(2010)
IEEE Journal of Selected Topics in Signal Processing
, vol.4
, Issue.5
, pp. 867-881
-
-
Wöllmer, M.1
Schuller, B.2
Eyben, F.3
Rigoll, G.4
|