-
1
-
-
84858961864
-
A novel bottleneck-BLSTM frontend for feature-level context modeling in conversational speech recognition
-
Waikoloa, Big Island, Hawaii
-
M. Wöllmer, B. Schuller, and G. Rigoll, "A novel Bottleneck-BLSTM frontend for feature-level context modeling in conversational speech recognition," in Proc. of ASRU, Waikoloa, Big Island, Hawaii, 2011, pp. 36-41.
-
(2011)
Proc. of ASRU
, pp. 36-41
-
-
Wöllmer, M.1
Schuller, B.2
Rigoll, G.3
-
2
-
-
79959845286
-
The CHiME corpus: A resource and a challenge for computational hearing in multisource environments
-
Makuhari, Japan
-
H. Christensen, J. Barker, N. Ma, and P. Green, "The CHiME corpus: a resource and a challenge for Computational Hearing in Multisource Environments," in Proc. of Interspeech, Makuhari, Japan, 2010, pp. 1918-1921.
-
(2010)
Proc. of Interspeech
, pp. 1918-1921
-
-
Christensen, H.1
Barker, J.2
Ma, N.3
Green, P.4
-
3
-
-
84857258863
-
The munich 2011 CHiME challenge contribution: Nmf-BLSTM speech enhancement and recognition for reverberated multisource environments
-
Florence, Italy
-
F. Weninger, J. Geiger, M. Wöllmer, B. Schuller, and G. Rigoll, "The Munich 2011 CHiME Challenge Contribution: NMF-BLSTM Speech Enhancement and Recognition for Reverberated Multisource Environments," in Proc. of CHiME Workshop, Florence, Italy, 2011, pp. 24-29.
-
(2011)
Proc. of CHiME Workshop
, pp. 24-29
-
-
Weninger, F.1
Geiger, J.2
Wöllmer, M.3
Schuller, B.4
Rigoll, G.5
-
4
-
-
77950116181
-
Factorial scaled hidden Markov model for polyphonic audio representation and source separation
-
Mohonk, NY, United States
-
A. Ozerov, C. Févotte, and M. Charbit, "Factorial scaled hidden Markov model for polyphonic audio representation and source separation," in Proc. of WASPAA, Mohonk, NY, United States, 2009, pp. 121-124.
-
(2009)
Proc. of WASPAA
, pp. 121-124
-
-
Ozerov, A.1
Févotte, C.2
Charbit, M.3
-
5
-
-
80051618211
-
OpenBliSSART: Design and evaluation of a research toolkit for blind source separation in audio recognition tasks
-
Prague, Czech Republic
-
F. Weninger, A. Lehmann, and B. Schuller, "openBliSSART: Design and Evaluation of a Research Toolkit for Blind Source Separation in Audio Recognition Tasks," in Proc. of ICASSP, Prague, Czech Republic, 2011, pp. 1625-1628.
-
(2011)
Proc. of ICASSP
, pp. 1625-1628
-
-
Weninger, F.1
Lehmann, A.2
Schuller, B.3
-
6
-
-
27744588611
-
Framewise phoneme classification with bidirectional LSTM and other neural network architectures
-
A. Graves and J. Schmidhuber, "Framewise phoneme classification with bidirectional LSTM and other neural network architectures," Neural Networks, vol. 18, no. 5-6, pp. 602-610, 2005.
-
(2005)
Neural Networks
, vol.18
, Issue.5-6
, pp. 602-610
-
-
Graves, A.1
Schmidhuber, J.2
-
7
-
-
79959404069
-
The design and collection of COSINE, a multi-microphone in situ speech corpus recorded in noisy environments
-
A. Stupakov, E. Hanusa, D. Vijaywargi, D. Fox, and J. Bilmes, "The design and collection of COSINE, a multi-microphone in situ speech corpus recorded in noisy environments," Computer Speech and Language, vol. 26, no. 1, pp. 52-66, 2011.
-
(2011)
Computer Speech and Language
, vol.26
, Issue.1
, pp. 52-66
-
-
Stupakov, A.1
Hanusa, E.2
Vijaywargi, D.3
Fox, D.4
Bilmes, J.5
-
8
-
-
51449106187
-
-
Columbus, OH, USA: Department of Psychology, Ohio State University (Distributor)
-
M. A. Pitt, L. Dilley, K. Johnson, S. Kiesling, W. Raymond, E. Hume, and E. Fosler-Lussier, Buckeye Corpus of Conversational Speech (2nd release). Columbus, OH, USA: Department of Psychology, Ohio State University (Distributor), 2007, [www.buckeyecorpus.osu.edu].
-
(2007)
Buckeye Corpus of Conversational Speech (2nd Release)
-
-
Pitt, M.A.1
Dilley, L.2
Johnson, K.3
Kiesling, S.4
Raymond, W.5
Hume, E.6
Fosler-Lussier, E.7
-
9
-
-
80051621128
-
Localization of non-linguistic events in spontaneous speech by non-negative matrix factorization and long short-term memory
-
Prague, Czech Republic
-
F. Weninger, B. Schuller, M. Wöllmer, and G. Rigoll, "Localization of non-linguistic events in spontaneous speech by non-negative matrix factorization and long short-term memory," in Proc. of ICASSP, Prague, Czech Republic, 2011, pp. 5840-5843.
-
(2011)
Proc. of ICASSP
, pp. 5840-5843
-
-
Weninger, F.1
Schuller, B.2
Wöllmer, M.3
Rigoll, G.4
-
10
-
-
44949110218
-
Single-channel speech separation using sparse non-negative matrix factorization
-
Pittsburgh, PA, USA
-
M. N. Schmidt and R. K. Olsson, "Single-channel speech separation using sparse non-negative matrix factorization," in Proc. of Interspeech, Pittsburgh, PA, USA, 2006.
-
(2006)
Proc. of Interspeech
-
-
Schmidt, M.N.1
Olsson, R.K.2
-
11
-
-
33744975847
-
Performance measurement in blind audio source separation
-
E. Vincent, R. Gribonval, and C. Févotte, "Performance measurement in blind audio source separation," IEEE Transactions on Audio, Speech and Language Processing, vol. 14, no. 4, pp. 1462-1469, 2006.
-
(2006)
IEEE Transactions on Audio, Speech and Language Processing
, vol.14
, Issue.4
, pp. 1462-1469
-
-
Vincent, E.1
Gribonval, R.2
Févotte, C.3
|