SCOPUS 정보 검색 플랫폼

13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012

Volumn 1, Issue , 2012, Pages 302-305

Combining Bottleneck-BLSTM and semi-supervised sparse NMF for recognition of conversational speech in highly instationary noise

(3) Weninger, Felix a Wöllmer, Martin a Schuller, Björn a

a TECHNICAL UNIVERSITY OF MUNICH (Germany)

Author keywords

[No Author keywords available]

Indexed keywords

AUTOMATIC RECOGNITION; CONVERSATIONAL SPEECH; EVALUATION PROTOCOL; INSTATIONARY NOISE; LONG SHORT-TERM MEMORY; SPARSE NON-NEGATIVE MATRIX FACTORIZATIONS; SPEAKER INDEPENDENTS; SPONTANEOUS SPEECH;

FACTORIZATION; RECURRENT NEURAL NETWORKS; SPEECH ENHANCEMENT;

SPEECH RECOGNITION;

EID: 84878390904 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (6)

References (11)

1
- 84858961864
- A novel bottleneck-BLSTM frontend for feature-level context modeling in conversational speech recognition
- Waikoloa, Big Island, Hawaii
- M. Wöllmer, B. Schuller, and G. Rigoll, "A novel Bottleneck-BLSTM frontend for feature-level context modeling in conversational speech recognition," in Proc. of ASRU, Waikoloa, Big Island, Hawaii, 2011, pp. 36-41.
- (2011) Proc. of ASRU , pp. 36-41
- Wöllmer, M.¹ Schuller, B.² Rigoll, G.³

2
- 79959845286
- The CHiME corpus: A resource and a challenge for computational hearing in multisource environments
- Makuhari, Japan
- H. Christensen, J. Barker, N. Ma, and P. Green, "The CHiME corpus: a resource and a challenge for Computational Hearing in Multisource Environments," in Proc. of Interspeech, Makuhari, Japan, 2010, pp. 1918-1921.
- (2010) Proc. of Interspeech , pp. 1918-1921
- Christensen, H.¹ Barker, J.² Ma, N.³ Green, P.⁴

3
- 84857258863
- The munich 2011 CHiME challenge contribution: Nmf-BLSTM speech enhancement and recognition for reverberated multisource environments
- Florence, Italy
- F. Weninger, J. Geiger, M. Wöllmer, B. Schuller, and G. Rigoll, "The Munich 2011 CHiME Challenge Contribution: NMF-BLSTM Speech Enhancement and Recognition for Reverberated Multisource Environments," in Proc. of CHiME Workshop, Florence, Italy, 2011, pp. 24-29.
- (2011) Proc. of CHiME Workshop , pp. 24-29
- Weninger, F.¹ Geiger, J.² Wöllmer, M.³ Schuller, B.⁴ Rigoll, G.⁵

4
- 77950116181
- Factorial scaled hidden Markov model for polyphonic audio representation and source separation
- Mohonk, NY, United States
- A. Ozerov, C. Févotte, and M. Charbit, "Factorial scaled hidden Markov model for polyphonic audio representation and source separation," in Proc. of WASPAA, Mohonk, NY, United States, 2009, pp. 121-124.
- (2009) Proc. of WASPAA , pp. 121-124
- Ozerov, A.¹ Févotte, C.² Charbit, M.³

5
- 80051618211
- OpenBliSSART: Design and evaluation of a research toolkit for blind source separation in audio recognition tasks
- Prague, Czech Republic
- F. Weninger, A. Lehmann, and B. Schuller, "openBliSSART: Design and Evaluation of a Research Toolkit for Blind Source Separation in Audio Recognition Tasks," in Proc. of ICASSP, Prague, Czech Republic, 2011, pp. 1625-1628.
- (2011) Proc. of ICASSP , pp. 1625-1628
- Weninger, F.¹ Lehmann, A.² Schuller, B.³

6
- 27744588611
- Framewise phoneme classification with bidirectional LSTM and other neural network architectures
- A. Graves and J. Schmidhuber, "Framewise phoneme classification with bidirectional LSTM and other neural network architectures," Neural Networks, vol. 18, no. 5-6, pp. 602-610, 2005.
- (2005) Neural Networks , vol.18 , Issue.5-6 , pp. 602-610
- Graves, A.¹ Schmidhuber, J.²

7
- 79959404069
- The design and collection of COSINE, a multi-microphone in situ speech corpus recorded in noisy environments
- A. Stupakov, E. Hanusa, D. Vijaywargi, D. Fox, and J. Bilmes, "The design and collection of COSINE, a multi-microphone in situ speech corpus recorded in noisy environments," Computer Speech and Language, vol. 26, no. 1, pp. 52-66, 2011.
- (2011) Computer Speech and Language , vol.26 , Issue.1 , pp. 52-66
- Stupakov, A.¹ Hanusa, E.² Vijaywargi, D.³ Fox, D.⁴ Bilmes, J.⁵

8
- 51449106187
- Columbus, OH, USA: Department of Psychology, Ohio State University (Distributor)
- M. A. Pitt, L. Dilley, K. Johnson, S. Kiesling, W. Raymond, E. Hume, and E. Fosler-Lussier, Buckeye Corpus of Conversational Speech (2nd release). Columbus, OH, USA: Department of Psychology, Ohio State University (Distributor), 2007, [www.buckeyecorpus.osu.edu].
- (2007) Buckeye Corpus of Conversational Speech (2nd Release)
- Pitt, M.A.¹ Dilley, L.² Johnson, K.³ Kiesling, S.⁴ Raymond, W.⁵ Hume, E.⁶ Fosler-Lussier, E.⁷

9
- 80051621128
- Localization of non-linguistic events in spontaneous speech by non-negative matrix factorization and long short-term memory
- Prague, Czech Republic
- F. Weninger, B. Schuller, M. Wöllmer, and G. Rigoll, "Localization of non-linguistic events in spontaneous speech by non-negative matrix factorization and long short-term memory," in Proc. of ICASSP, Prague, Czech Republic, 2011, pp. 5840-5843.
- (2011) Proc. of ICASSP , pp. 5840-5843
- Weninger, F.¹ Schuller, B.² Wöllmer, M.³ Rigoll, G.⁴

10
- 44949110218
- Single-channel speech separation using sparse non-negative matrix factorization
- Pittsburgh, PA, USA
- M. N. Schmidt and R. K. Olsson, "Single-channel speech separation using sparse non-negative matrix factorization," in Proc. of Interspeech, Pittsburgh, PA, USA, 2006.
- (2006) Proc. of Interspeech
- Schmidt, M.N.¹ Olsson, R.K.²

11
- 33744975847
- Performance measurement in blind audio source separation
- E. Vincent, R. Gribonval, and C. Févotte, "Performance measurement in blind audio source separation," IEEE Transactions on Audio, Speech and Language Processing, vol. 14, no. 4, pp. 1462-1469, 2006.
- (2006) IEEE Transactions on Audio, Speech and Language Processing , vol.14 , Issue.4 , pp. 1462-1469
- Vincent, E.¹ Gribonval, R.² Févotte, C.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.