메뉴 건너뛰기




Volumn 1, Issue , 2012, Pages 302-305

Combining Bottleneck-BLSTM and semi-supervised sparse NMF for recognition of conversational speech in highly instationary noise

Author keywords

[No Author keywords available]

Indexed keywords

AUTOMATIC RECOGNITION; CONVERSATIONAL SPEECH; EVALUATION PROTOCOL; INSTATIONARY NOISE; LONG SHORT-TERM MEMORY; SPARSE NON-NEGATIVE MATRIX FACTORIZATIONS; SPEAKER INDEPENDENTS; SPONTANEOUS SPEECH;

EID: 84878390904     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (6)

References (11)
  • 1
    • 84858961864 scopus 로고    scopus 로고
    • A novel bottleneck-BLSTM frontend for feature-level context modeling in conversational speech recognition
    • Waikoloa, Big Island, Hawaii
    • M. Wöllmer, B. Schuller, and G. Rigoll, "A novel Bottleneck-BLSTM frontend for feature-level context modeling in conversational speech recognition," in Proc. of ASRU, Waikoloa, Big Island, Hawaii, 2011, pp. 36-41.
    • (2011) Proc. of ASRU , pp. 36-41
    • Wöllmer, M.1    Schuller, B.2    Rigoll, G.3
  • 2
    • 79959845286 scopus 로고    scopus 로고
    • The CHiME corpus: A resource and a challenge for computational hearing in multisource environments
    • Makuhari, Japan
    • H. Christensen, J. Barker, N. Ma, and P. Green, "The CHiME corpus: a resource and a challenge for Computational Hearing in Multisource Environments," in Proc. of Interspeech, Makuhari, Japan, 2010, pp. 1918-1921.
    • (2010) Proc. of Interspeech , pp. 1918-1921
    • Christensen, H.1    Barker, J.2    Ma, N.3    Green, P.4
  • 3
    • 84857258863 scopus 로고    scopus 로고
    • The munich 2011 CHiME challenge contribution: Nmf-BLSTM speech enhancement and recognition for reverberated multisource environments
    • Florence, Italy
    • F. Weninger, J. Geiger, M. Wöllmer, B. Schuller, and G. Rigoll, "The Munich 2011 CHiME Challenge Contribution: NMF-BLSTM Speech Enhancement and Recognition for Reverberated Multisource Environments," in Proc. of CHiME Workshop, Florence, Italy, 2011, pp. 24-29.
    • (2011) Proc. of CHiME Workshop , pp. 24-29
    • Weninger, F.1    Geiger, J.2    Wöllmer, M.3    Schuller, B.4    Rigoll, G.5
  • 4
    • 77950116181 scopus 로고    scopus 로고
    • Factorial scaled hidden Markov model for polyphonic audio representation and source separation
    • Mohonk, NY, United States
    • A. Ozerov, C. Févotte, and M. Charbit, "Factorial scaled hidden Markov model for polyphonic audio representation and source separation," in Proc. of WASPAA, Mohonk, NY, United States, 2009, pp. 121-124.
    • (2009) Proc. of WASPAA , pp. 121-124
    • Ozerov, A.1    Févotte, C.2    Charbit, M.3
  • 5
    • 80051618211 scopus 로고    scopus 로고
    • OpenBliSSART: Design and evaluation of a research toolkit for blind source separation in audio recognition tasks
    • Prague, Czech Republic
    • F. Weninger, A. Lehmann, and B. Schuller, "openBliSSART: Design and Evaluation of a Research Toolkit for Blind Source Separation in Audio Recognition Tasks," in Proc. of ICASSP, Prague, Czech Republic, 2011, pp. 1625-1628.
    • (2011) Proc. of ICASSP , pp. 1625-1628
    • Weninger, F.1    Lehmann, A.2    Schuller, B.3
  • 6
    • 27744588611 scopus 로고    scopus 로고
    • Framewise phoneme classification with bidirectional LSTM and other neural network architectures
    • A. Graves and J. Schmidhuber, "Framewise phoneme classification with bidirectional LSTM and other neural network architectures," Neural Networks, vol. 18, no. 5-6, pp. 602-610, 2005.
    • (2005) Neural Networks , vol.18 , Issue.5-6 , pp. 602-610
    • Graves, A.1    Schmidhuber, J.2
  • 7
    • 79959404069 scopus 로고    scopus 로고
    • The design and collection of COSINE, a multi-microphone in situ speech corpus recorded in noisy environments
    • A. Stupakov, E. Hanusa, D. Vijaywargi, D. Fox, and J. Bilmes, "The design and collection of COSINE, a multi-microphone in situ speech corpus recorded in noisy environments," Computer Speech and Language, vol. 26, no. 1, pp. 52-66, 2011.
    • (2011) Computer Speech and Language , vol.26 , Issue.1 , pp. 52-66
    • Stupakov, A.1    Hanusa, E.2    Vijaywargi, D.3    Fox, D.4    Bilmes, J.5
  • 9
    • 80051621128 scopus 로고    scopus 로고
    • Localization of non-linguistic events in spontaneous speech by non-negative matrix factorization and long short-term memory
    • Prague, Czech Republic
    • F. Weninger, B. Schuller, M. Wöllmer, and G. Rigoll, "Localization of non-linguistic events in spontaneous speech by non-negative matrix factorization and long short-term memory," in Proc. of ICASSP, Prague, Czech Republic, 2011, pp. 5840-5843.
    • (2011) Proc. of ICASSP , pp. 5840-5843
    • Weninger, F.1    Schuller, B.2    Wöllmer, M.3    Rigoll, G.4
  • 10
    • 44949110218 scopus 로고    scopus 로고
    • Single-channel speech separation using sparse non-negative matrix factorization
    • Pittsburgh, PA, USA
    • M. N. Schmidt and R. K. Olsson, "Single-channel speech separation using sparse non-negative matrix factorization," in Proc. of Interspeech, Pittsburgh, PA, USA, 2006.
    • (2006) Proc. of Interspeech
    • Schmidt, M.N.1    Olsson, R.K.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.