메뉴 건너뛰기




Volumn , Issue , 2011, Pages 1089-1092

A study on speaker normalized MLP features in LVCSR

Author keywords

CMLLR; Dempster Shafer; GMM HMM; LVCSR; MLP; SAT; VTLN

Indexed keywords

CMLLR; DEMPSTER-SHAFER; GMM-HMM; LVCSR; MLP; SAT; VTLN;

EID: 84865742011     PISSN: None     EISSN: 19909772     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (11)

References (17)
  • 1
    • 0029764708 scopus 로고    scopus 로고
    • Speaker normalization on conversational telephone speech
    • S.Wegmann et al., "Speaker normalization on conversational telephone speech," in ICASSP, vol. 1, 1996, pp. 339-341.
    • (1996) ICASSP , vol.1 , pp. 339-341
    • Wegmann, S.1
  • 2
    • 85009174854 scopus 로고    scopus 로고
    • Vocal tract normalization as linear transformation of MFCC
    • M. Pitz and H. Ney, "Vocal tract normalization as linear transformation of MFCC," in Eurospeech, 2003, pp. 1445-1448.
    • (2003) Eurospeech , pp. 1445-1448
    • Pitz, M.1    Ney, H.2
  • 3
    • 56149105775 scopus 로고    scopus 로고
    • The RWTH 2007 TC-STAR evaluation system for European English and Spanish
    • J. Lööf et al., "The RWTH 2007 TC-STAR evaluation system for European English and Spanish," in Interspeech, 2007, pp. 2145- 2148.
    • (2007) Interspeech , pp. 2145-2148
    • Lööf, J.1
  • 4
    • 79959848126 scopus 로고    scopus 로고
    • A comparative large scale study of MLP features for Mandarin ASR
    • F. Valente et al., "A comparative large scale study of MLP features for Mandarin ASR," in Interspeech, 2010, pp. 2630-2633.
    • (2010) Interspeech , pp. 2630-2633
    • Valente, F.1
  • 5
    • 84867209104 scopus 로고    scopus 로고
    • Recent improvements of the RWTH GALE Mandarin LVCSR system
    • C. Plahl et al., "Recent improvements of the RWTH GALE Mandarin LVCSR system," in Interspeech, 2008, pp. 2426-2429.
    • (2008) Interspeech , pp. 2426-2429
    • Plahl, C.1
  • 6
    • 79959839253 scopus 로고    scopus 로고
    • The RWTH 2009 QUAERO ASR evaluation system for English and German
    • M. Nußbaum-Thom et al., "The RWTH 2009 QUAERO ASR evaluation system for English and German," in Interspeech, 2010, pp. 1517-1520.
    • (2010) Interspeech , pp. 1517-1520
    • Nußbaum-Thom, M.1
  • 7
    • 79959812202 scopus 로고    scopus 로고
    • Analysis of gender normalization using MLP and VTLN features
    • T. Schaaf and F. Metze, "Analysis of Gender Normalization Using MLP and VTLN Features," in Interspeech, 2010, pp. 306-309.
    • (2010) Interspeech , pp. 306-309
    • Schaaf, T.1    Metze, F.2
  • 8
    • 85009097225 scopus 로고    scopus 로고
    • On using MLP features in LVCSR
    • Q. Zhu et al., "On using MLP features in LVCSR," in Interspeech, 2004, pp. 921-924.
    • (2004) Interspeech , pp. 921-924
    • Zhu, Q.1
  • 9
    • 0033709098 scopus 로고    scopus 로고
    • Tandem connectionist feature extraction for conventional HMM systems
    • H. Hermansky et al., "Tandem connectionist feature extraction for conventional HMM systems," in ICASSP, vol. 3, 2000, pp. 1635- 1638.
    • (2000) ICASSP , vol.3 , pp. 1635-1638
    • Hermansky, H.1
  • 10
    • 79959828316 scopus 로고    scopus 로고
    • The 2010 CMU GALE speech-to-text system
    • F. Metze et al., "The 2010 CMU GALE Speech-to-Text System," in Interspeech, 2010, pp. 1501-1504.
    • (2010) Interspeech , pp. 1501-1504
    • Metze, F.1
  • 12
    • 0141676589 scopus 로고    scopus 로고
    • New entropy based combination rules in HMM/ANN multi-stream ASR
    • H. Misra et al., "New entropy based combination rules in HMM/ANN multi-stream ASR," in ICASSP, vol. 2, 2003, pp. 741-744.
    • (2003) ICASSP , vol.2 , pp. 741-744
    • Misra, H.1
  • 13
    • 73649085443 scopus 로고    scopus 로고
    • Multi-stream speech recognition based on Dempster- Shafer combination rule
    • Mar.
    • F. Valente, "Multi-stream speech recognition based on Dempster- Shafer combination rule," Speech Communication, vol. 52, no. 3, pp. 213-222, Mar. 2010.
    • (2010) Speech Communication , vol.52 , Issue.3 , pp. 213-222
    • Valente, F.1
  • 14
    • 84867209138 scopus 로고    scopus 로고
    • Transcribing broadcast data using MLP features
    • P. Fousek et al., "Transcribing broadcast data using MLP features," in Interspeech, 2008, pp. 1433-1436.
    • (2008) Interspeech , pp. 1433-1436
    • Fousek, P.1
  • 15
    • 51449103447 scopus 로고    scopus 로고
    • Optimizing bottle-neck features for LVCSR
    • F. Grézl and P. Fousek, "Optimizing bottle-neck features for LVCSR," in ICASSP, 2008, pp. 4729-4732.
    • (2008) ICASSP , pp. 4729-4732
    • Grézl, F.1    Fousek, P.2
  • 16
    • 33745213373 scopus 로고    scopus 로고
    • Multi-resolution RASTA filtering for TANDEM-based ASR
    • H. Hermansky and P. Fousek, "Multi-resolution RASTA filtering for TANDEM-based ASR," in Interspeech, 2005, pp. 361-364.
    • (2005) Interspeech , pp. 361-364
    • Hermansky, H.1    Fousek, P.2
  • 17
    • 0036753897 scopus 로고    scopus 로고
    • Speaker adaptive modeling by vocal tract normalization
    • Sep.
    • L. Welling et al., "Speaker adaptive modeling by vocal tract normalization," IEEE Trans. on Speech and Audio Processing, vol. 10, no. 6, pp. 415-426, Sep. 2002.
    • (2002) IEEE Trans. on Speech and Audio Processing , vol.10 , Issue.6 , pp. 415-426
    • Welling, L.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.