메뉴 건너뛰기




Volumn 22, Issue 8, 2015, Pages 1006-1010

Can we automatically transform speech recorded on common consumer devices in real-world environments into professional production quality speech? - A dataset, insights, and challenges

Author keywords

Automatic production; speech enhancement

Indexed keywords

AUDIO ACOUSTICS; AUDIO RECORDINGS; MOTION PICTURES; PROFESSIONAL ASPECTS; SPEECH ENHANCEMENT; SPEECH INTELLIGIBILITY; STUDIOS;

EID: 84919935005     PISSN: 10709908     EISSN: None     Source Type: Journal    
DOI: 10.1109/LSP.2014.2379648     Document Type: Article
Times cited : (113)

References (25)
  • 4
    • 84866036566 scopus 로고    scopus 로고
    • Mastering Audio
    • 2nd ed. New York, NY, USA: Focal
    • B. Katz, Mastering Audio. The Art and the Science, 2nd ed. New York, NY, USA: Focal, 2007.
    • (2007) The Art and the Science
    • Katz, B.1
  • 5
    • 0021645331 scopus 로고
    • Speech enhancement using a minimum-mean square error short-time spectral amplitude estimator
    • Dec.
    • Y. Ephraim and D. Malah, "Speech enhancement using a minimum-mean square error short-time spectral amplitude estimator," IEEE Trans. Acoust., Speech, Signal Process, vol. 32, no. 6, Dec. 1984.
    • (1984) IEEE Trans. Acoust., Speech, Signal Process , vol.32 , Issue.6
    • Ephraim, Y.1    Malah, D.2
  • 7
    • 84878420060 scopus 로고    scopus 로고
    • Speech enhancement by online non-negative spectrogram decomposition in non-stationary noise environments
    • Sep.
    • Z. Duan, G. J. Mysore, and P. Smaragdis, "Speech enhancement by online non-negative spectrogram decomposition in non-stationary noise environments," in Proc. Interspeech, Sep. 2012.
    • (2012) Proc. Interspeech
    • Duan, Z.1    Mysore, G.J.2    Smaragdis, P.3
  • 12
    • 85017319264 scopus 로고    scopus 로고
    • Bandwidth expansion of speech based on vector quantization of the mel frequency cepstral coefficients
    • Jun.
    • N. Enbom and B. Kleijn, "Bandwidth expansion of speech based on vector quantization of the mel frequency cepstral coefficients," in Proc. IEEE Workshop on Speech Coding, Jun. 1999.
    • (1999) Proc. IEEE Workshop on Speech Coding
    • Enbom, N.1    Kleijn, B.2
  • 14
    • 33745105930 scopus 로고    scopus 로고
    • Adaptive digital audio effects (a-dafx). A new class of sound transformations
    • Sep.
    • V. Verfaille, U. Zölzer, and D. Arfib, "Adaptive digital audio effects (a-dafx). A new class of sound transformations," IEEE Trans. Audio, Speech Lang. Process., vol. 14, no. 5, pp. 1817-1831, Sep. 2006.
    • (2006) IEEE Trans. Audio, Speech Lang. Process , vol.14 , Issue.5 , pp. 1817-1831
    • Verfaille, V.1    Zölzer, U.2    Arfib, D.3
  • 15
    • 84887107527 scopus 로고    scopus 로고
    • Parameter automation in a dynamic range compressor
    • Oct.
    • D. Giannoulis, M. Massberg, and J. D. Reiss, "Parameter automation in a dynamic range compressor," J. Audio Eng. Soc., vol. 61, no. 10, Oct. 2013.
    • (2013) J. Audio Eng. Soc. , vol.61 , Issue.10
    • Giannoulis, D.1    Massberg, M.2    Reiss, J.D.3
  • 16
  • 19
    • 4544279104 scopus 로고    scopus 로고
    • The aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions
    • Sep.
    • H.-G. Hirsch and D. Pearce, "The aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions," in Proc. ISCA Workshop ASR2000, Sep. 2000.
    • (2000) Proc. ISCA Workshop ASR2000
    • Hirsch, H.-G.1    Pearce, D.2
  • 21
    • 84865991945 scopus 로고    scopus 로고
    • Digital dynamic range compressor design-a tutorial and analysis
    • Jun.
    • D. Giannoulis, M. Massberg, and J. D. Reiss, "Digital dynamic range compressor design-a tutorial and analysis," J. Audio Eng. Soc., vol. 60, no. 6, Jun. 2012.
    • (2012) J. Audio Eng. Soc. , vol.60 , Issue.6
    • Giannoulis, D.1    Massberg, M.2    Reiss, J.D.3
  • 22
    • 0032762471 scopus 로고    scopus 로고
    • A statistical model-based voice activity detection
    • Jan.
    • J. Sohn, N. S. Kim, and W. Sung, "A statistical model-based voice activity detection," IEEE Signal Process. Lett., vol. 6, no. 1, pp. 1-3, Jan. 1999.
    • (1999) IEEE Signal Process. Lett. , vol.6 , Issue.1 , pp. 1-3
    • Sohn, J.1    Kim, N.S.2    Sung, W.3
  • 23
    • 84906264722 scopus 로고    scopus 로고
    • Speaker and noise independent voice activity detection
    • Aug.
    • F. G. Germain, D. Sun, and G. J. Mysore, "Speaker and noise independent voice activity detection," in Proc. Interspeech, Aug. 2013.
    • (2013) Proc. Interspeech
    • Germain, F.G.1    Sun, D.2    Mysore, G.J.3
  • 24
    • 44149106061 scopus 로고    scopus 로고
    • Evaluation of objective quality measures for speech enhancement
    • Jan.
    • Y. Hu and P. C. Loizou, "Evaluation of objective quality measures for speech enhancement," IEEE Trans. Audio, Speech Lang. Process., vol. 16, no. 1, pp. 229-238, Jan. 2008.
    • (2008) IEEE Trans. Audio, Speech Lang. Process , vol.16 , Issue.1 , pp. 229-238
    • Hu, Y.1    Loizou, P.C.2
  • 25


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.