메뉴 건너뛰기




Volumn 26, Issue 1, 2012, Pages 52-66

The design and collection of COSINE, a multi-microphone in situ speech corpus recorded in noisy environments

Author keywords

Microphone arrays; Multi microphone; Multi party corpora; Noise robust speech recognition; Portable recording; Speech recognition

Indexed keywords

ACOUSTIC NOISE; AUDIO RECORDINGS; HUMAN COMPUTER INTERACTION; MICROPHONES; SPEECH; TRANSCRIPTION;

EID: 79959404069     PISSN: 08852308     EISSN: 10958363     Source Type: Journal    
DOI: 10.1016/j.csl.2010.12.003     Document Type: Article
Times cited : (30)

References (36)
  • 2
    • 4444257069 scopus 로고    scopus 로고
    • Praat, a system for doing phonetics by computer
    • P. Boersma Praat, a system for doing phonetics by computer Glot International 5 9/10 2001 341 345
    • (2001) Glot International , vol.5 , Issue.9-10 , pp. 341-345
    • Boersma, P.1
  • 3
    • 79960067669 scopus 로고    scopus 로고
    • C. M. University. cmudict0.7a
    • C. M. University, 2008. cmudict0.7a, https://cmusphinx.svn.sourceforge. net/svnroot/cmusphinx/trunk/cmudict/cmudict0. 7a.
    • (2008)
  • 7
    • 79960039131 scopus 로고    scopus 로고
    • FLAC - Free Lossless Audio Codec, v1.1
    • FLAC - Free Lossless Audio Codec, v1.1. http://flac.sourceforge.net/.
  • 9
    • 85016587886 scopus 로고
    • SWITCHBOARD: Telephone speech corpus for research and development
    • J. Godfrey, E. Holliman, and J. McDaniel SWITCHBOARD: telephone speech corpus for research and development ICASSP, vol. 1 1992 517 520
    • (1992) ICASSP, Vol. 1 , pp. 517-520
    • Godfrey, J.1    Holliman, E.2    McDaniel, J.3
  • 10
    • 0029288202 scopus 로고
    • Speech recognition in noisy environments: A survey
    • Y. Gong Speech recognition in noisy environments: a survey Speech Communication 16 3 1995 261 291
    • (1995) Speech Communication , vol.16 , Issue.3 , pp. 261-291
    • Gong, Y.1
  • 12
    • 34547540831 scopus 로고    scopus 로고
    • An auditory neural feature extraction method for robust speech recognition
    • W. Guo, L. Zhang, and B. Xia An auditory neural feature extraction method for robust speech recognition ICASSP 2007
    • (2007) ICASSP
    • Guo, W.1    Zhang, L.2    Xia, B.3
  • 13
    • 0000259871 scopus 로고    scopus 로고
    • Models and selection criteria for regression and classification
    • D. Heckerman, and C. Meek Models and selection criteria for regression and classification UAI 1997
    • (1997) UAI
    • Heckerman, D.1    Meek, C.2
  • 14
    • 34447092407 scopus 로고    scopus 로고
    • Subjective comparison and evaluation of speech enhancement algorithms
    • DOI 10.1016/j.specom.2006.12.006, PII S0167639306001920
    • Y. Hu, and P. Loizou Subjective evaluation and comparison of speech enhancement algorithms Speech Communication 49 2007 588 601 (Pubitemid 47031352)
    • (2007) Speech Communication , vol.49 , Issue.7-8 , pp. 588-601
    • Hu, Y.1    Loizou, P.C.2
  • 15
    • 44949151536 scopus 로고    scopus 로고
    • An improved mel-Wiener filter for mel-LPC based speech recognition
    • M. Islam, H. Matsumoto, and K. Yamamoto An improved mel-Wiener filter for mel-LPC based speech recognition Interspeech-ICSLP 2006
    • (2006) Interspeech-ICSLP
    • Islam, M.1    Matsumoto, H.2    Yamamoto, K.3
  • 16
    • 0025680225 scopus 로고
    • NTIMIT: A phonetically balanced, continuous speech, telephone bandwidth speech database
    • C. Jankowski NTIMIT: a phonetically balanced, continuous speech, telephone bandwidth speech database ICASSP 1990
    • (1990) ICASSP
    • Jankowski, C.1
  • 18
    • 85187956132 scopus 로고    scopus 로고
    • The Lombard effect: A reflex to better communicate with others in noise
    • J.-C. Junqua, S. Fincke, and K. Field The Lombard effect: a reflex to better communicate with others in noise ICASSP 1999 2083 2086
    • (1999) ICASSP , pp. 2083-2086
    • Junqua, J.-C.1    Fincke, S.2    Field, K.3
  • 21
    • 51449098446 scopus 로고    scopus 로고
    • Cepstral domain feature compensation based on diagonal approximation
    • W. Lim, C. Han, J. Shin, and N. Kim Cepstral domain feature compensation based on diagonal approximation ICASSP 2008
    • (2008) ICASSP
    • Lim, W.1    Han, C.2    Shin, J.3    Kim, N.4
  • 25
    • 85009193359 scopus 로고    scopus 로고
    • Speech in noisy environments (SPINE) adds new dimension to speech recognition R&D
    • A. Schmidt-Nielsen, T. Crystal, and E. Marsh Speech in noisy environments (SPINE) adds new dimension to speech recognition R&D HLT 2002
    • (2002) HLT
    • Schmidt-Nielsen, A.1    Crystal, T.2    Marsh, E.3
  • 27
    • 33745828208 scopus 로고    scopus 로고
    • Spontaneous speech: How people really talk and why engineers should care
    • E. Shriberg Spontaneous speech: how people really talk and why engineers should care EUROSPEECH 2005
    • (2005) EUROSPEECH
    • Shriberg, E.1
  • 29
    • 70349199112 scopus 로고    scopus 로고
    • COSINE - A corpus of multi-party conversational speech in noisy environments
    • A. Stupakov, E. Hanusa, J. Bilmes, and D. Fox COSINE - A corpus of multi-party conversational speech in noisy environments ICASSP 2009
    • (2009) ICASSP
    • Stupakov, A.1    Hanusa, E.2    Bilmes, J.3    Fox, D.4
  • 30
    • 85026956548 scopus 로고    scopus 로고
    • Virtual evidence for training speech recognizers using partially labeled data
    • A. Subramanya, and J. Bilmes Virtual evidence for training speech recognizers using partially labeled data HLT 2007
    • (2007) HLT
    • Subramanya, A.1    Bilmes, J.2
  • 31
    • 84867197731 scopus 로고    scopus 로고
    • Applications of virtual-evidence based speech recognizer training
    • A. Subramanya, and J. Bilmes Applications of virtual-evidence based speech recognizer training Interspeech 2008
    • (2008) Interspeech
    • Subramanya, A.1    Bilmes, J.2
  • 32
    • 44849086817 scopus 로고    scopus 로고
    • Uncertainty in training large vocabulary speech recognizers
    • A. Subramanya, C. Bartels, J. Bilmes, and P. Nguyen Uncertainty in training large vocabulary speech recognizers ASRU 2007
    • (2007) ASRU
    • Subramanya, A.1    Bartels, C.2    Bilmes, J.3    Nguyen, P.4
  • 35
    • 51449089990 scopus 로고    scopus 로고
    • A minimum-mean-square-error noise reduction algorithm on mel-frequency cepstra for robust speech recognition
    • D. Yu, L. Deng, J. Droppo, J. Wu, Y. Gong, and A. Acero A minimum-mean-square-error noise reduction algorithm on mel-frequency cepstra for robust speech recognition ICASSP 2008
    • (2008) ICASSP
    • Yu, D.1    Deng, L.2    Droppo, J.3    Wu, J.4    Gong, Y.5    Acero, A.6
  • 36
    • 33947692806 scopus 로고    scopus 로고
    • Joint segmentation and classification of dialog acts in multiparty meetings
    • M. Zimmermann, A. Stolcke, and E. Shriberg Joint segmentation and classification of dialog acts in multiparty meetings ICASSP 2006
    • (2006) ICASSP
    • Zimmermann, M.1    Stolcke, A.2    Shriberg, E.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.