메뉴 건너뛰기




Volumn 2005, Issue 4, 2005, Pages 487-497

A computationally efficient mel-filter bank VAD algorithm for distributed speech recognition systems

Author keywords

Distributed speech recognition; Telecommunication systems; Voice activity detection

Indexed keywords

DATA COMMUNICATION SYSTEMS; DATABASE SYSTEMS; LINGUISTICS; NETWORK PROTOCOLS; SIGNAL TO NOISE RATIO; SPEECH CODING; SPEECH RECOGNITION; TELECOMMUNICATION SYSTEMS;

EID: 20844456665     PISSN: 11108657     EISSN: None     Source Type: Journal    
DOI: 10.1155/ASP.2005.487     Document Type: Conference Paper
Times cited : (30)

References (24)
  • 2
    • 84904318363 scopus 로고    scopus 로고
    • Coding of speech at 8 kbit/s using conjugate structure algebraiccode-excited linear-prediction (CS-ACELP) Annex B: A silence compression scheme
    • ITU, "Coding of speech at 8 kbit/s using conjugate structure algebraiccode-excited linear-prediction (CS-ACELP) Annex B: A silence compression scheme," ITU Recommendation G.729, 1996.
    • (1996) ITU Recommendation G.729
  • 3
    • 0005703888 scopus 로고    scopus 로고
    • Dual rate speech coder for multimedia communications transmitting at 5.3 and 6.3 kbit/s. Annex A: Silence compression scheme
    • ITU, "Dual rate speech coder for multimedia communications transmitting at 5.3 and 6.3 kbit/s. Annex A: Silence compression scheme," ITU Recommendation G.723.1, 1996.
    • (1996) ITU Recommendation G.723.1
  • 5
    • 85009085054 scopus 로고    scopus 로고
    • A multiconditional robust front-end feature extraction with a noise reduction procedure based on improved spectral subtraction algorithm
    • Aalborg, Denmark, September
    • B. Kotnik, Z. Kacic, and B. Horvat, "A multiconditional robust front-end feature extraction with a noise reduction procedure based on improved spectral subtraction algorithm," in Proc. 7th European Conference on Speech Communication and Technology (ISCA EUROSPEECH '01), pp. 197-200, Aalborg, Denmark, September 2001.
    • (2001) Proc. 7th European Conference on Speech Communication and Technology (ISCA EUROSPEECH '01) , pp. 197-200
    • Kotnik, B.1    Kacic, Z.2    Horvat, B.3
  • 7
    • 0031103254 scopus 로고    scopus 로고
    • Variable-rate CELP based on subband flatness
    • S. McClellan and J. D. Gibson, "Variable-rate CELP based on subband flatness," IEEE Trans. Speech Audio Processing, vol. 5, no. 2, pp. 120-130, 1997.
    • (1997) IEEE Trans. Speech Audio Processing , vol.5 , Issue.2 , pp. 120-130
    • McClellan, S.1    Gibson, J.D.2
  • 9
    • 0032762471 scopus 로고    scopus 로고
    • A statistical model-based voice activity detection
    • J. Sohn, N. S. Kim, and W. Sung, "A statistical model-based voice activity detection" IEEE Signal Processing Lett., vol. 6, no. 1, pp. 1-3, 1999.
    • (1999) IEEE Signal Processing Lett. , vol.6 , Issue.1 , pp. 1-3
    • Sohn, J.1    Kim, N.S.2    Sung, W.3
  • 11
    • 20844456677 scopus 로고    scopus 로고
    • Speech processing, transmission and quality aspects (STQ), distributed speech recognition, advanced front-end feature extraction algorithm, compression algorithm
    • ETSI, "Speech processing, transmission and quality aspects (STQ), distributed speech recognition, advanced front-end feature extraction algorithm, compression algorithm," ES 202 050 v1.1.1, 2002.
    • (2002) ES 202 050 V1.1.1
  • 12
    • 0038669544 scopus 로고    scopus 로고
    • The AURORA experimental framework for the performance evaluations of speech recognition systems under noisy conditions
    • Paris, France, September
    • H.-G. Hirsch and D. Pearce, "The AURORA experimental framework for the performance evaluations of speech recognition systems under noisy conditions," in Proc. Automatic Speech Recognition: Challenges for the Next Millennium (ISCA ITRW ASR '00), pp. 181-188, Paris, France, September 2000.
    • (2000) Proc. Automatic Speech Recognition: Challenges for the Next Millennium (ISCA ITRW ASR '00) , pp. 181-188
    • Hirsch, H.-G.1    Pearce, D.2
  • 14
    • 0141811244 scopus 로고    scopus 로고
    • Enabling new speech driven services for mobile devices: An overview of the ETSI standards activities for distributed speech recognition front-ends
    • San Jose, Calif, USA, May
    • D. Pearce, "Enabling new speech driven services for mobile devices: an overview of the ETSI standards activities for distributed speech recognition front-ends," in Proc. Applied Voice Input/Output Society Conference (AVIOS '00), San Jose, Calif, USA, May 2000.
    • (2000) Proc. Applied Voice Input/Output Society Conference (AVIOS '00)
    • Pearce, D.1
  • 17
    • 20844436983 scopus 로고    scopus 로고
    • Speech processing, transmission and quality aspects (STQ), distributed speech recognition, front-end feature extraction algorithm, compression algorithm
    • ETSI, "Speech processing, transmission and quality aspects (STQ), distributed speech recognition, front-end feature extraction algorithm, compression algorithm," ES 201 108 v1.1.1, 2000.
    • (2000) ES 201 108 V1.1.1
  • 18
    • 0010535712 scopus 로고    scopus 로고
    • Transmission performance characteristics of pulse code modulation channels
    • ITU, "Transmission performance characteristics of pulse code modulation channels," ITU Recommendation G.712, 1996.
    • (1996) ITU Recommendation G.712
  • 23
    • 0037939793 scopus 로고    scopus 로고
    • Efficient noise robust feature extraction algorithms for distributed speech recognition (DSR) systems
    • B. Kotnik, D. Vlaj, and B. Horvat, "Efficient noise robust feature extraction algorithms for distributed speech recognition (DSR) systems," International Journal of Speech Technology, vol. 6, no. 3, pp. 205-219, 2003.
    • (2003) International Journal of Speech Technology , vol.6 , Issue.3 , pp. 205-219
    • Kotnik, B.1    Vlaj, D.2    Horvat, B.3
  • 24
    • 85009278999 scopus 로고    scopus 로고
    • Robust MFCC feature extraction algorithm using efficient additive and convolutional noise reduction procedures
    • Denver, Colo, USA, September
    • B. Kotnik, D. Vlaj, Z. Kačič, and B. Horvat, "Robust MFCC feature extraction algorithm using efficient additive and convolutional noise reduction procedures," in Proc. International Conf. on Spoken Language Processing (ICSLP '02), pp. 445-448, Denver, Colo, USA, September 2002.
    • (2002) Proc. International Conf. on Spoken Language Processing (ICSLP '02) , pp. 445-448
    • Kotnik, B.1    Vlaj, D.2    Kačič, Z.3    Horvat, B.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.