SCOPUS 정보 검색 플랫폼

Eurasip Journal on Applied Signal Processing

Volumn 2005, Issue 4, 2005, Pages 487-497

A computationally efficient mel-filter bank VAD algorithm for distributed speech recognition systems

(4) Vlaj, Damjan a Kotnik, Bojan a Horvat, Bogomir a Kačič, Zdravko a

a UNIVERSITY OF MARIBOR (Slovenia)

Author keywords

Distributed speech recognition; Telecommunication systems; Voice activity detection

Indexed keywords

DATA COMMUNICATION SYSTEMS; DATABASE SYSTEMS; LINGUISTICS; NETWORK PROTOCOLS; SIGNAL TO NOISE RATIO; SPEECH CODING; SPEECH RECOGNITION; TELECOMMUNICATION SYSTEMS;

AUTOMATIC SPEECH RECOGNITION (ASR); DISTRIBUTED SPEECH RECOGNITION; MEL-FILTER BANKS (MFB); VOICE ACTIVITY DETECTION (VAD);

ALGORITHMS;

EID: 20844456665 PISSN: 11108657 EISSN: None Source Type: Journal
DOI: 10.1155/ASP.2005.487 Document Type: Conference Paper

Times cited : (30)

References (24)

1
- 20844448463
- Dealing with noisy speech and channel distortions
- chapter 5, Kluwer Academic Publishers, Norwell, Mass, USA
- J. C. Junqua and J. P. Haton, "Dealing with noisy speech and channel distortions," in Robustness in Automatic Speech Recognition: Fundamentals and Applications, chapter 5, pp. 155-189, Kluwer Academic Publishers, Norwell, Mass, USA, 1996.
- (1996) Robustness in Automatic Speech Recognition: Fundamentals and Applications , pp. 155-189
- Junqua, J.C.¹ Haton, J.P.²

2
- 84904318363
- Coding of speech at 8 kbit/s using conjugate structure algebraiccode-excited linear-prediction (CS-ACELP) Annex B: A silence compression scheme
- ITU, "Coding of speech at 8 kbit/s using conjugate structure algebraiccode-excited linear-prediction (CS-ACELP) Annex B: A silence compression scheme," ITU Recommendation G.729, 1996.
- (1996) ITU Recommendation G.729

3
- 0005703888
- Dual rate speech coder for multimedia communications transmitting at 5.3 and 6.3 kbit/s. Annex A: Silence compression scheme
- ITU, "Dual rate speech coder for multimedia communications transmitting at 5.3 and 6.3 kbit/s. Annex A: Silence compression scheme," ITU Recommendation G.723.1, 1996.
- (1996) ITU Recommendation G.723.1

4
- 0024934078
- The voice activity detector for the Pan-European digital cellular mobile telephone service
- Glasgow, UK, May
- D. K. Freeman, G. Cosier, C. B. Southcott, and I. Boyd, "The voice activity detector for the Pan-European digital cellular mobile telephone service," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing (ICASSP '89), vol. 1, pp. 369-372, Glasgow, UK, May 1989.
- (1989) Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing (ICASSP '89) , vol.1 , pp. 369-372
- Freeman, D.K.¹ Cosier, G.² Southcott, C.B.³ Boyd, I.⁴

5
- 85009085054
- A multiconditional robust front-end feature extraction with a noise reduction procedure based on improved spectral subtraction algorithm
- Aalborg, Denmark, September
- B. Kotnik, Z. Kacic, and B. Horvat, "A multiconditional robust front-end feature extraction with a noise reduction procedure based on improved spectral subtraction algorithm," in Proc. 7th European Conference on Speech Communication and Technology (ISCA EUROSPEECH '01), pp. 197-200, Aalborg, Denmark, September 2001.
- (2001) Proc. 7th European Conference on Speech Communication and Technology (ISCA EUROSPEECH '01) , pp. 197-200
- Kotnik, B.¹ Kacic, Z.² Horvat, B.³

6
- 4544305658
- A voice activity detector based on cepstral analysis
- Berlin, Germany, September
- J. Haigh and J. S. Mason, "A voice activity detector based on cepstral analysis," in Proc. 3rd European Conference on Speech Communication and Technology (ISCA EUROSPEECH '93), pp. 1103-1106, Berlin, Germany, September 1993.
- (1993) Proc. 3rd European Conference on Speech Communication and Technology (ISCA EUROSPEECH '93) , pp. 1103-1106
- Haigh, J.¹ Mason, J.S.²

7
- 0031103254
- Variable-rate CELP based on subband flatness
- S. McClellan and J. D. Gibson, "Variable-rate CELP based on subband flatness," IEEE Trans. Speech Audio Processing, vol. 5, no. 2, pp. 120-130, 1997.
- (1997) IEEE Trans. Speech Audio Processing , vol.5 , Issue.2 , pp. 120-130
- McClellan, S.¹ Gibson, J.D.²

8
- 85009078216
- Entropy based voice activity detection in very noisy conditions
- Aalborg, Denmark, September
- P. Renevey and A. Drygajlo, "Entropy based voice activity detection in very noisy conditions," in Proc. European Conference on Speech Communication and Technology (ISCA EUROSPEECH '01), pp. 1887-1890, Aalborg, Denmark, September 2001.
- (2001) Proc. European Conference on Speech Communication and Technology (ISCA EUROSPEECH '01) , pp. 1887-1890
- Renevey, P.¹ Drygajlo, A.²

9
- 0032762471
- A statistical model-based voice activity detection
- J. Sohn, N. S. Kim, and W. Sung, "A statistical model-based voice activity detection" IEEE Signal Processing Lett., vol. 6, no. 1, pp. 1-3, 1999.
- (1999) IEEE Signal Processing Lett. , vol.6 , Issue.1 , pp. 1-3
- Sohn, J.¹ Kim, N.S.² Sung, W.³

10
- 84870971026
- Voice activity detection in noisy environments
- Aalborg, Denmark, September
- J. Stadermann, V. Stahl, and G. Rose, "Voice activity detection in noisy environments," in Proc. European Conference on Speech Communication and Technology (ISCA EUROSPEECH '01), pp. 1851-1854, Aalborg, Denmark, September 2001.
- (2001) Proc. European Conference on Speech Communication and Technology (ISCA EUROSPEECH '01) , pp. 1851-1854
- Stadermann, J.¹ Stahl, V.² Rose, G.³

11
- 20844456677
- Speech processing, transmission and quality aspects (STQ), distributed speech recognition, advanced front-end feature extraction algorithm, compression algorithm
- ETSI, "Speech processing, transmission and quality aspects (STQ), distributed speech recognition, advanced front-end feature extraction algorithm, compression algorithm," ES 202 050 v1.1.1, 2002.
- (2002) ES 202 050 V1.1.1

12
- 0038669544
- The AURORA experimental framework for the performance evaluations of speech recognition systems under noisy conditions
- Paris, France, September
- H.-G. Hirsch and D. Pearce, "The AURORA experimental framework for the performance evaluations of speech recognition systems under noisy conditions," in Proc. Automatic Speech Recognition: Challenges for the Next Millennium (ISCA ITRW ASR '00), pp. 181-188, Paris, France, September 2000.
- (2000) Proc. Automatic Speech Recognition: Challenges for the Next Millennium (ISCA ITRW ASR '00) , pp. 181-188
- Hirsch, H.-G.¹ Pearce, D.²

13
- 0033677004
- Robust speech recognition over IP networks
- Istanbul, Turkey, June
- B. Milner and S. Semnani, "Robust speech recognition over IP networks," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing (ICASSP '00), vol. 3, pp. 1791-1794, Istanbul, Turkey, June 2000.
- (2000) Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing (ICASSP '00) , vol.3 , pp. 1791-1794
- Milner, B.¹ Semnani, S.²

14
- 0141811244
- Enabling new speech driven services for mobile devices: An overview of the ETSI standards activities for distributed speech recognition front-ends
- San Jose, Calif, USA, May
- D. Pearce, "Enabling new speech driven services for mobile devices: an overview of the ETSI standards activities for distributed speech recognition front-ends," in Proc. Applied Voice Input/Output Society Conference (AVIOS '00), San Jose, Calif, USA, May 2000.
- (2000) Proc. Applied Voice Input/Output Society Conference (AVIOS '00)
- Pearce, D.¹

15
- 12444343661
- The design of mobile multimodal communication device-personal navigator
- Bratislava, Slovakia, July
- B. Kotnik, T. Rotovnik, Z. Kačič, B. Horvat, and I. Kramberger, "The design of mobile multimodal communication device-personal navigator," in Proc. International Conference on Trends in Communications (EUROCON '01), vol. 2, pp. 337-340, Bratislava, Slovakia, July 2001.
- (2001) Proc. International Conference on Trends in Communications (EUROCON '01) , vol.2 , pp. 337-340
- Kotnik, B.¹ Rotovnik, T.² Kačič, Z.³ Horvat, B.⁴ Kramberger, I.⁵

16
- 0033676788
- The study on distributed speech recognition system
- Istanbul, Turkey, June
- W. Zhang, L. He, Y. Chow, R. Yang, and Y. Su, "The study on distributed speech recognition system," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing (ICASSP '00), vol. 3, pp. 1431-1434, Istanbul, Turkey, June 2000.
- (2000) Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing (ICASSP '00) , vol.3 , pp. 1431-1434
- Zhang, W.¹ He, L.² Chow, Y.³ Yang, R.⁴ Su, Y.⁵

17
- 20844436983
- Speech processing, transmission and quality aspects (STQ), distributed speech recognition, front-end feature extraction algorithm, compression algorithm
- ETSI, "Speech processing, transmission and quality aspects (STQ), distributed speech recognition, front-end feature extraction algorithm, compression algorithm," ES 201 108 v1.1.1, 2000.
- (2000) ES 201 108 V1.1.1

18
- 0010535712
- Transmission performance characteristics of pulse code modulation channels
- ITU, "Transmission performance characteristics of pulse code modulation channels," ITU Recommendation G.712, 1996.
- (1996) ITU Recommendation G.712

19
- 84913604217
- TSGSM03.50, 3.4.0
- ETSI-SMG, "European digital cellular telecommunication system (phase 1) - transmission planning aspects for the speech service in GSM PLMN system," TSGSM03.50, 3.4.0, 1994.
- (1994) European Digital Cellular Telecommunication System (Phase 1) - Transmission Planning Aspects for the Speech Service in GSM PLMN System

20
- 0003571977
- Microsoft Corporation, Redmond, Wash, USA
- S. Young, D. Kershaw, J. Odell, D. Ollason, V. Valtchev, and P. Woodland, The HTK Book (Version 3.0), Microsoft Corporation, Redmond, Wash, USA, 2000.
- (2000) The HTK Book (Version 3.0)
- Young, S.¹ Kershaw, D.² Odell, J.³ Ollason, D.⁴ Valtchev, V.⁵ Woodland, P.⁶

21
- 85009152845
- Recognition performance of the siemens front-end with and without frame dropping on the aurora 2 database
- Aalborg, Denmark, September
- B. Andrassy, D. Vlaj, and C. Beaugeant, "Recognition performance of the siemens front-end with and without frame dropping on the aurora 2 database," in Proc. 7th European Conference on Speech Communication and Technology (ISCA EUROSPEECH '01), pp. 193-196, Aalborg, Denmark, September 2001.
- (2001) Proc. 7th European Conference on Speech Communication and Technology (ISCA EUROSPEECH '01) , pp. 193-196
- Andrassy, B.¹ Vlaj, D.² Beaugeant, C.³

22
- 70249086510
- Robust ASR front-end using spectral-based and discriminant features: Experiments on the Aurora tasks
- Aalborg, Denmark, September
- C. Benitez, L. Burget, B. Chen, et al., "Robust ASR front-end using spectral-based and discriminant features: experiments on the Aurora tasks," in Proc. 7th European Conference on Speech Communication and Technology (ISCA EUROSPEECH '01), pp. 429-432, Aalborg, Denmark, September 2001.
- (2001) Proc. 7th European Conference on Speech Communication and Technology (ISCA EUROSPEECH '01) , pp. 429-432
- Benitez, C.¹ Burget, L.² Chen, B.³

23
- 0037939793
- Efficient noise robust feature extraction algorithms for distributed speech recognition (DSR) systems
- B. Kotnik, D. Vlaj, and B. Horvat, "Efficient noise robust feature extraction algorithms for distributed speech recognition (DSR) systems," International Journal of Speech Technology, vol. 6, no. 3, pp. 205-219, 2003.
- (2003) International Journal of Speech Technology , vol.6 , Issue.3 , pp. 205-219
- Kotnik, B.¹ Vlaj, D.² Horvat, B.³

24
- 85009278999
- Robust MFCC feature extraction algorithm using efficient additive and convolutional noise reduction procedures
- Denver, Colo, USA, September
- B. Kotnik, D. Vlaj, Z. Kačič, and B. Horvat, "Robust MFCC feature extraction algorithm using efficient additive and convolutional noise reduction procedures," in Proc. International Conf. on Spoken Language Processing (ICSLP '02), pp. 445-448, Denver, Colo, USA, September 2002.
- (2002) Proc. International Conf. on Spoken Language Processing (ICSLP '02) , pp. 445-448
- Kotnik, B.¹ Vlaj, D.² Kačič, Z.³ Horvat, B.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.