SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 16, Issue 4, 2008, Pages 859-873

Histogram-based quantization for robust and/or distributed speech recognition

(2) Wan, Chia Yu a Lee, Lin Shan a

a NATIONAL TAIWAN UNIVERSITY (Taiwan)

Author keywords

Error compensation; Robustness; Speech recognition; Vector quantization (VQ)

Indexed keywords

CONVENTIONAL APPROACHES; DISTRIBUTED SPEECH RECOGNITION; ENVIRONMENTAL NOISE; ERROR CONCEALMENTS; FEATURE TRANSFORMATIONS; JOINT UNCERTAINTIES; ORDER STATISTICS; PARTITION CELLS; PERFORMANCE IMPROVEMENTS; QUANTIZATION DISTORTIONS; QUANTIZATION ERRORS; RECOGNITION ACCURACIES; ROBUST SPEECH RECOGNITION; ROBUSTNESS; SPEECH FEATURES; TESTING ENVIRONMENTS; TRANSMISSION ERRORS;

DECODING; ERROR COMPENSATION; ERROR DETECTION; SPEECH ANALYSIS; VECTOR QUANTIZATION;

SPEECH RECOGNITION;

EID: 64849105676 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TASL.2008.920891 Document Type: Article

Times cited : (16)

References (35)

1
- 0002630527
- Quantization of cepstral parameters for speech recognition over the world wide web
- Jan
- V. Digalakis, L. Neumeyer, and M. Perakakis. "Quantization of cepstral parameters for speech recognition over the world wide web," IEEE Select. Ateas Commun., vol. 17, no. 1, pp. 82-90, Jan. 1999.
- (1999) IEEE Select. Ateas Commun , vol.17 , Issue.1 , pp. 82-90
- Digalakis, V.¹ Neumeyer, L.² Perakakis, M.³

2
- 85009080589
- Scalable distributed speech recognition using multi-frame gmm-based block quantization
- CD-ROM
- K. K. Paliwal and S. So, "Scalable distributed speech recognition using multi-frame gmm-based block quantization," in Proc. ICSLP, 2004, CD-ROM.
- (2004) Proc. ICSLP
- Paliwal, K.K.¹ So, S.²

3
- 4544351496
- Extended cluster information vector quantization (ECJ-VQ) for robust classification
- May
- J. A. Arrowood and M. Clements, "Extended cluster information vector quantization (ECJ-VQ) for robust classification," in Proc. IEEE Int. Conf. Acoust. Speech, Signal Process., May 2004, pp. 889-892.
- (2004) Proc. IEEE Int. Conf. Acoust. Speech, Signal Process , pp. 889-892
- Arrowood, J.A.¹ Clements, M.²

4
- 64849107842
- Speech Processing, Transmission and Quality Aspects (STQ); Distributed Speech Recognition; Extended Advanced Front-End Feature Extraction Algorithm; Compression Algorithms; Back-End Speech Reconstruction Algorithm, Nov. 2003, ETS1 Std. ES 202 212 VI.1.1 Rec.
- Speech Processing, Transmission and Quality Aspects (STQ); Distributed Speech Recognition; Extended Advanced Front-End Feature Extraction Algorithm; Compression Algorithms; Back-End Speech Reconstruction Algorithm, Nov. 2003, ETS1 Std. ES 202 212 VI.1.1 Rec.

5
- 0141702076
- Low bit-rate feature vector compression using transform coding and non-uniform bit allocation
- Apr
- B. Milner and X. Shao, "Low bit-rate feature vector compression using transform coding and non-uniform bit allocation," in Proc. IEEE Int. Conf. Acoust. Speech, Signal Process., Apr. 2003, pp. 129-132.
- (2003) Proc. IEEE Int. Conf. Acoust. Speech, Signal Process , pp. 129-132
- Milner, B.¹ Shao, X.²

6
- 0034841726
- An efficient and scalable 2D-DCT based feature coding scheme for remote speech recognition
- Q. Zhu and A. Alwan, "An efficient and scalable 2D-DCT based feature coding scheme for remote speech recognition," in Proc. IEEE Int. Conf. Acoust. Speech, Signal Process., 2001, pp. 113-116.
- (2001) Proc. IEEE Int. Conf. Acoust. Speech, Signal Process , pp. 113-116
- Zhu, Q.¹ Alwan, A.²

7
- 4544321132
- Efficient and robust distributed speech recognition (DSR) over wireless fading channels: 2D-DCT compression, iterative bit allocation, short BCH code and interleaving
- W.-H. Hsu and L.-S. Lee, "Efficient and robust distributed speech recognition (DSR) over wireless fading channels: 2D-DCT compression, iterative bit allocation, short BCH code and interleaving," in Proc. IEEE Int. Conf. Acoust. Speech, Signal Process., 2004, pp. 69-72.
- (2004) Proc. IEEE Int. Conf. Acoust. Speech, Signal Process , pp. 69-72
- Hsu, W.-H.¹ Lee, L.-S.²

8
- 84962800155
- Histogram based normalization in the acoustic feature space
- S. Molau, M. Pitz, and H. Ney, "Histogram based normalization in the acoustic feature space," in Pwc. ASRU, 2001, pp. 21-24.
- (2001) Pwc. ASRU , pp. 21-24
- Molau, S.¹ Pitz, M.² Ney, H.³

9
- 18744371585
- Histogram equalization of speech representation for robust speech recognition
- May
- A. de la Torre, A. M. Peinado, J. C. Segura, J. L. Perez-Cordoba, M. C. Benitez, and A. J. Rubio, "Histogram equalization of speech representation for robust speech recognition," IEEE Trans. Speech Audio Process., vol. 13, no. 3, pp. 355-366, May 2005.
- (2005) IEEE Trans. Speech Audio Process , vol.13 , Issue.3 , pp. 355-366
- de la Torre, A.¹ Peinado, A.M.² Segura, J.C.³ Perez-Cordoba, J.L.⁴ Benitez, M.C.⁵ Rubio, A.J.⁶

10
- 2442477604
- Gaussianization
- S. Chen and R. Gopinath, "Gaussianization," Proc. Neural Inf. Process. Syst., pp. 423-429, 2000.
- (2000) Proc. Neural Inf. Process. Syst , pp. 423-429
- Chen, S.¹ Gopinath, R.²

11
- 56149094127
- Robust feature vector compression algorithm for distributed speech recognition
- I. Kiss and P. Kapanen, "Robust feature vector compression algorithm for distributed speech recognition," in Proc. Eurospeech, 1999, pp. 2183-2186.
- (1999) Proc. Eurospeech , pp. 2183-2186
- Kiss, I.¹ Kapanen, P.²

12
- 0036291376
- Uncertainty decoding with SPLICE for noise robust speech recognition
- J. Droppo, A. Acero, and L. Deng, "Uncertainty decoding with SPLICE for noise robust speech recognition," in Proc. IEEE Int. Conf. Acoust. Speech, Signal Process., 2002, pp. 57-60.
- (2002) Proc. IEEE Int. Conf. Acoust. Speech, Signal Process , pp. 57-60
- Droppo, J.¹ Acero, A.² Deng, L.³

13
- 85009067687
- Using observation uncertainty in HMM decoding
- J. A. Arrowood and M. A. Clements, "Using observation uncertainty in HMM decoding," in Proc. ICSLP, 2002, pp. 1561-1564.
- (2002) Proc. ICSLP , pp. 1561-1564
- Arrowood, J.A.¹ Clements, M.A.²

14
- 33745202806
- Joint uncertainty decoding for noise robust speech recognition
- H. Liao and M. J. F. Gales, "Joint uncertainty decoding for noise robust speech recognition," in Proc. Eurospeech, 2005, pp. 3129-3132.
- (2005) Proc. Eurospeech , pp. 3129-3132
- Liao, H.¹ Gales, M.J.F.²

15
- 33744969526
- Modeling, estimating, and compensating low-bit rate coding distortion in speech recognition
- Jan
- N. B. Yoma, C. Molina, J. Silva, and C. Busso, "Modeling, estimating, and compensating low-bit rate coding distortion in speech recognition," IEEE Trans. Speech Audio Process., vol. 14, no. 1, pp. 246-255, Jan. 2006.
- (2006) IEEE Trans. Speech Audio Process , vol.14 , Issue.1 , pp. 246-255
- Yoma, N.B.¹ Molina, C.² Silva, J.³ Busso, C.⁴

16
- 0036880137
- Graceful degradation of speech recognition performance over packet-erasure networks
- Nov
- C. Boulis, M. Ostendorf, E. A. Riskin, and S. Otterson, "Graceful degradation of speech recognition performance over packet-erasure networks," IEEE Trans. Speech Audio Process., vol. 10. no. 8, pp. 580-590, Nov. 2002.
- (2002) IEEE Trans. Speech Audio Process , vol.10 , Issue.8 , pp. 580-590
- Boulis, C.¹ Ostendorf, M.² Riskin, E.A.³ Otterson, S.⁴

17
- 19944385270
- Efficient MMSE-based channel error mitigation techniques application to distributed speech recognition over wireless channels
- Jan
- A. M. Peinado, V. Sanchez, J. L. Perez-Cordoba, and A. J. Rubio, "Efficient MMSE-based channel error mitigation techniques application to distributed speech recognition over wireless channels," IEEE Trans. Wireless Commun., vol. 4, no. 1, pp. 14-19, Jan. 2005.
- (2005) IEEE Trans. Wireless Commun , vol.4 , Issue.1 , pp. 14-19
- Peinado, A.M.¹ Sanchez, V.² Perez-Cordoba, J.L.³ Rubio, A.J.⁴

18
- 0036880073
- Low-bitrate distributed speech recognition for packet-based and wireless communication
- Nov
- A. Bernard and A. Alwan, "Low-bitrate distributed speech recognition for packet-based and wireless communication," IEEE Trans. Speech, Audio Process., vol. 10, no. 8, pp. 570-579, Nov. 2002.
- (2002) IEEE Trans. Speech, Audio Process , vol.10 , Issue.8 , pp. 570-579
- Bernard, A.¹ Alwan, A.²

19
- 4544282401
- Soft decoding strategies for distributed speech recognition over ip networks
- May
- A. Cardenal-Lopez, L. Docio-Fernandez, and C. Garcia-Mateo, "Soft decoding strategies for distributed speech recognition over ip networks," in Proc. IEEE Int. Conf. Acoust. Speech, Signal Process., May 2004, pp. 49-52.
- (2004) Proc. IEEE Int. Conf. Acoust. Speech, Signal Process , pp. 49-52
- Cardenal-Lopez, A.¹ Docio-Fernandez, L.² Garcia-Mateo, C.³

20
- 33745192800
- A unified probabilistic approach to error concealment for distributed speech recognition
- Sep
- V. Ion and R. Haeb-Umbach, "A unified probabilistic approach to error concealment for distributed speech recognition," in Proc. Interspeech, Sep. 2005, pp. 2853-2856.
- (2005) Proc. Interspeech , pp. 2853-2856
- Ion, V.¹ Haeb-Umbach, R.²

21
- 4544353461
- A subvector based error concealment algorithm for speech recognition over mobile networks
- May
- Z.-H. Tan, P. Dalsgaard, and B. Lindberg, "A subvector based error concealment algorithm for speech recognition over mobile networks," in Proc. IEEE Int. Conf Acoust. Speech, Signal Process., May 2004, pp. 57-60.
- (2004) Proc. IEEE Int. Conf Acoust. Speech, Signal Process , pp. 57-60
- Tan, Z.-H.¹ Dalsgaard, P.² Lindberg, B.³

22
- 0038669544
- The AURORA experimental framework for the performance evaluations of speech recognition systems under noisy conditions
- Sep
- H. G. Hirsch and D. Pearce, "The AURORA experimental framework for the performance evaluations of speech recognition systems under noisy conditions," in Pwc. ISCA ITRW ASR2000, Sep. 2000, pp. 181-188.
- (2000) Pwc. ISCA ITRW ASR2000 , pp. 181-188
- Hirsch, H.G.¹ Pearce, D.²

23
- 33745188919
- Histogram-based quantization (HQ) for robust and scalable distributed speech recognition
- Sep
- C.-Y. Wan and L.-S. Lee, "Histogram-based quantization (HQ) for robust and scalable distributed speech recognition," in Proc. Interspeech, Sep. 2005, pp. 957-960.
- (2005) Proc. Interspeech , pp. 957-960
- Wan, C.-Y.¹ Lee, L.-S.²

24
- 0020102027
- Least squares quantization in PCM
- Mar
- S. P. Lloyd, "Least squares quantization in PCM," IEEE Trans. Inf. Theory, vol. 28, no. 2, pp. 129-137, Mar. 1982.
- (1982) IEEE Trans. Inf. Theory , vol.28 , Issue.2 , pp. 129-137
- Lloyd, S.P.¹

25
- 84937350296
- Quantizing for minimum distortion
- Mar
- J. Max, "Quantizing for minimum distortion," IEEE Trans. Inf. Theory, vol. 6, no. 1, pp. 7-12, Mar. 1960.
- (1960) IEEE Trans. Inf. Theory , vol.6 , Issue.1 , pp. 7-12
- Max, J.¹

26
- 0018918171
- An algorithm for vector quantizer design
- Jan
- Y. Linde, A. Buzo, and R. Gray, "An algorithm for vector quantizer design," IEEE Trans. Speech Audio Process., vol. 28, no. 1, pp. 84-95, Jan. 1980.
- (1980) IEEE Trans. Speech Audio Process , vol.28 , Issue.1 , pp. 84-95
- Linde, Y.¹ Buzo, A.² Gray, R.³

27
- 85009129589
- Quantile-based histogram equalization for noise robust speech recognition
- F. Hilger and H. Ney, "Quantile-based histogram equalization for noise robust speech recognition," in Proc. Eurospeech, 2001, pp. 1135-1138.
- (2001) Proc. Eurospeech , pp. 1135-1138
- Hilger, F.¹ Ney, H.²

28
- 33947669949
- Joint uncertainty decoding (JUD) with histogram-based quantization (HQ) for robust and/or distributed speech recognition
- May
- C.-Y. Wan and L.-S. Lee, "Joint uncertainty decoding (JUD) with histogram-based quantization (HQ) for robust and/or distributed speech recognition," in Proc. IEEE Int. Conf. Acoust. Speech, Signal Process., May 2006, pp. 125-128.
- (2006) Proc. IEEE Int. Conf. Acoust. Speech, Signal Process , pp. 125-128
- Wan, C.-Y.¹ Lee, L.-S.²

29
- 34547522655
- Three-stage error concealment for distributed speech recognition (DSR) with histogram-based quantization (HQ) under noisy environment
- Apr
- C.-Y. Wan, Y. Chen, and L.-S. Lee, "Three-stage error concealment for distributed speech recognition (DSR) with histogram-based quantization (HQ) under noisy environment," in Proc. IEEE Int. Conf. Acoust. Speech, Signal Process., Apr. 2007, pp. 877-880.
- (2007) Proc. IEEE Int. Conf. Acoust. Speech, Signal Process , pp. 877-880
- Wan, C.-Y.¹ Chen, Y.² Lee, L.-S.³

30
- 33744996004
- Robust speech recognition over mobile and IP networks in burst-like packet loss
- Jan
- B. Milner and A. James, "Robust speech recognition over mobile and IP networks in burst-like packet loss," IEEE Trans. Speech Audio Process., vol. 14, no. 1, pp. 223-231, Jan. 2006.
- (2006) IEEE Trans. Speech Audio Process , vol.14 , Issue.1 , pp. 223-231
- Milner, B.¹ James, A.²

31
- 64849114398
- Subjective Performance Assessment of Telephone-Band and Wideband Digital Codecs, Annex D: Modified IRS Send and Receive Characteristics, Feb. 1996, ITU-T Std. ITU-T Rec. P.830.
- Subjective Performance Assessment of Telephone-Band and Wideband Digital Codecs, Annex D: Modified IRS Send and Receive Characteristics, Feb. 1996, ITU-T Std. ITU-T Rec. P.830.

32
- 0037766930
- Receiver design and simulation analysis of GPRS physical layer,
- M.S. thesis, National Taiwan Univ, Taipei
- J.-H. Chen, "Receiver design and simulation analysis of GPRS physical layer," M.S. thesis, National Taiwan Univ., Taipei, 2001.
- (2001)
- Chen, J.-H.¹

33
- 42549139762
- MVA processing of speech features
- Jan
- C.-P. Chen and J. A. Bilmes, "MVA processing of speech features," IEEE Trans. Speech Audio Process., vol. 15, no. 1, pp. 257-270, Jan. 2007.
- (2007) IEEE Trans. Speech Audio Process , vol.15 , Issue.1 , pp. 257-270
- Chen, C.-P.¹ Bilmes, J.A.²

34
- 34047247200
- Optimization of temporal filters for constructing robust features in speech recognition
- May
- J.-W. Hung and L.-S. Lee, "Optimization of temporal filters for constructing robust features in speech recognition," IEEE Trans. Speech Audio Process., vol. 14, no. 3, pp. 808-832, May 2006.
- (2006) IEEE Trans. Speech Audio Process , vol.14 , Issue.3 , pp. 808-832
- Hung, J.-W.¹ Lee, L.-S.²

35
- 84946730259
- Trap-tandem: Data-driven extraction of temporal features from speech
- H. Hermansky, "Trap-tandem: Data-driven extraction of temporal features from speech," in Proc. ASRU, 2003, pp. 255-260.
- (2003) Proc. ASRU , pp. 255-260
- Hermansky, H.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.