메뉴 건너뛰기




Volumn 16, Issue 4, 2008, Pages 859-873

Histogram-based quantization for robust and/or distributed speech recognition

Author keywords

Error compensation; Robustness; Speech recognition; Vector quantization (VQ)

Indexed keywords

CONVENTIONAL APPROACHES; DISTRIBUTED SPEECH RECOGNITION; ENVIRONMENTAL NOISE; ERROR CONCEALMENTS; FEATURE TRANSFORMATIONS; JOINT UNCERTAINTIES; ORDER STATISTICS; PARTITION CELLS; PERFORMANCE IMPROVEMENTS; QUANTIZATION DISTORTIONS; QUANTIZATION ERRORS; RECOGNITION ACCURACIES; ROBUST SPEECH RECOGNITION; ROBUSTNESS; SPEECH FEATURES; TESTING ENVIRONMENTS; TRANSMISSION ERRORS;

EID: 64849105676     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2008.920891     Document Type: Article
Times cited : (16)

References (35)
  • 1
    • 0002630527 scopus 로고    scopus 로고
    • Quantization of cepstral parameters for speech recognition over the world wide web
    • Jan
    • V. Digalakis, L. Neumeyer, and M. Perakakis. "Quantization of cepstral parameters for speech recognition over the world wide web," IEEE Select. Ateas Commun., vol. 17, no. 1, pp. 82-90, Jan. 1999.
    • (1999) IEEE Select. Ateas Commun , vol.17 , Issue.1 , pp. 82-90
    • Digalakis, V.1    Neumeyer, L.2    Perakakis, M.3
  • 2
    • 85009080589 scopus 로고    scopus 로고
    • Scalable distributed speech recognition using multi-frame gmm-based block quantization
    • CD-ROM
    • K. K. Paliwal and S. So, "Scalable distributed speech recognition using multi-frame gmm-based block quantization," in Proc. ICSLP, 2004, CD-ROM.
    • (2004) Proc. ICSLP
    • Paliwal, K.K.1    So, S.2
  • 3
    • 4544351496 scopus 로고    scopus 로고
    • Extended cluster information vector quantization (ECJ-VQ) for robust classification
    • May
    • J. A. Arrowood and M. Clements, "Extended cluster information vector quantization (ECJ-VQ) for robust classification," in Proc. IEEE Int. Conf. Acoust. Speech, Signal Process., May 2004, pp. 889-892.
    • (2004) Proc. IEEE Int. Conf. Acoust. Speech, Signal Process , pp. 889-892
    • Arrowood, J.A.1    Clements, M.2
  • 4
    • 64849107842 scopus 로고    scopus 로고
    • Speech Processing, Transmission and Quality Aspects (STQ); Distributed Speech Recognition; Extended Advanced Front-End Feature Extraction Algorithm; Compression Algorithms; Back-End Speech Reconstruction Algorithm, Nov. 2003, ETS1 Std. ES 202 212 VI.1.1 Rec.
    • Speech Processing, Transmission and Quality Aspects (STQ); Distributed Speech Recognition; Extended Advanced Front-End Feature Extraction Algorithm; Compression Algorithms; Back-End Speech Reconstruction Algorithm, Nov. 2003, ETS1 Std. ES 202 212 VI.1.1 Rec.
  • 5
    • 0141702076 scopus 로고    scopus 로고
    • Low bit-rate feature vector compression using transform coding and non-uniform bit allocation
    • Apr
    • B. Milner and X. Shao, "Low bit-rate feature vector compression using transform coding and non-uniform bit allocation," in Proc. IEEE Int. Conf. Acoust. Speech, Signal Process., Apr. 2003, pp. 129-132.
    • (2003) Proc. IEEE Int. Conf. Acoust. Speech, Signal Process , pp. 129-132
    • Milner, B.1    Shao, X.2
  • 6
    • 0034841726 scopus 로고    scopus 로고
    • An efficient and scalable 2D-DCT based feature coding scheme for remote speech recognition
    • Q. Zhu and A. Alwan, "An efficient and scalable 2D-DCT based feature coding scheme for remote speech recognition," in Proc. IEEE Int. Conf. Acoust. Speech, Signal Process., 2001, pp. 113-116.
    • (2001) Proc. IEEE Int. Conf. Acoust. Speech, Signal Process , pp. 113-116
    • Zhu, Q.1    Alwan, A.2
  • 7
    • 4544321132 scopus 로고    scopus 로고
    • Efficient and robust distributed speech recognition (DSR) over wireless fading channels: 2D-DCT compression, iterative bit allocation, short BCH code and interleaving
    • W.-H. Hsu and L.-S. Lee, "Efficient and robust distributed speech recognition (DSR) over wireless fading channels: 2D-DCT compression, iterative bit allocation, short BCH code and interleaving," in Proc. IEEE Int. Conf. Acoust. Speech, Signal Process., 2004, pp. 69-72.
    • (2004) Proc. IEEE Int. Conf. Acoust. Speech, Signal Process , pp. 69-72
    • Hsu, W.-H.1    Lee, L.-S.2
  • 8
    • 84962800155 scopus 로고    scopus 로고
    • Histogram based normalization in the acoustic feature space
    • S. Molau, M. Pitz, and H. Ney, "Histogram based normalization in the acoustic feature space," in Pwc. ASRU, 2001, pp. 21-24.
    • (2001) Pwc. ASRU , pp. 21-24
    • Molau, S.1    Pitz, M.2    Ney, H.3
  • 11
    • 56149094127 scopus 로고    scopus 로고
    • Robust feature vector compression algorithm for distributed speech recognition
    • I. Kiss and P. Kapanen, "Robust feature vector compression algorithm for distributed speech recognition," in Proc. Eurospeech, 1999, pp. 2183-2186.
    • (1999) Proc. Eurospeech , pp. 2183-2186
    • Kiss, I.1    Kapanen, P.2
  • 13
    • 85009067687 scopus 로고    scopus 로고
    • Using observation uncertainty in HMM decoding
    • J. A. Arrowood and M. A. Clements, "Using observation uncertainty in HMM decoding," in Proc. ICSLP, 2002, pp. 1561-1564.
    • (2002) Proc. ICSLP , pp. 1561-1564
    • Arrowood, J.A.1    Clements, M.A.2
  • 14
    • 33745202806 scopus 로고    scopus 로고
    • Joint uncertainty decoding for noise robust speech recognition
    • H. Liao and M. J. F. Gales, "Joint uncertainty decoding for noise robust speech recognition," in Proc. Eurospeech, 2005, pp. 3129-3132.
    • (2005) Proc. Eurospeech , pp. 3129-3132
    • Liao, H.1    Gales, M.J.F.2
  • 15
    • 33744969526 scopus 로고    scopus 로고
    • Modeling, estimating, and compensating low-bit rate coding distortion in speech recognition
    • Jan
    • N. B. Yoma, C. Molina, J. Silva, and C. Busso, "Modeling, estimating, and compensating low-bit rate coding distortion in speech recognition," IEEE Trans. Speech Audio Process., vol. 14, no. 1, pp. 246-255, Jan. 2006.
    • (2006) IEEE Trans. Speech Audio Process , vol.14 , Issue.1 , pp. 246-255
    • Yoma, N.B.1    Molina, C.2    Silva, J.3    Busso, C.4
  • 16
    • 0036880137 scopus 로고    scopus 로고
    • Graceful degradation of speech recognition performance over packet-erasure networks
    • Nov
    • C. Boulis, M. Ostendorf, E. A. Riskin, and S. Otterson, "Graceful degradation of speech recognition performance over packet-erasure networks," IEEE Trans. Speech Audio Process., vol. 10. no. 8, pp. 580-590, Nov. 2002.
    • (2002) IEEE Trans. Speech Audio Process , vol.10 , Issue.8 , pp. 580-590
    • Boulis, C.1    Ostendorf, M.2    Riskin, E.A.3    Otterson, S.4
  • 17
    • 19944385270 scopus 로고    scopus 로고
    • Efficient MMSE-based channel error mitigation techniques application to distributed speech recognition over wireless channels
    • Jan
    • A. M. Peinado, V. Sanchez, J. L. Perez-Cordoba, and A. J. Rubio, "Efficient MMSE-based channel error mitigation techniques application to distributed speech recognition over wireless channels," IEEE Trans. Wireless Commun., vol. 4, no. 1, pp. 14-19, Jan. 2005.
    • (2005) IEEE Trans. Wireless Commun , vol.4 , Issue.1 , pp. 14-19
    • Peinado, A.M.1    Sanchez, V.2    Perez-Cordoba, J.L.3    Rubio, A.J.4
  • 18
    • 0036880073 scopus 로고    scopus 로고
    • Low-bitrate distributed speech recognition for packet-based and wireless communication
    • Nov
    • A. Bernard and A. Alwan, "Low-bitrate distributed speech recognition for packet-based and wireless communication," IEEE Trans. Speech, Audio Process., vol. 10, no. 8, pp. 570-579, Nov. 2002.
    • (2002) IEEE Trans. Speech, Audio Process , vol.10 , Issue.8 , pp. 570-579
    • Bernard, A.1    Alwan, A.2
  • 20
    • 33745192800 scopus 로고    scopus 로고
    • A unified probabilistic approach to error concealment for distributed speech recognition
    • Sep
    • V. Ion and R. Haeb-Umbach, "A unified probabilistic approach to error concealment for distributed speech recognition," in Proc. Interspeech, Sep. 2005, pp. 2853-2856.
    • (2005) Proc. Interspeech , pp. 2853-2856
    • Ion, V.1    Haeb-Umbach, R.2
  • 22
    • 0038669544 scopus 로고    scopus 로고
    • The AURORA experimental framework for the performance evaluations of speech recognition systems under noisy conditions
    • Sep
    • H. G. Hirsch and D. Pearce, "The AURORA experimental framework for the performance evaluations of speech recognition systems under noisy conditions," in Pwc. ISCA ITRW ASR2000, Sep. 2000, pp. 181-188.
    • (2000) Pwc. ISCA ITRW ASR2000 , pp. 181-188
    • Hirsch, H.G.1    Pearce, D.2
  • 23
    • 33745188919 scopus 로고    scopus 로고
    • Histogram-based quantization (HQ) for robust and scalable distributed speech recognition
    • Sep
    • C.-Y. Wan and L.-S. Lee, "Histogram-based quantization (HQ) for robust and scalable distributed speech recognition," in Proc. Interspeech, Sep. 2005, pp. 957-960.
    • (2005) Proc. Interspeech , pp. 957-960
    • Wan, C.-Y.1    Lee, L.-S.2
  • 24
    • 0020102027 scopus 로고
    • Least squares quantization in PCM
    • Mar
    • S. P. Lloyd, "Least squares quantization in PCM," IEEE Trans. Inf. Theory, vol. 28, no. 2, pp. 129-137, Mar. 1982.
    • (1982) IEEE Trans. Inf. Theory , vol.28 , Issue.2 , pp. 129-137
    • Lloyd, S.P.1
  • 25
    • 84937350296 scopus 로고
    • Quantizing for minimum distortion
    • Mar
    • J. Max, "Quantizing for minimum distortion," IEEE Trans. Inf. Theory, vol. 6, no. 1, pp. 7-12, Mar. 1960.
    • (1960) IEEE Trans. Inf. Theory , vol.6 , Issue.1 , pp. 7-12
    • Max, J.1
  • 26
    • 0018918171 scopus 로고
    • An algorithm for vector quantizer design
    • Jan
    • Y. Linde, A. Buzo, and R. Gray, "An algorithm for vector quantizer design," IEEE Trans. Speech Audio Process., vol. 28, no. 1, pp. 84-95, Jan. 1980.
    • (1980) IEEE Trans. Speech Audio Process , vol.28 , Issue.1 , pp. 84-95
    • Linde, Y.1    Buzo, A.2    Gray, R.3
  • 27
    • 85009129589 scopus 로고    scopus 로고
    • Quantile-based histogram equalization for noise robust speech recognition
    • F. Hilger and H. Ney, "Quantile-based histogram equalization for noise robust speech recognition," in Proc. Eurospeech, 2001, pp. 1135-1138.
    • (2001) Proc. Eurospeech , pp. 1135-1138
    • Hilger, F.1    Ney, H.2
  • 28
    • 33947669949 scopus 로고    scopus 로고
    • Joint uncertainty decoding (JUD) with histogram-based quantization (HQ) for robust and/or distributed speech recognition
    • May
    • C.-Y. Wan and L.-S. Lee, "Joint uncertainty decoding (JUD) with histogram-based quantization (HQ) for robust and/or distributed speech recognition," in Proc. IEEE Int. Conf. Acoust. Speech, Signal Process., May 2006, pp. 125-128.
    • (2006) Proc. IEEE Int. Conf. Acoust. Speech, Signal Process , pp. 125-128
    • Wan, C.-Y.1    Lee, L.-S.2
  • 29
    • 34547522655 scopus 로고    scopus 로고
    • Three-stage error concealment for distributed speech recognition (DSR) with histogram-based quantization (HQ) under noisy environment
    • Apr
    • C.-Y. Wan, Y. Chen, and L.-S. Lee, "Three-stage error concealment for distributed speech recognition (DSR) with histogram-based quantization (HQ) under noisy environment," in Proc. IEEE Int. Conf. Acoust. Speech, Signal Process., Apr. 2007, pp. 877-880.
    • (2007) Proc. IEEE Int. Conf. Acoust. Speech, Signal Process , pp. 877-880
    • Wan, C.-Y.1    Chen, Y.2    Lee, L.-S.3
  • 30
    • 33744996004 scopus 로고    scopus 로고
    • Robust speech recognition over mobile and IP networks in burst-like packet loss
    • Jan
    • B. Milner and A. James, "Robust speech recognition over mobile and IP networks in burst-like packet loss," IEEE Trans. Speech Audio Process., vol. 14, no. 1, pp. 223-231, Jan. 2006.
    • (2006) IEEE Trans. Speech Audio Process , vol.14 , Issue.1 , pp. 223-231
    • Milner, B.1    James, A.2
  • 31
    • 64849114398 scopus 로고    scopus 로고
    • Subjective Performance Assessment of Telephone-Band and Wideband Digital Codecs, Annex D: Modified IRS Send and Receive Characteristics, Feb. 1996, ITU-T Std. ITU-T Rec. P.830.
    • Subjective Performance Assessment of Telephone-Band and Wideband Digital Codecs, Annex D: Modified IRS Send and Receive Characteristics, Feb. 1996, ITU-T Std. ITU-T Rec. P.830.
  • 32
    • 0037766930 scopus 로고    scopus 로고
    • Receiver design and simulation analysis of GPRS physical layer,
    • M.S. thesis, National Taiwan Univ, Taipei
    • J.-H. Chen, "Receiver design and simulation analysis of GPRS physical layer," M.S. thesis, National Taiwan Univ., Taipei, 2001.
    • (2001)
    • Chen, J.-H.1
  • 33
    • 42549139762 scopus 로고    scopus 로고
    • MVA processing of speech features
    • Jan
    • C.-P. Chen and J. A. Bilmes, "MVA processing of speech features," IEEE Trans. Speech Audio Process., vol. 15, no. 1, pp. 257-270, Jan. 2007.
    • (2007) IEEE Trans. Speech Audio Process , vol.15 , Issue.1 , pp. 257-270
    • Chen, C.-P.1    Bilmes, J.A.2
  • 34
    • 34047247200 scopus 로고    scopus 로고
    • Optimization of temporal filters for constructing robust features in speech recognition
    • May
    • J.-W. Hung and L.-S. Lee, "Optimization of temporal filters for constructing robust features in speech recognition," IEEE Trans. Speech Audio Process., vol. 14, no. 3, pp. 808-832, May 2006.
    • (2006) IEEE Trans. Speech Audio Process , vol.14 , Issue.3 , pp. 808-832
    • Hung, J.-W.1    Lee, L.-S.2
  • 35
    • 84946730259 scopus 로고    scopus 로고
    • Trap-tandem: Data-driven extraction of temporal features from speech
    • H. Hermansky, "Trap-tandem: Data-driven extraction of temporal features from speech," in Proc. ASRU, 2003, pp. 255-260.
    • (2003) Proc. ASRU , pp. 255-260
    • Hermansky, H.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.