메뉴 건너뛰기




Volumn 48, Issue 11, 2006, Pages 1402-1421

Towards improving the robustness of distributed speech recognition in packet loss

Author keywords

Distributed speech recognition; Interleaving; MAP reconstruction; Packet loss; Weighted Viterbi decoding

Indexed keywords

COMMUNICATION CHANNELS (INFORMATION THEORY); DATA REDUCTION; DECODING; ROBUSTNESS (CONTROL SYSTEMS); VECTORS;

EID: 33750368383     PISSN: 01676393     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.specom.2006.07.005     Document Type: Article
Times cited : (5)

References (42)
  • 1
    • 33750336250 scopus 로고    scopus 로고
    • Andrews, K., Heegard, C., Kozen, D., 1997. A theory of interleavers. Technical report 97-1634, Computer Science Department, Cornell University, June, 1997.
  • 2
    • 4544367814 scopus 로고    scopus 로고
    • Arizmendi, I., Rose, R.C., 2004. A distributed framework for enterprise level speech recognition services. In: Proc. ICASSP 2004, Montreal, Canada.
  • 4
    • 0036880073 scopus 로고    scopus 로고
    • Low-bitrate distributed speech recognition for packet-based and wireless communication
    • Bernard A., and Alwan A. Low-bitrate distributed speech recognition for packet-based and wireless communication. IEEE Trans. Speech Audio Process. 10 November (2002)
    • (2002) IEEE Trans. Speech Audio Process. , vol.10 , Issue.November
    • Bernard, A.1    Alwan, A.2
  • 5
    • 85009250664 scopus 로고    scopus 로고
    • Bernard, A., Alwan, A., 2002b. Channel noise robustness for low-bitrate remote speech recognition. In: Proc. ICSLP 2002.
  • 6
    • 33750311357 scopus 로고    scopus 로고
    • Bolot, J., Crepin, H., 1995. Analysis and control of audio packet loss over packet-switched networks. In: Proc. of NOSSDAV, 1995.
  • 7
  • 8
    • 33750298220 scopus 로고    scopus 로고
    • Cardenal-López, A., Doci{dotless}́o-Fernández, L., Garci{dotless}́a-Mateo, C., 2004. Soft decoding strategies for distributed speech recognition over IP networks. In: Proc. ICASSP 2004, Montreal, Canada.
  • 9
    • 33750302996 scopus 로고    scopus 로고
    • Chesterfield, J., Chakravorty, R., Crowcroft, J., Rodriguez, P., Banerjee, S., 2004. Experiences with multimedia streaming over 2.5G and 3G networks. In: Proc. Workshop on Broadband Wireless Multimedia 2004, San Jose, United States.
  • 10
    • 0035342414 scopus 로고    scopus 로고
    • Robust automatic speech recognition with missing and unreliable acoustic data
    • Cooke M., Green P., Josifovski L., and Vizinho A. Robust automatic speech recognition with missing and unreliable acoustic data. Speech Comm. 34 (2001) 267-285
    • (2001) Speech Comm. , vol.34 , pp. 267-285
    • Cooke, M.1    Green, P.2    Josifovski, L.3    Vizinho, A.4
  • 11
    • 33750292964 scopus 로고    scopus 로고
    • Cuny, R., Lakaniemi, A., 2003. VoIP in 3G networks: An end-to-end quality of service analysis. Nokia Whitepaper, 2003.
  • 12
    • 85009228769 scopus 로고    scopus 로고
    • Endo, T., Kuroiwa, S., Nakamura, S. 2003. Missing feature theory applied to robust speech recognition on IP networks. In: Proc. Eurospeech, 2003.
  • 13
    • 33750302480 scopus 로고    scopus 로고
    • Ericsson, 2000. Aurora document no. AU/266/00: Recognition with WI007 Compression and Transmission over GSM Channel. Ericsson, December 2000.
  • 14
    • 33750372493 scopus 로고    scopus 로고
    • ETSI, 2000. ETSI document. STQ - DSR - front-end feature extraction algorithm; compression algorithm. Technical Report ES 201 108, ETSI, 2000.
  • 15
    • 33750368434 scopus 로고    scopus 로고
    • ETSI, 2002. ETSI document. STQ - DSR - advanced front-end feature algorithm; compression algorithm. Technical Report ES 202 050, ETSI, 2002.
  • 16
    • 33750335709 scopus 로고    scopus 로고
    • ETSI, 2003. ETSI document. STQ - DSR - extended advanced front-end feature algorithm; compression algorithms; back-end speech reconstruction algorithm. Technical Report ES 202 050, ETSI, 2003.
  • 17
    • 0022667694 scopus 로고
    • Speaker independent isolated word recognition using dynamic features of speech spectrum
    • Furui S. Speaker independent isolated word recognition using dynamic features of speech spectrum. IEEE Trans. ASSP 34 1 (1986)
    • (1986) IEEE Trans. ASSP , vol.34 , Issue.1
    • Furui, S.1
  • 18
    • 33750306754 scopus 로고    scopus 로고
    • Gómez, A.M., Peinado, A.M., Sánchez, V., Milner, B.P., 2004. Statistical-based reconstruction methods for speech recognition in IP networks. In: Proc. Robust 2004, Norwich, United Kingdom.
  • 20
    • 33750343109 scopus 로고    scopus 로고
    • Hirsch, H.G., Pearce, D., 2000. The AURORA experimental framework for the performance evaluation of speech recognition systems under noisy conditions. In: Proc. ISCA ITRW ASR2000, September, 2000.
  • 21
    • 4544323318 scopus 로고    scopus 로고
    • James, A.B., Milner, B.P., 2004a. An analysis of interleavers for robust speech recognition in burst-like packet loss. In: Proc. ICASSP 2004, Montreal, Canada.
  • 22
    • 33750342576 scopus 로고    scopus 로고
    • James, A.B., Milner, B.P., 2004b. Interleaving and estimation of lost vectors for robust speech recognition in burst-like packet loss. In: Proc. EUSIPCO 2004, Vienna, Austria.
  • 23
    • 33646820291 scopus 로고    scopus 로고
    • James, A.B., Milner, B.P., 2005. Soft decoding of temporal derivatives for robust distributed speech recognition in packet loss. In: Proc. ICASSP 2005, Philadelphia, United States.
  • 24
    • 0242526814 scopus 로고    scopus 로고
    • Ji, P., Benyuan, L., Towsley, D., Kurose, J., 2004. Modelling frame-level errors in GSM wireless channels. In: Proc. Internet Performance Sympos. (IPS 2002), January 2004, Vol. 55 (1-2), pp. 165-181.
  • 25
    • 85009204334 scopus 로고    scopus 로고
    • Milner, B.P., James, A.B., 2003. Analysis and compensation of packet loss in distributed speech recognition using interleaving. In: Proc. Eurospeech 2003.
  • 26
    • 85009097092 scopus 로고    scopus 로고
    • Milner, B.P., James, A.B., 2004. An analysis of packet loss models for distributed speech recognition. In: Proc. ICSLP 2004, Jeju island, Korea.
  • 27
    • 33750300605 scopus 로고    scopus 로고
    • Mutter, A., Necker, A.C., Lück, S., 2004. IP-packet service time distributions in UMTS radio access networks. In: Proc. EUNICE 2004.
  • 28
    • 4544240901 scopus 로고    scopus 로고
    • Nour-Eldin, A.H., Tolba, H., O'Shaughnessy, D., 2004. Automatic recognition of bluetooth speech in 802.11 interference and the effectiveness of insertion-based compensation techniques. In: Proc. ICASSP 2004, Montreal, Canada.
  • 29
    • 33750314046 scopus 로고    scopus 로고
    • Pearce, D., 2000. An overview of the ETSI standards activities for distributed speech recognition front-ends. In: Proc. AVIOS 2000.
  • 30
    • 33750337624 scopus 로고    scopus 로고
    • Pearce, D., 2004. Robustness to Transmission Channel - the DSR Approach. In: Proc. Robust 2004, Norwich, United Kingdom.
  • 31
    • 0242721421 scopus 로고    scopus 로고
    • HMM-based channel error mitigation and its application to distributed speech recognition
    • Peinado A.M., Sánchez V., Pérez-Córdoba J.L., and Torre A. HMM-based channel error mitigation and its application to distributed speech recognition. Speech Comm. 21 (2003) 549-561
    • (2003) Speech Comm. , vol.21 , pp. 549-561
    • Peinado, A.M.1    Sánchez, V.2    Pérez-Córdoba, J.L.3    Torre, A.4
  • 32
    • 33750303483 scopus 로고    scopus 로고
    • Raj, B., 2000. Reconstruction of incomplete spectrograms for robust speech recognition. Ph.D. thesis, Carnegie Mellon University, 2000.
  • 33
    • 4644336054 scopus 로고    scopus 로고
    • Reconstruction of missing features for robust speech recognition
    • Raj B., Seltzer M.L., and Stern R. Reconstruction of missing features for robust speech recognition. Speech Comm. 43 (2004) 275-296
    • (2004) Speech Comm. , vol.43 , pp. 275-296
    • Raj, B.1    Seltzer, M.L.2    Stern, R.3
  • 34
    • 0014781954 scopus 로고
    • The realization of optimum interleavers
    • Ramsey R.L. The realization of optimum interleavers. IEEE Trans. Inform. Theory IT-16 3 (1970) 772-781
    • (1970) IEEE Trans. Inform. Theory , vol.IT-16 , Issue.3 , pp. 772-781
    • Ramsey, R.L.1
  • 35
    • 0028996854 scopus 로고    scopus 로고
    • Robinson, T., Fransen, J., Pye, D., Foote, J., Renals, S. 1995. WSJCAM0: A British English speech corpus for large vocabulary continuous speech recognition. In: Proc. ICASSP 1995.
  • 36
    • 33750325215 scopus 로고    scopus 로고
    • Schulzrinne, H., Casner, S., Frederick, R., Jacobson,V., 2003. RTP: A transport protocol for real-time applications. IETF RFC 3550, July 2003.
  • 37
    • 4544353461 scopus 로고    scopus 로고
    • Tan, Z., Dalsgaard, P., Lindberg, B., 2004. A subvector-based error concealment algorithm for speech recognition over mobile networks. In: Proc. ICASSP 2004, Montreal, Canada.
  • 40
    • 33750354439 scopus 로고    scopus 로고
    • Xie, Q., 2003. RTP Payload Format for European telecommunications standards institute (ETSI) European standard ES 201 108 distributed speech recognition encoding. RFC 3557, IETF, 2003.
  • 41
    • 33750353917 scopus 로고    scopus 로고
    • Xie, Q., Pearce, D., 2005. RTP payload formats for European telecommunications standards Institute (ETSI) European standard ES 202 050, ES 202 211, and ES 202 212 distributed speech recognition encoding. RFC 4060, IETF, 2005.
  • 42
    • 0032677669 scopus 로고    scopus 로고
    • Yajnik, M., Moon, S., Kurose, J., Towsley, D. 1999. Measurement and Modelling of the Temporal Dependence in Packet Loss. In: Proc. INFOCOMM 1999.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.