SCOPUS 정보 검색 플랫폼 - 논문 보기

메뉴 건너뛰기

Speech Communication

Volumn 48, Issue 11, 2006, Pages 1402-1421

Towards improving the robustness of distributed speech recognition in packet loss

(2) James, Alastair a Milner, Ben a

a UNIVERSITY OF EAST ANGLIA (United Kingdom)

Author keywords

Distributed speech recognition; Interleaving; MAP reconstruction; Packet loss; Weighted Viterbi decoding

Indexed keywords

COMMUNICATION CHANNELS (INFORMATION THEORY); DATA REDUCTION; DECODING; ROBUSTNESS (CONTROL SYSTEMS); VECTORS;

DATA CHANNELS; DISTRIBUTED SPEECH RECOGNITION (DSR); INTERLEAVING; MAP RECONSTRUCTION; PACKET LOSS; WEIGHTED-VITERBI DECODING;

SPEECH RECOGNITION;

EID: 33750368383 PISSN: 01676393 EISSN: None Source Type: Journal
DOI: 10.1016/j.specom.2006.07.005 Document Type: Article

Times cited : (5)

References (42)

1
- 33750336250
- Andrews, K., Heegard, C., Kozen, D., 1997. A theory of interleavers. Technical report 97-1634, Computer Science Department, Cornell University, June, 1997.

2
- 4544367814
- Arizmendi, I., Rose, R.C., 2004. A distributed framework for enterprise level speech recognition services. In: Proc. ICASSP 2004, Montreal, Canada.

3
- 84891584375
- IEEE Press
- Basagni S., Conti M., Giordano S., and Stojmenović I. Mobile Ad-Hoc Networking (2004), IEEE Press
- (2004) Mobile Ad-Hoc Networking
- Basagni, S.¹ Conti, M.² Giordano, S.³ Stojmenović, I.⁴

4
- 0036880073
- Low-bitrate distributed speech recognition for packet-based and wireless communication
- Bernard A., and Alwan A. Low-bitrate distributed speech recognition for packet-based and wireless communication. IEEE Trans. Speech Audio Process. 10 November (2002)
- (2002) IEEE Trans. Speech Audio Process. , vol.10 , Issue.November
- Bernard, A.¹ Alwan, A.²

5
- 85009250664
- Bernard, A., Alwan, A., 2002b. Channel noise robustness for low-bitrate remote speech recognition. In: Proc. ICSLP 2002.

6
- 33750311357
- Bolot, J., Crepin, H., 1995. Analysis and control of audio packet loss over packet-switched networks. In: Proc. of NOSSDAV, 1995.

7
- 0036880137
- Graceful degradation of speech recognition performance over packet erasure networks
- Boulis C., Ostendorf M., Riskin E.A., and Otterson S. Graceful degradation of speech recognition performance over packet erasure networks. IEEE. Trans. Speech Audio Process. 10 November (2002)
- (2002) IEEE. Trans. Speech Audio Process. , vol.10 , Issue.November
- Boulis, C.¹ Ostendorf, M.² Riskin, E.A.³ Otterson, S.⁴

8
- 33750298220
- Cardenal-López, A., Doci{dotless}́o-Fernández, L., Garci{dotless}́a-Mateo, C., 2004. Soft decoding strategies for distributed speech recognition over IP networks. In: Proc. ICASSP 2004, Montreal, Canada.

9
- 33750302996
- Chesterfield, J., Chakravorty, R., Crowcroft, J., Rodriguez, P., Banerjee, S., 2004. Experiences with multimedia streaming over 2.5G and 3G networks. In: Proc. Workshop on Broadband Wireless Multimedia 2004, San Jose, United States.

10
- 0035342414
- Robust automatic speech recognition with missing and unreliable acoustic data
- Cooke M., Green P., Josifovski L., and Vizinho A. Robust automatic speech recognition with missing and unreliable acoustic data. Speech Comm. 34 (2001) 267-285
- (2001) Speech Comm. , vol.34 , pp. 267-285
- Cooke, M.¹ Green, P.² Josifovski, L.³ Vizinho, A.⁴

11
- 33750292964
- Cuny, R., Lakaniemi, A., 2003. VoIP in 3G networks: An end-to-end quality of service analysis. Nokia Whitepaper, 2003.

12
- 85009228769
- Endo, T., Kuroiwa, S., Nakamura, S. 2003. Missing feature theory applied to robust speech recognition on IP networks. In: Proc. Eurospeech, 2003.

13
- 33750302480
- Ericsson, 2000. Aurora document no. AU/266/00: Recognition with WI007 Compression and Transmission over GSM Channel. Ericsson, December 2000.

14
- 33750372493
- ETSI, 2000. ETSI document. STQ - DSR - front-end feature extraction algorithm; compression algorithm. Technical Report ES 201 108, ETSI, 2000.

15
- 33750368434
- ETSI, 2002. ETSI document. STQ - DSR - advanced front-end feature algorithm; compression algorithm. Technical Report ES 202 050, ETSI, 2002.

16
- 33750335709
- ETSI, 2003. ETSI document. STQ - DSR - extended advanced front-end feature algorithm; compression algorithms; back-end speech reconstruction algorithm. Technical Report ES 202 050, ETSI, 2003.

17
- 0022667694
- Speaker independent isolated word recognition using dynamic features of speech spectrum
- Furui S. Speaker independent isolated word recognition using dynamic features of speech spectrum. IEEE Trans. ASSP 34 1 (1986)
- (1986) IEEE Trans. ASSP , vol.34 , Issue.1
- Furui, S.¹

18
- 33750306754
- Gómez, A.M., Peinado, A.M., Sánchez, V., Milner, B.P., 2004. Statistical-based reconstruction methods for speech recognition in IP networks. In: Proc. Robust 2004, Norwich, United Kingdom.

19
- 0003953023
- Addison-Wesley
- Halsall F. Data Communications, Computer Networks and Open Systems. fourth ed. (1995), Addison-Wesley
- (1995) Data Communications, Computer Networks and Open Systems. fourth ed.
- Halsall, F.¹

20
- 33750343109
- Hirsch, H.G., Pearce, D., 2000. The AURORA experimental framework for the performance evaluation of speech recognition systems under noisy conditions. In: Proc. ISCA ITRW ASR2000, September, 2000.

21
- 4544323318
- James, A.B., Milner, B.P., 2004a. An analysis of interleavers for robust speech recognition in burst-like packet loss. In: Proc. ICASSP 2004, Montreal, Canada.

22
- 33750342576
- James, A.B., Milner, B.P., 2004b. Interleaving and estimation of lost vectors for robust speech recognition in burst-like packet loss. In: Proc. EUSIPCO 2004, Vienna, Austria.

23
- 33646820291
- James, A.B., Milner, B.P., 2005. Soft decoding of temporal derivatives for robust distributed speech recognition in packet loss. In: Proc. ICASSP 2005, Philadelphia, United States.

24
- 0242526814
- Ji, P., Benyuan, L., Towsley, D., Kurose, J., 2004. Modelling frame-level errors in GSM wireless channels. In: Proc. Internet Performance Sympos. (IPS 2002), January 2004, Vol. 55 (1-2), pp. 165-181.

25
- 85009204334
- Milner, B.P., James, A.B., 2003. Analysis and compensation of packet loss in distributed speech recognition using interleaving. In: Proc. Eurospeech 2003.

26
- 85009097092
- Milner, B.P., James, A.B., 2004. An analysis of packet loss models for distributed speech recognition. In: Proc. ICSLP 2004, Jeju island, Korea.

27
- 33750300605
- Mutter, A., Necker, A.C., Lück, S., 2004. IP-packet service time distributions in UMTS radio access networks. In: Proc. EUNICE 2004.

28
- 4544240901
- Nour-Eldin, A.H., Tolba, H., O'Shaughnessy, D., 2004. Automatic recognition of bluetooth speech in 802.11 interference and the effectiveness of insertion-based compensation techniques. In: Proc. ICASSP 2004, Montreal, Canada.

29
- 33750314046
- Pearce, D., 2000. An overview of the ETSI standards activities for distributed speech recognition front-ends. In: Proc. AVIOS 2000.

30
- 33750337624
- Pearce, D., 2004. Robustness to Transmission Channel - the DSR Approach. In: Proc. Robust 2004, Norwich, United Kingdom.

31
- 0242721421
- HMM-based channel error mitigation and its application to distributed speech recognition
- Peinado A.M., Sánchez V., Pérez-Córdoba J.L., and Torre A. HMM-based channel error mitigation and its application to distributed speech recognition. Speech Comm. 21 (2003) 549-561
- (2003) Speech Comm. , vol.21 , pp. 549-561
- Peinado, A.M.¹ Sánchez, V.² Pérez-Córdoba, J.L.³ Torre, A.⁴

32
- 33750303483
- Raj, B., 2000. Reconstruction of incomplete spectrograms for robust speech recognition. Ph.D. thesis, Carnegie Mellon University, 2000.

33
- 4644336054
- Reconstruction of missing features for robust speech recognition
- Raj B., Seltzer M.L., and Stern R. Reconstruction of missing features for robust speech recognition. Speech Comm. 43 (2004) 275-296
- (2004) Speech Comm. , vol.43 , pp. 275-296
- Raj, B.¹ Seltzer, M.L.² Stern, R.³

34
- 0014781954
- The realization of optimum interleavers
- Ramsey R.L. The realization of optimum interleavers. IEEE Trans. Inform. Theory IT-16 3 (1970) 772-781
- (1970) IEEE Trans. Inform. Theory , vol.IT-16 , Issue.3 , pp. 772-781
- Ramsey, R.L.¹

35
- 0028996854
- Robinson, T., Fransen, J., Pye, D., Foote, J., Renals, S. 1995. WSJCAM0: A British English speech corpus for large vocabulary continuous speech recognition. In: Proc. ICASSP 1995.

36
- 33750325215
- Schulzrinne, H., Casner, S., Frederick, R., Jacobson,V., 2003. RTP: A transport protocol for real-time applications. IETF RFC 3550, July 2003.

37
- 4544353461
- Tan, Z., Dalsgaard, P., Lindberg, B., 2004. A subvector-based error concealment algorithm for speech recognition over mobile networks. In: Proc. ICASSP 2004, Montreal, Canada.

38
- 84889779628
- John-Wiley
- Vaseghi S.V. Advanced Digital Signal Processing and Noise Reduction (2000), John-Wiley
- (2000) Advanced Digital Signal Processing and Noise Reduction
- Vaseghi, S.V.¹

39
- 3242892382
- Wiley
- Wesolowski K. Mobile Communication Systems (2002), Wiley
- (2002) Mobile Communication Systems
- Wesolowski, K.¹

40
- 33750354439
- Xie, Q., 2003. RTP Payload Format for European telecommunications standards institute (ETSI) European standard ES 201 108 distributed speech recognition encoding. RFC 3557, IETF, 2003.

41
- 33750353917
- Xie, Q., Pearce, D., 2005. RTP payload formats for European telecommunications standards Institute (ETSI) European standard ES 202 050, ES 202 211, and ES 202 212 distributed speech recognition encoding. RFC 4060, IETF, 2005.

42
- 0032677669
- Yajnik, M., Moon, S., Kurose, J., Towsley, D. 1999. Measurement and Modelling of the Temporal Dependence in Packet Loss. In: Proc. INFOCOMM 1999.

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.