SCOPUS 정보 검색 플랫폼

IEEE Transactions on Signal Processing

Volumn 52, Issue 6, 2004, Pages 1783-1790

Constrained optimization for audio-to-visual conversion

(2) Choi, Kyoung Ho a Hwang, Jenq Neng b

a Electronics and Telecommunications Research Institute (ETRI) (South Korea)

b University of Washington (United States)

Author keywords

Audio to visual conversion; HMM; HMMI; Talking heads

Indexed keywords

ALGORITHMS; COMPUTER SIMULATION; CONSTRAINT THEORY; IMAGE ANALYSIS; LAGRANGE MULTIPLIERS; MARKOV PROCESSES; NEURAL NETWORKS; OPTIMIZATION; PARAMETER ESTIMATION; PROBABILITY DISTRIBUTIONS; VECTOR QUANTIZATION;

AUDIO TO VISUAL CONVERSION; CONSTRAINED OPTIMIZATION; HIDDEN MARKOV MODELS; TALKING HEADS;

SPEECH PROCESSING;

EID: 2942596586 PISSN: 1053587X EISSN: None Source Type: Journal
DOI: 10.1109/TSP.2004.827153 Document Type: Article

Times cited : (8)

References (25)

1
- 2942629743
- ISO/IEC JTC1/SC29/WG11 N2501, Nov.
- ISO/IEC FDIS 14 496-1 Systems, ISO/IEC JTC1/SC29/WG11 N2501, Nov. 1998.
- (1998) ISO/IEC FDIS 14 496-1 Systems

2
- 0038669820
- ISO/IEC JTC1/SC29/WG11 N2502, Nov.
- ISO/IEC FDIS 14 496-2 Visual, ISO/IEC JTC1/SC29/WG11 N2502, Nov. 1998.
- (1998) ISO/IEC FDIS 14 496-2 Visual

3
- 0033689047
- Design of a virtual human presenter
- T. Noma, L. Zhao, and N. I. Bradler, "Design of a virtual human presenter," IEEE Comput. Graphics Applicat., vol. 20, no. 4, pp. 79-85, 2000.
- (2000) IEEE Comput. Graphics Applicat. , vol.20 , Issue.4 , pp. 79-85
- Noma, T.¹ Zhao, L.² Bradler, N.I.³

4
- 0032683588
- SeamlessDesign: A face-to-face collaborative virtual/augmented environment for rapid prototyping of geometrically constrained 3-D objects
- K. Kiyokawa, H. Takemura, and N. Yokoya, "SeamlessDesign: a face-to-face collaborative virtual/augmented environment for rapid prototyping of geometrically constrained 3-D objects," in Proc. IEEE Int. Conf. Multimedia Comput. Syst., vol. 2, 1999, pp. 447-453.
- (1999) Proc. IEEE Int. Conf. Multimedia Comput. Syst. , vol.2 , pp. 447-453
- Kiyokawa, K.¹ Takemura, H.² Yokoya, N.³

5
- 0001519981
- Implementation of a virtual chat room for multimedia communications
- Y.-J. Chang, C.-C. Chen, J.-C. Chou, and Y.-C. Chen, "Implementation of a virtual chat room for multimedia communications," in Proc. IEEE 3rd Workshop Multimedia Signal Process., 1999, pp. 599-604.
- Proc. IEEE 3rd Workshop Multimedia Signal Process., 1999 , pp. 599-604
- Chang, Y.-J.¹ Chen, C.-C.² Chou, J.-C.³ Chen, Y.-C.⁴

6
- 0032641881
- Video avatar: Embedded video for collaborative virtual environment
- S. Yura, T. Usaka, and K. Sakamura, "Video avatar: embedded video for collaborative virtual environment," in Proc. IEEE Int. Conf. Multimedia Comput. Syst., vol. 2, 1999, pp. 433-438.
- (1999) Proc. IEEE Int. Conf. Multimedia Comput. Syst. , vol.2 , pp. 433-438
- Yura, S.¹ Usaka, T.² Sakamura, K.³

7
- 0026156861
- A media conversion from speech to facial image for intelligent man-machine interface
- May
- S. Morishima and H. Harashima, "A media conversion from speech to facial image for intelligent man-machine interface," IEEE J. Select. Areas Commun., vol. 9, pp. 594-600, May 1991.
- (1991) IEEE J. Select. Areas Commun. , vol.9 , pp. 594-600
- Morishima, S.¹ Harashima, H.²

8
- 0029270677
- Converting speech into lip movement: A multimedia telephone for hard of hearing people
- Jan.
- F. Lavagetto, "Converting speech into lip movement: a multimedia telephone for hard of hearing people," IEEE Trans. Rehab. Eng., vol. 3, pp. 90-102, Jan. 1995.
- (1995) IEEE Trans. Rehab. Eng. , vol.3 , pp. 90-102
- Lavagetto, F.¹

9
- 0031257449
- Time-delay neural networks for estimating lip movements from speech analysis: A useful tool in audio-video synchronization
- May
- ____, "Time-delay neural networks for estimating lip movements from speech analysis: a useful tool in audio-video synchronization," IEEE Trans. Circuits Syst. Video Technol., vol. 7, pp. 786-800, May 1997.
- (1997) IEEE Trans. Circuits Syst. Video Technol. , vol.7 , pp. 786-800
- Lavagetto, F.¹

10
- 0031997085
- Audio-to-visual conversion for multimedia communication
- Jan.
- R. R. Rao, T. Chen, and R. M. Mersereau, "Audio-to-visual conversion for multimedia communication," IEEE Trans. Ind. Electron., vol. 45, pp. 15-22, Jan. 1998.
- (1998) IEEE Trans. Ind. Electron. , vol.45 , pp. 15-22
- Rao, R.R.¹ Chen, T.² Mersereau, R.M.³

11
- 0035251712
- Speech-to-lip movement synthesis by maximizing audio-visual joint probability based on the EM algorithm
- S.Satoshi Nakamura and E.Eli Yamamoto, "Speech-to-lip movement synthesis by maximizing audio-visual joint probability based on the EM algorithm," J. VLSI Signal Process., vol. 27, pp. 119-126, 2001.
- (2001) J. VLSI Signal Process. , vol.27 , pp. 119-126
- Nakamura, S.S.¹ Yamamoto, E.E.²

12
- 2942629454
- Speech-to-lip movement synthesis maximizing audio-visual joint probability based on EM algorithm
- S. Nakamura, E. Yamamoto, and K.Kiyohiro Shikano, "Speech-to-lip movement synthesis maximizing audio-visual joint probability based on EM algorithm," in Proc. IEEE 2nd Workshop Multimedia Signal Process., 1998, pp. 53-58.
- Proc. IEEE 2nd Workshop Multimedia Signal Process., 1998 , pp. 53-58
- Nakamura, S.¹ Yamamoto, E.² Shikano, K.K.³

13
- 0003429272
- Boston, MA: Charles River Media
- B. Fleming and D. Dobbs, Animating Facial Features and Expressions. Boston, MA: Charles River Media, 1999, pp. 53-78.
- (1999) Animating Facial Features and Expressions , pp. 53-78
- Fleming, B.¹ Dobbs, D.²

14
- 0028996864
- Noisy speech recognition using robust inversion of hidden Markov models
- S. Y. Moon and J. N. Hwang, "Noisy speech recognition using robust inversion of hidden Markov models," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 1995, pp. 145-148.
- Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 1995 , pp. 145-148
- Moon, S.Y.¹ Hwang, J.N.²

15
- 0031100269
- Robust speech recognition based on joint model and feature space optimization of hidden Markov models
- Mar.
- ____, "Robust speech recognition based on joint model and feature space optimization of hidden Markov models," IEEE Trans. Neural Networks, vol. 8, pp. 194-204, Mar. 1997.
- (1997) IEEE Trans. Neural Networks , vol.8 , pp. 194-204
- Moon, S.Y.¹ Hwang, J.N.²

16
- 0028317510
- A projection-based likelihood measure for speech recognition in noise
- Jan.
- B. A. Carlson and M. A. Clements, "A projection-based likelihood measure for speech recognition in noise," IEEE Trans. Speech Audio Processing, vol. 2, pp. 97-102, Jan. 1994.
- (1994) IEEE Trans. Speech Audio Processing , vol.2 , pp. 97-102
- Carlson, B.A.¹ Clements, M.A.²

17
- 0004244302
- Englewood Cliffs, NJ: Prentice-Hall
- L. R. Rabiner and B. H. Juang, Fundamentals of Speech Recognition. Englewood Cliffs, NJ: Prentice-Hall, 1993, pp. 321-389.
- (1993) Fundamentals of Speech Recognition , pp. 321-389
- Rabiner, L.R.¹ Juang, B.H.²

18
- 0003896318
- Newe York: Academic
- D. P. Bertsekas, Constrained Optimization and Lagrange Multiplier Methods, Newe York: Academic, 1982.
- (1982) Constrained Optimization and Lagrange Multiplier Methods
- Bertsekas, D.P.¹

19
- 0034792570
- Speech-driven cartoon animation with emotions
- Y. Li, F. Yu, Y.-Q. Xu, E. Chang, and H.-Y. Shum, "Speech-driven cartoon animation with emotions," in Proc. ACM Multimedia; 9th ACM Int. Multimedia Conf., Ottawa, ON, Canada, Sept. 30th-Oct. 5th 2001.
- Proc. ACM Multimedia; 9th ACM Int. Multimedia Conf., Ottawa, ON, Canada, Sept. 30th-Oct. 5th 2001
- Li, Y.¹ Yu, F.² Xu, Y.-Q.³ Chang, E.⁴ Shum, H.-Y.⁵

20
- 0030353343
- Recognizing emotion in speech
- F. Dellaert, T. Polzin, and A. Waibel, "Recognizing emotion in speech," in Proc. IEEE Int. Conf. Spoken Language, vol. 3, 1996, pp. 1970-1973.
- (1996) Proc. IEEE Int. Conf. Spoken Language , vol.3 , pp. 1970-1973
- Dellaert, F.¹ Polzin, T.² Waibel, A.³

21
- 85032751766
- Emotion recognition in human-computer interaction
- R. Cowie, E. Douglas-Cowie, N. Tsapatsoulis, G. Votsis, S. Kollias, W. Fellenz, and J. G. Taylor, "Emotion recognition in human-computer interaction," IEEE Signal Processing Mag., vol. 18, no. 1, pp. 32-80, 2001.
- (2001) IEEE Signal Processing Mag. , vol.18 , Issue.1 , pp. 32-80
- Cowie, R.¹ Douglas-Cowie, E.² Tsapatsoulis, N.³ Votsis, G.⁴ Kollias, S.⁵ Fellenz, W.⁶ Taylor, J.G.⁷

22
- 0034512820
- Emotional expressions in audiovisual human computer interaction
- L. S. Chen, and T. S. Huang, "Emotional expressions in audiovisual human computer interaction," in Proc. IEEE Int. Conf. Multimedia Expo, vol. 1, 2000, pp. 423-426.
- (2000) Proc. IEEE Int. Conf. Multimedia Expo , vol.1 , pp. 423-426
- Chen, L.S.¹ Huang, T.S.²

23
- 2942630206
- [Online] http://htk.eng.cam.ac.uk

24
- 0036447903
- Creating 3D speech-driven talking heads: A probabilistic approach
- K. H. Choi and J.-N.Jenq-Neng Hwang, "Creating 3D speech-driven talking heads: a probabilistic approach," in Proc. IEEE Int. Conf. Image Processing, 2002, pp. 984-987.
- Proc. IEEE Int. Conf. Image Processing, 2002 , pp. 984-987
- Choi, K.H.¹ Hwang, J.-N.²

25
- 0038370173
- A probabilistic network for facial feature verification
- K. H. Choi, J. J. Yoo, T. H. Hwang, J. H. Park, and J. H. Lee, "A probabilistic network for facial feature verification," ETRI J., vol. 25, no. 2, pp. 140-143, 2003.
- (2003) ETRI J. , vol.25 , Issue.2 , pp. 140-143
- Choi, K.H.¹ Yoo, J.J.² Hwang, T.H.³ Park, J.H.⁴ Lee, J.H.⁵

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.