SCOPUS 정보 검색 플랫폼

IEEE Transactions on Speech and Audio Processing

Volumn 11, Issue 4, 2003, Pages 321-333

Prosodic and accentual information for automatic speech recognition

(2) Milone, Diego H a Rubio, Antonio J b

a UNIVERSIDAD NACIONAL DEL LITORAL (Argentina)

b Investigadora del Instituto de Investigación de Estudios de las Mujeres y de Género UGR (Spain)

Author keywords

Accentuation; Continuous speech recognition; Language models; Prosody

Indexed keywords

MARKOV PROCESSES; SPEECH PROCESSING; STATISTICAL METHODS;

ACCENTUATION;

CONTINUOUS SPEECH RECOGNITION;

EID: 0042093525 PISSN: 10636676 EISSN: None Source Type: Journal
DOI: 10.1109/TSA.2003.814368 Document Type: Article

Times cited : (13)

References (68)

1
- 22944489801
- Madrid: Espasa Calpe
- L. E. Alarcos, Gramática de la Lengua Española. Madrid: Espasa Calpe, 1999, pp. 52-68.
- (1999) Gramática de la Lengua Española , pp. 52-68
- Alarcos, L.E.¹

2
- 0030165438
- Language accent classification in American English
- L. M. Arslan and I. H. L. Hansen, "Language accent classification in American English," Speech Commun., vol. 18, pp. 353-367, 1996.
- (1996) Speech Commun. , vol.18 , pp. 353-367
- Arslan, L.M.¹ Hansen, I.H.L.²

3
- 85009136277
- Selective prosodic post-processing for improving recognition of French telephone numbers
- K. Bartkova and D. Jouvet, "Selective prosodic post-processing for improving recognition of French telephone numbers," in Proc. 7th Eur. Conf. Speech Communication and Technology, vol. 1, 1999, pp. 267-270.
- (1999) Proc. 7th Eur. Conf. Speech Communication and Technology , vol.1 , pp. 267-270
- Bartkova, K.¹ Jouvet, D.²

4
- 0042394427
- Tempo and its change in spontaneous speech
- A. Batliner, A. Kießling, R. Kompe, H. Niemann, and E. Nöth, "Tempo and its change in spontaneous speech," in Proc. 5th Eur. Conf. Speech Commun. Technology, vol. 2, 1997, pp. 763-766.
- (1997) Proc. 5th Eur. Conf. Speech Commun. Technology , vol.2 , pp. 763-766
- Batliner, A.¹ Kießling, A.² Kompe, R.³ Niemann, H.⁴ Nöth, E.⁵

5
- 0042394428
- A bilingual text-to-speech system in Spanish and Catalan
- A. Bonafonte, I. Esquerra, A. Febrer, and F. Vallverdu, "A bilingual text-to-speech system in Spanish and Catalan," in Proc. 5th Eur. Conf. Speech Commun. Technol., vol. 5, 1997, pp. 2455-2458.
- (1997) Proc. 5th Eur. Conf. Speech Commun. Technol. , vol.5 , pp. 2455-2458
- Bonafonte, A.¹ Esquerra, I.² Febrer, A.³ Vallverdu, F.⁴

6
- 0042394423
- The role of prosody in infants' native-language discrimination abilities: The case of two phonologically close languages
- L. Bosch and N. Gallés, "The role of prosody in infants' native-language discrimination abilities: the case of two phonologically close languages," in Proc. 5th Eur. Conf. Speech Commun. Technol., vol. 1, 1997, pp. 231-234.
- (1997) Proc. 5th Eur. Conf. Speech Commun. Technol. , vol.1 , pp. 231-234
- Bosch, L.¹ Gallés, N.²

7
- 4244089078
- A comparative study of HMM-based approaches for the automatic recognition of perceptually relevant aspects of spontaneous German speech melody
- C. Brindöpke, G. A. Fink, and F. Kummert, "A comparative study of HMM-based approaches for the automatic recognition of perceptually relevant aspects of spontaneous German speech melody," in Proc. 7th Eur. Conf. Speech Commun. Technol., vol. 2, 1999, pp. 699-702.
- (1999) Proc. 7th Eur. Conf. Speech Commun. Technol. , vol.2 , pp. 699-702
- Brindöpke, C.¹ Fink, G.A.² Kummert, F.³

8
- 4243850589
- A HMM-based recognition system for perceptive relevant pitch movements of spontaneous German speech
- Prosody and Emotion 6
- C. Brindöpke, G. A. Fink, F. Kummert, and G. Sagerer, "A HMM-based recognition system for perceptive relevant pitch movements of spontaneous German speech," in 5th Int. Conf. Spoken Language Processing, 1998, Prosody and Emotion 6.
- (1998) 5th Int. Conf. Spoken Language Processing
- Brindöpke, C.¹ Fink, G.A.² Kummert, F.³ Sagerer, G.⁴

9
- 0030149810
- Robust parametric modeling of durations in hidden Markov models
- D. Busdhtein, "Robust parametric modeling of durations in hidden Markov models," IEEE Trans. Speech Audio Processing, vol. 4, no. 3, 1996.
- (1996) IEEE Trans. Speech Audio Processing , vol.4 , Issue.3
- Busdhtein, D.¹

10
- 79951864468
- A computational memory and processing model for prosody
- Prosody and Emotion 2
- J. E. Cahn, "A computational memory and processing model for prosody," in Proc. 5th Int. Conf. Spoken Language Processing, 1998, Prosody and Emotion 2.
- (1998) Proc. 5th Int. Conf. Spoken Language Processing
- Cahn, J.E.¹

11
- 0042895383
- A statistical study of pitch target points in five languages
- Prosody and Emotion 5
- E. Campione and J. Véronis, "A statistical study of pitch target points in five languages," in Proc. 5th Int. Conf. Spoken Language Processing, 1998, Prosody and Emotion 5.
- (1998) Proc. 5th Int. Conf. Spoken Language Processing
- Campione, E.¹ Véronis, J.²

12
- 0001848274
- Development of a Spanish corpora for the speech research
- September 26-28, CEC DGXIII, ESCA and ESPRIT PROJECT 2589 "SAM"
- F. Casacuberta, R. García, J. Llisterri, C. Nadeu, J. M. Prado, and A. Rubio, "Development of a Spanish corpora for the speech research," in Proc. Workshop on International Co-Operation and Standardization of Speech Databases and Speech I/O Assessment Methods, September 26-28, 1991, CEC DGXIII, ESCA and ESPRIT PROJECT 2589 "SAM".
- (1991) Proc. Workshop on International Co-Operation and Standardization of Speech Databases and Speech I/O Assessment Methods
- Casacuberta, F.¹ García, R.² Llisterri, J.³ Nadeu, C.⁴ Prado, J.M.⁵ Rubio, A.⁶

13
- 0040465429
- Testing the meaning of four Dutch pitch accent types
- J. Caspers, "Testing the meaning of four Dutch pitch accent types," in Proc. 5th Eur. Conf. Speech Commun. Technol., vol. 2, 1997, pp. 863-866.
- (1997) Proc. 5th Eur. Conf. Speech Commun. Technol. , vol.2 , pp. 863-866
- Caspers, J.¹

14
- 0032073761
- An RNN-based prosodic information synthesizer for Mandarin text-to-speech
- S.-H. Chen, S.-H. Hwang, and Y.-R. Wang, "An RNN-based prosodic information synthesizer for Mandarin text-to-speech," IEEE Trans. Speech Audio Processing, vol. 6, no. 3, 1998.
- (1998) IEEE Trans. Speech Audio Processing , vol.6 , Issue.3
- Chen, S.-H.¹ Hwang, S.-H.² Wang, Y.-R.³

15
- 0030143016
- On jointly learning the parameters in a character synchronous integrated speech and language model
- T.-H. Chiang, Y.-C. Lin, and K.-Y. Su, "On jointly learning the parameters in a character synchronous integrated speech and language model," IEEE Trans. Speech Audio Processing, vol. 4, no. 3, 1996.
- (1996) IEEE Trans. Speech Audio Processing , vol.4 , Issue.3
- Chiang, T.-H.¹ Lin, Y.-C.² Su, K.-Y.³

16
- 0030120916
- Frameworks for recognition of Mandarin syllables with tones using sub-syllabic units
- L. Chih-Heng, W. Chien-Hsing, T. Pei-Yih, and W. Hsin-Min, "Frameworks for recognition of Mandarin syllables with tones using sub-syllabic units," Speech Commun., vol. 18, pp. 175-190, 1996.
- (1996) Speech Commun. , vol.18 , pp. 175-190
- Chih-Heng, L.¹ Chien-Hsing, W.² Pei-Yih, T.³ Hsin-Min, W.⁴

17
- 79951898003
- Improvements in speech understanding accuracy through the integration of hierarchical linguistic, prosodic, and phonological constraints in the Jupiter domain
- G. Chung and S. Seneff, "Improvements in speech understanding accuracy through the integration of hierarchical linguistic, prosodic, and phonological constraints in the Jupiter domain," in Proc. 5th Int. Conf. Spoken Language Processing, Spoken Language Understanding Systems, vol. 1, 1998.
- (1998) Proc. 5th Int. Conf. Spoken Language Processing, Spoken Language Understanding Systems , vol.1
- Chung, G.¹ Seneff, S.²

18
- 0003424145
- Englewood Cliffs, NJ: Prentice-Hall
- J. R. Deller, J. G. Proakis, and J. H. Hansen, Discrete-Time Processing of Speech Signals. Englewood Cliffs, NJ: Prentice-Hall, 1987.
- (1987) Discrete-Time Processing of Speech Signals
- Deller, J.R.¹ Proakis, J.G.² Hansen, J.H.³

19
- 0042895382
- Departement de Filologia Espanyola, Facultad de Filosofia i Lletres, Universitat Autònoma de Barcelona
- A. J. M. Garrido, Modelización de Patrones Melódicos del Español para la Síntesis y el Reconocimiento Del habla: Departement de Filologia Espanyola, Facultad de Filosofia i Lletres, Universitat Autònoma de Barcelona, 1991.
- (1991) Modelización de Patrones Melódicos del Español para la Síntesis y el Reconocimiento Del Habla
- Garrido, A.J.M.¹

20
- 0033709101
- Detection of prosodic word boundaries by statistical modeling of mora transitions of fundamental frequency contours and its use for continuous speech recognition
- K. Hirose and K. Iwano, "Detection of prosodic word boundaries by statistical modeling of mora transitions of fundamental frequency contours and its use for continuous speech recognition," in Proc. IEEE 25th Int. Conf. Acoustics, Speech, Signal Processing, vol. 3, 2000, pp. 1763-1766.
- (2000) Proc. IEEE 25th Int. Conf. Acoustics, Speech, Signal Processing , vol.3 , pp. 1763-1766
- Hirose, K.¹ Iwano, K.²

21
- 0041392274
- The prosody of broad and narrow focus in English: Two experiments
- S. Hoskins, "The prosody of broad and narrow focus in English: Two experiments," in Proc. 5th Eur. Conf. Speech Commun. Technol., vol. 2, 1997, pp. 791-794.
- (1997) Proc. 5th Eur. Conf. Speech Commun. Technol. , vol.2 , pp. 791-794
- Hoskins, S.¹

22
- 0031631064
- The use of accent-specific pronunciation dictionaries in acoustic model training
- J. J. Humphries and P. C. Woodland, "The use of accent-specific pronunciation dictionaries in acoustic model training," in Proc. IEEE 23rd Int. Conf. Acoustics, Speech, Signal Processing, vol. 1, 1998, pp. 317-320.
- (1998) Proc. IEEE 23rd Int. Conf. Acoustics, Speech, Signal Processing , vol.1 , pp. 317-320
- Humphries, J.J.¹ Woodland, P.C.²

23
- 0032785782
- Modeling long distance dependence in language: Topic mixtures versus dynamic cache models
- R. M. Iyer and M. Ostendorf, "Modeling long distance dependence in language: Topic mixtures versus dynamic cache models," IEEE Trans. Speech Audio Processing, vol. 7, no. 1, 1999.
- (1999) IEEE Trans. Speech Audio Processing , vol.7 , Issue.1
- Iyer, R.M.¹ Ostendorf, M.²

24
- 0003786003
- Cambridge, MA: MIT Press
- F. Jelinek, Statistical Methods for Speech Recognition. Cambridge, MA: MIT Press, 1999.
- (1999) Statistical Methods for Speech Recognition
- Jelinek, F.¹

25
- 0003770709
- Norwell, MA: Kluwer
- J. C. Junqua and J. P. Haton, Robustness In Automatic Speech Recognition: Fundamentals and Applications. Norwell, MA: Kluwer, 1996.
- (1996) Robustness in Automatic Speech Recognition: Fundamentals and Applications
- Junqua, J.C.¹ Haton, J.P.²

26
- 0003410791
- New York: Springer-Verlag
- T. Kohonen, The Self-Organizing Map. New York: Springer-Verlag, 1995.
- (1995) The Self-Organizing Map
- Kohonen, T.¹

27
- 0031191419
- The contribution of intonation, segmental durations, and spectral features to the perception of a spontaneous and a read speaking style
- G. P. M. Laan, "The contribution of intonation, segmental durations, and spectral features to the perception of a spontaneous and a read speaking style," Speech Commun., vol. 22, pp. 43-65, 1997.
- (1997) Speech Commun. , vol.22 , pp. 43-65
- Laan, G.P.M.¹

28
- 0041392272
- Dynamic beam-search strategy using prosodic-syntactic information
- S.-W. Lee and K. Hirose, "Dynamic beam-search strategy using prosodic-syntactic information," in Workshop on Automatic Speech Recognition and Understanding, 1999, pp. 189-192.
- (1999) Workshop on Automatic Speech Recognition and Understanding , pp. 189-192
- Lee, S.-W.¹ Hirose, K.²

29
- 0032677483
- Cantonese syllable recognition using neural networks
- T. Lee and C. Ching, "Cantonese syllable recognition using neural networks," IEEE Trans. Speech Audio Processing, vol. 7, no. 4, 1999.
- (1999) IEEE Trans. Speech Audio Processing , vol.7 , Issue.4
- Lee, T.¹ Ching, C.²

30
- 0042394425
- Giving prosody a meaning
- C. Lieske, J. Bos, M. Emele, B. Gambäck, and C. J. Rupp, "Giving prosody a meaning," in Proc. 5th Eur. Conf. Speech Commun. Technol., vol. 3, 1997, pp. 1431-1434.
- (1997) Proc. 5th Eur. Conf. Speech Commun. Technol. , vol.3 , pp. 1431-1434
- Lieske, C.¹ Bos, J.² Emele, M.³ Gambäck, B.⁴ Rupp, C.J.⁵

31
- 0020180460
- Maximum likelihood estimation for multivariate stochastic observations of Markov chains
- L. A. Liporace, "Maximum likelihood estimation for multivariate stochastic observations of Markov chains," IEEE Trans. Inform. Theory, vol. IT-28, no. 5, 1982.
- (1982) IEEE Trans. Inform. Theory , vol.IT-28 , Issue.5
- Liporace, L.A.¹

32
- 0031187171
- Speech recognition by machines and humans
- R. P. Lippmann, "Speech recognition by machines and humans," Speech Commun., vol. 22, pp. 1-15, 1997.
- (1997) Speech Commun. , vol.22 , pp. 1-15
- Lippmann, R.P.¹

33
- 85083433846
- Improvement on connected numbers recognition using prosodie information
- Prosody and Emotion 2
- E. López, J. Caminero, I. Cortázar, and L. Hernández, "Improvement on connected numbers recognition using prosodie information," in Proc. 5th Eur. Conf. Speech Commun. Technol., 1998, Prosody and Emotion 2.
- (1998) Proc. 5th Eur. Conf. Speech Commun. Technol.
- López, E.¹ Caminero, J.² Cortázar, I.³ Hernández, L.⁴

34
- 26744462948
- Automatic corpus-based training of rules for prosodic generation in text-to-speech
- E. López-Gonzalo, J. M. Rodríguez-García, L. Hernández-Gómez, and J. M. Villar, "Automatic corpus-based training of rules for prosodic generation in text-to-speech," in Proc. 5th Eur. Conf. Speech Commun. Technol., vol. 5, 1997, pp. 2515-2518.
- (1997) Proc. 5th Eur. Conf. Speech Commun. Technol. , vol.5 , pp. 2515-2518
- López-Gonzalo, E.¹ Rodríguez-García, J.M.² Hernández-Gómez, L.³ Villar, J.M.⁴

35
- 0030204644
- Speaker attribution of successive utterances: The role of discontinuities in voice characteristics and prosody
- V. Lublinskaja and C. Sappok, "Speaker attribution of successive utterances: The role of discontinuities in voice characteristics and prosody," Speech Commun., vol. 19, pp. 145-159, 1996.
- (1996) Speech Commun. , vol.19 , pp. 145-159
- Lublinskaja, V.¹ Sappok, C.²

36
- 0003612091
- London, U.K.: Ellis Horwood
- D. Michie, D. J. Spiegelhalter, and C. C. Taylor, Machine Learning, Neural and Statistical Classification. London, U.K.: Ellis Horwood, 1994.
- (1994) Machine Learning, Neural and Statistical Classification
- Michie, D.¹ Spiegelhalter, D.J.² Taylor, C.C.³

37
- 0041893246
- Self-Organizing neural tree networks
- D. H. Milone, J. C. Sáez, G. Simón, and H. L. Rufiner, "Self-Organizing neural tree networks," in Proc. 20th Annu. Int. Conf. IEEE Engineering in Medicine and Biology Society, vol. 20, 1998.
- (1998) Proc. 20th Annu. Int. Conf. IEEE Engineering in Medicine and Biology Society , vol.20
- Milone, D.H.¹ Sáez, J.C.² Simón, G.³ Rufiner, H.L.⁴

38
- 0041392279
- Árboles de redes neuronales autoorganizativas
- _, "Árboles de redes neuronales autoorganizativas," Revista Mexicana de Ingeniería Biomédica, vol. 29, no. 4, 1998.
- (1998) Revista Mexicana de Ingeniería Biomédica , vol.29 , Issue.4

39
- 84901425599
- Evolutionary algorithm for speech segmentation
- May
- D. H. Milone, J. J. Merelo, and H. L. Rufiner, "Evolutionary algorithm for speech segmentation," in Proc. of 2002 IEEE World Congress on Evolutionary Computation, May 2002.
- (2002) Proc. of 2002 IEEE World Congress on Evolutionary Computation
- Milone, D.H.¹ Merelo, J.J.² Rufiner, H.L.³

40
- 85009244099
- Suprasegmental duration modeling with elastic contraints in automatic speech recognition
- Hidden Markov Model Techniques 3
- L. Molloy and S. Isard, "Suprasegmental duration modeling with elastic contraints in automatic speech recognition," in Proc. 5th Int. Conf. Spoken Language Processing, 1998, Hidden Markov Model Techniques 3.
- (1998) Proc. 5th Int. Conf. Spoken Language Processing
- Molloy, L.¹ Isard, S.²

41
- 0014055288
- Cepstrum pitch determination
- A. M. Noll, "Cepstrum pitch determination," J. Acoust. Soc. Amer., vol. 41, pp. 293-309, 1967.
- (1967) J. Acoust. Soc. Amer. , vol.41 , pp. 293-309
- Noll, A.M.¹

42
- 0034274273
- Verb-mobil: The use of prosody in the linguistic components of a speech understanding system
- E. Nöth, A. Batliner, A. Kießling, R. Kompe, and H. Niemann, "Verb-mobil: The use of prosody in the linguistic components of a speech understanding system," IEEE Trans. Speech Audio Processing, vol. 8, no. 5, 2000.
- (2000) IEEE Trans. Speech Audio Processing , vol.8 , Issue.5
- Nöth, E.¹ Batliner, A.² Kießling, A.³ Kompe, R.⁴ Niemann, H.⁵

43
- 0031074261
- Prosody generation for German CTS/TTS systems (from theoretical intonation patterns to practical realization)
- O. Gábor and N. Géza, "Prosody generation for German CTS/TTS systems (from theoretical intonation patterns to practical realization)," Speech Commun., vol. 21, pp. 37-60, 1997.
- (1997) Speech Commun. , vol.21 , pp. 37-60
- Gábor, O.¹ Géza, N.²

44
- 0041392276
- Prosodic structure and phonetic processing:'A cross-linguistic study
- C. Pallier, A. Cutler, and N. Sebastián-Gallés, "Prosodic structure and phonetic processing:'A cross-linguistic study," in Proc. 5th Eur. Conf. Speech Commun. Technol., vol. 4, 1997, pp. 2131-2134.
- (1997) Proc. 5th Eur. Conf. Speech Commun. Technol. , vol.4 , pp. 2131-2134
- Pallier, C.¹ Cutler, A.² Sebastián-Gallés, N.³

45
- 0030205397
- Modeling of phone duration (using the TIMIT database) and its potential benefit for ASR
- L. C. W. Pols, X. Wang, and L. F. M. Bosch, "Modeling of phone duration (using the TIMIT database) and its potential benefit for ASR," Speech Commun., vol. 19, pp. 161-176, 1996.
- (1996) Speech Commun. , vol.19 , pp. 161-176
- Pols, L.C.W.¹ Wang, X.² Bosch, L.F.M.³

46
- 0031071430
- Toward a prominence-based synthesis system
- T. Portele and B. Heuft, "Toward a prominence-based synthesis system," Speech Commun., vol. 21, pp. 61-72, 1997.
- (1997) Speech Commun. , vol.21 , pp. 61-72
- Portele, T.¹ Heuft, B.²

47
- 0032089995
- A study of n-gram and decision tree letter language modeling methods
- G. Potamianos and F. Jelinek, "A study of n-gram and decision tree letter language modeling methods," Speech Commun., vol. 24, pp. 171-192, 1998.
- (1998) Speech Commun. , vol.24 , pp. 171-192
- Potamianos, G.¹ Jelinek, F.²

48
- 0032795155
- Classification of Thai tone sequences in syllable-segmented speech using the analysis-by-synthesis method
- S. Potisuk, M. P. Harper, and J. Gandour, "Classification of Thai tone sequences in syllable-segmented speech using the analysis-by-synthesis method," IEEE Trans. Speech Audio Processing, vol. 7, no. 1, 1999.
- (1999) IEEE Trans. Speech Audio Processing , vol.7 , Issue.1
- Potisuk, S.¹ Harper, M.P.² Gandour, J.³

49
- 0040361816
- Madrid, Spain: Editorial Credos
- A. Quilis, Tratado de Fonología y Fonética Españolas. Madrid, Spain: Editorial Credos, 1993.
- (1993) Tratado de Fonología y Fonética Españolas
- Quilis, A.¹

50
- 84938440015
- Englewood Cliffs, NJ: Prentice-Hall
- L. R. Rabiner and B. Gold, Theory and Application of Digital Signal Processing. Englewood Cliffs, NJ: Prentice-Hall, 1975.
- (1975) Theory and Application of Digital Signal Processing
- Rabiner, L.R.¹ Gold, B.²

51
- 0022594196
- An introduction to hidden Markov models
- Jan.
- L. R. Rabiner and B. H. Juang, "An introduction to hidden Markov models," IEEE ASSP Mag., Jan. 1986.
- (1986) IEEE ASSP Mag.
- Rabiner, L.R.¹ Juang, B.H.²

52
- 0004244302
- Englewood Cliffs, NJ: Prentice-Hall
- _, Fundamentals of Speech Recognition. Englewood Cliffs, NJ: Prentice-Hall, 1993.
- (1993) Fundamentals of Speech Recognition

53
- 0029754565
- 0 patterns
- 0 patterns," Speech Commun., vol. 18, pp. 21-46, 1996.
- (1996) Speech Commun. , vol.18 , pp. 21-46
- Rajendran, S.¹ Yenanarayana, B.²

54
- 0032665603
- A dynamical system model for generating fundamental frequency for speech synthesis
- N. K. Ross and M. Ostendorf, "A dynamical system model for generating fundamental frequency for speech synthesis," IEEE Trans. Speech Audio Processing, vol. 7, no. 3, 1999.
- (1999) IEEE Trans. Speech Audio Processing , vol.7 , Issue.3
- Ross, N.K.¹ Ostendorf, M.²

55
- 0042895379
- Is syntactic structure prosodically retrievable?
- M. Rossi, "Is syntactic structure prosodically retrievable?," in Proc. 5th Eur. Conf. Speech Commun. Technol., 1997.
- (1997) Proc. 5th Eur. Conf. Speech Commun. Technol.
- Rossi, M.¹

56
- 85135139722
- A lognormal tied mixture model of pitch for prosody based speaker recognition
- M. K. Sönrnez, L. Heck, M. Weintraub, and E. Shriberg, "A lognormal tied mixture model of pitch for prosody based speaker recognition," in Proc. of 5th Eur. Conf. Speech Commun. Technol., vol. 3, 1997, pp. 1391-1394.
- (1997) Proc. of 5th Eur. Conf. Speech Commun. Technol. , vol.3 , pp. 1391-1394
- Sönrnez, M.K.¹ Heck, L.² Weintraub, M.³ Shriberg, E.⁴

57
- 79959823252
- Modeling the prosody of hidden events for improved word recognition
- A. Stolcke, E. Shriberg, D. Hakkani-Tür, and G. Tür, "Modeling the prosody of hidden events for improved word recognition," in Proc. 7th Eur. Conf. Speech Commun. Technol., vol. 1, 1999, pp. 311-314.
- (1999) Proc. 7th Eur. Conf. Speech Commun. Technol. , vol.1 , pp. 311-314
- Stolcke, A.¹ Shriberg, E.² Hakkani-Tür, D.³ Tür, G.⁴

58
- 0031185913
- Prosodic and lexical indications of discourse structure in human-machine interactions
- M. Swerts and M. Ostendorf, "Prosodic and lexical indications of discourse structure in human-machine interactions," Speech Commun., vol. 22, pp. 25-41, 1997.
- (1997) Speech Commun. , vol.22 , pp. 25-41
- Swerts, M.¹ Ostendorf, M.²

59
- 0030170432
- Acoustic parameters for place of articulation identification and classification of Spanish unvoiced stops
- M. I. Torres and P. Iparraguirre, "Acoustic parameters for place of articulation identification and classification of Spanish unvoiced stops," Speech Commun., vol. 18, pp. 369-379, 1996.
- (1996) Speech Commun. , vol.18 , pp. 369-379
- Torres, M.I.¹ Iparraguirre, P.²

60
- 0033096914
- Acoustic characteristics of lexical stress in continuous telephone speech
- Van Kuijk and L. Boves, "Acoustic characteristics of lexical stress in continuous telephone speech," Speech Commun., vol. 27, pp. 95-111, 1999.
- (1999) Speech Commun. , vol.27 , pp. 95-111
- Kuijk, V.¹ Boves, L.²

61
- 85135169269
- Prosodic modeling in text-to-speech synthesis
- J. P. H. Van Santen, "Prosodic modeling in text-to-speech synthesis," in Proc. 5th Eur. Conf. Speech Commun. Technol., 1997.
- (1997) Proc. 5th Eur. Conf. Speech Commun. Technol.
- Van Santen, J.P.H.¹

62
- 85037162672
- Improving the phonetic annotation by means of prosodic phrasing
- H. Vereecken, A. Vorstermans, J. P. Martens, and B. Van Coile, "Improving the phonetic annotation by means of prosodic phrasing," in Proc. 5th Eur. Conf. Speech Commun. Technol., vol. 1, 1997, pp. 179-182.
- (1997) Proc. 5th Eur. Conf. Speech Commun. Technol. , vol.1 , pp. 179-182
- Vereecken, H.¹ Vorstermans, A.² Martens, J.P.³ Van Coile, B.⁴

63
- 0032296808
- A stochastic model of intonation for text-to-speech synthesis
- J. Véronis, P. Di Cristo, F. Courtois, and C. Chaumette, "A stochastic model of intonation for text-to-speech synthesis," Speech Commun., vol. 26, pp. 233-244, 1998.
- (1998) Speech Commun. , vol.26 , pp. 233-244
- Véronis, J.¹ Di Cristo, P.² Courtois, F.³ Chaumette, C.⁴

64
- 85128385898
- A study of tones and tempo in continuous Mandarin digit strings and their application in telephone quality speech recognition
- Prosody and Emotion 2
- C. Wang and S. Seneff, "A study of tones and tempo in continuous Mandarin digit strings and their application in telephone quality speech recognition," in Proc. 5th Int. Conf. Spoken Language Processing, 1998, Prosody and Emotion 2.
- (1998) Proc. 5th Int. Conf. Spoken Language Processing
- Wang, C.¹ Seneff, S.²

65
- 0000079804
- Integrating multiple knowledge sources for word hypotheses graph interpretation
- V. Warnke, F. Gallwitz, A. Batliner, J. Buckow, R. Huber, E. Nöth, and A. Höthker, "Integrating multiple knowledge sources for word hypotheses graph interpretation," in Proc, 7th Eur. Conf. Speech Commun. Technol., vol. 1, 1999, pp. 235-238.
- (1999) Proc. 7th Eur. Conf. Speech Commun. Technol. , vol.1 , pp. 235-238
- Warnke, V.¹ Gallwitz, F.² Batliner, A.³ Buckow, J.⁴ Huber, R.⁵ Nöth, E.⁶ Höthker, A.⁷

66
- 0024634603
- Phoneme recognition using time-delay neural networks
- A. Waibel, T. Hanazawa, G. Hinton, K. Shikano, and L. Lang, "Phoneme recognition using time-delay neural networks," IEEE Trans. Acoust., Speech, Signal Processing, vol. 37, no. 3, 1989.
- (1989) IEEE Trans. Acoust., Speech, Signal Processing , vol.37 , Issue.3
- Waibel, A.¹ Hanazawa, T.² Hinton, G.³ Shikano, K.⁴ Lang, L.⁵

67
- 84892186467
- Incorporating information from syllable-length time scales into automatic speech recognition
- S.-L. Wu, B. E. D. Kingsbury, N. Morgan, and S. Greenberg, "Incorporating information from syllable-length time scales into automatic speech recognition," in Proc. IEEE 23rd Int. Conf. Acoustics, Speech, Signal Processing, vol. 2, 1998, pp. 721-724.
- (1998) Proc. IEEE 23rd Int. Conf. Acoustics, Speech, Signal Processing , vol.2 , pp. 721-724
- Wu, S.-L.¹ Kingsbury, B.E.D.² Morgan, N.³ Greenberg, S.⁴

68
- 0030181237
- Register as a variable in prosodic analysis: The case of the English negative
- M. Yaeger-Dror, "Register as a variable in prosodic analysis: The case of the English negative," Speech Commun., vol. 19, pp. 39-60, 1996.
- (1996) Speech Commun. , vol.19 , pp. 39-60
- Yaeger-Dror, M.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.