-
1
-
-
0012884503
-
Can we tell apart intonation from prosody (if we look at accents and boundaries)?
-
Athens
-
Batliner, A., Kießling, A., Kompe, R., Niemann, H., Nöth, E., 1997. Can we tell apart intonation from prosody (if we look at accents and boundaries)? In: Proc. ESCA Intonation Workshop, Athens, pp. 39-42.
-
(1997)
Proc. ESCA Intonation Workshop
, pp. 39-42
-
-
Batliner, A.1
Kießling, A.2
Kompe, R.3
Niemann, H.4
Nöth, E.5
-
3
-
-
0003597966
-
Guidelines for ToBI labelling
-
Ohio State University
-
Beckman, M.E., Elam, G.A., 1994. Guidelines for ToBI labelling. Technical report, Ohio State University. Available from < http://www.ling.ohio-state.edu/research/phonetics/E_ToBI/singer_tobi.html>.
-
(1994)
Technical Report
-
-
Beckman, M.E.1
Elam, G.A.2
-
6
-
-
21844474476
-
Acoustic differentiation of ip and IP boundary levels: Comparison of L- and L-L% in the switchboard corpus
-
Nara, Japan
-
Chavarria, S., Yoon, T., Cole, J., Hasegawa-Johnson, M., 2004. Acoustic differentiation of ip and IP boundary levels: Comparison of L- and L-L% in the switchboard corpus. In Proc. SpeechProsody, Nara, Japan.
-
(2004)
Proc. SpeechProsody
-
-
Chavarria, S.1
Yoon, T.2
Cole, J.3
Hasegawa-Johnson, M.4
-
9
-
-
85009211730
-
Prosody dependent speech recognition with explicit duration modeling at intonational phrase boundaries
-
Geneva
-
Chen, K., Borys, S., Hasegawa-Johnson, M., 2003a. Prosody dependent speech recognition with explicit duration modeling at intonational phrase boundaries. In: Proc. EUROSPEECH, Geneva, pp. 393-396.
-
(2003)
Proc. EUROSPEECH
, pp. 393-396
-
-
Chen, K.1
Borys, S.2
Hasegawa-Johnson, M.3
-
10
-
-
21844474221
-
An intonational phrase boundary and pitch accent dependent speech recognizer
-
Orlando, FL
-
Chen, K., Hasegawa-Johnson, M., Kim, S.-S., 2003b. An intonational phrase boundary and pitch accent dependent speech recognizer. In: Internat. Conf. on Syst., Cybernet., Intell. (SCI). Orlando, FL.
-
(2003)
Internat. Conf. on Syst., Cybernet., Intell. (SCI)
-
-
Chen, K.1
Hasegawa-Johnson, M.2
Kim, S.-S.3
-
11
-
-
4544275067
-
An automatic prosody labeling system using ANN-based syntactic prosodic model and GMM-based acoustic prosodic model
-
Chen, K., Hasegawa-Johnson, M., Cohen, A., 2004a. An automatic prosody labeling system using ANN-based syntactic prosodic model and GMM-based acoustic prosodic model. In: Proc. ICASSP.
-
(2004)
Proc. ICASSP
-
-
Chen, K.1
Hasegawa-Johnson, M.2
Cohen, A.3
-
12
-
-
21844456475
-
A maximum likelihood prosody recognizer
-
Nara, Japan
-
Chen, K., Hasegawa-Johnson, M., Cohen, A., Cole, J., 2004b. A maximum likelihood prosody recognizer. In: Proc. SpeechProsody, Nara, Japan.
-
(2004)
Proc. SpeechProsody
-
-
Chen, K.1
Hasegawa-Johnson, M.2
Cohen, A.3
Cole, J.4
-
13
-
-
21844459928
-
Prosody dependent speech recognition on radio news
-
in press
-
Chen, K., Hasegawa-Johnson, M., Cohen, A., Borys, S., Kim, S.-S., Cole, J., Choi, J.-Y., in press. Prosody dependent speech recognition on radio news. IEEE Trans. Speech Audio Process.
-
IEEE Trans. Speech Audio Process
-
-
Chen, K.1
Hasegawa-Johnson, M.2
Cohen, A.3
Borys, S.4
Kim, S.-S.5
Cole, J.6
Choi, J.-Y.7
-
16
-
-
21844447535
-
The effect of accent on the acoustic cues to stop voicing in radio news speech
-
Cole, J., Choi, H., Kim, H., Hasegawa-Johnson, M., 2003. The effect of accent on the acoustic cues to stop voicing in radio news speech. In: Internat. Conf. Phonet. Sci.
-
(2003)
Internat. Conf. Phonet. Sci.
-
-
Cole, J.1
Choi, H.2
Kim, H.3
Hasegawa-Johnson, M.4
-
17
-
-
0023921973
-
Segmental druations in connected-speech signals: Current results
-
T.H. Crystal, and A.S. House Segmental druations in connected-speech signals: Current results J. Acoust. Soc. Amer. 83 1988 1553 1573
-
(1988)
J. Acoust. Soc. Amer.
, vol.83
, pp. 1553-1573
-
-
Crystal, T.H.1
House, A.S.2
-
18
-
-
21844469157
-
The supraglottal articulation of prominence in English: Linguistic stress as localized hyperarticulation
-
K. DeJong The supraglottal articulation of prominence in English: Linguistic stress as localized hyperarticulation J. Acoust. Soc. Amer. 89 1 1995 369 382
-
(1995)
J. Acoust. Soc. Amer.
, vol.89
, Issue.1
, pp. 369-382
-
-
Dejong, K.1
-
19
-
-
0030268342
-
Glottalization of word-initial vowels as a function of prosodic structure
-
L. Dilley, S. Shattuck-Hufnagel, and M. Ostendorf Glottalization of word-initial vowels as a function of prosodic structure J. Phonet. 24 1996 423 444
-
(1996)
J. Phonet.
, vol.24
, pp. 423-444
-
-
Dilley, L.1
Shattuck-Hufnagel, S.2
Ostendorf, M.3
-
20
-
-
0002585974
-
Variable duration models for speech
-
J. Ferguson Princeton University Press Princeton, NJ
-
J.D. Ferguson Variable duration models for speech J. Ferguson Proc. Symp. Applic. hidden Markov Models to Text and Speech 1980 Princeton University Press Princeton, NJ 143 179
-
(1980)
Proc. Symp. Applic. Hidden Markov Models to Text and Speech
, pp. 143-179
-
-
Ferguson, J.D.1
-
21
-
-
0141702354
-
A prosody-based approach to end-of-utterance detection that does not require speech recognition
-
Ferrer, L., Shriberg, E., Stolcke, A., 2003. A prosody-based approach to end-of-utterance detection that does not require speech recognition. In: Proc. ICASSP. pp. 608-611.
-
(2003)
Proc. ICASSP
, pp. 608-611
-
-
Ferrer, L.1
Shriberg, E.2
Stolcke, A.3
-
22
-
-
0031009252
-
Articulatory strengthening at edges of prosodic domains
-
C. Fougeron, and P.A. Keating Articulatory strengthening at edges of prosodic domains J. Acoust. Soc. Amer. 101 6 1997 3728 3740
-
(1997)
J. Acoust. Soc. Amer.
, vol.101
, Issue.6
, pp. 3728-3740
-
-
Fougeron, C.1
Keating, P.A.2
-
23
-
-
85011187169
-
Analysis of voice fundamental frequency contours for declarative sentence of Japanese
-
H. Fujisaki, and K. Hirose Analysis of voice fundamental frequency contours for declarative sentence of Japanese J. Acoust. Soc. Jpn. 5 4 1984 233 242
-
(1984)
J. Acoust. Soc. Jpn.
, vol.5
, Issue.4
, pp. 233-242
-
-
Fujisaki, H.1
Hirose, K.2
-
24
-
-
85016587886
-
SWITCHBOARD: Telephone speech corpus for research and development
-
Godfrey, J., Holliman, E., McDaniel, J., 1992. SWITCHBOARD: telephone speech corpus for research and development. In: Proc. ICASSP. pp. 517-520.
-
(1992)
Proc. ICASSP
, pp. 517-520
-
-
Godfrey, J.1
Holliman, E.2
McDaniel, J.3
-
27
-
-
21844462996
-
Speech recognition models of the interdependence among syntax, prosody, and segmental acoustics
-
Hasegawa-Johnson, M., Cole, J., Shih, C., Chen, K., Cohen, A., Chavarria, S., Kim, H., Yoon, T., Borys, S., Choi, J.-Y., 2004. Speech recognition models of the interdependence among syntax, prosody, and segmental acoustics. In: HLT/NAACL Workshop on Linguist. Other Higher-Level Knowledge Speech Process.
-
(2004)
HLT/NAACL Workshop on Linguist. Other Higher-level Knowledge Speech Process.
-
-
Hasegawa-Johnson, M.1
Cole, J.2
Shih, C.3
Chen, K.4
Cohen, A.5
Chavarria, S.6
Kim, H.7
Yoon, T.8
Borys, S.9
Choi, J.-Y.10
-
28
-
-
0037795510
-
0 control rules using statistical analysis
-
J.P.H. van Santen R.W. Sproat J.P. Olive J. Hirschberg Springer-Verlag New York
-
0 control rules using statistical analysis J.P.H. van Santen R.W. Sproat J.P. Olive J. Hirschberg Progress in Speech Synthesis 1997 Springer-Verlag New York 333 346
-
(1997)
Progress in Speech Synthesis
, pp. 333-346
-
-
Hirai, T.1
Iwahashi, N.2
Higuchi, N.3
Sagisaka, Y.4
-
30
-
-
0002838041
-
Consonant types, vowel quality and tone
-
Fromkin, V. (Ed.)
-
Hombert, J., 1978. Consonant types, vowel quality and tone. In: Fromkin, V. (Ed.), Tone: A Linguistic Survey. pp. 77-112.
-
(1978)
Tone: A Linguistic Survey
, pp. 77-112
-
-
Hombert, J.1
-
31
-
-
0032203256
-
Pattern recognition using a family of design algorithms based upon the generalized probabilistic descent method
-
S. Katagiri, B.-H. Juang, and C.-H. Lee Pattern recognition using a family of design algorithms based upon the generalized probabilistic descent method Proc. IEEE 86 11 1998 2345 2373
-
(1998)
Proc. IEEE
, vol.86
, Issue.11
, pp. 2345-2373
-
-
Katagiri, S.1
Juang, B.-H.2
Lee, C.-H.3
-
32
-
-
0015196653
-
Effects of stress contrasts on certain articulatory parameters
-
Kent, and Netsell Effects of stress contrasts on certain articulatory parameters Phonetica. 24 1971 23 44
-
(1971)
Phonetica.
, vol.24
, pp. 23-44
-
-
Kent1
Netsell2
-
33
-
-
0032584970
-
Time-delay recurrent neural network for temporal correlations and prediction
-
S.-S. Kim Time-delay recurrent neural network for temporal correlations and prediction Neurocomputing 20 1998 253 263
-
(1998)
Neurocomputing
, vol.20
, pp. 253-263
-
-
Kim, S.-S.1
-
34
-
-
21844462286
-
The effect of accent on acoustic cues to stop voicing and place of articulation in radio news speech
-
Nara, Japan
-
Kim, H., Cole, J., Choi, H., Hasegawa-Johnson, M., 2004a. The effect of accent on acoustic cues to stop voicing and place of articulation in radio news speech. In: Proc. SpeechProsody, Nara, Japan.
-
(2004)
Proc. SpeechProsody
-
-
Kim, H.1
Cole, J.2
Choi, H.3
Hasegawa-Johnson, M.4
-
35
-
-
3142765506
-
Automatic recognition of pitch movements using multi-layer perceptron and time-delay recursive neural network
-
S.-S. Kim, M. Hasegawa-Johnson, and K. Chen Automatic recognition of pitch movements using multi-layer perceptron and time-delay recursive neural network IEEE Signal Process. Lett. 11 7 2004 645 648
-
(2004)
IEEE Signal Process. Lett.
, vol.11
, Issue.7
, pp. 645-648
-
-
Kim, S.-S.1
Hasegawa-Johnson, M.2
Chen, K.3
-
36
-
-
0016952322
-
Linguistic uses of segmental duration in english: Acoustic and perceptual evidence
-
D.H. Klatt Linguistic uses of segmental duration in english: Acoustic and perceptual evidence J. Acoust. Soc. Amer. 59 5 1976 1208 1221
-
(1976)
J. Acoust. Soc. Amer.
, vol.59
, Issue.5
, pp. 1208-1221
-
-
Klatt, D.H.1
-
38
-
-
0024768209
-
Speaker-independent phone recognition using hidden Markov models
-
K.-F. Lee, and H.-W. Hon Speaker-independent phone recognition using hidden Markov models IEEE Trans. Acoust., Speech, Signal Process. 37 11 1989 1641 1648 November
-
(1989)
IEEE Trans. Acoust., Speech, Signal Process.
, vol.37
, Issue.11
, pp. 1641-1648
-
-
Lee, K.-F.1
Hon, H.-W.2
-
39
-
-
85009223733
-
Automatic disfluency identification in conversational speech using multiple knowledge sources
-
Liu, Y., Shriberg, E., Stolcke, A., 2003. Automatic disfluency identification in conversational speech using multiple knowledge sources. In: Proc. EUROSPEECH.
-
(2003)
Proc. EUROSPEECH
-
-
Liu, Y.1
Shriberg, E.2
Stolcke, A.3
-
40
-
-
0346707540
-
Approximately independent factors of speech using non-linear symplectic transformation
-
M.K. Omar, and M. Hasegawa-Johnson Approximately independent factors of speech using non-linear symplectic transformation IEEE Trans. Speech and Audio Process. 11 6 2003 660 671
-
(2003)
IEEE Trans. Speech and Audio Process.
, vol.11
, Issue.6
, pp. 660-671
-
-
Omar, M.K.1
Hasegawa-Johnson, M.2
-
43
-
-
33947647913
-
A prosodically labeled database of spontaneous speech
-
Red Bank, NJ
-
Ostendorf, M., Shafran, I., Shattuck-Hufnagel, S., Carmichael, L., Byrne, W., 2002. A prosodically labeled database of spontaneous speech. In: Proc. ISCA Tutorial Res. Workshop on Prosody in Speech Recognition Understand., Red Bank, NJ.
-
(2002)
Proc. ISCA Tutorial Res. Workshop on Prosody in Speech Recognition Understand.
-
-
Ostendorf, M.1
Shafran, I.2
Shattuck-Hufnagel, S.3
Carmichael, L.4
Byrne, W.5
-
45
-
-
0026323673
-
The use of prosody in syntactic disambiguation
-
P. Price, M. Ostendorf, S. Shattuck-Hufnagel, and C. Fong The use of prosody in syntactic disambiguation J. Acoust. Soc. Amer. 90 6 1991 2956 2970 Dec.
-
(1991)
J. Acoust. Soc. Amer.
, vol.90
, Issue.6
, pp. 2956-2970
-
-
Price, P.1
Ostendorf, M.2
Shattuck-Hufnagel, S.3
Fong, C.4
-
46
-
-
21844460435
-
Speaker-independent automatic detection of pitch accent
-
Nara, Japan
-
Ren, Y., Kim, S.-S., Hasegawa-Johnson, M., Cole, J., 2004. Speaker-independent automatic detection of pitch accent. In: Proc. SpeechProsody, Nara, Japan.
-
(2004)
Proc. SpeechProsody
-
-
Ren, Y.1
Kim, S.-S.2
Hasegawa-Johnson, M.3
Cole, J.4
-
47
-
-
34848820349
-
Direct modeling of prosody: An overview of applications in automatic speech processing
-
Shriberg, E., Stolcke, A., 2004. Direct modeling of prosody: An overview of applications in automatic speech processing. In: Proc. SpeechProsody.
-
(2004)
Proc. SpeechProsody
-
-
Shriberg, E.1
Stolcke, A.2
-
48
-
-
85119213703
-
TOBI: A standard for labeling English prosody
-
Silverman, K., Beckman, M., Pitrelli, J., Ostendorf, M., Wightman, C., Price, P., Pierrehumbert, J., Hirschberg, J., 1992. TOBI: A standard for labeling English prosody. In: Proc. Internat. Conf. Spoken Language Process.
-
(1992)
Proc. Internat. Conf. Spoken Language Process.
-
-
Silverman, K.1
Beckman, M.2
Pitrelli, J.3
Ostendorf, M.4
Wightman, C.5
Price, P.6
Pierrehumbert, J.7
Hirschberg, J.8
-
50
-
-
85128436986
-
Modeling dynamic prosodic variation for speaker verification
-
Sönmez, K., Shriberg, E., Heck, L., Weintraub, M., 1998. Modeling dynamic prosodic variation for speaker verification. In: Proc. Internat. Conf. Spoken Language Process., pp. 3189-3192.
-
(1998)
Proc. Internat. Conf. Spoken Language Process.
, pp. 3189-3192
-
-
Sönmez, K.1
Shriberg, E.2
Heck, L.3
Weintraub, M.4
-
51
-
-
79959823252
-
Modeling the prosody of hidden events for improved word recognition
-
Stolcke, A., Shriberg, E., Hakkani-Tür, D., Tür, G., 1999. Modeling the prosody of hidden events for improved word recognition. In: Proc. EUROSPEECH, pp. 307-310.
-
(1999)
Proc. EUROSPEECH
, pp. 307-310
-
-
Stolcke, A.1
Shriberg, E.2
Hakkani-Tür, D.3
Tür, G.4
-
52
-
-
0034008810
-
Analysis and synthesis of intonation using the Tilt model
-
P. Taylor Analysis and synthesis of intonation using the Tilt model J. Acoust. Soc. Amer. 107 3 2000 1697 1714
-
(2000)
J. Acoust. Soc. Amer.
, vol.107
, Issue.3
, pp. 1697-1714
-
-
Taylor, P.1
-
53
-
-
0033096914
-
Acoustic characteristics of lexical stress in continuous telephone speech
-
D. van Kuijk, and L. Boves Acoustic characteristics of lexical stress in continuous telephone speech Speech Comm. 27 1999 95 111
-
(1999)
Speech Comm.
, vol.27
, pp. 95-111
-
-
Van Kuijk, D.1
Boves, L.2
-
54
-
-
0141703284
-
Prosodic knowledge sources for automatic speech recognition
-
Vergyri, D., Stolcke, A., Gadde, V.R., Ferrer, L., Shriberg, E., 2003. Prosodic knowledge sources for automatic speech recognition. In: Proc. ICASSP.
-
(2003)
Proc. ICASSP
-
-
Vergyri, D.1
Stolcke, A.2
Gadde, V.R.3
Ferrer, L.4
Shriberg, E.5
-
55
-
-
0024634603
-
Phoneme recognition using time-delay neural networks
-
A. Waibel, T. Hanazawa, G. Hinton, K. Shikano, and K.J. Lang Phoneme recognition using time-delay neural networks Trans. Acoust. Speech Sig. Proc. 37 1989 328 339
-
(1989)
Trans. Acoust. Speech Sig. Proc.
, vol.37
, pp. 328-339
-
-
Waibel, A.1
Hanazawa, T.2
Hinton, G.3
Shikano, K.4
Lang, K.J.5
-
58
-
-
85009080837
-
Intertranscriber reliability of prosodic labeling on telephone conversation using tobi
-
Yoon, T., Chavarria, S., Cole, J., Hasegawa-Johnson, M., 2004. Intertranscriber reliability of prosodic labeling on telephone conversation using tobi. In: Proc. Internat. Conf. Spoken Language Process.
-
(2004)
Proc. Internat. Conf. Spoken Language Process.
-
-
Yoon, T.1
Chavarria, S.2
Cole, J.3
Hasegawa-Johnson, M.4
-
59
-
-
0003822743
-
-
Cambridge University Engineering Department Cambridge, UK
-
S. Young, G. Evermann, T. Hain, D. Kershaw, G. Moore, J. Odell, D. Ollason, D. Povey, V. Valtchev, and P. Woodland The HTK Book 2002 Cambridge University Engineering Department Cambridge, UK
-
(2002)
The HTK Book
-
-
Young, S.1
Evermann, G.2
Hain, T.3
Kershaw, D.4
Moore, G.5
Odell, J.6
Ollason, D.7
Povey, D.8
Valtchev, V.9
Woodland, P.10
-
60
-
-
0025477640
-
Speech database development at MIT: TIMIT and beyond
-
V. Zue, S. Seneff, and J. Glass Speech database development at MIT: TIMIT and beyond Speech Comm. 9 1990 351 356
-
(1990)
Speech Comm.
, vol.9
, pp. 351-356
-
-
Zue, V.1
Seneff, S.2
Glass, J.3
|