-
1
-
-
85119213703
-
ToBI: A standard scheme for labeling prosody
-
K. Silverman, M. Beckman, J. Pitrelli, M. Ostendorf, C.Wightman, P. Price, J. Pierrehumbert, and J. Hirschberg, "ToBI: A standard scheme for labeling prosody," in Proc. Int. Conf. Spoken Lang. Process., 1992, pp. 867-869.
-
(1992)
Proc. Int. Conf. Spoken Lang. Process
, pp. 867-869
-
-
Silverman, K.1
Beckman, M.2
Pitrelli, J.3
Ostendorf, M.4
Wightman, C.5
Price, P.6
Pierrehumbert, J.7
Hirschberg, J.8
-
2
-
-
0003665661
-
-
D. Hirst and A. D. Cristo, , D. Hirst and A. D. Cristo, Eds., Cambridge, U.K.: Cambridge Univ. Press
-
D. Hirst and A. D. Cristo, , D. Hirst and A. D. Cristo, Eds., Intonation Systems: A Survey of Twenty Languages. Cambridge, U.K.: Cambridge Univ. Press, 1998.
-
(1998)
Intonation Systems: A Survey of Twenty Languages
-
-
-
3
-
-
33646805961
-
IViE-A comparative transcription system for intonational variation in English
-
E. Grabe, F. Nolan, and K. Farrar, "IViE-A comparative transcription system for intonational variation in English," in Proc. Int. Conf. Spoken Lang. Process., 1998, pp. 1259-1262.
-
(1998)
Proc. Int. Conf. Spoken Lang. Process
, pp. 1259-1262
-
-
Grabe, E.1
Nolan, F.2
Farrar, K.3
-
4
-
-
60849083145
-
Automatic prosodic event detection using acoustic, lexical, and syntactic evidence
-
Jan.
-
S. Ananthakrishnan and S. Narayanan, "Automatic prosodic event detection using acoustic, lexical, and syntactic evidence," IEEE Trans. Audio, Speech, Lang. Process., vol.16, no.1, pp. 216-228, Jan. 2008.
-
(2008)
IEEE Trans. Audio, Speech, Lang. Process
, vol.16
, Issue.1
, pp. 216-228
-
-
Ananthakrishnan, S.1
Narayanan, S.2
-
5
-
-
34547540499
-
Prosody models for conversational speech recognition
-
M. Ostendorf, I. Shafran, and R. Bates, "Prosody models for conversational speech recognition," in Proc. 2nd Plenary Meeting Symp. Prosody and Speech Process., 2003, pp. 147-154.
-
(2003)
Proc. 2nd Plenary Meeting Symp. Prosody and Speech Process
, pp. 147-154
-
-
Ostendorf, M.1
Shafran, I.2
Bates, R.3
-
6
-
-
85009102907
-
Lexical stress modeling for improved speech recognition of spontaneous telephone speech in the JUPITER domain
-
C. Wang and S. Seneff, "Lexical stress modeling for improved speech recognition of spontaneous telephone speech in the JUPITER domain," in Proc. 7th Eur. Conf. Speech Commun. Technol., 2001, pp. 2761-2764.
-
(2001)
Proc. 7th Eur. Conf. Speech Commun. Technol.
, pp. 2761-2764
-
-
Wang, C.1
Seneff, S.2
-
7
-
-
34547496179
-
Speech recognition models of the interdependence among syntax, prosody and segmental acoustics
-
M. Hasegawa-Johnson, J. Cole, C. Shih, K. Chen, A. Cohen, S. Chavarria, H. Kim, T. Yoon, S. Borys, and J.-Y. Choi, "Speech recognition models of the interdependence among syntax, prosody and segmental acoustics," in Proc. HLT/NAACL, 2004, pp. 56-63.
-
(2004)
Proc. HLT/NAACL
, pp. 56-63
-
-
Hasegawa-Johnson, M.1
Cole, J.2
Shih, C.3
Chen, K.4
Cohen, A.5
Chavarria, S.6
Kim, H.7
Yoon, T.8
Borys, S.9
Choi, J.-Y.10
-
8
-
-
33744970676
-
Prosody dependent speech recognition on radio news corpus of American english
-
Jan.
-
K. Chen, M. Hasegawa-Johnson, A. Cohen, S. Borys, S.-S. Kim, J. Cole, and J.-Y. Choi, "Prosody dependent speech recognition on radio news corpus of American english," IEEE Trans. Speech, Audio, Lang. Process., vol.14, no.1, pp. 232-245, Jan. 2006.
-
(2006)
IEEE Trans. Speech, Audio, Lang. Process
, vol.14
, Issue.1
, pp. 232-245
-
-
Chen, K.1
Hasegawa-Johnson, M.2
Cohen, A.3
Borys, S.4
Kim, S.-S.5
Cole, J.6
Choi, J.-Y.7
-
9
-
-
0004115604
-
The boston university radio news corpus
-
Mar.
-
M. Ostendorf, P. Price, and S. Shattuck-Hufnagel, "The Boston University Radio News Corpus," Boston Univ., Boston, MA, Tech. Rep. ECS-95-1001, Mar. 1995.
-
(1995)
Boston Univ., Boston, MA, Tech. Rep. ECS-95-1001
-
-
Ostendorf, M.1
Price, P.2
Shattuck-Hufnagel, S.3
-
10
-
-
34547525606
-
Improved speech recognition using acoustic and lexical correlates of pitch accent in a N-best rescoring framework
-
S. Ananthakrishnan and S. Narayanan, "Improved speech recognition using acoustic and lexical correlates of pitch accent in a N-best rescoring framework," in Proc. Int. Conf. Acoust., Speech, Signal Process., 2007, pp. 873-876.
-
(2007)
Proc. Int. Conf. Acoust., Speech, Signal Process
, pp. 873-876
-
-
Ananthakrishnan, S.1
Narayanan, S.2
-
12
-
-
0035156005
-
Automatic ToBI prediction and alignment to speed manual labeling of prosody
-
A. Syrdal, J. Hirschberg, J. McGory, and M. Beckman, "Automatic ToBI prediction and alignment to speed manual labeling of prosody," Speech Commun., vol.33, pp. 135-151, 2001.
-
(2001)
Speech Commun.
, vol.33
, pp. 135-151
-
-
Syrdal, A.1
Hirschberg, J.2
McGory, J.3
Beckman, M.4
-
13
-
-
78149415050
-
Discourse structure in spoken language: Studies on speech corpora
-
Mar.
-
C. Nakatani, J. Hirschberg, and B. Grosz, "Discourse structure in spoken language: Studies on speech corpora," in Proc. AAAI Spring Symp. Empirical Methods in Discourse Interpretation and Generation, Mar. 1995, pp. 106-112.
-
(1995)
Proc. AAAI Spring Symp. Empirical Methods in Discourse Interpretation and Generation
, pp. 106-112
-
-
Nakatani, C.1
Hirschberg, J.2
Grosz, B.3
-
14
-
-
67449109656
-
The design and implementation of the TRAINS-96 system: A prototype mixed-initiative planning assistant
-
Oct.
-
G. Ferguson, J. Allen, B. Miller, and E. Ringger, "The design and implementation of the TRAINS-96 system: A prototype mixed-initiative planning assistant," Univ. of Rochester, Rochester, Tech. Rep. TN96-5, Oct. 1996.
-
(1996)
Univ. of Rochester, Rochester, Tech. Rep. TN96-5
-
-
Ferguson, G.1
Allen, J.2
Miller, B.3
Ringger, E.4
-
15
-
-
33947647913
-
A prosodically labeled database of spontaneous speech
-
Oct.
-
M. Ostendorf, I. Shafran, S. Shattuck-Hufnagel, L. Carmichael, and W. Byrne, "A prosodically labeled database of spontaneous speech," in Proc. ISCA Workshop Prosody in Speech Recognition and Understanding, Oct. 2001, pp. 119-121.
-
(2001)
Proc. ISCA Workshop Prosody in Speech Recognition and Understanding
, pp. 119-121
-
-
Ostendorf, M.1
Shafran, I.2
Shattuck-Hufnagel, S.3
Carmichael, L.4
Byrne, W.5
-
16
-
-
51449117928
-
A novel algorithm for unsupervised prosodic language model adaptation
-
Las Vegas, NV
-
S. Ananthakrishnan and S. Narayanan, "A novel algorithm for unsupervised prosodic language model adaptation," in Proc. Int. Conf. Acoust. , Speech Signal Process., Las Vegas, NV, 2008.
-
(2008)
Proc. Int. Conf. Acoust. , Speech Signal Process
-
-
Ananthakrishnan, S.1
Narayanan, S.2
-
17
-
-
70350481607
-
CSR-II (WSJ1) complete
-
Philadelphia, PA
-
"CSR-II (WSJ1) Complete," Linguistic Data Consortium, Philadelphia, PA, 1994.
-
Linguistic Data Consortium
, pp. 1994
-
-
-
18
-
-
0141589558
-
SONIC: The University of colorado continuous speech recognizer
-
Mar.
-
B. Pellom, "SONIC: The University of colorado continuous speech recognizer," Univ. of Colorado, Boulder, CO, Tech. Rep. TR-CSLR- 2001-2101, Mar. 2001.
-
(2001)
Univ. of Colorado, Boulder, CO, Tech. Rep. TR-CSLR-2001-2101
-
-
Pellom, B.1
-
19
-
-
84891308106
-
SRILM-An extensible language modeling toolkit
-
Denver, CO
-
A. Stolcke, "SRILM-An extensible language modeling toolkit," in Proc. Int. Conf. Spoken Lang. Process., Denver, CO, 2002, vol.2, pp. 901-904.
-
(2002)
Proc. Int. Conf. Spoken Lang. Process
, vol.2
, pp. 901-904
-
-
Stolcke, A.1
-
20
-
-
0034296009
-
Finding consensus in speech recognition: Word error minimization and other applications of confusion networks
-
L. Mangu, E. Brill, and A. Stolcke, "Finding consensus in speech recognition: Word error minimization and other applications of confusion networks," Computer, Speech, Lang., vol.14, no.4, pp. 373-400, 2000.
-
(2000)
Computer, Speech, Lang.
, vol.14
, Issue.4
, pp. 373-400
-
-
Mangu, L.1
Brill, E.2
Stolcke, A.3
-
21
-
-
34249306038
-
-
M.S. thesis, Cambridge Univ., Cambridge, U.K.
-
G. Evermann, "Minimum word error rate decoding," M.S. thesis, Cambridge Univ., Cambridge, U.K., 1999.
-
(1999)
Minimum Word Error Rate Decoding
-
-
Evermann, G.1
-
22
-
-
0003857778
-
A Gentle tutorial on the EM algorithm and its application to parameter estimation for Gaussian mixture and hidden Markov models
-
J. Bilmes, "A Gentle tutorial on the EM algorithm and its application to parameter estimation for Gaussian mixture and hidden Markov models," Univ. of Berkeley, Berkeley, CA, Tech. Rep. ICSI-TR-97- 021, 1997.
-
(1997)
Univ. of Berkeley, Berkeley, CA, Tech. Rep. ICSI-TR-97-021
-
-
Bilmes, J.1
-
23
-
-
0028419019
-
Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains
-
Apr.
-
J.-L. Gauvain and C.-H. Lee, "Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains," IEEE Trans. Speech Audio Process., vol.2, no.2, pp. 291-298, Apr. 1994.
-
(1994)
IEEE Trans. Speech Audio Process
, vol.2
, Issue.2
, pp. 291-298
-
-
Gauvain, J.-L.1
Lee, C.-H.2
-
24
-
-
0040262052
-
Bayesian learning of Gaussian mixture densities for hidden Markov models
-
Pacific Grove, CA, Morgan-Kaufmann
-
J.-L. Gauvain and C.-H. Lee, "Bayesian learning of Gaussian mixture densities for hidden Markov models," in Proc. DARPA Speech and Natural Language Workshop, Pacific Grove, CA, 1991, pp. 272-277, Morgan-Kaufmann.
-
(1991)
Proc. DARPA Speech and Natural Language Workshop
, pp. 272-277
-
-
Gauvain, J.-L.1
Lee, C.-H.2
-
25
-
-
0003822743
-
-
Cambridge, U.K.: Cambridge Univ., Dec.
-
S. Young, G. Evermann, T. Hain, D. Kershaw, G. Moore, J. Odell, D. Ollason, D. Povey, V. Valtchev, and P. Woodland, The HTK Book. Cambridge, U.K.: Cambridge Univ., Dec. 2002.
-
(2002)
The HTK Book
-
-
Young, S.1
Evermann, G.2
Hain, T.3
Kershaw, D.4
Moore, G.5
Odell, J.6
Ollason, D.7
Povey, D.8
Valtchev, V.9
Woodland, P.10
|