-
1
-
-
79960572268
-
Quantifying temporal speech reduction in French using forced speech alignment
-
10.1016/j.wocn.2010.11.011
-
Adda-Decker, M., and Snoeren, N. D. (2011). " Quantifying temporal speech reduction in French using forced speech alignment.," J. Phonetics 39, 261-270. 10.1016/j.wocn.2010.11.011
-
(2011)
J. Phonetics
, vol.39
, pp. 261-270
-
-
Adda-Decker, M.1
Snoeren, N.D.2
-
3
-
-
79961209587
-
Collecting and evaluating speech recognition corpora for 11 South African languages
-
10.1007/s10579-011-9152-1
-
Badenhorst, J., van Heerden, C., Davel, M., and Barnard, E. (2011). " Collecting and evaluating speech recognition corpora for 11 South African languages.," Lang. Res. Eval. 45 (3), 289-309. 10.1007/s10579-011-9152-1
-
(2011)
Lang. Res. Eval.
, vol.45
, Issue.3
, pp. 289-309
-
-
Badenhorst, J.1
Van Heerden, C.2
Davel, M.3
Barnard, E.4
-
4
-
-
30644460082
-
Fitting linear mixed models in R
-
Bates, D. M. (2005). " Fitting linear mixed models in R.," R News 5, 27-30.
-
(2005)
R News
, vol.5
, pp. 27-30
-
-
Bates, D.M.1
-
5
-
-
0032744759
-
Perception of coarticulatory nasalization by speakers of English and Thai: Evidence for partial compensation
-
10.1121/1.428111
-
Beddor, P. S., and Krakow, R. A. (1999). " Perception of coarticulatory nasalization by speakers of English and Thai: Evidence for partial compensation.," J. Acoust. Soc. Am. 106 (5), 2868-2887. 10.1121/1.428111
-
(1999)
J. Acoust. Soc. Am.
, vol.106
, Issue.5
, pp. 2868-2887
-
-
Beddor, P.S.1
Krakow, R.A.2
-
6
-
-
41149149864
-
Free prefix ordering in Chintang
-
10.1353/lan.2007.0002
-
Bickel, B., Banjade, G., Gaenszle, M., Lieven, E., Paudyal, N. P., Rai, I. P., Rai, M., Rai, N. K., and Stoll, S. (2007). " Free prefix ordering in Chintang.," Language 83 (1), 43-73. 10.1353/lan.2007.0002
-
(2007)
Language
, vol.83
, Issue.1
, pp. 43-73
-
-
Bickel, B.1
Banjade, G.2
Gaenszle, M.3
Lieven, E.4
Paudyal, N.P.5
Rai, I.P.6
Rai, M.7
Rai, N.K.8
Stoll, S.9
-
7
-
-
84883409481
-
-
Praat: Doing phonetics by computer" [computer program], (date last viewed 10/1/12)
-
Boersma, P., and Weenink, D. (2012). "Praat: Doing phonetics by computer" [computer program], www.praat.org (date last viewed 10/1/12).
-
(2012)
-
-
Boersma, P.1
Weenink, D.2
-
8
-
-
84883371125
-
Multi-lingual automatic phoneme clustering
-
in, San Francisco, CA
-
Boula de Mareüil, P., Corredor-Ardoy, C., and Adda-Decker, M. (1999). " Multi-lingual automatic phoneme clustering.," in Proceedings of the 14th International Congress of the Phonetic Sciences, San Francisco, CA, pp. 1209-1212.
-
(1999)
Proceedings of the 14th International Congress of the Phonetic Sciences
, pp. 1209-1212
-
-
Boula De Mareüil, P.1
Corredor-Ardoy, C.2
Adda-Decker, M.3
-
9
-
-
78049394188
-
Multilingual acoustic modeling for speech recognition based on subspace Gaussian mixture models
-
Burget, L., Schwarz, P., Agarwal, M., Akyazi, P., Feng, K., Ghoshal, A., Glembek, O., Goel, N., Karafiát, M., Povey, D., Rastrow, A., Rose, R. C., and Thomas, S. (2010). " Multilingual acoustic modeling for speech recognition based on subspace Gaussian mixture models.," in Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 4334-4337.
-
(2010)
Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
, pp. 4334-4337
-
-
Burget, L.1
Schwarz, P.2
Agarwal, M.3
Akyazi, P.4
Feng, K.5
Ghoshal, A.6
Glembek, O.7
Goel, N.8
Karafiát, M.9
Povey, D.10
Rastrow, A.11
Rose, R.C.12
Thomas, S.13
-
10
-
-
0026407260
-
Multi-lingual label alignment using acoustic- phonetic features derived by neural-network technique
-
Vol.
-
Dalsgaard, P., Andersen, O., and Barry, W. (1991). " Multi-lingual label alignment using acoustic- phonetic features derived by neural-network technique.," Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 1, 197-200.
-
(1991)
Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
, vol.1
, pp. 197-200
-
-
Dalsgaard, P.1
Andersen, O.2
Barry, W.3
-
11
-
-
85011436241
-
Czech
-
10.1017/S0025100300005442
-
Dankovičová, J. (1997). " Czech.," J. Int. Phonetic Assoc. 27 (1), 77-80. 10.1017/S0025100300005442
-
(1997)
J. Int. Phonetic Assoc.
, vol.27
, Issue.1
, pp. 77-80
-
-
Dankovičová, J.1
-
12
-
-
84919927894
-
Phonetic alignment in Yoloxóchitl Mixtec tone
-
" talk presented at, Portland, OR
-
DiCanio, C., Amith, J., and Castillo-García, R. (2012). " Phonetic alignment in Yoloxóchitl Mixtec tone.," talk presented at The Society for the Study of the Indigenous Languages of the Americas, Annual Meeting, Portland, OR.
-
(2012)
The Society for the Study of the Indigenous Languages of the Americas, Annual Meeting
-
-
Dicanio, C.1
Amith, J.2
Castillo-García, R.3
-
13
-
-
84935322488
-
The discourse basis of ergativity
-
10.2307/415719
-
Du Bois, J. W. (1987). " The discourse basis of ergativity.," Language 63 (4), 805-855. 10.2307/415719
-
(1987)
Language
, vol.63
, Issue.4
, pp. 805-855
-
-
Du Bois, J.W.1
-
14
-
-
0003548585
-
-
(Linguistic Data Consortium, Philadelphia)
-
Garofolo, J. S., Lamel, L. F., Fisher, W. M., Fiscus, J. G., Pallett, D. S., Dahlgren, N. L., and Zue, V. (1993). TIMIT Acoustic-Phonetic Continuous Speech Corpus (Linguistic Data Consortium, Philadelphia).
-
(1993)
TIMIT Acoustic-Phonetic Continuous Speech Corpus
-
-
Garofolo, J.S.1
Lamel, L.F.2
Fisher, W.M.3
Fiscus, J.G.4
Pallett, D.S.5
Dahlgren, N.L.6
Zue, V.7
-
15
-
-
84874464263
-
Context dependent phone mapping for cross-lingual acoustic modeling
-
" in
-
Hai, D. V., Xiao, X., Chng, E. S., and Li, H. (2012). " Context dependent phone mapping for cross-lingual acoustic modeling.," in Proceedings of ISCSLP, pp. 16-20.
-
(2012)
Proceedings of ISCSLP
, pp. 16-20
-
-
Hai, D.V.1
Xiao, X.2
Chng, E.S.3
Li, H.4
-
16
-
-
85032751458
-
Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups
-
10.1109/MSP.2012.2205597
-
Hinton, G., Deng, L., Yu, D., Dahl, G. E., Mohamed, A., Jaitly, N., Senior, A., Vanhoucke, V., Nguyen, P., Sainath, T. N., and Kingsbury, B. (2012). " Deep neural networks for acoustic modeling in speech recognition: the shared views of four research groups.," IEEE Signal Process. Mag. 29 (6), 82-97. 10.1109/MSP.2012.2205597
-
(2012)
IEEE Signal Process. Mag.
, vol.29
, Issue.6
, pp. 82-97
-
-
Hinton, G.1
Deng, L.2
Yu, D.3
Dahl, G.E.4
Mohamed, A.5
Jaitly, N.6
Senior, A.7
Vanhoucke, V.8
Nguyen, P.9
Sainath, T.N.10
Kingsbury, B.11
-
17
-
-
0036027521
-
Temporal rate change of dialogue speech in prosodic units as compared to read speech
-
10.1016/S0167-6393(01)00028-0
-
Hirose, K., and Kawanami, H. (2002). " Temporal rate change of dialogue speech in prosodic units as compared to read speech.," Speech Commun. 36, 97-111. 10.1016/S0167-6393(01)00028-0
-
(2002)
Speech Commun.
, vol.36
, pp. 97-111
-
-
Hirose, K.1
Kawanami, H.2
-
18
-
-
59649105180
-
Speaker-independent phoneme alignment using transition-dependent states
-
10.1016/j.specom.2008.11.003
-
Hosom, J-P. (2009). " Speaker-independent phoneme alignment using transition-dependent states.," Speech Commun. 51, 352-368. 10.1016/j.specom.2008.11.003
-
(2009)
Speech Commun.
, vol.51
, pp. 352-368
-
-
Hosom, J.-P.1
-
19
-
-
23844450992
-
Segmental and prosodic effects on coda glottalization
-
10.1016/j.wocn.2005.02.004
-
Huffman, M. K. (2005). " Segmental and prosodic effects on coda glottalization.," J. Phonetics 33, 335-362. 10.1016/j.wocn.2005.02.004
-
(2005)
J. Phonetics
, vol.33
, pp. 335-362
-
-
Huffman, M.K.1
-
20
-
-
3643089879
-
On the role of perception in shaping phonological assimilation rules
-
"
-
Hura, S. L., Lindblom, B., and Diehl, R. L. (1992). " On the role of perception in shaping phonological assimilation rules.," Lang. Speech 35 (1-2), 59-72.
-
(1992)
Lang. Speech
, vol.35
, Issue.12
, pp. 59-72
-
-
Hura, S.L.1
Lindblom, B.2
Diehl, R.L.3
-
21
-
-
84883358833
-
Boosting under-resourced speech recognizers by exploiting out of language data - A case study on Afrikaans
-
" in
-
Imseng, D., Bourlard, H., and Garner, P. N. (2012). " Boosting under-resourced speech recognizers by exploiting out of language data-A case study on Afrikaans.," in Proceedings of the 3rd International Workshop on Spoken Languages Technologies for Under-resourced Languages, pp. 60-67.
-
(2012)
Proceedings of the 3rd International Workshop on Spoken Languages Technologies for Under-resourced Languages
, pp. 60-67
-
-
Imseng, D.1
Bourlard, H.2
Garner, P.N.3
-
22
-
-
35348856844
-
A fusion approach for automatic speech segmentation of large corpora with application to speech synthesis
-
10.1016/j.specom.2007.07.001
-
Jarifi, S., Pastor, D., and Rosec, O. (2008). " A fusion approach for automatic speech segmentation of large corpora with application to speech synthesis.," Speech Commun. 50, 67-80. 10.1016/j.specom.2007.07.001
-
(2008)
Speech Commun.
, vol.50
, pp. 67-80
-
-
Jarifi, S.1
Pastor, D.2
Rosec, O.3
-
24
-
-
84867609577
-
Region dependent linear transforms in multilingual speech recognition
-
" in
-
Karafiát, M., Janda, M., Černocky, J., and Burget, L. (2012). " Region dependent linear transforms in multilingual speech recognition.," in Proceedings from ICASSP 2012, pp. 4885-4888.
-
(2012)
Proceedings from ICASSP 2012
, pp. 4885-4888
-
-
Karafiát, M.1
Janda, M.2
Černocky, J.3
Burget, L.4
-
25
-
-
0001951591
-
The world's languages in crisis
-
10.1353/lan.1992.0075
-
Krauss, M. (1992). " The world's languages in crisis.," Language 68, 4-10. 10.1353/lan.1992.0075
-
(1992)
Language
, vol.68
, pp. 4-10
-
-
Krauss, M.1
-
26
-
-
0031191419
-
The contribution of intonation, segmental durations, and spectral features to the perception of a spontaneous and a read speaking style
-
10.1016/S0167-6393(97)00012-5
-
Laan, G. P. M. (1997). " The contribution of intonation, segmental durations, and spectral features to the perception of a spontaneous and a read speaking style.," Speech Commun. 22, 43-65. 10.1016/S0167-6393(97)00012-5
-
(1997)
Speech Commun.
, vol.22
, pp. 43-65
-
-
Laan, G.P.M.1
-
27
-
-
33745210540
-
Incorporating tone-related MLP posteriors in the feature representation for Mandarin ASR
-
" in, Lisbon, Portugal
-
Lei, X., Hwang, M-Y., and Ostendorf, M. (2005). " Incorporating tone-related MLP posteriors in the feature representation for Mandarin ASR.," in Proceedings of Interspeech-2005, Lisbon, Portugal, pp. 2981-2984.
-
(2005)
Proceedings of Interspeech-2005
, pp. 2981-2984
-
-
Lei, X.1
Hwang, M.-Y.2
Ostendorf, M.3
-
28
-
-
33748865429
-
Automatic segmentation and labeling for Mandarin Chinese speech corpora for concatenation-based TTS
-
"
-
Lin, C-Y., Roger Jang, J-S., Chen, K-T. (2005). " Automatic segmentation and labeling for Mandarin Chinese speech corpora for concatenation-based TTS.," Comput. Ling. Chinese Lang. Process. 10 (2), 145-166.
-
(2005)
Comput. Ling. Chinese Lang. Process.
, vol.10
, Issue.2
, pp. 145-166
-
-
Lin, C.-Y.1
Roger Jang, J.-S.2
Chen, K.-T.3
-
29
-
-
85032775034
-
Subword modeling for automatic speech recognition: Past, present, and emerging approaches
-
Livescu, K., Fosler-Lussier, E., and Metze, F. (2012). " Subword modeling for automatic speech recognition: Past, present, and emerging approaches.," IEEE Signal Process. Mag. November, 44-57.
-
(2012)
IEEE Signal Process. Mag.
, pp. 44-57
-
-
Livescu, K.1
Fosler-Lussier, E.2
Metze, F.3
-
30
-
-
84989382388
-
Prosodic templates in sound change
-
10.1075/dia.14.1.03mac
-
Macken, M. A., and Salmons, J. C. (1997). " Prosodic templates in sound change.," Diachronica 14 (1), 31-66. 10.1075/dia.14.1.03mac
-
(1997)
Diachronica
, vol.14
, Issue.1
, pp. 31-66
-
-
MacKen, M.A.1
Salmons, J.C.2
-
31
-
-
0037850986
-
Phonetic alignment: Speech synthesis-based vs. Viterbi-based
-
10.1016/S0167-6393(02)00131-0
-
Malfrère, F., Deroo, O., Dutoit, T., and Ris, C. (2003). " Phonetic alignment: Speech synthesis-based vs. Viterbi-based.," Speech Commun. 40, 503-515. 10.1016/S0167-6393(02)00131-0
-
(2003)
Speech Commun.
, vol.40
, pp. 503-515
-
-
Malfrère, F.1
Deroo, O.2
Dutoit, T.3
Ris, C.4
-
32
-
-
0008771399
-
Some phonetic bases for the relative malleability of syllable-final versus syllable-initial consonants
-
in, Université de Provence, Aix-en-Provence, Vol. 5
-
Manuel, S. Y. (1991). " Some phonetic bases for the relative malleability of syllable-final versus syllable-initial consonants.," in Proceedings of the 12th International Congress of Phonetic Sciences, Université de Provence, Aix-en-Provence, Vol. 5, pp. 118-121.
-
(1991)
Proceedings of the 12th International Congress of Phonetic Sciences
, pp. 118-121
-
-
Manuel, S.Y.1
-
33
-
-
84970305057
-
Detection of target phonemes in spontaneous and read speech
-
Mehta, G., and Cutler, A. (1988). " Detection of target phonemes in spontaneous and read speech.," Lang. Speech 31 (2), 135-156.
-
(1988)
Lang. Speech
, vol.31
, Issue.2
, pp. 135-156
-
-
Mehta, G.1
Cutler, A.2
-
34
-
-
44949167484
-
-
Ní Chasaide, A., Wogan, J., Ó Raghallaigh, B., Ní Bhriain, Á., Zoerner, E., Berthelsen, H., and Gobl, C. (2006). Speech Technology for Minority Languages: The Case of Irish (Gaelic), in INTERSPEECH-2006, pp. 181-184.
-
(2006)
Speech Technology for Minority Languages: The Case of Irish (Gaelic), in INTERSPEECH-2006
, pp. 181-184
-
-
Ní Chasaide, A.1
Wogan, J.2
Raghallaigh, B.Ó.3
Ní Bhriain, Á.4
Zoerner, E.5
Berthelsen, H.6
Gobl, C.7
-
35
-
-
0003377189
-
The phonetics and phonology of aspects of assimilation
-
Ohala, J. (1990). " The phonetics and phonology of aspects of assimilation.," Papers Lab. Phonol. 1, 258-275
-
(1990)
Papers Lab. Phonol.
, vol.1
, pp. 258-275
-
-
Ohala, J.1
-
36
-
-
84883333361
-
-
R Development Core Team. "R: A language and environment for statistical computing" [computer program], R Foundation for Statistical Computing, Vienna, Austria (date last viewed 10/1/12)
-
R Development Core Team (2012). "R: A language and environment for statistical computing" [computer program], http://www.R-project.org, R Foundation for Statistical Computing, Vienna, Austria (date last viewed 10/1/12).
-
(2012)
-
-
-
37
-
-
0004244302
-
-
(Prentice Hall, Englewood Cliffs, NJ)
-
Rabiner, Lawrence, R., and Juang, B. H. (1993). Fundamentals of Speech Recognition, Prentice-Hall Signal Processing Series (Prentice Hall, Englewood Cliffs, NJ).
-
(1993)
Fundamentals of Speech Recognition, Prentice-Hall Signal Processing Series
-
-
Rabiner1
Lawrence, R.2
Juang, B.H.3
-
39
-
-
0010135126
-
The timing of prenuclear high accents in English
-
in, edited by J. Kingston and M. Beckman, Cascadilla Proceedings Project, Somerville, MA
-
Silverman, K., and Pierrehumbert, J. (1990). " The timing of prenuclear high accents in English.," in Papers in Laboratory Phonology I: Between the Grammar and Physics of Speech, edited by, J. Kingston, and, M. Beckman, Cascadilla Proceedings Project, Somerville, MA, pp. 103-112.
-
(1990)
Papers in Laboratory Phonology I: Between the Grammar and Physics of Speech
, pp. 103-112
-
-
Silverman, K.1
Pierrehumbert, J.2
-
40
-
-
51449101990
-
Robust phone mapping using decision tree clustering for cross-lingual phone recognition
-
in
-
Sim, K. C., and Li, H. (2008a). " Robust phone mapping using decision tree clustering for cross-lingual phone recognition.," in Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 4309-4312.
-
(2008)
Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
, pp. 4309-4312
-
-
Sim, K.C.1
Li, H.2
-
41
-
-
84867192907
-
Context-sensitive probabilistic phone mapping model for cross-lingual speech recognition
-
In, International Speech Communication Association (ISCA)
-
Sim, K. C., and Li, H. (2008b). " Context-sensitive probabilistic phone mapping model for cross-lingual speech recognition.," In Proceedings of Interspeech 2008, International Speech Communication Association (ISCA), pp. 2715-2718.
-
(2008)
Proceedings of Interspeech 2008
, pp. 2715-2718
-
-
Sim, K.C.1
Li, H.2
-
42
-
-
84858783740
-
Endangered language families
-
10.1353/lan.2012.0012
-
Whalen, D. H., and Simons, G. F. (2012). " Endangered language families.," Language 88, 155-173. 10.1353/lan.2012.0012
-
(2012)
Language
, vol.88
, pp. 155-173
-
-
Whalen, D.H.1
Simons, G.F.2
-
43
-
-
0026470422
-
Information for Mandarin tones in the amplitude contour and in brief segments
-
10.1159/000261901
-
Whalen, D. H., and Xu, Y. (1992). " Information for Mandarin tones in the amplitude contour and in brief segments.," Phonetica 49, 25-47. 10.1159/000261901
-
(1992)
Phonetica
, vol.49
, pp. 25-47
-
-
Whalen, D.H.1
Xu, Y.2
-
44
-
-
84883363328
-
ModelTalker voice recorder (MTVR) - A system for capturing individual voices for synthetic speech
-
" talk presented at the, Montreal, Canada (August 2-7)
-
Yarrington, D., Pennington, C., Bunnell, H. T., Gray, J., Lilley, J., Nagao, K., and Polikoff, J. (2008). " ModelTalker voice recorder (MTVR)-A system for capturing individual voices for synthetic speech.," talk presented at the ISAAC 13th Biennial Conference, Montreal, Canada (August 2-7).
-
(2008)
ISAAC 13th Biennial Conference
-
-
Yarrington, D.1
Pennington, C.2
Bunnell, H.T.3
Gray, J.4
Lilley, J.5
Nagao, K.6
Polikoff, J.7
-
45
-
-
84937375671
-
-
Cambridge Textbooks in Linguistics (Cambridge University Press, Cambridge, UK)
-
Yip, M. (2002). Tone, Cambridge Textbooks in Linguistics (Cambridge University Press, Cambridge, UK), p. 376.
-
(2002)
Tone
, pp. 376
-
-
Yip, M.1
-
46
-
-
84874902640
-
Speaker identification on the SCOTUS corpus
-
in
-
Yuan, J., and Liberman, M. (2008). " Speaker identification on the SCOTUS corpus.," in Proceedings of Acoustics 2008, pp. 5687-5690.
-
(2008)
Proceedings of Acoustics 2008
, pp. 5687-5690
-
-
Yuan, J.1
Liberman, M.2
-
48
-
-
0342497635
-
Transcription and alignment of the TIMIT database
-
in, edited by H. Fujisaki (Elsevier, Amsterdam)
-
Zue, V. W., and Seneff, S. (1996). " Transcription and alignment of the TIMIT database.," in Recent Research towards Advanced Man-Machine Interface through Spoken Language, edited by, H. Fujisaki, (Elsevier, Amsterdam), pp. 515-525.
-
(1996)
Recent Research Towards Advanced Man-Machine Interface Through Spoken Language
, pp. 515-525
-
-
Zue, V.W.1
Seneff, S.2
-
49
-
-
0025477640
-
Speech database development at MIT: TIMIT and beyond
-
10.1016/0167-6393(90)90010-7
-
Zue, V., Seneff, S., and Glass, J. (1990). " Speech database development at MIT: TIMIT and beyond.," Speech Commun. 9, 351-356. 10.1016/0167-6393(90)90010-7
-
(1990)
Speech Commun.
, vol.9
, pp. 351-356
-
-
Zue, V.1
Seneff, S.2
Glass, J.3
|