메뉴 건너뛰기




Volumn 134, Issue 3, 2013, Pages 2235-2246

Using automatic alignment to analyze endangered language data: Testing the viability of untrained alignment

Author keywords

[No Author keywords available]

Indexed keywords

ALIGNMENT ACCURACY; ALIGNMENT SYSTEM; AUTOMATIC ALIGNMENT; CONTEXTUALIZATION; ENDANGERED LANGUAGES; PHONETIC ANALYSIS; SMALL DATA SET; TARGET LANGUAGE;

EID: 84883392022     PISSN: 00014966     EISSN: None     Source Type: Journal    
DOI: 10.1121/1.4816491     Document Type: Article
Times cited : (47)

References (49)
  • 1
    • 79960572268 scopus 로고    scopus 로고
    • Quantifying temporal speech reduction in French using forced speech alignment
    • 10.1016/j.wocn.2010.11.011
    • Adda-Decker, M., and Snoeren, N. D. (2011). " Quantifying temporal speech reduction in French using forced speech alignment.," J. Phonetics 39, 261-270. 10.1016/j.wocn.2010.11.011
    • (2011) J. Phonetics , vol.39 , pp. 261-270
    • Adda-Decker, M.1    Snoeren, N.D.2
  • 3
    • 79961209587 scopus 로고    scopus 로고
    • Collecting and evaluating speech recognition corpora for 11 South African languages
    • 10.1007/s10579-011-9152-1
    • Badenhorst, J., van Heerden, C., Davel, M., and Barnard, E. (2011). " Collecting and evaluating speech recognition corpora for 11 South African languages.," Lang. Res. Eval. 45 (3), 289-309. 10.1007/s10579-011-9152-1
    • (2011) Lang. Res. Eval. , vol.45 , Issue.3 , pp. 289-309
    • Badenhorst, J.1    Van Heerden, C.2    Davel, M.3    Barnard, E.4
  • 4
    • 30644460082 scopus 로고    scopus 로고
    • Fitting linear mixed models in R
    • Bates, D. M. (2005). " Fitting linear mixed models in R.," R News 5, 27-30.
    • (2005) R News , vol.5 , pp. 27-30
    • Bates, D.M.1
  • 5
    • 0032744759 scopus 로고    scopus 로고
    • Perception of coarticulatory nasalization by speakers of English and Thai: Evidence for partial compensation
    • 10.1121/1.428111
    • Beddor, P. S., and Krakow, R. A. (1999). " Perception of coarticulatory nasalization by speakers of English and Thai: Evidence for partial compensation.," J. Acoust. Soc. Am. 106 (5), 2868-2887. 10.1121/1.428111
    • (1999) J. Acoust. Soc. Am. , vol.106 , Issue.5 , pp. 2868-2887
    • Beddor, P.S.1    Krakow, R.A.2
  • 7
    • 84883409481 scopus 로고    scopus 로고
    • Praat: Doing phonetics by computer" [computer program], (date last viewed 10/1/12)
    • Boersma, P., and Weenink, D. (2012). "Praat: Doing phonetics by computer" [computer program], www.praat.org (date last viewed 10/1/12).
    • (2012)
    • Boersma, P.1    Weenink, D.2
  • 11
    • 85011436241 scopus 로고    scopus 로고
    • Czech
    • 10.1017/S0025100300005442
    • Dankovičová, J. (1997). " Czech.," J. Int. Phonetic Assoc. 27 (1), 77-80. 10.1017/S0025100300005442
    • (1997) J. Int. Phonetic Assoc. , vol.27 , Issue.1 , pp. 77-80
    • Dankovičová, J.1
  • 13
    • 84935322488 scopus 로고
    • The discourse basis of ergativity
    • 10.2307/415719
    • Du Bois, J. W. (1987). " The discourse basis of ergativity.," Language 63 (4), 805-855. 10.2307/415719
    • (1987) Language , vol.63 , Issue.4 , pp. 805-855
    • Du Bois, J.W.1
  • 15
    • 84874464263 scopus 로고    scopus 로고
    • Context dependent phone mapping for cross-lingual acoustic modeling
    • " in
    • Hai, D. V., Xiao, X., Chng, E. S., and Li, H. (2012). " Context dependent phone mapping for cross-lingual acoustic modeling.," in Proceedings of ISCSLP, pp. 16-20.
    • (2012) Proceedings of ISCSLP , pp. 16-20
    • Hai, D.V.1    Xiao, X.2    Chng, E.S.3    Li, H.4
  • 17
    • 0036027521 scopus 로고    scopus 로고
    • Temporal rate change of dialogue speech in prosodic units as compared to read speech
    • 10.1016/S0167-6393(01)00028-0
    • Hirose, K., and Kawanami, H. (2002). " Temporal rate change of dialogue speech in prosodic units as compared to read speech.," Speech Commun. 36, 97-111. 10.1016/S0167-6393(01)00028-0
    • (2002) Speech Commun. , vol.36 , pp. 97-111
    • Hirose, K.1    Kawanami, H.2
  • 18
    • 59649105180 scopus 로고    scopus 로고
    • Speaker-independent phoneme alignment using transition-dependent states
    • 10.1016/j.specom.2008.11.003
    • Hosom, J-P. (2009). " Speaker-independent phoneme alignment using transition-dependent states.," Speech Commun. 51, 352-368. 10.1016/j.specom.2008.11.003
    • (2009) Speech Commun. , vol.51 , pp. 352-368
    • Hosom, J.-P.1
  • 19
    • 23844450992 scopus 로고    scopus 로고
    • Segmental and prosodic effects on coda glottalization
    • 10.1016/j.wocn.2005.02.004
    • Huffman, M. K. (2005). " Segmental and prosodic effects on coda glottalization.," J. Phonetics 33, 335-362. 10.1016/j.wocn.2005.02.004
    • (2005) J. Phonetics , vol.33 , pp. 335-362
    • Huffman, M.K.1
  • 20
    • 3643089879 scopus 로고
    • On the role of perception in shaping phonological assimilation rules
    • "
    • Hura, S. L., Lindblom, B., and Diehl, R. L. (1992). " On the role of perception in shaping phonological assimilation rules.," Lang. Speech 35 (1-2), 59-72.
    • (1992) Lang. Speech , vol.35 , Issue.12 , pp. 59-72
    • Hura, S.L.1    Lindblom, B.2    Diehl, R.L.3
  • 22
    • 35348856844 scopus 로고    scopus 로고
    • A fusion approach for automatic speech segmentation of large corpora with application to speech synthesis
    • 10.1016/j.specom.2007.07.001
    • Jarifi, S., Pastor, D., and Rosec, O. (2008). " A fusion approach for automatic speech segmentation of large corpora with application to speech synthesis.," Speech Commun. 50, 67-80. 10.1016/j.specom.2007.07.001
    • (2008) Speech Commun. , vol.50 , pp. 67-80
    • Jarifi, S.1    Pastor, D.2    Rosec, O.3
  • 24
    • 84867609577 scopus 로고    scopus 로고
    • Region dependent linear transforms in multilingual speech recognition
    • " in
    • Karafiát, M., Janda, M., Černocky, J., and Burget, L. (2012). " Region dependent linear transforms in multilingual speech recognition.," in Proceedings from ICASSP 2012, pp. 4885-4888.
    • (2012) Proceedings from ICASSP 2012 , pp. 4885-4888
    • Karafiát, M.1    Janda, M.2    Černocky, J.3    Burget, L.4
  • 25
    • 0001951591 scopus 로고
    • The world's languages in crisis
    • 10.1353/lan.1992.0075
    • Krauss, M. (1992). " The world's languages in crisis.," Language 68, 4-10. 10.1353/lan.1992.0075
    • (1992) Language , vol.68 , pp. 4-10
    • Krauss, M.1
  • 26
    • 0031191419 scopus 로고    scopus 로고
    • The contribution of intonation, segmental durations, and spectral features to the perception of a spontaneous and a read speaking style
    • 10.1016/S0167-6393(97)00012-5
    • Laan, G. P. M. (1997). " The contribution of intonation, segmental durations, and spectral features to the perception of a spontaneous and a read speaking style.," Speech Commun. 22, 43-65. 10.1016/S0167-6393(97)00012-5
    • (1997) Speech Commun. , vol.22 , pp. 43-65
    • Laan, G.P.M.1
  • 27
    • 33745210540 scopus 로고    scopus 로고
    • Incorporating tone-related MLP posteriors in the feature representation for Mandarin ASR
    • " in, Lisbon, Portugal
    • Lei, X., Hwang, M-Y., and Ostendorf, M. (2005). " Incorporating tone-related MLP posteriors in the feature representation for Mandarin ASR.," in Proceedings of Interspeech-2005, Lisbon, Portugal, pp. 2981-2984.
    • (2005) Proceedings of Interspeech-2005 , pp. 2981-2984
    • Lei, X.1    Hwang, M.-Y.2    Ostendorf, M.3
  • 28
    • 33748865429 scopus 로고    scopus 로고
    • Automatic segmentation and labeling for Mandarin Chinese speech corpora for concatenation-based TTS
    • "
    • Lin, C-Y., Roger Jang, J-S., Chen, K-T. (2005). " Automatic segmentation and labeling for Mandarin Chinese speech corpora for concatenation-based TTS.," Comput. Ling. Chinese Lang. Process. 10 (2), 145-166.
    • (2005) Comput. Ling. Chinese Lang. Process. , vol.10 , Issue.2 , pp. 145-166
    • Lin, C.-Y.1    Roger Jang, J.-S.2    Chen, K.-T.3
  • 29
    • 85032775034 scopus 로고    scopus 로고
    • Subword modeling for automatic speech recognition: Past, present, and emerging approaches
    • Livescu, K., Fosler-Lussier, E., and Metze, F. (2012). " Subword modeling for automatic speech recognition: Past, present, and emerging approaches.," IEEE Signal Process. Mag. November, 44-57.
    • (2012) IEEE Signal Process. Mag. , pp. 44-57
    • Livescu, K.1    Fosler-Lussier, E.2    Metze, F.3
  • 30
    • 84989382388 scopus 로고    scopus 로고
    • Prosodic templates in sound change
    • 10.1075/dia.14.1.03mac
    • Macken, M. A., and Salmons, J. C. (1997). " Prosodic templates in sound change.," Diachronica 14 (1), 31-66. 10.1075/dia.14.1.03mac
    • (1997) Diachronica , vol.14 , Issue.1 , pp. 31-66
    • MacKen, M.A.1    Salmons, J.C.2
  • 31
    • 0037850986 scopus 로고    scopus 로고
    • Phonetic alignment: Speech synthesis-based vs. Viterbi-based
    • 10.1016/S0167-6393(02)00131-0
    • Malfrère, F., Deroo, O., Dutoit, T., and Ris, C. (2003). " Phonetic alignment: Speech synthesis-based vs. Viterbi-based.," Speech Commun. 40, 503-515. 10.1016/S0167-6393(02)00131-0
    • (2003) Speech Commun. , vol.40 , pp. 503-515
    • Malfrère, F.1    Deroo, O.2    Dutoit, T.3    Ris, C.4
  • 32
    • 0008771399 scopus 로고
    • Some phonetic bases for the relative malleability of syllable-final versus syllable-initial consonants
    • in, Université de Provence, Aix-en-Provence, Vol. 5
    • Manuel, S. Y. (1991). " Some phonetic bases for the relative malleability of syllable-final versus syllable-initial consonants.," in Proceedings of the 12th International Congress of Phonetic Sciences, Université de Provence, Aix-en-Provence, Vol. 5, pp. 118-121.
    • (1991) Proceedings of the 12th International Congress of Phonetic Sciences , pp. 118-121
    • Manuel, S.Y.1
  • 33
    • 84970305057 scopus 로고
    • Detection of target phonemes in spontaneous and read speech
    • Mehta, G., and Cutler, A. (1988). " Detection of target phonemes in spontaneous and read speech.," Lang. Speech 31 (2), 135-156.
    • (1988) Lang. Speech , vol.31 , Issue.2 , pp. 135-156
    • Mehta, G.1    Cutler, A.2
  • 35
    • 0003377189 scopus 로고
    • The phonetics and phonology of aspects of assimilation
    • Ohala, J. (1990). " The phonetics and phonology of aspects of assimilation.," Papers Lab. Phonol. 1, 258-275
    • (1990) Papers Lab. Phonol. , vol.1 , pp. 258-275
    • Ohala, J.1
  • 36
    • 84883333361 scopus 로고    scopus 로고
    • R Development Core Team. "R: A language and environment for statistical computing" [computer program], R Foundation for Statistical Computing, Vienna, Austria (date last viewed 10/1/12)
    • R Development Core Team (2012). "R: A language and environment for statistical computing" [computer program], http://www.R-project.org, R Foundation for Statistical Computing, Vienna, Austria (date last viewed 10/1/12).
    • (2012)
  • 39
    • 0010135126 scopus 로고
    • The timing of prenuclear high accents in English
    • in, edited by J. Kingston and M. Beckman, Cascadilla Proceedings Project, Somerville, MA
    • Silverman, K., and Pierrehumbert, J. (1990). " The timing of prenuclear high accents in English.," in Papers in Laboratory Phonology I: Between the Grammar and Physics of Speech, edited by, J. Kingston, and, M. Beckman, Cascadilla Proceedings Project, Somerville, MA, pp. 103-112.
    • (1990) Papers in Laboratory Phonology I: Between the Grammar and Physics of Speech , pp. 103-112
    • Silverman, K.1    Pierrehumbert, J.2
  • 41
    • 84867192907 scopus 로고    scopus 로고
    • Context-sensitive probabilistic phone mapping model for cross-lingual speech recognition
    • In, International Speech Communication Association (ISCA)
    • Sim, K. C., and Li, H. (2008b). " Context-sensitive probabilistic phone mapping model for cross-lingual speech recognition.," In Proceedings of Interspeech 2008, International Speech Communication Association (ISCA), pp. 2715-2718.
    • (2008) Proceedings of Interspeech 2008 , pp. 2715-2718
    • Sim, K.C.1    Li, H.2
  • 42
    • 84858783740 scopus 로고    scopus 로고
    • Endangered language families
    • 10.1353/lan.2012.0012
    • Whalen, D. H., and Simons, G. F. (2012). " Endangered language families.," Language 88, 155-173. 10.1353/lan.2012.0012
    • (2012) Language , vol.88 , pp. 155-173
    • Whalen, D.H.1    Simons, G.F.2
  • 43
    • 0026470422 scopus 로고
    • Information for Mandarin tones in the amplitude contour and in brief segments
    • 10.1159/000261901
    • Whalen, D. H., and Xu, Y. (1992). " Information for Mandarin tones in the amplitude contour and in brief segments.," Phonetica 49, 25-47. 10.1159/000261901
    • (1992) Phonetica , vol.49 , pp. 25-47
    • Whalen, D.H.1    Xu, Y.2
  • 44
    • 84883363328 scopus 로고    scopus 로고
    • ModelTalker voice recorder (MTVR) - A system for capturing individual voices for synthetic speech
    • " talk presented at the, Montreal, Canada (August 2-7)
    • Yarrington, D., Pennington, C., Bunnell, H. T., Gray, J., Lilley, J., Nagao, K., and Polikoff, J. (2008). " ModelTalker voice recorder (MTVR)-A system for capturing individual voices for synthetic speech.," talk presented at the ISAAC 13th Biennial Conference, Montreal, Canada (August 2-7).
    • (2008) ISAAC 13th Biennial Conference
    • Yarrington, D.1    Pennington, C.2    Bunnell, H.T.3    Gray, J.4    Lilley, J.5    Nagao, K.6    Polikoff, J.7
  • 45
    • 84937375671 scopus 로고    scopus 로고
    • Cambridge Textbooks in Linguistics (Cambridge University Press, Cambridge, UK)
    • Yip, M. (2002). Tone, Cambridge Textbooks in Linguistics (Cambridge University Press, Cambridge, UK), p. 376.
    • (2002) Tone , pp. 376
    • Yip, M.1
  • 46
    • 84874902640 scopus 로고    scopus 로고
    • Speaker identification on the SCOTUS corpus
    • in
    • Yuan, J., and Liberman, M. (2008). " Speaker identification on the SCOTUS corpus.," in Proceedings of Acoustics 2008, pp. 5687-5690.
    • (2008) Proceedings of Acoustics 2008 , pp. 5687-5690
    • Yuan, J.1    Liberman, M.2
  • 49
    • 0025477640 scopus 로고
    • Speech database development at MIT: TIMIT and beyond
    • 10.1016/0167-6393(90)90010-7
    • Zue, V., Seneff, S., and Glass, J. (1990). " Speech database development at MIT: TIMIT and beyond.," Speech Commun. 9, 351-356. 10.1016/0167-6393(90)90010-7
    • (1990) Speech Commun. , vol.9 , pp. 351-356
    • Zue, V.1    Seneff, S.2    Glass, J.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.