메뉴 건너뛰기




Volumn 8, Issue 1, 2011, Pages

Speech retrieval from unsegmented finnish audio using statistical morpheme-like units for segmentation, recognition, and retrieval

Author keywords

Confusion networks; Lattices; Morphemes; Spoken document retrieval; Story segmentation; Subword indexing; Topic segmentation

Indexed keywords

CONFUSION NETWORKS; MORPHEMES; SPOKEN DOCUMENT RETRIEVAL; STORY SEGMENTATION; SUBWORDS; TOPIC SEGMENTATION;

EID: 80455130027     PISSN: 15504875     EISSN: 15504883     Source Type: Journal    
DOI: 10.1145/2036916.2036917     Document Type: Article
Times cited : (12)

References (63)
  • 2
    • 0042337326 scopus 로고    scopus 로고
    • From Plain Character Strings to Meaningful Words: Producing Better Full Text Databases for Inflectional and Compounding Languages with Morphological Analysis Software
    • DOI 10.1023/A:1011942104443
    • ALKULA, R. 2001. From plain character strings to meaningful words: Producing better full text databases for inflectional and compounding languages with morphological analysis software. Inform. Retrieval 4, 195-208. (Pubitemid 33642148)
    • (2001) Information Retrieval , vol.4 , Issue.3-4 , pp. 195-208
    • Alkula, R.1
  • 4
    • 0032674505 scopus 로고    scopus 로고
    • Statistical models for text segmentation
    • BEEFERMAN, D., BERGER, A., AND LAFFERTY, J. 1999. Statistical models for text segmentation. Machine Learn. 34, 1, 177-210.
    • (1999) Machine Learn. , vol.34 , Issue.1 , pp. 177-210
    • Beeferman, D.1    Berger, A.2    Lafferty, J.3
  • 7
    • 33847607574 scopus 로고    scopus 로고
    • Soft indexing of speech content for search in spoken documents
    • DOI 10.1016/j.csl.2006.09.001, PII S0885230806000313
    • CHELBA, C., SILVA, J., AND ACERO, A. 2007. Soft indexing of speech content for search in spoken documents. Comput. Speech Lang. 21, 3, 458-478. (Pubitemid 46367509)
    • (2007) Computer Speech and Language , vol.21 , Issue.3 , pp. 458-478
    • Chelba, C.1    Silva, J.2    Acero, A.3
  • 8
    • 79851497439 scopus 로고    scopus 로고
    • Statistical lattice-based spoken document retrieval
    • CHIA, T. K., SIM, K. C., LI, H., AND NG, H. T. 2010. Statistical lattice-based spoken document retrieval. ACM Trans. Inf. Syst. 28, 1, 1-30.
    • (2010) ACM Trans. Inf. Syst. , vol.28 , Issue.1 , pp. 1-30
    • Chia, T.K.1    Sim, K.C.2    Li, H.3    Ng, H.T.4
  • 10
    • 33750359664 scopus 로고    scopus 로고
    • Unsupervised morpheme segmentation and morphology induction from text corpora using Morfessor 1.0
    • Publications in Computer and Information Science, Helsinki University of Technology
    • CREUTZ, M. AND LAGUS, K. 2005. Unsupervised morpheme segmentation and morphology induction from text corpora using Morfessor 1.0. Tech. rep. A81, Publications in Computer and Information Science, Helsinki University of Technology. http://www.cis.hut.fi/projects/morpho/.
    • (2005) Tech. rep. , vol.A81
    • Creutz, M.1    Lagus, K.2
  • 11
    • 80455151260 scopus 로고    scopus 로고
    • CSC TIETEELLINEN LASKENTA OY
    • CSC TIETEELLINEN LASKENTA OY. 2007. Finnish language text bank. http://www.csc.fi/kielipankki/.
    • (2007) Finnish Language Text Bank
  • 15
    • 0001819680 scopus 로고    scopus 로고
    • TextTiling: Segmenting Text into Multi-paragraph Subtopic Passages
    • HEARST, M. A. 1997. TextTiling: segmenting text into multi-paragraph subtopic passages. Comput. Linguist. 23, 1, 33-64. (Pubitemid 127458657)
    • (1997) Computational Linguistics , vol.23 , Issue.1 , pp. 33-64
    • Hearst, M.A.1
  • 16
    • 33746524944 scopus 로고    scopus 로고
    • Unlimited vocabulary speech recognition with morph language models applied to Finnish
    • DOI 10.1016/j.csl.2005.07.002, PII S0885230805000331
    • HIRSIMÄKI, T., CREUTZ, M., SIIVOLA, V., KURIMO, M., VIRPIOJA, S., AND PYLKKÖNEN, J. 2006. Unlimited vocabulary speech recognition with morph language models applied to Finnish. Computer Speech Lang. 20, 4, 515-541. (Pubitemid 44142005)
    • (2006) Computer Speech and Language , vol.20 , Issue.4 , pp. 515-541
    • Hirsimaki, T.1    Creutz, M.2    Siivola, V.3    Kurimo, M.4    Virpioja, S.5    Pylkkonen, J.6
  • 17
  • 19
    • 0027725490 scopus 로고
    • Using statistical testing in the evaluation of retrieval experiments
    • ACM Press, New York, NY
    • HULL, D. A. 1993. Using statistical testing in the evaluation of retrieval experiments. In Proceedings of SIGIR. ACM Press, New York, NY, 329-338.
    • (1993) Proceedings of SIGIR , pp. 329-338
    • Hull, D.A.1
  • 28
    • 70349845164 scopus 로고    scopus 로고
    • Morpho Challenge evaluation using a linguistic gold standard
    • (Revised Selected Papers). Lecture Notes in Computer Science, Springer, Berlin
    • KURIMO, M., CREUTZ, M., AND VARJOKALLIO, M. 2008. Morpho Challenge evaluation using a linguistic gold standard. In Proceedings of the 8th Workshop of the Cross-Language Evaluation Forum (CLEF'07). (Revised Selected Papers). Lecture Notes in Computer Science, Vol. 5152, Springer, Berlin, 864-873.
    • (2008) Proceedings of the 8th Workshop of the Cross-Language Evaluation Forum (CLEF'07) , vol.5152 , pp. 864-873
    • Kurimo, M.1    Creutz, M.2    Varjokallio, M.3
  • 29
    • 33745225206 scopus 로고    scopus 로고
    • To recover from speech recognition errors in spoken document retrieval
    • 9th European Conference on Speech Communication and Technology, Eurospeech Interspeech
    • KURIMO, M. AND TURUNEN, V. 2005. To recover from speech recognition errors in spoken document retrieval. In Proceedings of Interspeech. 605-608. (Pubitemid 43908135)
    • (2005) 9th European Conference on Speech Communication and Technology , pp. 605-608
    • Kurimo, M.1    Turunen, V.2
  • 30
    • 51449124319 scopus 로고    scopus 로고
    • An evaluation of a spoken document retrieval baseline system in Finnish
    • KURIMO, M., TURUNEN, V., AND EKMAN, I. 2004. An evaluation of a spoken document retrieval baseline system in Finnish. In Proceedings of Interspeech.
    • (2004) Proceedings of Interspeech
    • Kurimo, M.1    Turunen, V.2    Ekman, I.3
  • 32
    • 80455123198 scopus 로고    scopus 로고
    • LINGSOFT INC. 2007 FINTWOL: Finnish morphological analyser [computer software]
    • LINGSOFT, INC. 2007. FINTWOL: Finnish morphological analyser [computer software]. http://www.lingsoft.fi/.
  • 33
    • 33750329509 scopus 로고    scopus 로고
    • One-sided measures for evaluating ranked retrieval effectiveness with spontaneous conversational speech
    • ACM, New York, NY
    • LIU, B. AND OARD, D. W. 2006. One-sided measures for evaluating ranked retrieval effectiveness with spontaneous conversational speech. In Proceedings of SIGIR. ACM, New York, NY, 673-674.
    • (2006) Proceedings of SIGIR , pp. 673-674
    • Liu, B.1    Oard, D.W.2
  • 36
    • 33750331971 scopus 로고    scopus 로고
    • Spoken document retrieval from call-center conversations
    • ACM Press, New York, NY
    • MAMOU, J., CARMEL, D., AND HOORY, R. 2006. Spoken document retrieval from call-center conversations. In Proceedings of SIGIR. ACM Press, New York, NY, 51-58.
    • (2006) Proceedings of SIGIR , pp. 51-58
    • Mamou, J.1    Carmel, D.2    Hoory, R.3
  • 37
    • 0034296009 scopus 로고    scopus 로고
    • Finding consensus in speech recognition: Word errorminimization and other applications of confusion networks
    • MANGU, L.,BRILL, E., AND STOLCKE, A. 2000. Finding consensus in speech recognition: word errorminimization and other applications of confusion networks. Comput. Speech Lang. 14, 373-400.
    • (2000) Comput. Speech Lang. , vol.14 , pp. 373-400
    • Mangu, L.1    Brill, E.2    Stolcke, A.3
  • 40
    • 1542370048 scopus 로고    scopus 로고
    • Experiments using the lemur toolkit
    • National Institute of Standards and Technology
    • OGILVIE, P. AND CALLAN, J. 2002. Experiments using the lemur toolkit. In Proceedings of TREC. National Institute of Standards and Technology. 103-108.
    • (2002) Proceedings of TREC , pp. 103-108
    • Ogilvie, P.1    Callan, J.2
  • 41
    • 43849093302 scopus 로고    scopus 로고
    • Analytical comparison between position specific posterior lattices and confusion networks based on words and subword units for spoken document indexing
    • PAN, Y. C., CHANG, H. L., AND LEE, L. S. 2007. Analytical comparison between position specific posterior lattices and confusion networks based on words and subword units for spoken document indexing. In Proceedings of the Workshop on Automatic Speech Recognition and Understanding (ASRU). 677-682.
    • (2007) Proceedings of the Workshop on Automatic Speech Recognition and Understanding (ASRU) , pp. 677-682
    • Pan, Y.C.1    Chang, H.L.2    Lee, L.S.3
  • 42
    • 0001409997 scopus 로고    scopus 로고
    • Discourse Segmentation by Human and Automated Means
    • PASSONNEAU, R. J. AND LITMAN, D. J. 1997. Discourse segmentation by human and automated means. Comput. Linguist. 23, 1, 103-139. (Pubitemid 127458659)
    • (1997) Computational Linguistics , vol.23 , Issue.1 , pp. 103-139
    • Passonneau, R.J.1    Litman, D.J.2
  • 44
    • 84993016582 scopus 로고    scopus 로고
    • Morphological typology of languages for IR
    • DOI 10.1108/EUM0000000007085
    • PIRKOLA, A. 2001. Morphological typology of languages for IR. J. Document. 57, 3, 330-348. (Pubitemid 33259201)
    • (2001) Journal of Documentation , vol.57 , Issue.3 , pp. 330-348
    • Pirkola, A.1
  • 47
    • 85050187568 scopus 로고    scopus 로고
    • Lattice-based search for spoken utterance retrieval
    • SARAĆLAR, M. AND SPROAT, R. 2004. Lattice-based search for spoken utterance retrieval. In Proceedings of HTL-NAACL. 129-136.
    • (2004) Proceedings of HTL-NAACL , pp. 129-136
    • Saraćlar, M.1    Sproat, R.2
  • 48
    • 0034275920 scopus 로고    scopus 로고
    • Prosody-based automatic segmentation of speech into sentences and topics
    • SHRIBERG, E., STOLCKE, A., HAKKANI-TÜR, D., AND TÜR, G. 2000. Prosody-based automatic segmentation of speech into sentences and topics. Speech Comm. 32, 1-2, 127-154.
    • (2000) Speech Comm. , vol.32 , Issue.1-2 , pp. 127-154
    • Shriberg, E.1    Stolcke, A.2    Hakkani-tür, D.3    Tür, G.4
  • 50
    • 85086132755 scopus 로고    scopus 로고
    • Morfessor and VariKN machine learning tools for speech and language technology
    • SIIVOLA, V., CREUTZ, M., AND KURIMO, M. 2007. Morfessor and VariKN machine learning tools for speech and language technology. In Proceedings of Interspeech.
    • (2007) Proceedings of Interspeech
    • Siivola, V.1    Creutz, M.2    Kurimo, M.3
  • 53
    • 1842479362 scopus 로고    scopus 로고
    • Select: A lexical cohesion based news story segmentation system
    • STOKES, N. 2004. Select: a lexical cohesion based news story segmentation system. AI Comm. 17, 1, 3-12.
    • (2004) AI Comm. , vol.17 , Issue.1 , pp. 3-12
    • Stokes, N.1
  • 55
    • 18044403021 scopus 로고    scopus 로고
    • Integrating prosodic and lexical cues for automatic topic segmentation
    • DOI 10.1162/089120101300346796
    • TÜR, G., HAKKANI-TÜR, D., STOLCKE, A., AND SHRIBERG, E. 2001. Integrating prosodic and lexical cues for automatic topic segmentation. Computat. Ling. 27, 1, 31-57. (Pubitemid 33597441)
    • (2001) Computational Linguistics , vol.27 , Issue.1 , pp. 31-57
    • Tur, G.1    Stolcke, A.2    Hakkani-Tur, D.3    Shriberg, E.4
  • 56
    • 84867205123 scopus 로고    scopus 로고
    • Reducing the effect of OOV query words by using morph-based spoken document retrieval
    • TURUNEN, V. T. 2008. Reducing the effect of OOV query words by using morph-based spoken document retrieval. In Proceedings of Interspeech. 2158-2161.
    • (2008) Proceedings of Interspeech. , pp. 2158-2161
    • Turunen, V.T.1
  • 57
    • 36448944395 scopus 로고    scopus 로고
    • Using latent semantic indexing for morph-based spoken document retrieval
    • TURUNEN, V. T. AND KURIMO, M. 2006. Using latent semantic indexing for morph-based spoken document retrieval. In Proceedings of Interspeech. 341-344.
    • (2006) Proceedings of Interspeech. , pp. 341-344
    • Turunen, V.T.1    Kurimo, M.2
  • 60
    • 45449098251 scopus 로고    scopus 로고
    • Multi-Scale TextTiling for automatic story segmentation in chinese broadcast news
    • Springer, Berlin
    • XIE, L., ZENG, J., AND FENG, W. 2008. Multi-Scale TextTiling for automatic story segmentation in chinese broadcast news. In Information Retrieval Technology, Springer, Berlin, 345-355.
    • (2008) Information Retrieval Technology , pp. 345-355
    • Xie, L.1    Zeng, J.2    Feng, W.3
  • 62
    • 85009089367 scopus 로고    scopus 로고
    • A hybrid-word/phoneme-based approach for improved vocabulary-independent search in spontaneous speech
    • YU, P. AND SEIDE, F. 2004. A hybrid-word/phoneme-based approach for improved vocabulary-independent search in spontaneous speech. In Proceedings of Interspeech. 293-296.
    • (2004) Proceedings of Interspeech. , pp. 293-296
    • Yu, P.1    Seide, F.2
  • 63
    • 84863337904 scopus 로고    scopus 로고
    • Towards spoken-document retrieval for the internet: Lattice indexing for large-scale web-search architectures
    • ACL, New York, New York
    • ZHOU, Z., YU, P., CHELBA, C., AND SEIDE, F. 2006. Towards spoken-document retrieval for the internet: lattice indexing for large-scale web-search architectures. In Proceedings of HLT-NAACL. ACL, New York, New York, 415-422
    • (2006) Proceedings of HLT-NAACL , pp. 415-422
    • Zhou, Z.1    Yu, P.2    Chelba, C.3    Seide, F.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.