메뉴 건너뛰기




Volumn 19, Issue 4, 2011, Pages 688-698

Stochastic pronunciation modeling for out-of-vocabulary spoken term detection

Author keywords

Letter to sound; out of vocabulary (OOV); pronunciation modeling; speech recognition; spoken term detection (STD)

Indexed keywords

DEGREE OF UNCERTAINTY; FALSE ALARM PROBABILITY; HARD TASK; LETTER-TO-SOUND; OPERATING REGIONS; OUT-OF-VOCABULARY (OOV); PERFORMANCE GAIN; PROBABILISTIC MODELS; PRONUNCIATION MODELING; SEARCH TERMS; SPECIAL PROPERTIES; SPOKEN TERM DETECTION (STD); SUBWORD UNITS;

EID: 79951661301     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2010.2058800     Document Type: Article
Times cited : (12)

References (56)
  • 1
    • 84872321262 scopus 로고    scopus 로고
    • The Spoken term detection (STD) 2006 evaluation plan
    • Gaithersburg, MD, Sep.
    • NIST, "The Spoken term detection (STD) 2006 evaluation plan,"10th ed. National Institute of Standards and Technology (NIST). Gaithersburg, MD, Sep. 2006 [Online]. Available: http://www.nist.gov/speech/ tests/std
    • (2006) 10th Ed. National Institute of Standards and Technology (NIST)
  • 2
    • 84867228727 scopus 로고    scopus 로고
    • Phonetic query expansion for spoken document retrieval
    • Brisbane, Australia, Sep.
    • J. Mamou and B. Ramabhadran, "Phonetic query expansion for spoken document retrieval," in Proc. Interspeech'08, Brisbane, Australia, Sep. 2008, pp. 2106-2109.
    • (2008) Proc. Interspeech'08 , pp. 2106-2109
    • Mamou, J.1    Ramabhadran, B.2
  • 3
    • 70349211775 scopus 로고    scopus 로고
    • Effect of pronunciations on OOV queries in spoken term detection
    • Taipei, Taiwan, Apr.
    • D. Can, E. Cooper, A. Sethy, C. White, B. Ramabhadran, and M. Saraclar, "Effect of pronunciations on OOV queries in spoken term detection," in Proc. ICASSP'09, Taipei, Taiwan, Apr. 2009, pp. 3957-3960.
    • (2009) Proc. ICASSP'09 , pp. 3957-3960
    • Can, D.1    Cooper, E.2    Sethy, A.3    White, C.4    Ramabhadran, B.5    Saraclar, M.6
  • 6
    • 51449101963 scopus 로고    scopus 로고
    • Open-vocabulary spoken term detection using graphone-based hybrid recognition systems
    • Las Vegas, NV, Mar.
    • M. Akbacak, D. Vergyri, and A. Stolcke, "Open-vocabulary spoken term detection using graphone-based hybrid recognition systems," in Proc. ICASSP'08, Las Vegas, NV, Mar. 2008, pp. 5240-5243.
    • (2008) Proc. ICASSP'08 , pp. 5240-5243
    • Akbacak, M.1    Vergyri, D.2    Stolcke, A.3
  • 10
    • 51449122583 scopus 로고    scopus 로고
    • Fusing multiple systems into a compact lattice index for Chinese spoken term detection
    • Las Vegas, NV, Mar.
    • S. Meng, P. Yu, J. Liu, and F. Seide, "Fusing multiple systems into a compact lattice index for Chinese spoken term detection," in Proc. ICASSP'08, Las Vegas, NV, Mar. 2008, pp. 4345-4348.
    • (2008) Proc. ICASSP'08 , pp. 4345-4348
    • Meng, S.1    Yu, P.2    Liu, J.3    Seide, F.4
  • 11
    • 54249103198 scopus 로고    scopus 로고
    • Rapid yet accurate speech indexing using dynamic match lattice spotting
    • Jan.
    • K. Thambiratmann and S. Sridharan, "Rapid yet accurate speech indexing using dynamic match lattice spotting," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 1, pp. 346-357, Jan. 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.1 , pp. 346-357
    • Thambiratmann, K.1    Sridharan, S.2
  • 12
    • 0031630644 scopus 로고    scopus 로고
    • Using word probabilities as confidence measures
    • Seattle, WA, May
    • F.Wessel, K. Macherey, and R. Schlüter, "Using word probabilities as confidence measures," in Proc. ICASSP'98, Seattle, WA, May 1998, vol. 1, pp. 225-228.
    • (1998) Proc. ICASSP'98 , vol.1 , pp. 225-228
    • Wessel, F.1    MacHerey, K.2    Schlüter, R.3
  • 16
    • 85046873967 scopus 로고    scopus 로고
    • The DET curve in assessment of detection task performance
    • Rhodes, Greece, Sep.
    • A. Martin, G. Doddington, T. Kamm, M. Ordowski, and M. Przybocki, "The DET curve in assessment of detection task performance," in Proc. Eurospeech'97, Rhodes, Greece, Sep. 1997, vol. 4, pp. 1895-1898.
    • (1997) Proc. Eurospeech'97 , vol.4 , pp. 1895-1898
    • Martin, A.1    Doddington, G.2    Kamm, T.3    Ordowski, M.4    Przybocki, M.5
  • 19
    • 35348912517 scopus 로고    scopus 로고
    • An experimental study of an audio indexing system for the web
    • Beijing, China, Oct.
    • B. Logan, P. Moreno, J.-M. V. Thong, and E. Whittaker, "An experimental study of an audio indexing system for the web," in Proc. ICSLP'00, Beijing, China, Oct. 2000, vol. 2, pp. 676-679.
    • (2000) Proc. ICSLP'00 , vol.2 , pp. 676-679
    • Logan, B.1    Moreno, P.2    Thong, J.-M.V.3    Whittaker, E.4
  • 22
    • 34547521512 scopus 로고    scopus 로고
    • Keyword search using modified minimum edit distance measure
    • Honolulu, HI, Apr.
    • K. Audhkhasi and A. Verma, "Keyword search using modified minimum edit distance measure," in Proc. ICASSP'07, Honolulu, HI, Apr. 2007, vol. 4, pp. 929-932.
    • (2007) Proc. ICASSP'07 , vol.4 , pp. 929-932
    • Audhkhasi, K.1    Verma, A.2
  • 23
    • 70450153437 scopus 로고    scopus 로고
    • Fast approximate spoken term detection from sequence of phonemes
    • Singapore, Jul., Assoc. for Computing Machinery
    • J. Pinto, I. Szöke, S. Prasanna, and H. Her̂manský, "Fast approximate spoken term detection from sequence of phonemes," in Proc. 31st Annu. Int. ACM SIGIR Conf., Singapore, Jul. 2008, pp. 28-33, Assoc. for Computing Machinery.
    • (2008) Proc. 31st Annu. Int. ACM SIGIR Conf. , pp. 28-33
    • Pinto, J.1    Szöke, I.2    Prasanna, S.3    Her̂manský, H.4
  • 24
    • 70349215651 scopus 로고    scopus 로고
    • Spoken term detection using fast phonetic decoding
    • Taipei, Taiwan, Apr.
    • R. Wallace, R. Vogt, and S. Sridharan, "Spoken term detection using fast phonetic decoding," in Proc. ICASSP'09, Taipei, Taiwan, Apr. 2009, pp. 4881-4884.
    • (2009) Proc. ICASSP'09 , pp. 4881-4884
    • Wallace, R.1    Vogt, R.2    Sridharan, S.3
  • 25
    • 70450204389 scopus 로고    scopus 로고
    • Stochastic pronunciation modelling for spoken term detection
    • Brighton, U.K., Sep.
    • D.Wang, S. King, and J. Frankel, "Stochastic pronunciation modelling for spoken term detection," in Proc. Interspeech'09, Brighton, U.K., Sep. 2009, pp. 2135-2138.
    • (2009) Proc. Interspeech'09 , pp. 2135-2138
    • Wang, D.1    King, S.2    Frankel, J.3
  • 26
    • 78049355862 scopus 로고    scopus 로고
    • Stochastic pronunciation modelling and soft match for out-of-vocabulary spoken term detection
    • Dallas, TX, Mar.
    • D. Wang, S. King, J. Frankel, and P. Bell, "Stochastic pronunciation modelling and soft match for out-of-vocabulary spoken term detection," in Proc. ICASSP'10, Dallas, TX, Mar. 2010, pp. 5294-5297.
    • (2010) Proc. ICASSP'10 , pp. 5294-5297
    • Wang, D.1    King, S.2    Frankel, J.3    Bell, P.4
  • 27
    • 0039666139 scopus 로고    scopus 로고
    • Pronunciation by analogy: Impact of implementational choices on performance
    • R. Damper and J. Eastmond, "Pronunciation by analogy: Impact of implementational choices on performance," Lang. Speech, vol. 40, no. 1, pp. 1-23, 1997. (Pubitemid 127459061)
    • (1997) Language and Speech , vol.40 , Issue.1 , pp. 1-23
    • Damper, R.I.1    Eastmond, J.F.G.2
  • 28
    • 84989849732 scopus 로고    scopus 로고
    • Issues in building general letter to sound rules
    • thesis, Jenolan Caves, Australia
    • A. W. Black, K. Lenzo, and V. Pagel, "Issues in building general letter to sound rules," in Proc. 3rd ESCA Workshop on Speech Synthesis, Jenolan Caves, Australia, 1998, pp. 77-80.
    • (1998) Proc. 3rd ESCA Workshop on Speech Syn , pp. 77-80
    • Black, A.W.1    Lenzo, K.2    Pagel, V.3
  • 29
    • 0032624182 scopus 로고    scopus 로고
    • Forgetting exceptions is harmful in language learning
    • W. Daelemans, A. van den Bosch, and J. Zavrel, "Forgetting exceptions is harmful in language learning," Mach. Learn., vol. 34, no. 1-3, pp. 11-41, 1999.
    • (1999) Mach. Learn. , vol.34 , Issue.1-3 , pp. 11-41
    • Daelemans, W.1    Van Den Bosch, A.2    Zavrel, J.3
  • 30
    • 33745189126 scopus 로고    scopus 로고
    • Hidden Markov models for grapheme to phoneme conversion
    • 9th European Conference on Speech Communication and Technology, Eurospeech Interspeech
    • P. Taylor, "Hidden Markov models for grapheme to phoneme conversion," in Proc. Interspeech'05, Lisbon, Portugal, Sep. 2005, pp. 1973-1976. (Pubitemid 43908476)
    • (2005) 9th European Conference on Speech Communication and Technology , pp. 1973-1976
    • Taylor, P.1
  • 31
    • 41049105254 scopus 로고    scopus 로고
    • Joint-sequence models for grapheme-tophoneme conversion
    • May
    • M. Bisani and H. Ney, "Joint-sequence models for grapheme-tophoneme conversion," Speech Commun., vol. 50, no. 5, pp. 434-451, May 2008.
    • (2008) Speech Commun. , vol.50 , Issue.5 , pp. 434-451
    • Bisani, M.1    Ney, H.2
  • 32
    • 85012973695 scopus 로고
    • A fast lattice-based approach to vocabulary independent wordspotting
    • Yokohama, Japan, Sep.
    • D. A. James and S. J. Young, "A fast lattice-based approach to vocabulary independent wordspotting," in Proc. ICASSP'94, Yokohama, Japan, Sep. 1994, pp. 377-380.
    • (1994) Proc. ICASSP'94 , pp. 377-380
    • James, D.A.1    Young, S.J.2
  • 34
    • 0030711143 scopus 로고    scopus 로고
    • Acoustic indexing for multimedia retrieval and browsing
    • Munich, Germany, Apr.
    • S. J. Young, M. Brown, J. T. Foote, G. J. F. Jones, and K. Spärck Jones, "Acoustic indexing for multimedia retrieval and browsing," in Proc. ICASSP'97, Munich, Germany, Apr. 1997, vol. 1, pp. 199-202.
    • (1997) Proc. ICASSP'97 , vol.1 , pp. 199-202
    • Young, S.J.1    Brown, M.2    Foote, J.T.3    Jones, G.J.F.4    Jones, K.S.5
  • 35
    • 4544257924 scopus 로고    scopus 로고
    • Vocabulary-independent search in spontaneous speech
    • Montreal, QC, Canada, May
    • F. Seide, P. Yu, C.Ma, and E. Chang, "Vocabulary-independent search in spontaneous speech," in Proc. ICASSP'04, Montreal, QC, Canada, May 2004, vol. 1, pp. 253-256.
    • (2004) Proc. ICASSP'04 , vol.1 , pp. 253-256
    • Seide, F.1    Yu, P.2    Ma, C.3    Chang, E.4
  • 36
    • 33646808085 scopus 로고    scopus 로고
    • Dynamic match phone-lattice searches for very fast and accurate unrestricted vocabulary keyword spotting
    • Philadelphia, PA, Mar.
    • K. Thambiratnam and S. Sridharan, "Dynamic match phone-lattice searches for very fast and accurate unrestricted vocabulary keyword spotting," in Proc. ICASSP'05, Philadelphia, PA, Mar. 2005, vol. 1, pp. 465-468.
    • (2005) Proc. ICASSP'05 , vol.1 , pp. 465-468
    • Thambiratnam, K.1    Sridharan, S.2
  • 38
    • 84867217757 scopus 로고    scopus 로고
    • Discriminative graph training for ultra-fast low-footprint speech indexing
    • Las Vegas, NV, Mar.
    • U. Chaudhari, H.-K. J. Kuo, and B. Kingsbury, "Discriminative graph training for ultra-fast low-footprint speech indexing," in Proc. Interspeech 2008, Las Vegas, NV, Mar. 2008, pp. 2175-2178.
    • (2008) Proc. Interspeech 2008 , pp. 2175-2178
    • Chaudhari, U.1    Kuo, H.-K.J.2    Kingsbury, B.3
  • 39
    • 44949128324 scopus 로고    scopus 로고
    • Two-stage vocabulary-free spoken document retrieval-subword identification and re-recognition of the identified sections
    • Pittsburgh, PA, Sep.
    • Y. Itoh, T. Otake, K. Iwata, K. Kojima, M. Ishigame, K. Tanaka, and S. wook Lee, "Two-stage vocabulary-free spoken document retrieval-subword identification and re-recognition of the identified sections," in Proc. ICSLP'06, Pittsburgh, PA, Sep. 2006, pp. 1161-1164.
    • (2006) Proc. ICSLP'06 , pp. 1161-1164
    • Itoh, Y.1    Otake, T.2    Iwata, K.3    Kojima, K.4    Ishigame, M.5    Tanaka, K.6    Lee, S.W.7
  • 40
    • 84867226720 scopus 로고    scopus 로고
    • Robust spoken term detection using combination of phone-based and word-based recognition
    • Brisbane, Australia, Sep.
    • K. Iwata, K. Shinoda, and S. Furui, "Robust spoken term detection using combination of phone-based and word-based recognition," in Proc. Interspeech'08, Brisbane, Australia, Sep. 2008, pp. 2195-2198.
    • (2008) Proc. Interspeech'08 , pp. 2195-2198
    • Iwata, K.1    Shinoda, K.2    Furui, S.3
  • 41
    • 0030363039 scopus 로고    scopus 로고
    • Dictionary learning for spontaneous speech recognition
    • Philadelphia, PA, Oct.
    • T. Sloboda and A. Waibel, "Dictionary learning for spontaneous speech recognition," in Proc. ICSLP'96, Philadelphia, PA, Oct. 1996, pp. 2328-2331.
    • (1996) Proc. ICSLP'96 , pp. 2328-2331
    • Sloboda, T.1    Waibel, A.2
  • 42
    • 0033329496 scopus 로고    scopus 로고
    • In search of better pronunciation models for speech recognition
    • DOI 10.1016/S0167-6393(99)00034-5
    • N. Cremelie and J.-P. Martens, "In search of better pronunciation models for speech recognition," Speech Commun., vol. 29, no. 2-4, pp. 115-136, 1999. (Pubitemid 30514827)
    • (1999) Speech Communication , vol.29 , Issue.2 , pp. 115-136
    • Cremelie, N.1    Martens, J.-P.2
  • 43
    • 0033335618 scopus 로고    scopus 로고
    • Modeling pronunciation variation for ASR: A survey of the literature
    • DOI 10.1016/S0167-6393(99)00038-2
    • H. Strik and C. Cucchiarini, "Modeling pronunciation variation for ASR: A survey of the literature," Speech Commun., vol. 29, no. 5, pp. 225-246, 1999. (Pubitemid 30514833)
    • (1999) Speech Communication , vol.29 , Issue.2 , pp. 225-246
    • Strik, H.1    Cucchiarini, C.2
  • 44
    • 33845875676 scopus 로고    scopus 로고
    • Acoustic model adaptation based on pronunciation variability analysis for non-native speech recognition
    • DOI 10.1016/j.specom.2006.10.006, PII S0167639306001440
    • Y. R. Oh, J. S. Yoon, and H. K. Kim, "Acoustic model adaptation based on pronunciation variability analysis for non-native speech recognition," Speech Commun., vol. 49, no. 1, pp. 59-70, 2007. (Pubitemid 46026867)
    • (2007) Speech Communication , vol.49 , Issue.1 , pp. 59-70
    • Oh, Y.R.1    Yoon, J.S.2    Kim, H.K.3
  • 46
    • 70450167643 scopus 로고    scopus 로고
    • Term-dependent confidence for out-of-vocabulary term detection
    • Brighton, U.K., Sep.
    • D. Wang, S. King, J. Frankel, and P. Bell, "Term-dependent confidence for out-of-vocabulary term detection," in Proc. Interspeech'09, Brighton, U.K., Sep. 2009, pp. 2139-2142.
    • (2009) Proc. Interspeech'09 , pp. 2139-2142
    • Wang, D.1    King, S.2    Frankel, J.3    Bell, P.4
  • 47
    • 0038576501 scopus 로고    scopus 로고
    • Towards robust methods for spoken document retrieval
    • Sydney, Australia, Nov.
    • K. Ng, "Towards robust methods for spoken document retrieval," in Proc. ICSLP'98, Sydney, Australia, Nov. 1998, pp. 939-942.
    • (1998) Proc. ICSLP'98 , pp. 939-942
    • Ng, K.1
  • 48
    • 0032282577 scopus 로고    scopus 로고
    • New techniques for open-vocabulary spoken document retrieval
    • Melbourne, Australia, Aug.
    • M. Wechsler, E. Munteanu, and P. Schäuble, "New techniques for open-vocabulary spoken document retrieval," in Proc. ACM SIGIR 1998, Melbourne, Australia, Aug. 1998, pp. 20-27.
    • (1998) Proc. ACM SIGIR 1998 , pp. 20-27
    • Wechsler, M.1    Munteanu, E.2    Schäuble, P.3
  • 51
    • 85135195552 scopus 로고
    • Variable-length sequence matching for phonetic transcription using joint multigrams
    • Madrid, Spain, Sep.
    • S. Deligne, F. Yvon, and F. Bimbot, "Variable-length sequence matching for phonetic transcription using joint multigrams," in Proc. Eurospeech'95, Madrid, Spain, Sep. 1995, pp. 2243-2246.
    • (1995) Proc. Eurospeech'95 , pp. 2243-2246
    • Deligne, S.1    Yvon, F.2    Bimbot, F.3
  • 52
    • 85009227369 scopus 로고    scopus 로고
    • Conditional and joint models for grapheme-to-phoneme conversion
    • Geneva, Switzerland, Sep.
    • S. F. Chen, "Conditional and joint models for grapheme-to-phoneme conversion," in Proc. Eurospeech'03, Geneva, Switzerland, Sep. 2003, pp. 2033-2036.
    • (2003) Proc. Eurospeech'03 , pp. 2033-2036
    • Chen, S.F.1
  • 53
    • 85009250482 scopus 로고    scopus 로고
    • Investigations on joint-multigram models for grapheme-to-phoneme conversion
    • Denver, CO, Sep.
    • M. Bisani and H. Ney, "Investigations on joint-multigram models for grapheme-to-phoneme conversion," in Proc. ICSLP'02, Denver, CO, Sep. 2002, pp. 105-108.
    • (2002) Proc. ICSLP'02 , pp. 105-108
    • Bisani, M.1    Ney, H.2
  • 54
    • 78650990390 scopus 로고    scopus 로고
    • Ph.D. dissertation, Center for Speech Technol. Res., Edinburgh Univ., Edinburgh, U.K., Dec.
    • D. Wang, "Out-of-vocabulary spoken term detection," Ph.D. dissertation, Center for Speech Technol. Res., Edinburgh Univ., Edinburgh, U.K., Dec. 2009.
    • (2009) Out-of-vocabulary Spoken Term Detection
    • Wang, D.1
  • 56
    • 34047123652 scopus 로고    scopus 로고
    • Multisyn: Open-domain unit selection for the Festival speech synthesis system
    • DOI 10.1016/j.specom.2007.01.014, PII S0167639307000398
    • R. A. J. Clark, K. Richmond, and S. King, "Multisyn: Open-domain unit selection for the Festival speech synthesis system," Speech Commun., vol. 49, no. 4, pp. 317-330, 2007. (Pubitemid 46517714)
    • (2007) Speech Communication , vol.49 , Issue.4 , pp. 317-330
    • Clark, R.A.J.1    Richmond, K.2    King, S.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.