SCOPUS 정보 검색 플랫폼

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

Volumn , Issue , 2013, Pages 8081-8085

Unsupervised discovery of linguistic structure including two-level acoustic patterns using three cascaded stages of iterative optimization

(3) Chung, Cheng Tao a Chan, Chun An a Lee, Lin Shan a

a NATIONAL TAIWAN UNIVERSITY (Taiwan)

Author keywords

hidden Markov models; iterative optimization; spoken term detection; unsupervised learning; zero resource speech recognition

Indexed keywords

INITIALIZATION STEP; ITERATIVE OPTIMIZATION; LARGE VOCABULARY; LINGUISTIC STRUCTURE; MANUAL ANNOTATION; MODEL PARAMETERS; N-GRAM LANGUAGE MODELS; SPOKEN TERM DETECTIONS;

COMPUTATIONAL LINGUISTICS; HIDDEN MARKOV MODELS; OPTIMIZATION; SIGNAL PROCESSING; SPEECH; SPEECH RECOGNITION; UNSUPERVISED LEARNING;

ITERATIVE DECODING;

EID: 84890479779 PISSN: 15206149 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICASSP.2013.6639239 Document Type: Conference Paper

Times cited : (28)

References (35)

1
- 84865770260
- Towards unsupervised training of speaker independent acoustic models
- A. Jansen and K. Church "Towards Unsupervised Training of Speaker Independent Acoustic Models" in InterSpeech, 2011, pp. 1693-1696.
- (2011) InterSpeech , pp. 1693-1696
- Jansen, A.¹ Church, K.²

2
- 84867809023
- A nonparametric bayesian approach to acoustic model discovery
- C. Lee and J. Glass, "A Nonparametric Bayesian Approach to Acoustic Model Discovery" in Proc. The Association for Computer Linguistics, 2012, vol. 1, pp. 40-49.
- (2012) Proc. The Association for Computer Linguistics , vol.1 , pp. 40-49
- Lee, C.¹ Glass, J.²

3
- 70450158585
- Unsupervised training of an hmm-based speech recognizer for topic classification
- H. Gish, M. Siu, A. Chan, and B. Belfield, "Unsupervised training of an HMM-based Speech Recognizer for Topic Classification" in InterSpeech, 2009, pp. 1935-1938.
- (2009) InterSpeech , pp. 1935-1938
- Gish, H.¹ Siu, M.² Chan, A.³ Belfield, B.⁴

4
- 79959819374
- Improved topic classification and keyword discovery using an hmm-based speech recognizer trained without supervision
- M. Siu, H. Gish, A. Chan, and W. Belfield, "Improved Topic Classification and Keyword Discovery using an HMM-based Speech Recognizer Trained without Supervision" in Inter-Speech, 2010, pp. 2838-2841.
- (2010) Inter-Speech , pp. 2838-2841
- Siu, M.¹ Gish, H.² Chan, A.³ Belfield, W.⁴

5
- 80051626575
- Unsupervised acoustic sub-word unit detection for query-by-example spoken term detection
- M. Huijbregts, M. McLaren, and D. van Leeuwen, "Unsupervised acoustic sub-word unit detection for query-by-example spoken term detection," in ICASSP, 2011, pp. 4436-4439.
- (2011) ICASSP , pp. 4436-4439
- Huijbregts, M.¹ McLaren, M.² Van Leeuwen, D.³

6
- 70349210894
- Unsupervised acoustic and language model training with small amounts of labelled data
- S. Novotney, R. Schwartz, and J. Ma, "Unsupervised acoustic and language model training with small amounts of labelled data," in ICASSP, 2009, pp. 4297-4300.
- (2009) ICASSP , pp. 4297-4300
- Novotney, S.¹ Schwartz, R.² Ma, J.³

7
- 84865757470
- Unsupervised hidden markov modeling of spoken queries for spoken term detection without speech recognition
- C. Chan and L. Lee, "Unsupervised Hidden Markov Modeling of Spoken Queries for Spoken Term Detection without Speech Recognition" in InterSpeech, 2011, pp. 2141-2144.
- (2011) InterSpeech , pp. 2141-2144
- Chan, C.¹ Lee, L.²

8
- 84867209590
- Computational language acquisition by statistical bottom-up processing
- O. J. Rasanen, U. K. Laine, T. Altosaar "Computational language acquisition by statistical bottom-up processing," in Inter-Speech, 2008, pp. 1980-1983.
- (2008) Inter-Speech , pp. 1980-1983
- Rasanen, O.J.¹ Laine, U.K.² Altosaar, T.³

9
- 70450212196
- A noise robust method for pattern discovery in quantized time series: The concept matrix approach
- O. J. Rasanen, U. K. Laine, T. Altosaar "A noise robust method for pattern discovery in quantized time series: the concept matrix approach," in InterSpeech, 2009, pp. 3035-3038.
- (2009) InterSpeech , pp. 3035-3038
- Rasanen, O.J.¹ Laine, U.K.² Altosaar, T.³

10
- 70450191104
- Self-learning vector quantization for pattern discovery from speech
- O. J. Rasanen, U. K. Laine, T. Altosaar, "Self-learning vector quantization for pattern discovery from speech," in InterSpeech, 2009, pp. 852-855.
- (2009) InterSpeech , pp. 852-855
- Rasanen, O.J.¹ Laine, U.K.² Altosaar, T.³

11
- 79959846026
- O. J. Rasanen, Fully unsupervised word learning from continuous speech using transitional probabilities of atomic acoustic events in InterSpeech, 2010, pp. 2922-2925.
- (2010) Fully Unsupervised Word Learning from Continuous Speech Using Transitional Probabilities of Atomic Acoustic Events in InterSpeech , pp. 2922-2925
- Rasanen, O.J.¹

12
- 84890539993
- Unsupervised spoken term detection with spoken queries
- C. Chan, "Unsupervised Spoken Term Detection with Spoken Queries " Ph.D dissertation, National Taiwan University, July, 2012.
- (2012) Ph.D Dissertation, National Taiwan University, July
- Chan, C.¹

13
- 51449096712
- Unsupervised optimal phoneme segmentation: Objectives, algorithm and comparisons
- Y. Qiao, N. Shimomura, and N. Minematsu, "Unsupervised optimal phoneme segmentation: objectives, algorithm and comparisons," in ICASSP, 2008, pp. 3989-3992.
- (2008) ICASSP , pp. 3989-3992
- Qiao, Y.¹ Shimomura, N.² Minematsu, N.³

14
- 80051622244
- Integrating frame-based and segmentbased dynamic time warping for unsupervised spoken term detection with spoken queries
- C. Chan and L. Lee, "Integrating Frame-based and Segmentbased Dynamic Time Warping for Unsupervised Spoken Term Detection with Spoken Queries" in ICASSP, 2011, pp. 5652-5655.
- (2011) ICASSP , pp. 5652-5655
- Chan, C.¹ Lee, L.²

15
- 84890526441
- Toward unsupervised model-based spoken term detection with spoken queries without annotated data
- C. Chan, C. Chung, Y. Kuo and L. Lee, "Toward Unsupervised Model-based Spoken Term Detection with Spoken Queries without Annotated Data" in ICASSP, 2013
- (2013) ICASSP
- Chan, C.¹ Chung, C.² Kuo, Y.³ Lee, L.⁴

16
- 85013744934
- A successive state splitting algorithm for efficient allophone modeling
- J. Takami and S. Sagayama, "A successive state splitting algorithm for efficient allophone modeling," in ICASSP, 1992, vol. 1, pp. 573-576.
- (1992) ICASSP , vol.1 , pp. 573-576
- Takami, J.¹ Sagayama, S.²

17
- 0029745231
- Maximum likelihood successive state splitting
- H. Singer and M. Ostendorf, "Maximum likelihood successive state splitting," in ICASSP, 1997, vol. 2, pp. 601-604.
- (1997) ICASSP , vol.2 , pp. 601-604
- Singer, H.¹ Ostendorf, M.²

18
- 84867221475
- Automatically learning speaker-independent acoustic subword units
- B. Varadarajan and S. Khudanpur, "Automatically Learning Speaker-independent Acoustic Subword Units," in InterSpeech, 2008.
- (2008) InterSpeech
- Varadarajan, B.¹ Khudanpur, S.²

19
- 77955759248
- Performance analysis for lattice-based speech indexing approaches using word and subword units
- August
- Y.-c. Pan and L.-s. Lee, "Performance analysis for lattice-based speech indexing approaches using word and subword units," IEEE Transactions on Audio, Speech, and Language Processing, vol. 18, no. 6, August 2010, pp. 1562-1574.
- (2010) IEEE Transactions on Audio, Speech, and Language Processing , vol.18 , Issue.6 , pp. 1562-1574
- Pan, Y.-C.¹ Lee, L.-S.²

20
- 79959851706
- Towards spoken term discovery at scalewith zero resources
- A. Jansen, K. Church, and H. Hermansky, "Towards Spoken Term Discovery At ScaleWith Zero Resources" in InterSpeech, 2010, pp. 1676-1679.
- (2010) InterSpeech , pp. 1676-1679
- Jansen, A.¹ Church, K.² Hermansky, H.³

21
- 0007493259
- Topological gray-scale watershed transform
- M. Couprie, G. Bertrand, "Topological gray-scale watershed transform," in Proc. of SPIE Vision Geometry V, 1997, vol. 3168, pp. 136-146.
- (1997) Proc. of SPIE Vision Geometry v , vol.3168 , pp. 136-146
- Couprie, M.¹ Bertrand, G.²

22
- 0013281072
- Updateable pat-tree approach to chinese key phrase extraction using mutual information: A linguistic foundation for knowledge management
- T. Ong and H. Chen, "Updateable PAT-Tree Approach to Chinese Key Phrase Extraction using Mutual Information: A Linguistic Foundation for Knowledge Management," in Proc. The Second Asian Digital Library Conference, 1999, pp. 63-84.
- (1999) Proc. The Second Asian Digital Library Conference , pp. 63-84
- Ong, T.¹ Chen, H.²

23
- 84890511750
- Enhancing query expansion for semantic retrieval of spoken content with automatically discovered acoustic patterns
- H. Lee, Y. Li, C. Chung, and L. Lee, "Enhancing Query Expansion for Semantic Retrieval of Spoken Content with Automatically Discovered Acoustic Patterns," in ICASSP, 2013
- (2013) ICASSP
- Lee, H.¹ Li, Y.² Chung, C.³ Lee, L.⁴

24
- 34547516258
- Approximating the kullback liebler divergence between gaussain mixture models
- J. Hershey and P. Olsen, "Approximating the Kullback Liebler Divergence between Gaussain Mixture Models" in ICASSP, 2007, vol. 4, pp. 317-320.
- (2007) ICASSP , vol.4 , pp. 317-320
- Hershey, J.¹ Olsen, P.²

25
- 84867316017
- The spoken web search task at mediaeval 2011
- F. Metze, N. Rajput et al., "The spoken web search task at Mediaeval 2011," in ICASSP, 2012, pp. 5165-5168.
- (2012) ICASSP , pp. 5165-5168
- Metze, F.¹ Rajput, N.²

26
- 84867600320
- An acoustic segment modeling approach to query-by-example spoken term detection
- H. Wang, C.-C. Leung, T. Lee, B. Ma, and H. Li, "An acoustic segment modeling approach to query-by-example spoken term detection," in ICASSP, 2012, pp. 5157-5160.
- (2012) ICASSP , pp. 5157-5160
- Wang, H.¹ Leung, C.-C.² Lee, T.³ Ma, B.⁴ Li, H.⁵

27
- 33947644326
- Keyword spotting of arbitrary words using minimal speech resources
- A. Garcia and H. Gish, "Keyword spotting of arbitrary words using minimal speech resources," in ICASSP, 2006.
- (2006) ICASSP
- Garcia, A.¹ Gish, H.²

28
- 58049100564
- A phonetic search approach to the 2006 NIST spoken term detection evaluation
- R. Wallace, R. Vogt, and S. Sridharan, "A phonetic search approach to the 2006 NIST spoken term detection evaluation," in InterSpeech, 2007, pp. 2385-2388.
- (2007) InterSpeech , pp. 2385-2388
- Wallace, R.¹ Vogt, R.² Sridharan, S.³

29
- 84865709671
- Open vocabulary spoken-document retrieval based on query expansion using related web documents
- M. Terao, T. Koshinaka, S. Ando, R. Isotani, and A. Okumura, "Open vocabulary spoken-document retrieval based on query expansion using related web documents," in InterSpeech, 2008, pp. 2171-2174.
- (2008) InterSpeech , pp. 2171-2174
- Terao, M.¹ Koshinaka, T.² Ando, S.³ Isotani, R.⁴ Okumura, A.⁵

30
- 70450160623
- A comparison of queryby example methods for spoken term detection
- W. Shen, C. M. White, and T. J. Hazen, "A comparison of queryby example methods for spoken term detection," in InterSpeech, 2009, pp. 2143-2146.
- (2009) InterSpeech , pp. 2143-2146
- Shen, W.¹ White, C.M.² Hazen, T.J.³

31
- 0030245363
- From hmms to segment models: A unified view of stochastic modeling for speech recognition
- M. Ostendorf, V. Digalakis, and O. A. Kimball, "From hmms to segment models: A unified view of stochastic modeling for speech recognition," IEEE Transactions on Speech and Audio Processing, vol. 4, pp. 360V-378, 1995.
- (1995) IEEE Transactions on Speech and Audio Processing , vol.4
- Ostendorf, M.¹ Digalakis, V.² Kimball, O.A.³

32
- 84865770619
- A piecewise aggregate approximation lowerbound estimate for posteriorgram-based dynamic time warping
- Y. Zhang and J. R. Glass, "A piecewise aggregate approximation lowerbound estimate for posteriorgram-based dynamic time warping," in InterSpeech, 2011, pp. 1909-1912.
- (2011) InterSpeech , pp. 1909-1912
- Zhang, Y.¹ Glass, J.R.²

33
- 0035509488
- Speech recognition and utterance verification based on a generalized confidence score
- M.-W. Koo, C.-H. Lee, and B.-H. Juang, "Speech recognition and utterance verification based on a generalized confidence score," IEEE Transactions on Speech and Audio Processing, vol. 9, no. 8, pp. 821V-832, 2001.
- (2001) IEEE Transactions on Speech and Audio Processing , vol.9 , Issue.8
- Koo, M.-W.¹ Lee, C.-H.² Juang, B.-H.³

34
- 78049411640
- An acoustic segment model approach to incorporating temporal information into speaker modeling for text-independent speaker recognition
- Y. Tsao, H. Sun, H. Li, and C.-H. Lee, "An acoustic segment model approach to incorporating temporal information into speaker modeling for text-independent speaker recognition," in ICASSP, 2010, pp. 4422-4425.
- (2010) ICASSP , pp. 4422-4425
- Tsao, Y.¹ Sun, H.² Li, H.³ Lee, C.-H.⁴

35
- 33750319706
- Overview of TREC 2006
- E. Voorhees, "Overview of TREC 2006," in TREC, 2006.
- (2006) TREC
- Voorhees, E.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.