SCOPUS 정보 검색 플랫폼

Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

Volumn , Issue , 2013, Pages 2297-2301

Unsupervised mining of acoustic subword units with segment-level gaussian posteriorgrams

(5) Wang, Haipeng a Lee, Tan a Leung, Cheung Chi b Ma, Bin b Li, Haizhou b

a CHINESE UNIVERSITY OF HONG KONG (Hong Kong)

b INSTITUTE FOR INFOCOMM RESEARCH (Singapore)

Author keywords

Gaussian by segment matrix; Non negative matrix factorization; Normalized cut; Segment level posteriorgrams; Unsupervised acoustic unit mining

Indexed keywords

CLUSTERING ALGORITHMS; GAUSSIAN DISTRIBUTION; IMAGE SEGMENTATION;

CLUSTERING RESULTS; GAUSSIAN MIXTURE MODEL; LABELING METHODS; MATRIX FACTORIZATIONS; NONNEGATIVE MATRIX FACTORIZATION; NORMALIZED CUTS; SEGMENT- LEVEL POSTERIORGRAMS; UNSUPERVISED SEGMENTATION;

MATRIX ALGEBRA;

EID: 84906281211 PISSN: 2308457X EISSN: 19909772 Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (25)

References (29)

1
- 0036476255
- Automatic generation of subword units for speech recognition systems
- R. Singh, B. Raj, and R. Stern, "Automatic generation of subword units for speech recognition systems, " IEEE Trans. SAP, vol. 10, no. 2, pp. 89-99, 2002.
- (2002) IEEE Trans. SAP , vol.10 , Issue.2 , pp. 89-99
- Singh, R.¹ Raj, B.² Stern, R.³

2
- 79959819374
- Im- proved topic classification and keyword discovery using an HMM-based speech recognizer trained without super- vision
- M.-H. Siu, H. Gish, A. Chan, and W. Belfield, "Im- proved topic classification and keyword discovery using an HMM-based speech recognizer trained without super- vision, " in Proc. INTERSPEECH, 2010, pp. 2838-2841.
- (2010) Proc. INTERSPEECH , pp. 2838-2841
- Siu, M.-H.¹ Gish, H.² Chan, A.³ Belfield, W.⁴

3
- 84890452240
- Using parallel tokenizers with DTW matrix combination for low- resource spoken term detection
- H. Wang, T. Lee, C.-C. Leung, B. Ma, and H. Li, "Using parallel tokenizers with DTW matrix combination for low- resource spoken term detection, " in Proc. ICASSP, 2013.
- (2013) Proc. ICASSP
- Wang, H.¹ Lee, T.² Leung, C.-C.³ Ma, B.⁴ Li, H.⁵

4
- 0023800699
- A segment model based approach to speech recognition
- C.-H. Lee, F. Soong, and B.-H. Juang, "A segment model based approach to speech recognition, " in Proc. ICASSP, 1988, pp. 501-541.
- (1988) Proc. ICASSP , pp. 501-541
- Lee, C.-H.¹ Soong, F.² Juang, B.-H.³

5
- 0033353287
- Joint lexicon, acoustic unit inventory and model design
- M. Bacchiani and M. Ostendorf, "Joint lexicon, acous- Tic unit inventory and model design, " Speech Communication, vol. 29, no. 2, pp. 99-114, 1999.
- (1999) Speech Communication , vol.29 , Issue.2 , pp. 99-114
- Bacchiani, M.¹ Ostendorf, M.²

6
- 0034244751
- Normalized cuts and image segmentation
- J. Shi and J. Malik, "Normalized cuts and image segmentation, " IEEE Trans. PAMI, vol. 22, no. 8, pp. 888-905, 2000.
- (2000) IEEE Trans. PAMI , vol.22 , Issue.8 , pp. 888-905
- Shi, J.¹ Malik, J.²

7
- 84898964201
- Algorithms for non-negative matrix factorization
- D. Seung and L. Lee, "Algorithms for non-negative matrix factorization, " Advances in neural information processing systems, vol. 13, pp. 556-562, 2001.
- (2001) Advances in Neural Information Processing Systems , vol.13 , pp. 556-562
- Seung, D.¹ Lee, L.²

8
- 77949473673
- Unsupervised spoken keyword spotting via segmental DTW on Gaussian posterior- grams
- Y. Zhang and J. Glass, "Unsupervised spoken keyword spotting via segmental DTW on Gaussian posterior- grams, " in Proc. ASRU, 2009, pp. 398-403.
- (2009) Proc. ASRU , pp. 398-403
- Zhang, Y.¹ Glass, J.²

9
- 70449646765
- Acoustic segment modeling for speaker recognition
- B. Ma, D. Zhu, and H. Li, "Acoustic segment modeling for speaker recognition, " in Proc. ICME, 2009, pp. 1668- 1671.
- (2009) Proc. ICME , pp. 1668-1671
- Ma, B.¹ Zhu, D.² Li, H.³

10
- 84873444148
- A study on music genre classifica- Tion based on universal acoustic models
- J. Reed and C.-H. Lee, "A study on music genre classifica- Tion based on universal acoustic models, " in Proc. ISMIR, 2006, pp. 89-94.
- (2006) Proc. ISMIR , pp. 89-94
- Reed, J.¹ Lee, C.-H.²

11
- 0027277742
- A segmental speech model with applications to word spotting
- H. Gish and K. Ng, "A segmental speech model with applications to word spotting, " in Proc. ICASSP, vol. 2, 1993, pp. 447-450.
- (1993) Proc. ICASSP , vol.2 , pp. 447-450
- Gish, H.¹ Ng, K.²

12
- 84867600320
- An acoustic segment modeling approach to query-by- example spoken term detection
- H. Wang, C.-C. Leung, T. Lee, B. Ma, and H. Li, "An acoustic segment modeling approach to query-by- example spoken term detection, " in Proc. ICASSP, 2012, pp. 5157-5160.
- (2012) Proc. ICASSP , pp. 5157-5160
- Wang, H.¹ Leung, C.-C.² Lee, T.³ Ma, B.⁴ Li, H.⁵

13
- 84858975943
- Topic modeling for spoken documents using only pho- netic information
- T. Hazen, M.-H. Siu, H. Gish, S. Lowe, and A. Chan, "Topic modeling for spoken documents using only pho- netic information, " in Proc. ASRU, 2011, pp. 395-400.
- (2011) Proc. ASRU , pp. 395-400
- Hazen, T.¹ Siu, M.-H.² Gish, H.³ Lowe, S.⁴ Chan, A.⁵

14
- 84867221475
- Automatically learning speaker-independent acoustic subword units
- B. Varadarajan and S. Khudanpur, "Automatically learn- ing speaker-independent acoustic subword units, " in Proc. INTERSPEECH, 2008, pp. 1333-1336.
- (2008) Proc. INTERSPEECH , pp. 1333-1336
- Varadarajan, B.¹ Khudanpur, S.²

15
- 0029745231
- Maximum likelihood successive state splitting
- H. Singer and M. Ostendorf, "Maximum likelihood suc- cessive state splitting, " in Proc. ICASSP, vol. 2, 1996, pp. 601-604.
- (1996) Proc. ICASSP , vol.2 , pp. 601-604
- Singer, H.¹ Ostendorf, M.²

16
- 84867809023
- A nonparametric bayesian approach to acoustic model discovery
- C. Lee and J. Glass, "A nonparametric bayesian approach to acoustic model discovery, " in Proc. ACL, 2012.
- (2012) Proc. ACL
- Lee, C.¹ Glass, J.²

17
- 84865770260
- Towards unsupervised training of speaker independent acoustic models
- A. Jansen and K. Church, "Towards unsupervised training of speaker independent acoustic models, " in Proc. INTER- SPEECH, 2011, pp. 1693-1696.
- (2011) Proc. INTER-SPEECH , pp. 1693-1696
- Jansen, A.¹ Church, K.²

18
- 84890467020
- Weak top- down constraints for unsupervised acoustic model training
- A. Jansen, S. Thomas, and H. Hermansky, "Weak top- down constraints for unsupervised acoustic model training, " in Proc. ICASSP, 2013.
- (2013) Proc. ICASSP
- Jansen, A.¹ Thomas, S.² Hermansky, H.³

19
- 44949165663
- Using posterior- based features in template matching for speech recognition
- G. Aradilla, J. Vepa, and H. Bourlard, "Using posterior- based features in template matching for speech recognition, " in Proc. INTERSPEECH, 2006, pp. 1186-1189.
- (2006) Proc. INTERSPEECH , pp. 1186-1189
- Aradilla, G.¹ Vepa, J.² Bourlard, H.³

20
- 77949351968
- Query-by-example spoken term detection using phonetic posteriorgram templates
- T. Hazen, W. Shen, and C. White, "Query-by-example spoken term detection using phonetic posteriorgram templates, " in Proc. ASRU, 2009, pp. 421-426.
- (2009) Proc. ASRU , pp. 421-426
- Hazen, T.¹ Shen, W.² White, C.³

21
- 84989525001
- Indexing by latent semantic analysis
- S. Deerwester, S. Dumais, G. Furnas, T. Landauer, and R. Harshman, "Indexing by latent semantic analysis, " Journal of the American society for information science, vol. 41, no. 6, pp. 391-407, 1990.
- (1990) Journal of the American Society for Information Science , vol.41 , Issue.6 , pp. 391-407
- Deerwester, S.¹ Dumais, S.² Furnas, G.³ Landauer, T.⁴ Harshman, R.⁵

22
- 84899013108
- On spectral clustering: Analysis and an algorithm
- A. Ng, M. Jordan, Y.Weiss et al., "On spectral clustering: Analysis and an algorithm, " Advances in neural information processing systems, vol. 2, pp. 849-856, 2002.
- (2002) Advances in Neural Information Processing Systems , vol.2 , pp. 849-856
- Ng, A.¹ Jordan, M.² Weissal, Y.³

23
- 1542347778
- Document clustering based on non-negative matrix factorization
- W. Xu, X. Liu, and Y. Gong, "Document clustering based on non-negative matrix factorization, " in Proc. 26th annual international ACM SIGIR conference on Research and development in informaion retrieval, 2003, pp. 267- 273.
- (2003) Proc. 26th Annual International ACM SIGIR Conference on Research and Development in Informaion Retrieval , pp. 267-273
- Xu, W.¹ Liu, X.² Gong, Y.³

24
- 33749575326
- Orthogonal non- negative matrix T-factorizations for clustering
- C. Ding, T. Li, W. Peng, and H. Park, "Orthogonal non- negative matrix T-factorizations for clustering, " in Proc. 12th ACM SIGKDD international conference on Knowledge discovery and data mining, 2006, pp. 126-135.
- (2006) Proc. 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining , pp. 126-135
- Ding, C.¹ Li, T.² Peng, W.³ Park, H.⁴

25
- 41249089920
- On the equivalence between non-negative matrix factorization and probabilistic latent semantic indexing
- C. Ding, T. Li, and W. Peng, "On the equivalence be- Tween non-negative matrix factorization and probabilistic latent semantic indexing, " Computational Statistics and Data Analysis, vol. 52, no. 8, pp. 3913-3927, 2008.
- (2008) Computational Statistics and Data Analysis , vol.52 , Issue.8 , pp. 3913-3927
- Ding, C.¹ Li, T.² Peng, W.³

26
- 84906255885
- Y. Muthusamy, R. Cole, and B. Oshika, The OGI multi- language telephone speech corpus, 1994.
- (1994) The OGI Multi- Language Telephone Speech Corpus
- Muthusamy, Y.¹ Cole, R.² Oshika, B.³

27
- 84862931515
- Experiments on cross-language attribute detection and phone recognition with minimal target-specific training data
- S. Siniscalchi, D.-C. Lyu, T. Svendsen, and C.-H. Lee, "Experiments on cross-language attribute detection and phone recognition with minimal target-specific training data, " IEEE Trans. ASLP, vol. 20, no. 3, pp. 875-887, 2012.
- (2012) IEEE Trans. ASLP , vol.20 , Issue.3 , pp. 875-887
- Siniscalchi, S.¹ Lyu, D.-C.² Svendsen, T.³ Lee, C.-H.⁴

28
- 34548080780
- Cambridge University Press Cambridge
- C. Manning, P. Raghavan, and H. Schütze, Introduction to information retrieval. Cambridge University Press Cambridge, 2008.
- (2008) Introduction to Information Retrieval
- Manning, C.¹ Raghavan, P.² Schütze, H.³

29
- 44849118234
- Broad phonetic class recognition in a hidden Markov model framework using extended Baum-Welch transformations
- T. Sainath, D. Kanevsky, and B. Ramabhadran, "Broad phonetic class recognition in a hidden Markov model framework using extended Baum-Welch transformations, " in Proc. ASRU, 2007, pp. 306-311.
- (2007) Proc. ASRU , pp. 306-311
- Sainath, T.¹ Kanevsky, D.² Ramabhadran, B.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.