SCOPUS 정보 검색 플랫폼

Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

Volumn , Issue , 2014, Pages 875-879

A graph-based Gaussian component clustering approach to unsupervised acoustic modeling

(5) Wang, Haipeng a Lee, Tan a Leung, Cheung Chi b Ma, Bin b Li, Haizhou b

a CHINESE UNIVERSITY OF HONG KONG (Hong Kong)

b INSTITUTE FOR INFOCOMM RESEARCH (Singapore)

Author keywords

Gaussian component clustering; Graph based clustering algorithms; Unsupervised acoustic modeling

Indexed keywords

GAUSSIAN DISTRIBUTION; GRAPHIC METHODS; SPEECH COMMUNICATION;

ACOUSTIC MODEL; CLUSTERING APPROACH; GAUSSIAN COMPONENTS; GRAPH-BASED; GRAPH-BASED CLUSTERING; NEW APPROACHES; SIMILARITY MEASURE; SUB-WORD UNITS;

CLUSTERING ALGORITHMS;

EID: 84910084383 PISSN: 2308457X EISSN: 19909772 Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (9)

References (31)

1
- 84868541271
- Towards unsupervised speech processing
- J. Glass, "Towards unsupervised speech processing, " in Proc. ISSPA, 2012, pp. 1-4.
- (2012) Proc. ISSPA , pp. 1-4
- Glass, J.¹

2
- 0003857778
- A gentle tutorial of the EM algorithm and its application to parameter estimation for Gaussian mixture and hidden Markov models
- J. Bilmes, "A gentle tutorial of the EM algorithm and its application to parameter estimation for Gaussian mixture and hidden Markov models, " Technical Report ICSI-TR-97-02, 1997.
- (1997) Technical Report ICSI-TR-97-02
- Bilmes, J.¹

3
- 0023800699
- A segment model based approach to speech recognition
- C.-H. Lee, F. Soong, and B.-H. Juang, "A segment model based approach to speech recognition, " in Proc. ICASSP, 1988, pp. 501- 541.
- (1988) Proc. ICASSP , pp. 501-541
- Lee, C.-H.¹ Soong, F.² Juang, B.-H.³

4
- 0027277742
- A segmental speech model with applications to word spotting
- H. Gish and K. Ng, "A segmental speech model with applications to word spotting, " in Proc. ICASSP, vol. 2, 1993, pp. 447-450.
- (1993) Proc. ICASSP , vol.2 , pp. 447-450
- Gish, H.¹ Ng, K.²

5
- 84873444148
- A study on music genre classification based on universal acoustic models
- J. Reed and C.-H. Lee, "A study on music genre classification based on universal acoustic models, " in Proc. ISMIR, 2006, pp. 89-94.
- (2006) Proc. ISMIR , pp. 89-94
- Reed, J.¹ Lee, C.-H.²

6
- 34547502608
- A vector space modeling approach to spoken language identification
- H. Li, B. Ma, and C. Lee, "A vector space modeling approach to spoken language identification, " IEEE Trans. ASLP, vol. 15, no. 1, pp. 271-284, 2007.
- (2007) IEEE Trans. ASLP , vol.15 , Issue.1 , pp. 271-284
- Li, H.¹ Ma, B.² Lee, C.³

7
- 70449646765
- Acoustic segment modeling for speaker recognition
- B. Ma, D. Zhu, and H. Li, "Acoustic segment modeling for speaker recognition, " in Proc. ICME, 2009, pp. 1668-1671.
- (2009) Proc. ICME , pp. 1668-1671
- Ma, B.¹ Zhu, D.² Li, H.³

8
- 84858975943
- Topic modeling for spoken documents using only phonetic information
- T. Hazen, M.-H. Siu, H. Gish, S. Lowe, and A. Chan, "Topic modeling for spoken documents using only phonetic information, " in Proc. ASRU, 2011, pp. 395-400.
- (2011) Proc. ASRU , pp. 395-400
- Hazen, T.¹ Siu, M.-H.² Gish, H.³ Lowe, S.⁴ Chan, A.⁵

9
- 84885423493
- Unsupervised training of an HMM-based self-organizing unit recognizer with applications to topic classification and keyword discovery
- M.-H. Siu, H. Gish, A. Chan, W. Belfield, and S. Lowe, "Unsupervised training of an HMM-based self-organizing unit recognizer with applications to topic classification and keyword discovery, " Computer Speech & Language, vol. 28, no. 1, pp. 210-223, 2014.
- (2014) Computer Speech & Language , vol.28 , Issue.1 , pp. 210-223
- Siu, M.-H.¹ Gish, H.² Chan, A.³ Belfield, W.⁴ Lowe, S.⁵

10
- 84890479779
- Unsupervised discovery of linguistic structure including two-level acoustic patterns using three cascaded stages of iterative optimization
- C.-T. Chung, C.-A. Chan, and L.-S. Lee, "Unsupervised discovery of linguistic structure including two-level acoustic patterns using three cascaded stages of iterative optimization, " in Proc. ICASSP, 2013, pp. 8081-8085.
- (2013) Proc. ICASSP , pp. 8081-8085
- Chung, C.-T.¹ Chan, C.-A.² Lee, L.-S.³

11
- 84867221475
- Automatically learning speaker-independent acoustic subword units
- B. Varadarajan and S. Khudanpur, "Automatically learning speaker-independent acoustic subword units, " in Proc. INTERSPEECH, 2008, pp. 1333-1336.
- (2008) Proc. INTERSPEECH , pp. 1333-1336
- Varadarajan, B.¹ Khudanpur, S.²

12
- 84867809023
- A nonparametric Bayesian approach to acoustic model discovery
- C. Lee and J. Glass, "A nonparametric Bayesian approach to acoustic model discovery, " in Proc. ACL, 2012, pp. 40-49.
- (2012) Proc. ACL , pp. 40-49
- Lee, C.¹ Glass, J.²

13
- 84890467020
- Weak top-down constraints for unsupervised acoustic model training
- A. Jansen, S. Thomas, and H. Hermansky, "Weak top-down constraints for unsupervised acoustic model training, " in Proc. ICASSP, 2013, pp. 8091-8095.
- (2013) Proc. ICASSP , pp. 8091-8095
- Jansen, A.¹ Thomas, S.² Hermansky, H.³

14
- 0034244751
- Normalized cuts and image segmentation
- J. Shi and J. Malik, "Normalized cuts and image segmentation, " IEEE Trans. PAMI, vol. 22, no. 8, pp. 888-905, 2000.
- (2000) IEEE Trans. PAMI , vol.22 , Issue.8 , pp. 888-905
- Shi, J.¹ Malik, J.²

15
- 25844488029
- Document clustering using nonnegative matrix factorization
- F. Shahnaz, M. Berry, V. Pauca, and R. Plemmons, "Document clustering using nonnegative matrix factorization, " Information Processing and Management, vol. 42, no. 2, pp. 373-386, 2006.
- (2006) Information Processing and Management , vol.42 , Issue.2 , pp. 373-386
- Shahnaz, F.¹ Berry, M.² Pauca, V.³ Plemmons, R.⁴

16
- 33749255098
- On the equivalence of nonnegative matrix factorization and spectral clustering
- C. H. Ding, X. He, and H. D. Simon, "On the equivalence of nonnegative matrix factorization and spectral clustering." in SDM, vol. 5, 2005, pp. 606-610.
- (2005) SDM , vol.5 , pp. 606-610
- Ding, C.H.¹ He, X.² Simon, H.D.³

17
- 84906281211
- Unsupervised mining of acoustic subword units with segment-level Gaussian posteriorgrams
- H. Wang, T. Lee, C.-C. Leung, B. Ma, and H. Li, "Unsupervised mining of acoustic subword units with segment-level Gaussian posteriorgrams, " in Proc. Interspeech, 2013, pp. 2297-2301.
- (2013) Proc. Interspeech , pp. 2297-2301
- Wang, H.¹ Lee, T.² Leung, C.-C.³ Ma, B.⁴ Li, H.⁵

18
- 79959819374
- Improved topic classification and keyword discovery using an HMM-based speech recognizer trained without supervision
- M.-H. Siu, H. Gish, A. Chan, and W. Belfield, "Improved topic classification and keyword discovery using an HMM-based speech recognizer trained without supervision, " in Proc. INTERSPEECH, 2010, pp. 2838-2841.
- (2010) Proc. INTERSPEECH , pp. 2838-2841
- Siu, M.-H.¹ Gish, H.² Chan, A.³ Belfield, W.⁴

19
- 33745185321
- Using MLP features in SRI's conversational speech recognition system
- Q. Zhu, A. Stolcke, B. Chen, and N. Morgan, "Using MLP features in SRI's conversational speech recognition system, " in Proc. INTERSPEECH, 2005, pp. 2141-2144.
- (2005) Proc. INTERSPEECH , pp. 2141-2144
- Zhu, Q.¹ Stolcke, A.² Chen, B.³ Morgan, N.⁴

20
- 77949473673
- Unsupervised spoken keyword spotting via segmental DTW on Gaussian posteriorgrams
- Y. Zhang and J. Glass, "Unsupervised spoken keyword spotting via segmental DTW on Gaussian posteriorgrams, " in Proc. ASRU, 2009, pp. 398-403.
- (2009) Proc. ASRU , pp. 398-403
- Zhang, Y.¹ Glass, J.²

21
- 84870264929
- Shifted-delta MLP features for spoken language recognition
- IEEE
- H. Wang, C.-C. Leung, T. Lee, B. Ma, and H. Li, "Shifted-delta MLP features for spoken language recognition, " Signal Processing Letters, IEEE, vol. 20, no. 1, pp. 15-18, 2013.
- (2013) Signal Processing Letters , vol.20 , Issue.1 , pp. 15-18
- Wang, H.¹ Leung, C.-C.² Lee, T.³ Ma, B.⁴ Li, H.⁵

22
- 84890452240
- Using parallel tokenizers with DTW matrix combination for low-resource spoken term detection
- H. Wang, T. Lee, C.-C. Leung, B. Ma, and H. Li, "Using parallel tokenizers with DTW matrix combination for low-resource spoken term detection, " in Proc. ICASSP, 2013, pp. 8545-8949.
- (2013) Proc. ICASSP , pp. 8545-8949
- Wang, H.¹ Lee, T.² Leung, C.-C.³ Ma, B.⁴ Li, H.⁵

23
- 84867596539
- Acoustic texttiling for story segmentation of spoken documents
- L. Zheng, C.-C. Leung, L. Xie, B. Ma, and H. Li, "Acoustic texttiling for story segmentation of spoken documents, " in Proc. ICASSP, 2012, pp. 5121-5124.
- (2012) Proc. ICASSP , pp. 5121-5124
- Zheng, L.¹ Leung, C.-C.² Xie, L.³ Ma, B.⁴ Li, H.⁵

24
- 84899013108
- On spectral clustering: Analysis and an algorithm
- A. Ng, M. Jordan, Y. Weiss et al., "On spectral clustering: Analysis and an algorithm, " Advances in neural information processing systems, vol. 2, pp. 849-856, 2002.
- (2002) Advances in Neural Information Processing Systems , vol.2 , pp. 849-856
- Ng, A.¹ Jordan, M.² Weiss, Y.³

25
- 33746044621
- Keyword extraction from a single document using word co-occurrence statistical information
- Y. Matsuo and M. Ishizuka, "Keyword extraction from a single document using word co-occurrence statistical information, " International Journal on Artificial Intelligence Tools, vol. 13, no. 1, pp. 157-169, 2004.
- (2004) International Journal on Artificial Intelligence Tools , vol.13 , Issue.1 , pp. 157-169
- Matsuo, Y.¹ Ishizuka, M.²

26
- 34548583274
- A tutorial on spectral clustering
- U. Von Luxburg, "A tutorial on spectral clustering, " Statistics and computing, vol. 17, no. 4, pp. 395-416, 2007.
- (2007) Statistics and Computing , vol.17 , Issue.4 , pp. 395-416
- Von Luxburg, U.¹

27
- 84898964201
- Algorithms for non-negative matrix factorization
- D. Seung and L. Lee, "Algorithms for non-negative matrix factorization, " Advances in neural information processing systems, vol. 13, pp. 556-562, 2001.
- (2001) Advances in Neural Information Processing Systems , vol.13 , pp. 556-562
- Seung, D.¹ Lee, L.²

28
- 34547844077
- Sparse non-negative matrix factorizations via alternating non-negativity-constrained least squares for microarray data analysis
- H. Kim and H. Park, "Sparse non-negative matrix factorizations via alternating non-negativity-constrained least squares for microarray data analysis, " Bioinformatics, vol. 23, no. 12, pp. 1495- 1502, 2007.
- (2007) Bioinformatics , vol.23 , Issue.12 , pp. 1495-1502
- Kim, H.¹ Park, H.²

29
- 3042518464
- TIMIT: Acoustic-phonetic continuous speech corpus
- J. Garofolo et al., "TIMIT: Acoustic-phonetic continuous speech corpus, " Linguistic Data Consortium, 1993.
- (1993) Linguistic Data Consortium
- Garofolo, J.¹

30
- 84867600320
- An acoustic segment modeling approach to query-by-example spoken term detection
- H. Wang, C.-C. Leung, T. Lee, B. Ma, and H. Li, "An acoustic segment modeling approach to query-by-example spoken term detection, " in Proc. ICASSP, 2012, pp. 5157-5160.
- (2012) Proc. ICASSP , pp. 5157-5160
- Wang, H.¹ Leung, C.-C.² Lee, T.³ Ma, B.⁴ Li, H.⁵

31
- 34548080780
- Cambridge University Press Cambridge
- C. Manning, P. Raghavan, and H. Schütze, Introduction to information retrieval. Cambridge University Press Cambridge, 2008.
- (2008) Introduction to Information Retrieval
- Manning, C.¹ Raghavan, P.² Schütze, H.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.