메뉴 건너뛰기




Volumn , Issue , 2014, Pages 875-879

A graph-based Gaussian component clustering approach to unsupervised acoustic modeling

Author keywords

Gaussian component clustering; Graph based clustering algorithms; Unsupervised acoustic modeling

Indexed keywords

GAUSSIAN DISTRIBUTION; GRAPHIC METHODS; SPEECH COMMUNICATION;

EID: 84910084383     PISSN: 2308457X     EISSN: 19909772     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (9)

References (31)
  • 1
    • 84868541271 scopus 로고    scopus 로고
    • Towards unsupervised speech processing
    • J. Glass, "Towards unsupervised speech processing, " in Proc. ISSPA, 2012, pp. 1-4.
    • (2012) Proc. ISSPA , pp. 1-4
    • Glass, J.1
  • 2
    • 0003857778 scopus 로고    scopus 로고
    • A gentle tutorial of the EM algorithm and its application to parameter estimation for Gaussian mixture and hidden Markov models
    • J. Bilmes, "A gentle tutorial of the EM algorithm and its application to parameter estimation for Gaussian mixture and hidden Markov models, " Technical Report ICSI-TR-97-02, 1997.
    • (1997) Technical Report ICSI-TR-97-02
    • Bilmes, J.1
  • 3
    • 0023800699 scopus 로고
    • A segment model based approach to speech recognition
    • C.-H. Lee, F. Soong, and B.-H. Juang, "A segment model based approach to speech recognition, " in Proc. ICASSP, 1988, pp. 501- 541.
    • (1988) Proc. ICASSP , pp. 501-541
    • Lee, C.-H.1    Soong, F.2    Juang, B.-H.3
  • 4
    • 0027277742 scopus 로고
    • A segmental speech model with applications to word spotting
    • H. Gish and K. Ng, "A segmental speech model with applications to word spotting, " in Proc. ICASSP, vol. 2, 1993, pp. 447-450.
    • (1993) Proc. ICASSP , vol.2 , pp. 447-450
    • Gish, H.1    Ng, K.2
  • 5
    • 84873444148 scopus 로고    scopus 로고
    • A study on music genre classification based on universal acoustic models
    • J. Reed and C.-H. Lee, "A study on music genre classification based on universal acoustic models, " in Proc. ISMIR, 2006, pp. 89-94.
    • (2006) Proc. ISMIR , pp. 89-94
    • Reed, J.1    Lee, C.-H.2
  • 6
    • 34547502608 scopus 로고    scopus 로고
    • A vector space modeling approach to spoken language identification
    • H. Li, B. Ma, and C. Lee, "A vector space modeling approach to spoken language identification, " IEEE Trans. ASLP, vol. 15, no. 1, pp. 271-284, 2007.
    • (2007) IEEE Trans. ASLP , vol.15 , Issue.1 , pp. 271-284
    • Li, H.1    Ma, B.2    Lee, C.3
  • 7
    • 70449646765 scopus 로고    scopus 로고
    • Acoustic segment modeling for speaker recognition
    • B. Ma, D. Zhu, and H. Li, "Acoustic segment modeling for speaker recognition, " in Proc. ICME, 2009, pp. 1668-1671.
    • (2009) Proc. ICME , pp. 1668-1671
    • Ma, B.1    Zhu, D.2    Li, H.3
  • 8
    • 84858975943 scopus 로고    scopus 로고
    • Topic modeling for spoken documents using only phonetic information
    • T. Hazen, M.-H. Siu, H. Gish, S. Lowe, and A. Chan, "Topic modeling for spoken documents using only phonetic information, " in Proc. ASRU, 2011, pp. 395-400.
    • (2011) Proc. ASRU , pp. 395-400
    • Hazen, T.1    Siu, M.-H.2    Gish, H.3    Lowe, S.4    Chan, A.5
  • 9
    • 84885423493 scopus 로고    scopus 로고
    • Unsupervised training of an HMM-based self-organizing unit recognizer with applications to topic classification and keyword discovery
    • M.-H. Siu, H. Gish, A. Chan, W. Belfield, and S. Lowe, "Unsupervised training of an HMM-based self-organizing unit recognizer with applications to topic classification and keyword discovery, " Computer Speech & Language, vol. 28, no. 1, pp. 210-223, 2014.
    • (2014) Computer Speech & Language , vol.28 , Issue.1 , pp. 210-223
    • Siu, M.-H.1    Gish, H.2    Chan, A.3    Belfield, W.4    Lowe, S.5
  • 10
    • 84890479779 scopus 로고    scopus 로고
    • Unsupervised discovery of linguistic structure including two-level acoustic patterns using three cascaded stages of iterative optimization
    • C.-T. Chung, C.-A. Chan, and L.-S. Lee, "Unsupervised discovery of linguistic structure including two-level acoustic patterns using three cascaded stages of iterative optimization, " in Proc. ICASSP, 2013, pp. 8081-8085.
    • (2013) Proc. ICASSP , pp. 8081-8085
    • Chung, C.-T.1    Chan, C.-A.2    Lee, L.-S.3
  • 11
    • 84867221475 scopus 로고    scopus 로고
    • Automatically learning speaker-independent acoustic subword units
    • B. Varadarajan and S. Khudanpur, "Automatically learning speaker-independent acoustic subword units, " in Proc. INTERSPEECH, 2008, pp. 1333-1336.
    • (2008) Proc. INTERSPEECH , pp. 1333-1336
    • Varadarajan, B.1    Khudanpur, S.2
  • 12
    • 84867809023 scopus 로고    scopus 로고
    • A nonparametric Bayesian approach to acoustic model discovery
    • C. Lee and J. Glass, "A nonparametric Bayesian approach to acoustic model discovery, " in Proc. ACL, 2012, pp. 40-49.
    • (2012) Proc. ACL , pp. 40-49
    • Lee, C.1    Glass, J.2
  • 13
    • 84890467020 scopus 로고    scopus 로고
    • Weak top-down constraints for unsupervised acoustic model training
    • A. Jansen, S. Thomas, and H. Hermansky, "Weak top-down constraints for unsupervised acoustic model training, " in Proc. ICASSP, 2013, pp. 8091-8095.
    • (2013) Proc. ICASSP , pp. 8091-8095
    • Jansen, A.1    Thomas, S.2    Hermansky, H.3
  • 14
    • 0034244751 scopus 로고    scopus 로고
    • Normalized cuts and image segmentation
    • J. Shi and J. Malik, "Normalized cuts and image segmentation, " IEEE Trans. PAMI, vol. 22, no. 8, pp. 888-905, 2000.
    • (2000) IEEE Trans. PAMI , vol.22 , Issue.8 , pp. 888-905
    • Shi, J.1    Malik, J.2
  • 16
    • 33749255098 scopus 로고    scopus 로고
    • On the equivalence of nonnegative matrix factorization and spectral clustering
    • C. H. Ding, X. He, and H. D. Simon, "On the equivalence of nonnegative matrix factorization and spectral clustering." in SDM, vol. 5, 2005, pp. 606-610.
    • (2005) SDM , vol.5 , pp. 606-610
    • Ding, C.H.1    He, X.2    Simon, H.D.3
  • 17
    • 84906281211 scopus 로고    scopus 로고
    • Unsupervised mining of acoustic subword units with segment-level Gaussian posteriorgrams
    • H. Wang, T. Lee, C.-C. Leung, B. Ma, and H. Li, "Unsupervised mining of acoustic subword units with segment-level Gaussian posteriorgrams, " in Proc. Interspeech, 2013, pp. 2297-2301.
    • (2013) Proc. Interspeech , pp. 2297-2301
    • Wang, H.1    Lee, T.2    Leung, C.-C.3    Ma, B.4    Li, H.5
  • 18
    • 79959819374 scopus 로고    scopus 로고
    • Improved topic classification and keyword discovery using an HMM-based speech recognizer trained without supervision
    • M.-H. Siu, H. Gish, A. Chan, and W. Belfield, "Improved topic classification and keyword discovery using an HMM-based speech recognizer trained without supervision, " in Proc. INTERSPEECH, 2010, pp. 2838-2841.
    • (2010) Proc. INTERSPEECH , pp. 2838-2841
    • Siu, M.-H.1    Gish, H.2    Chan, A.3    Belfield, W.4
  • 19
    • 33745185321 scopus 로고    scopus 로고
    • Using MLP features in SRI's conversational speech recognition system
    • Q. Zhu, A. Stolcke, B. Chen, and N. Morgan, "Using MLP features in SRI's conversational speech recognition system, " in Proc. INTERSPEECH, 2005, pp. 2141-2144.
    • (2005) Proc. INTERSPEECH , pp. 2141-2144
    • Zhu, Q.1    Stolcke, A.2    Chen, B.3    Morgan, N.4
  • 20
    • 77949473673 scopus 로고    scopus 로고
    • Unsupervised spoken keyword spotting via segmental DTW on Gaussian posteriorgrams
    • Y. Zhang and J. Glass, "Unsupervised spoken keyword spotting via segmental DTW on Gaussian posteriorgrams, " in Proc. ASRU, 2009, pp. 398-403.
    • (2009) Proc. ASRU , pp. 398-403
    • Zhang, Y.1    Glass, J.2
  • 21
    • 84870264929 scopus 로고    scopus 로고
    • Shifted-delta MLP features for spoken language recognition
    • IEEE
    • H. Wang, C.-C. Leung, T. Lee, B. Ma, and H. Li, "Shifted-delta MLP features for spoken language recognition, " Signal Processing Letters, IEEE, vol. 20, no. 1, pp. 15-18, 2013.
    • (2013) Signal Processing Letters , vol.20 , Issue.1 , pp. 15-18
    • Wang, H.1    Leung, C.-C.2    Lee, T.3    Ma, B.4    Li, H.5
  • 22
    • 84890452240 scopus 로고    scopus 로고
    • Using parallel tokenizers with DTW matrix combination for low-resource spoken term detection
    • H. Wang, T. Lee, C.-C. Leung, B. Ma, and H. Li, "Using parallel tokenizers with DTW matrix combination for low-resource spoken term detection, " in Proc. ICASSP, 2013, pp. 8545-8949.
    • (2013) Proc. ICASSP , pp. 8545-8949
    • Wang, H.1    Lee, T.2    Leung, C.-C.3    Ma, B.4    Li, H.5
  • 23
    • 84867596539 scopus 로고    scopus 로고
    • Acoustic texttiling for story segmentation of spoken documents
    • L. Zheng, C.-C. Leung, L. Xie, B. Ma, and H. Li, "Acoustic texttiling for story segmentation of spoken documents, " in Proc. ICASSP, 2012, pp. 5121-5124.
    • (2012) Proc. ICASSP , pp. 5121-5124
    • Zheng, L.1    Leung, C.-C.2    Xie, L.3    Ma, B.4    Li, H.5
  • 25
    • 33746044621 scopus 로고    scopus 로고
    • Keyword extraction from a single document using word co-occurrence statistical information
    • Y. Matsuo and M. Ishizuka, "Keyword extraction from a single document using word co-occurrence statistical information, " International Journal on Artificial Intelligence Tools, vol. 13, no. 1, pp. 157-169, 2004.
    • (2004) International Journal on Artificial Intelligence Tools , vol.13 , Issue.1 , pp. 157-169
    • Matsuo, Y.1    Ishizuka, M.2
  • 26
    • 34548583274 scopus 로고    scopus 로고
    • A tutorial on spectral clustering
    • U. Von Luxburg, "A tutorial on spectral clustering, " Statistics and computing, vol. 17, no. 4, pp. 395-416, 2007.
    • (2007) Statistics and Computing , vol.17 , Issue.4 , pp. 395-416
    • Von Luxburg, U.1
  • 28
    • 34547844077 scopus 로고    scopus 로고
    • Sparse non-negative matrix factorizations via alternating non-negativity-constrained least squares for microarray data analysis
    • H. Kim and H. Park, "Sparse non-negative matrix factorizations via alternating non-negativity-constrained least squares for microarray data analysis, " Bioinformatics, vol. 23, no. 12, pp. 1495- 1502, 2007.
    • (2007) Bioinformatics , vol.23 , Issue.12 , pp. 1495-1502
    • Kim, H.1    Park, H.2
  • 29
    • 3042518464 scopus 로고
    • TIMIT: Acoustic-phonetic continuous speech corpus
    • J. Garofolo et al., "TIMIT: Acoustic-phonetic continuous speech corpus, " Linguistic Data Consortium, 1993.
    • (1993) Linguistic Data Consortium
    • Garofolo, J.1
  • 30
    • 84867600320 scopus 로고    scopus 로고
    • An acoustic segment modeling approach to query-by-example spoken term detection
    • H. Wang, C.-C. Leung, T. Lee, B. Ma, and H. Li, "An acoustic segment modeling approach to query-by-example spoken term detection, " in Proc. ICASSP, 2012, pp. 5157-5160.
    • (2012) Proc. ICASSP , pp. 5157-5160
    • Wang, H.1    Leung, C.-C.2    Lee, T.3    Ma, B.4    Li, H.5


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.