메뉴 건너뛰기




Volumn , Issue , 2013, Pages 2297-2301

Unsupervised mining of acoustic subword units with segment-level gaussian posteriorgrams

Author keywords

Gaussian by segment matrix; Non negative matrix factorization; Normalized cut; Segment level posteriorgrams; Unsupervised acoustic unit mining

Indexed keywords

CLUSTERING ALGORITHMS; GAUSSIAN DISTRIBUTION; IMAGE SEGMENTATION;

EID: 84906281211     PISSN: 2308457X     EISSN: 19909772     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (25)

References (29)
  • 1
    • 0036476255 scopus 로고    scopus 로고
    • Automatic generation of subword units for speech recognition systems
    • R. Singh, B. Raj, and R. Stern, "Automatic generation of subword units for speech recognition systems, " IEEE Trans. SAP, vol. 10, no. 2, pp. 89-99, 2002.
    • (2002) IEEE Trans. SAP , vol.10 , Issue.2 , pp. 89-99
    • Singh, R.1    Raj, B.2    Stern, R.3
  • 2
    • 79959819374 scopus 로고    scopus 로고
    • Im- proved topic classification and keyword discovery using an HMM-based speech recognizer trained without super- vision
    • M.-H. Siu, H. Gish, A. Chan, and W. Belfield, "Im- proved topic classification and keyword discovery using an HMM-based speech recognizer trained without super- vision, " in Proc. INTERSPEECH, 2010, pp. 2838-2841.
    • (2010) Proc. INTERSPEECH , pp. 2838-2841
    • Siu, M.-H.1    Gish, H.2    Chan, A.3    Belfield, W.4
  • 3
    • 84890452240 scopus 로고    scopus 로고
    • Using parallel tokenizers with DTW matrix combination for low- resource spoken term detection
    • H. Wang, T. Lee, C.-C. Leung, B. Ma, and H. Li, "Using parallel tokenizers with DTW matrix combination for low- resource spoken term detection, " in Proc. ICASSP, 2013.
    • (2013) Proc. ICASSP
    • Wang, H.1    Lee, T.2    Leung, C.-C.3    Ma, B.4    Li, H.5
  • 4
    • 0023800699 scopus 로고
    • A segment model based approach to speech recognition
    • C.-H. Lee, F. Soong, and B.-H. Juang, "A segment model based approach to speech recognition, " in Proc. ICASSP, 1988, pp. 501-541.
    • (1988) Proc. ICASSP , pp. 501-541
    • Lee, C.-H.1    Soong, F.2    Juang, B.-H.3
  • 5
    • 0033353287 scopus 로고    scopus 로고
    • Joint lexicon, acoustic unit inventory and model design
    • M. Bacchiani and M. Ostendorf, "Joint lexicon, acous- Tic unit inventory and model design, " Speech Communication, vol. 29, no. 2, pp. 99-114, 1999.
    • (1999) Speech Communication , vol.29 , Issue.2 , pp. 99-114
    • Bacchiani, M.1    Ostendorf, M.2
  • 6
    • 0034244751 scopus 로고    scopus 로고
    • Normalized cuts and image segmentation
    • J. Shi and J. Malik, "Normalized cuts and image segmentation, " IEEE Trans. PAMI, vol. 22, no. 8, pp. 888-905, 2000.
    • (2000) IEEE Trans. PAMI , vol.22 , Issue.8 , pp. 888-905
    • Shi, J.1    Malik, J.2
  • 8
    • 77949473673 scopus 로고    scopus 로고
    • Unsupervised spoken keyword spotting via segmental DTW on Gaussian posterior- grams
    • Y. Zhang and J. Glass, "Unsupervised spoken keyword spotting via segmental DTW on Gaussian posterior- grams, " in Proc. ASRU, 2009, pp. 398-403.
    • (2009) Proc. ASRU , pp. 398-403
    • Zhang, Y.1    Glass, J.2
  • 9
    • 70449646765 scopus 로고    scopus 로고
    • Acoustic segment modeling for speaker recognition
    • B. Ma, D. Zhu, and H. Li, "Acoustic segment modeling for speaker recognition, " in Proc. ICME, 2009, pp. 1668- 1671.
    • (2009) Proc. ICME , pp. 1668-1671
    • Ma, B.1    Zhu, D.2    Li, H.3
  • 10
    • 84873444148 scopus 로고    scopus 로고
    • A study on music genre classifica- Tion based on universal acoustic models
    • J. Reed and C.-H. Lee, "A study on music genre classifica- Tion based on universal acoustic models, " in Proc. ISMIR, 2006, pp. 89-94.
    • (2006) Proc. ISMIR , pp. 89-94
    • Reed, J.1    Lee, C.-H.2
  • 11
    • 0027277742 scopus 로고
    • A segmental speech model with applications to word spotting
    • H. Gish and K. Ng, "A segmental speech model with applications to word spotting, " in Proc. ICASSP, vol. 2, 1993, pp. 447-450.
    • (1993) Proc. ICASSP , vol.2 , pp. 447-450
    • Gish, H.1    Ng, K.2
  • 12
    • 84867600320 scopus 로고    scopus 로고
    • An acoustic segment modeling approach to query-by- example spoken term detection
    • H. Wang, C.-C. Leung, T. Lee, B. Ma, and H. Li, "An acoustic segment modeling approach to query-by- example spoken term detection, " in Proc. ICASSP, 2012, pp. 5157-5160.
    • (2012) Proc. ICASSP , pp. 5157-5160
    • Wang, H.1    Leung, C.-C.2    Lee, T.3    Ma, B.4    Li, H.5
  • 13
    • 84858975943 scopus 로고    scopus 로고
    • Topic modeling for spoken documents using only pho- netic information
    • T. Hazen, M.-H. Siu, H. Gish, S. Lowe, and A. Chan, "Topic modeling for spoken documents using only pho- netic information, " in Proc. ASRU, 2011, pp. 395-400.
    • (2011) Proc. ASRU , pp. 395-400
    • Hazen, T.1    Siu, M.-H.2    Gish, H.3    Lowe, S.4    Chan, A.5
  • 14
    • 84867221475 scopus 로고    scopus 로고
    • Automatically learning speaker-independent acoustic subword units
    • B. Varadarajan and S. Khudanpur, "Automatically learn- ing speaker-independent acoustic subword units, " in Proc. INTERSPEECH, 2008, pp. 1333-1336.
    • (2008) Proc. INTERSPEECH , pp. 1333-1336
    • Varadarajan, B.1    Khudanpur, S.2
  • 15
    • 0029745231 scopus 로고    scopus 로고
    • Maximum likelihood successive state splitting
    • H. Singer and M. Ostendorf, "Maximum likelihood suc- cessive state splitting, " in Proc. ICASSP, vol. 2, 1996, pp. 601-604.
    • (1996) Proc. ICASSP , vol.2 , pp. 601-604
    • Singer, H.1    Ostendorf, M.2
  • 16
    • 84867809023 scopus 로고    scopus 로고
    • A nonparametric bayesian approach to acoustic model discovery
    • C. Lee and J. Glass, "A nonparametric bayesian approach to acoustic model discovery, " in Proc. ACL, 2012.
    • (2012) Proc. ACL
    • Lee, C.1    Glass, J.2
  • 17
    • 84865770260 scopus 로고    scopus 로고
    • Towards unsupervised training of speaker independent acoustic models
    • A. Jansen and K. Church, "Towards unsupervised training of speaker independent acoustic models, " in Proc. INTER- SPEECH, 2011, pp. 1693-1696.
    • (2011) Proc. INTER-SPEECH , pp. 1693-1696
    • Jansen, A.1    Church, K.2
  • 18
    • 84890467020 scopus 로고    scopus 로고
    • Weak top- down constraints for unsupervised acoustic model training
    • A. Jansen, S. Thomas, and H. Hermansky, "Weak top- down constraints for unsupervised acoustic model training, " in Proc. ICASSP, 2013.
    • (2013) Proc. ICASSP
    • Jansen, A.1    Thomas, S.2    Hermansky, H.3
  • 19
    • 44949165663 scopus 로고    scopus 로고
    • Using posterior- based features in template matching for speech recognition
    • G. Aradilla, J. Vepa, and H. Bourlard, "Using posterior- based features in template matching for speech recognition, " in Proc. INTERSPEECH, 2006, pp. 1186-1189.
    • (2006) Proc. INTERSPEECH , pp. 1186-1189
    • Aradilla, G.1    Vepa, J.2    Bourlard, H.3
  • 20
    • 77949351968 scopus 로고    scopus 로고
    • Query-by-example spoken term detection using phonetic posteriorgram templates
    • T. Hazen, W. Shen, and C. White, "Query-by-example spoken term detection using phonetic posteriorgram templates, " in Proc. ASRU, 2009, pp. 421-426.
    • (2009) Proc. ASRU , pp. 421-426
    • Hazen, T.1    Shen, W.2    White, C.3
  • 25
    • 41249089920 scopus 로고    scopus 로고
    • On the equivalence between non-negative matrix factorization and probabilistic latent semantic indexing
    • C. Ding, T. Li, and W. Peng, "On the equivalence be- Tween non-negative matrix factorization and probabilistic latent semantic indexing, " Computational Statistics and Data Analysis, vol. 52, no. 8, pp. 3913-3927, 2008.
    • (2008) Computational Statistics and Data Analysis , vol.52 , Issue.8 , pp. 3913-3927
    • Ding, C.1    Li, T.2    Peng, W.3
  • 27
    • 84862931515 scopus 로고    scopus 로고
    • Experiments on cross-language attribute detection and phone recognition with minimal target-specific training data
    • S. Siniscalchi, D.-C. Lyu, T. Svendsen, and C.-H. Lee, "Experiments on cross-language attribute detection and phone recognition with minimal target-specific training data, " IEEE Trans. ASLP, vol. 20, no. 3, pp. 875-887, 2012.
    • (2012) IEEE Trans. ASLP , vol.20 , Issue.3 , pp. 875-887
    • Siniscalchi, S.1    Lyu, D.-C.2    Svendsen, T.3    Lee, C.-H.4
  • 29
    • 44849118234 scopus 로고    scopus 로고
    • Broad phonetic class recognition in a hidden Markov model framework using extended Baum-Welch transformations
    • T. Sainath, D. Kanevsky, and B. Ramabhadran, "Broad phonetic class recognition in a hidden Markov model framework using extended Baum-Welch transformations, " in Proc. ASRU, 2007, pp. 306-311.
    • (2007) Proc. ASRU , pp. 306-311
    • Sainath, T.1    Kanevsky, D.2    Ramabhadran, B.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.