메뉴 건너뛰기




Volumn , Issue , 2013, Pages 410-415

Fixed-dimensional acoustic embeddings of variable-length segments in low-resource settings

Author keywords

Fixed dimensional embedding; query by example search; segmental acoustic modeling; speech indexing

Indexed keywords

ACOUSTIC MODEL; ACOUSTIC SIMILARITIES; APPLICATION OF STANDARDS; FIXED-DIMENSIONAL EMBEDDING; QUERY-BY-EXAMPLE; SPEECH INDEXING; UNSUPERVISED APPROACHES; VARIABLE-LENGTH SEGMENTS;

EID: 84893657609     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ASRU.2013.6707765     Document Type: Conference Paper
Times cited : (167)

References (34)
  • 1
    • 84858952478 scopus 로고    scopus 로고
    • Don't multiply lightly: Quantifying problems with the acoustic model assumptions in speech recognition
    • D. Gillick, L. Gillick, and S. Wegmann, "Don't multiply lightly: Quantifying problems with the acoustic model assumptions in speech recognition, " in Proc. ASRU, 2011.
    • (2011) Proc. ASRU
    • Gillick, D.1    Gillick, L.2    Wegmann, S.3
  • 2
    • 85032752215 scopus 로고    scopus 로고
    • Exemplar-based processing for speech recognition: An overview
    • T. N. Sainath et al., "Exemplar-based processing for speech recognition: An overview, " IEEE Signal Processing Magazine, vol. 29, no. 6, pp. 98-113, 2012.
    • (2012) IEEE Signal Processing Magazine , vol.29 , Issue.6 , pp. 98-113
    • Sainath, T.N.1
  • 4
    • 84867584256 scopus 로고    scopus 로고
    • Investigations on exemplar-based features for speech recognition towards thousands of hours of unsupervised, noisy data
    • G. Heigold, P. Nguyen, M. Weintraub, and V. Vanhoucke, "Investigations on exemplar-based features for speech recognition towards thousands of hours of unsupervised, noisy data, " in Proc. ICASSP, 2012.
    • (2012) Proc. ICASSP
    • Heigold, G.1    Nguyen, P.2    Weintraub, M.3    Vanhoucke, V.4
  • 6
    • 84865772542 scopus 로고    scopus 로고
    • Nearest neighbors with learned distances for phonetic frame classification
    • J. Labiak and K. Livescu, "Nearest neighbors with learned distances for phonetic frame classification, " in Proc. Interspeech, 2011.
    • (2011) Proc. Interspeech
    • Labiak, J.1    Livescu, K.2
  • 8
    • 79959851706 scopus 로고    scopus 로고
    • Towards spoken term discovery at scale with zero resources
    • A. Jansen, K. Church, and H. Hermansky, "Towards spoken term discovery at scale with zero resources, " in Proc. Interspeech, 2010.
    • (2010) Proc. Interspeech
    • Jansen, A.1    Church, K.2    Hermansky, H.3
  • 9
    • 84858987768 scopus 로고    scopus 로고
    • Efficient spoken term discovery using randomized algorithms
    • A. Jansen and B. Van Durme, "Efficient spoken term discovery using randomized algorithms, " in Proc. ASRU, 2011.
    • (2011) Proc. ASRU
    • Jansen, A.1    Van Durme, B.2
  • 10
    • 84878566254 scopus 로고    scopus 로고
    • Indexing raw acoustic features for scalable zero resource search
    • A. Jansen and B. Van Durme, "Indexing raw acoustic features for scalable zero resource search, " in Proc. Interspeech, 2012.
    • (2012) Proc. Interspeech
    • Jansen, A.1    Van Durme, B.2
  • 11
    • 77949473673 scopus 로고    scopus 로고
    • Unsupervised spoken keyword spotting via segmental dtw on gaussian posteriorgrams
    • Y. Zhang and J. Glass, "Unsupervised spoken keyword spotting via segmental DTW on Gaussian posteriorgrams, " in Proc. ASRU, 2009.
    • (2009) Proc. ASRU
    • Zhang, Y.1    Glass, J.2
  • 12
    • 84890478910 scopus 로고    scopus 로고
    • The spoken web search task at mediaeval 2012
    • F. Metze et al., "The spoken web search task at MediaEval 2012, " in Proc. ICASSP, 2013.
    • (2013) Proc. ICASSP
    • Metze, F.1
  • 14
    • 0038359548 scopus 로고    scopus 로고
    • A probabilistic framework for segment-based speech recognition
    • J. R. Glass, "A probabilistic framework for segment-based speech recognition, " Speech Communication, vol. 17, pp. 137- 152, 2003.
    • (2003) Speech Communication , vol.17 , pp. 137-152
    • Glass, J.R.1
  • 15
    • 0012315045 scopus 로고    scopus 로고
    • From hmms to segment models: Stochastic modelling for CSR
    • (C.-H. Lee, F. K. Soong, and K. K. Paliwal, Eds.), chapter 8, Springer
    • M. Ostendorf, "From HMMs to segment models: Stochastic modelling for CSR, " in Automatic Speech and Speaker Recognition: Advanced Topics (C.-H. Lee, F. K. Soong, and K. K. Paliwal, Eds.), chapter 8, pp. 185-209. Springer, 1996.
    • (1996) Automatic Speech and Speaker Recognition: Advanced Topics , pp. 185-209
    • Ostendorf, M.1
  • 17
    • 80051659716 scopus 로고    scopus 로고
    • Speech recognition with segmental conditional random fields: A summary of the jhu clsp 2010 summer workshop
    • G. Zweig et al., "Speech recognition with segmental conditional random fields: A summary of the JHU CLSP 2010 summer workshop, " in Proc. ICASSP, 2011.
    • (2011) Proc. ICASSP
    • Zweig, G.1
  • 18
    • 33645789558 scopus 로고    scopus 로고
    • Acoustic modelling using continuous rational kernels
    • M. I. Layton and M. J. F. Gales, "Acoustic modelling using continuous rational kernels, " in Proc. MLSP, 2005.
    • (2005) Proc. MLSP
    • Layton, M.I.1    Gales, M.J.F.2
  • 20
    • 77953988140 scopus 로고    scopus 로고
    • A novel vector representation of stochastic signals based on adapted ergodic hmms
    • H. Tang, M. Hasegawa-Johnson, and T. Huang, "A novel vector representation of stochastic signals based on adapted ergodic HMMs, " IEEE Signal Processing Letters, vol. 17, no. 8, pp. 715-718, 2010.
    • (2010) IEEE Signal Processing Letters , vol.17 , Issue.8 , pp. 715-718
    • Tang, H.1    Hasegawa-Johnson, M.2    Huang, T.3
  • 22
    • 0001907042 scopus 로고    scopus 로고
    • Approximate nearest neighbors: Towards removing the curse of dimensionality
    • P. Indyk and R. Motwani, "Approximate nearest neighbors: Towards removing the curse of dimensionality, " in Proc. STOC, 1998.
    • (1998) Proc. STOC
    • Indyk, P.1    Motwani, R.2
  • 25
    • 0034704222 scopus 로고    scopus 로고
    • Nonlinear dimensionality reduction by locally linear embedding
    • S. Roweis and L. Saul, "Nonlinear dimensionality reduction by locally linear embedding, " Science, vol. 290, no. 5500, 2000.
    • (2000) Science , vol.290 , Issue.5500
    • Roweis, S.1    Saul, L.2
  • 26
    • 84898964829 scopus 로고    scopus 로고
    • Stochastic neighbor embedding
    • G. Hinton and S. T. Roweis, "Stochastic neighbor embedding, " in NIPS, 2003.
    • (2003) NIPS
    • Hinton, G.1    Roweis, S.T.2
  • 27
    • 0042378381 scopus 로고    scopus 로고
    • Laplacian eigenmaps for dimensionality reduction and data representation
    • DOI 10.1162/089976603321780317
    • M. Belkin and P. Niyogi, "Laplacian Eigenmaps for Dimensionality Reduction and Data Represenation, " Neural Computation, vol. 16, pp. 1373-1396, 2003. (Pubitemid 37049796)
    • (2003) Neural Computation , vol.15 , Issue.6 , pp. 1373-1396
    • Belkin, M.1    Niyogi, P.2
  • 28
    • 33750729556 scopus 로고    scopus 로고
    • Manifold regularization: A geometric framework for learning from labeled and unlabeled examples
    • M. Belkin, P. Niyogi, and V. Sindhwani, "Manifold Regularization: A Geometric Framework for Learning from Examples, " Journal of Machine Learning Research, vol. 7, pp. 2399-2434, 2006. (Pubitemid 44708005)
    • (2006) Journal of Machine Learning Research , vol.7 , pp. 2399-2434
    • Belkin, M.1    Niyogi, P.2    Sindhwani, V.3
  • 29
    • 84868553158 scopus 로고    scopus 로고
    • Application of a locality preserving discriminant analysis approach to asr
    • V. S. Tomar and R. C. Rose, "Application of a locality preserving discriminant analysis approach to ASR, " in Proc. ISSPA, 2012.
    • (2012) Proc. ISSPA
    • Tomar, V.S.1    Rose, R.C.2
  • 32
    • 84858969928 scopus 로고    scopus 로고
    • Rapid evaluation of speech representations for spoken term discovery
    • M. Carlin, S. Thomas, A. Jansen, and H. Hermansky, "Rapid evaluation of speech representations for spoken term discovery, " in Proc. ICASSP, 2011.
    • (2011) Proc. ICASSP
    • Carlin, M.1    Thomas, S.2    Jansen, A.3    Hermansky, H.4
  • 33
    • 70349212558 scopus 로고    scopus 로고
    • Phoneme recognition using spectral envelope and modulation frequency features
    • S. Thomas, S. Ganapathy, and H. Hermansky, "Phoneme recognition using spectral envelope and modulation frequency features, " in Proc. ICASSP, 2009.
    • (2009) Proc. ICASSP
    • Thomas, S.1    Ganapathy, S.2    Hermansky, H.3
  • 34
    • 84890488932 scopus 로고    scopus 로고
    • A summary of the 2012 CLSP workshop on zero resource speech technologies and models of early language acquisition
    • A. Jansen et al., "A summary of the 2012 CLSP workshop on zero resource speech technologies and models of early language acquisition, " in Proc. ICASSP, 2013.
    • (2013) Proc. ICASSP
    • Jansen, A.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.