SCOPUS 정보 검색 플랫폼

2013 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2013 - Proceedings

Volumn , Issue , 2013, Pages 410-415

Fixed-dimensional acoustic embeddings of variable-length segments in low-resource settings

(4) Levin, Keith a Henry, Katharine b Jansen, Aren a Livescu, Karen c

a Johns Hopkins University (United States)

b UNIVERSITY OF CHICAGO (United States)

c TOYOTA TECHNOLOGICAL INSTITUTE AT CHICAGO (United States)

Author keywords

Fixed dimensional embedding; query by example search; segmental acoustic modeling; speech indexing

Indexed keywords

ACOUSTIC MODEL; ACOUSTIC SIMILARITIES; APPLICATION OF STANDARDS; FIXED-DIMENSIONAL EMBEDDING; QUERY-BY-EXAMPLE; SPEECH INDEXING; UNSUPERVISED APPROACHES; VARIABLE-LENGTH SEGMENTS;

DISTANCE EDUCATION; LINGUISTICS;

INDEXING (OF INFORMATION);

EID: 84893657609 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ASRU.2013.6707765 Document Type: Conference Paper

Times cited : (167)

References (34)

1
- 84858952478
- Don't multiply lightly: Quantifying problems with the acoustic model assumptions in speech recognition
- D. Gillick, L. Gillick, and S. Wegmann, "Don't multiply lightly: Quantifying problems with the acoustic model assumptions in speech recognition, " in Proc. ASRU, 2011.
- (2011) Proc. ASRU
- Gillick, D.¹ Gillick, L.² Wegmann, S.³

2
- 85032752215
- Exemplar-based processing for speech recognition: An overview
- T. N. Sainath et al., "Exemplar-based processing for speech recognition: An overview, " IEEE Signal Processing Magazine, vol. 29, no. 6, pp. 98-113, 2012.
- (2012) IEEE Signal Processing Magazine , vol.29 , Issue.6 , pp. 98-113
- Sainath, T.N.¹

3
- 45549086638
- Template-based continuous speech recognition
- De Wachter et al., "Template-based continuous speech recognition, " IEEE Transactions on Audio, Speech, and Language Processing, vol. 15, no. 4, pp. 1377-1390, 2007.
- (2007) IEEE Transactions on Audio, Speech, and Language Processing , vol.15 , Issue.4 , pp. 1377-1390
- De Wachter¹

4
- 84867584256
- Investigations on exemplar-based features for speech recognition towards thousands of hours of unsupervised, noisy data
- G. Heigold, P. Nguyen, M. Weintraub, and V. Vanhoucke, "Investigations on exemplar-based features for speech recognition towards thousands of hours of unsupervised, noisy data, " in Proc. ICASSP, 2012.
- (2012) Proc. ICASSP
- Heigold, G.¹ Nguyen, P.² Weintraub, M.³ Vanhoucke, V.⁴

5
- 35148876898
- Distance metric learning: A comprehensive survey
- L. Yang and R. Jin, "Distance metric learning: A comprehensive survey, " Tech. Rep., Michigan State Universiy, 2006.
- (2006) Tech. Rep., Michigan State Universiy
- Yang, L.¹ Jin, R.²

6
- 84865772542
- Nearest neighbors with learned distances for phonetic frame classification
- J. Labiak and K. Livescu, "Nearest neighbors with learned distances for phonetic frame classification, " in Proc. Interspeech, 2011.
- (2011) Proc. Interspeech
- Labiak, J.¹ Livescu, K.²

7
- 64849085294
- Unsupervised pattern discovery in speech
- A. Park and J. R. Glass, "Unsupervised pattern discovery in speech, " IEEE Transcations on Audio, Speech, and Language Processing, vol. 16, no. 1, pp. 186-197, 2008.
- (2008) IEEE Transcations on Audio, Speech, and Language Processing , vol.16 , Issue.1 , pp. 186-197
- Park, A.¹ Glass, J.R.²

8
- 79959851706
- Towards spoken term discovery at scale with zero resources
- A. Jansen, K. Church, and H. Hermansky, "Towards spoken term discovery at scale with zero resources, " in Proc. Interspeech, 2010.
- (2010) Proc. Interspeech
- Jansen, A.¹ Church, K.² Hermansky, H.³

9
- 84858987768
- Efficient spoken term discovery using randomized algorithms
- A. Jansen and B. Van Durme, "Efficient spoken term discovery using randomized algorithms, " in Proc. ASRU, 2011.
- (2011) Proc. ASRU
- Jansen, A.¹ Van Durme, B.²

10
- 84878566254
- Indexing raw acoustic features for scalable zero resource search
- A. Jansen and B. Van Durme, "Indexing raw acoustic features for scalable zero resource search, " in Proc. Interspeech, 2012.
- (2012) Proc. Interspeech
- Jansen, A.¹ Van Durme, B.²

11
- 77949473673
- Unsupervised spoken keyword spotting via segmental dtw on gaussian posteriorgrams
- Y. Zhang and J. Glass, "Unsupervised spoken keyword spotting via segmental DTW on Gaussian posteriorgrams, " in Proc. ASRU, 2009.
- (2009) Proc. ASRU
- Zhang, Y.¹ Glass, J.²

12
- 84890478910
- The spoken web search task at mediaeval 2012
- F. Metze et al., "The spoken web search task at MediaEval 2012, " in Proc. ICASSP, 2013.
- (2013) Proc. ICASSP
- Metze, F.¹

13
- 85121123643
- The mit summit speech recognition system: A progress report
- V. Zue, J. Glass, M. Phillips, and S. Seneff, "The MIT SUMMIT speech recognition system: A progress report, " in Workshop on Speech and Natural Language, 1989.
- (1989) Workshop on Speech and Natural Language
- Zue, V.¹ Glass, J.² Phillips, M.³ Seneff, S.⁴

14
- 0038359548
- A probabilistic framework for segment-based speech recognition
- J. R. Glass, "A probabilistic framework for segment-based speech recognition, " Speech Communication, vol. 17, pp. 137- 152, 2003.
- (2003) Speech Communication , vol.17 , pp. 137-152
- Glass, J.R.¹

15
- 0012315045
- From hmms to segment models: Stochastic modelling for CSR
- (C.-H. Lee, F. K. Soong, and K. K. Paliwal, Eds.), chapter 8, Springer
- M. Ostendorf, "From HMMs to segment models: Stochastic modelling for CSR, " in Automatic Speech and Speaker Recognition: Advanced Topics (C.-H. Lee, F. K. Soong, and K. K. Paliwal, Eds.), chapter 8, pp. 185-209. Springer, 1996.
- (1996) Automatic Speech and Speaker Recognition: Advanced Topics , pp. 185-209
- Ostendorf, M.¹

16
- 84906282118
- Deep segmental neural networks for speech recognition
- O. Abdel-Hamid, L. Deng, D. Yu, and H. Jiang, "Deep segmental neural networks for speech recognition, " in Proc. Interspeech, 2013.
- (2013) Proc. Interspeech
- Abdel-Hamid, O.¹ Deng, L.² Yu, D.³ Jiang, H.⁴

17
- 80051659716
- Speech recognition with segmental conditional random fields: A summary of the jhu clsp 2010 summer workshop
- G. Zweig et al., "Speech recognition with segmental conditional random fields: A summary of the JHU CLSP 2010 summer workshop, " in Proc. ICASSP, 2011.
- (2011) Proc. ICASSP
- Zweig, G.¹

18
- 33645789558
- Acoustic modelling using continuous rational kernels
- M. I. Layton and M. J. F. Gales, "Acoustic modelling using continuous rational kernels, " in Proc. MLSP, 2005.
- (2005) Proc. MLSP
- Layton, M.I.¹ Gales, M.J.F.²

19
- 84893638889
- Wordlevel acoustic modeling with convolutional vector regression
- A. Maas, S. Miller, T. O'Neil, A. Ng, and P. Nguyen, "Wordlevel acoustic modeling with convolutional vector regression, " in Proc. ICML Workshop on Representation Learning, 2012.
- (2012) Proc. ICML Workshop on Representation Learning
- Maas, A.¹ Miller, S.² O'Neil, T.³ Ng, A.⁴ Nguyen, P.⁵

20
- 77953988140
- A novel vector representation of stochastic signals based on adapted ergodic hmms
- H. Tang, M. Hasegawa-Johnson, and T. Huang, "A novel vector representation of stochastic signals based on adapted ergodic HMMs, " IEEE Signal Processing Letters, vol. 17, no. 8, pp. 715-718, 2010.
- (2010) IEEE Signal Processing Letters , vol.17 , Issue.8 , pp. 715-718
- Tang, H.¹ Hasegawa-Johnson, M.² Huang, T.³

21
- 0038633576
- Properties of embedding methods for similarity searching in metric spaces
- G. R. Hjaltason and H. Samet, "Properties of embedding methods for similarity searching in metric spaces, " IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 25, no. 5, pp. 530-549, 2003.
- (2003) IEEE Transactions on Pattern Analysis and Machine Intelligence , vol.25 , Issue.5 , pp. 530-549
- Hjaltason, G.R.¹ Samet, H.²

22
- 0001907042
- Approximate nearest neighbors: Towards removing the curse of dimensionality
- P. Indyk and R. Motwani, "Approximate nearest neighbors: Towards removing the curse of dimensionality, " in Proc. STOC, 1998.
- (1998) Proc. STOC
- Indyk, P.¹ Motwani, R.²

23
- 0010011718
- Cluster-preserving embedding of proteins
- G. Hristescu and M. Farach-Colton, "Cluster-preserving embedding of proteins, " Tech. Rep. 99-50, Rutgers University, 1999.
- (1999) Tech. Rep. 99-50, Rutgers University
- Hristescu, G.¹ Farach-Colton, M.²

24
- 77956528213
- Metric learning to rank
- B. McFee and G. R. Lanckriet, "Metric learning to rank, " in Proc. ICML, 2010.
- (2010) Proc. ICML
- McFee, B.¹ Lanckriet, G.R.²

25
- 0034704222
- Nonlinear dimensionality reduction by locally linear embedding
- S. Roweis and L. Saul, "Nonlinear dimensionality reduction by locally linear embedding, " Science, vol. 290, no. 5500, 2000.
- (2000) Science , vol.290 , Issue.5500
- Roweis, S.¹ Saul, L.²

26
- 84898964829
- Stochastic neighbor embedding
- G. Hinton and S. T. Roweis, "Stochastic neighbor embedding, " in NIPS, 2003.
- (2003) NIPS
- Hinton, G.¹ Roweis, S.T.²

27
- 0042378381
- Laplacian eigenmaps for dimensionality reduction and data representation
- DOI 10.1162/089976603321780317
- M. Belkin and P. Niyogi, "Laplacian Eigenmaps for Dimensionality Reduction and Data Represenation, " Neural Computation, vol. 16, pp. 1373-1396, 2003. (Pubitemid 37049796)
- (2003) Neural Computation , vol.15 , Issue.6 , pp. 1373-1396
- Belkin, M.¹ Niyogi, P.²

28
- 33750729556
- Manifold regularization: A geometric framework for learning from labeled and unlabeled examples
- M. Belkin, P. Niyogi, and V. Sindhwani, "Manifold Regularization: A Geometric Framework for Learning from Examples, " Journal of Machine Learning Research, vol. 7, pp. 2399-2434, 2006. (Pubitemid 44708005)
- (2006) Journal of Machine Learning Research , vol.7 , pp. 2399-2434
- Belkin, M.¹ Niyogi, P.² Sindhwani, V.³

29
- 84868553158
- Application of a locality preserving discriminant analysis approach to asr
- V. S. Tomar and R. C. Rose, "Application of a locality preserving discriminant analysis approach to ASR, " in Proc. ISSPA, 2012.
- (2012) Proc. ISSPA
- Tomar, V.S.¹ Rose, R.C.²

30
- 39449085279
- Locality sensitive discriminant analysis
- D. Cai, J. Han, X. He, K. Zhou, and H. Bao, "Locality sensitive discriminant analysis, " in Proc. IJCAI, 2007.
- (2007) Proc. IJCAI
- Cai, D.¹ Han, J.² He, X.³ Zhou, K.⁴ Bao, H.⁵

31
- 33947194180
- Graph embedding and extensions: A general framework for dimensionality reduction
- DOI 10.1109/TPAMI.2007.250598
- S. Yan et al., "Graph embedding and extensions: A general framework for dimensionality reduction, " IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 29, no. 1, pp. 40-51, 2007. (Pubitemid 46415944)
- (2007) IEEE Transactions on Pattern Analysis and Machine Intelligence , vol.29 , Issue.1 , pp. 40-51
- Yan, S.¹ Xu, D.² Zhang, B.³ Zhang, H.-J.⁴ Yang, Q.⁵ Lin, S.⁶

32
- 84858969928
- Rapid evaluation of speech representations for spoken term discovery
- M. Carlin, S. Thomas, A. Jansen, and H. Hermansky, "Rapid evaluation of speech representations for spoken term discovery, " in Proc. ICASSP, 2011.
- (2011) Proc. ICASSP
- Carlin, M.¹ Thomas, S.² Jansen, A.³ Hermansky, H.⁴

33
- 70349212558
- Phoneme recognition using spectral envelope and modulation frequency features
- S. Thomas, S. Ganapathy, and H. Hermansky, "Phoneme recognition using spectral envelope and modulation frequency features, " in Proc. ICASSP, 2009.
- (2009) Proc. ICASSP
- Thomas, S.¹ Ganapathy, S.² Hermansky, H.³

34
- 84890488932
- A summary of the 2012 CLSP workshop on zero resource speech technologies and models of early language acquisition
- A. Jansen et al., "A summary of the 2012 CLSP workshop on zero resource speech technologies and models of early language acquisition, " in Proc. ICASSP, 2013.
- (2013) Proc. ICASSP
- Jansen, A.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.