-
1
-
-
77955558847
-
Real-world acoustic event detection
-
X. Zhuang, X. Zhou, M. A. Hasegawa-Johnson, and T. S. Huang, "Real-world acoustic event detection, " Pattern Recognition Letters, vol. 31, no. 12, pp. 1543-1551, 2010.
-
(2010)
Pattern Recognition Letters
, vol.31
, Issue.12
, pp. 1543-1551
-
-
Zhuang, X.1
Zhou, X.2
Hasegawa-Johnson, M.A.3
Huang, T.S.4
-
2
-
-
84905269591
-
2011 Multimedia event detection: Late- fusion approaches to combine multiple audio-visual features
-
A. G. A. Perera, S. Oh, M. Leotta, I. Kim, B. Byun, C.-H. Lee, S. McCloskey, J. Liu, B. Miller, Z. F. Huan, A. Vahdat, W. Yang, G. Mori, K. Tang, D. Koller, L. Fei-Fei, K. Li, G. Chen, J. Corso, Y. Fu, and R. Srihari, "2011 Multimedia Event Detection: Late- Fusion Approaches to Combine Multiple Audio-Visual features, " in Proc. NIST TRECVID Workshop, 2011.
-
(2011)
Proc. NIST TRECVID Workshop
-
-
Perera, A.G.A.1
Oh, S.2
Leotta, M.3
Kim, I.4
Byun, B.5
Lee, C.-H.6
McCloskey, S.7
Liu, J.8
Miller, B.9
Huan, Z.F.10
Vahdat, A.11
Yang, W.12
Mori, G.13
Tang, K.14
Koller, D.15
Fei-Fei, L.16
Li, K.17
Chen, G.18
Corso, J.19
Fu, Y.20
Srihari, R.21
more..
-
3
-
-
84905233993
-
Tokyotech+ canon at TRECVID 2011
-
N. Inoue, Y. Kamishima, T. Wada, K. Shinoda, and S. Sato, "TokyoTech+ Canon at TRECVID 2011, " in Proc. NIST TRECVID Workshop, 2011.
-
(2011)
Proc. NIST TRECVID Workshop
-
-
Inoue, N.1
Kamishima, Y.2
Wada, T.3
Shinoda, K.4
Sato, S.5
-
4
-
-
84867614198
-
Audio event detection from acoustic unit occurrence patterns
-
IEEE
-
A. Kumar, P. Dighe, R. Singh, S. Chaudhuri, and B. Raj, "Audio event detection from acoustic unit occurrence patterns, " in Proc. ICASSP. IEEE, 2012, pp. 489-492.
-
(2012)
Proc. ICASSP
, pp. 489-492
-
-
Kumar, A.1
Dighe, P.2
Singh, R.3
Chaudhuri, S.4
Raj, B.5
-
5
-
-
11244272075
-
Highlight sound effects detection in audio stream
-
IEEE
-
R. Cai, L. Lu, H.-J. Zhang, and L.-H. Cai, "Highlight sound effects detection in audio stream, " in Proc. ICME, vol. 3. IEEE, 2003, pp. III-37.
-
(2003)
Proc. ICME
, vol.3
-
-
Cai, R.1
Lu, L.2
Zhang, H.-J.3
Cai, L.-H.4
-
6
-
-
51449101221
-
Feature analysis and selection for acoustic event detection
-
X. Zhuang, X. Zhou, T. S. Huang, and M. Hasegawa-Johnson, "Feature analysis and selection for acoustic event detection, " in in Proc. ICASSP. IEEE, 2008, pp. 17-20.
-
(2008)
Proc. ICASSP. IEEE
, pp. 17-20
-
-
Zhuang, X.1
Zhou, X.2
Huang, T.S.3
Hasegawa-Johnson, M.4
-
7
-
-
84878582006
-
Consumerlevel multimedia event detection through unsupervised audio signal modeling
-
B. Byun, I. Kim, S. M. Siniscalchi, and C.-H. Lee, "Consumerlevel multimedia event detection through unsupervised audio signal modeling, " in Proc. INTERSPEECH, 2012.
-
(2012)
Proc. INTERSPEECH
-
-
Byun, B.1
Kim, I.2
Siniscalchi, S.M.3
Lee, C.-H.4
-
8
-
-
0024610919
-
A tutorial on hidden Markov models and selected applications in speech recognition
-
L. R. Rabiner, "A tutorial on hidden Markov models and selected applications in speech recognition, " Proceedings of the IEEE, vol. 77, no. 2, pp. 257-286, 1989.
-
(1989)
Proceedings of the IEEE
, vol.77
, Issue.2
, pp. 257-286
-
-
Rabiner, L.R.1
-
9
-
-
33947632983
-
Automatic image annotation through multi-topic text categorization
-
IEEE
-
S. Gao, D.-H.Wang, and C.-H. Lee, "Automatic image annotation through multi-topic text categorization, " in Proc. ICASSP, vol. 2. IEEE, 2006, pp. II-II.
-
(2006)
Proc. ICASSP
, vol.2
-
-
Gao, S.1
Wang, D.-h.2
Lee, C.-H.3
-
10
-
-
77951957024
-
An incremental learning framework combining sample confidence and discrimination with an application to automatic image annotation
-
IEEE
-
B. Byun and C.-H. Lee, "An incremental learning framework combining sample confidence and discrimination with an application to automatic image annotation, " in Proc. ICIP. IEEE, 2009, pp. 1441-1444.
-
(2009)
Proc. ICIP
, pp. 1441-1444
-
-
Byun, B.1
Lee, C.-H.2
-
11
-
-
70349213510
-
A hierarchical grid feature representation framework for automatic image annotation
-
IEEE
-
I. Kim and C.-H. Lee, "A hierarchical grid feature representation framework for automatic image annotation, " in Proc. ICASSP. IEEE, 2009, pp. 1125-1128.
-
(2009)
Proc. ICASSP
, pp. 1125-1128
-
-
Kim, I.1
Lee, C.-H.2
-
12
-
-
34547502608
-
A vector space modeling approach to spoken language identification
-
H. Li, B. Ma, and C.-H. Lee, "A vector space modeling approach to spoken language identification, " IEEE Transactions on Audio, Speech, and Language Processing, vol. 15, no. 1, pp. 271-284, 2007.
-
(2007)
IEEE Transactions on Audio, Speech, and Language Processing
, vol.15
, Issue.1
, pp. 271-284
-
-
Li, H.1
Ma, B.2
Lee, C.-H.3
-
13
-
-
14344255188
-
A MFoM learning approach to robust multiclass multi-label text categorization
-
ACM
-
S. Gao, W. Wu, C.-H. Lee, and T.-S. Chua, "A MFoM learning approach to robust multiclass multi-label text categorization, " in Proc. ICML. ACM, 2004, p. 42.
-
(2004)
Proc. ICML
, pp. 42
-
-
Gao, S.1
Wu, W.2
Lee, C.-H.3
Chua, T.-S.4
-
14
-
-
79956286980
-
A regularized maximum figure-of-merit (rmfom) approach to supervised and semi-supervised learning
-
C. Ma and C.-H. Lee, "A Regularized Maximum Figure-of-Merit (rMFoM) Approach to Supervised and Semi-Supervised Learning, " IEEE Transactions on Audio, Speech, and Language Processing, vol. 19, no. 5, pp. 1316-1327, 2011.
-
(2011)
IEEE Transactions on Audio, Speech, and Language Processing
, vol.19
, Issue.5
, pp. 1316-1327
-
-
Ma, C.1
Lee, C.-H.2
-
15
-
-
50249170027
-
Joint factor analysis versus eigenchannels in speaker recognition
-
P. Kenny, G. Boulianne, P. Ouellet, and P. Dumouchel, "Joint factor analysis versus eigenchannels in speaker recognition, " IEEE Transactions on Audio, Speech, and Language Processing, vol. 15, no. 4, pp. 1435-1447, 2007.
-
(2007)
IEEE Transactions on Audio, Speech, and Language Processing
, vol.15
, Issue.4
, pp. 1435-1447
-
-
Kenny, P.1
Boulianne, G.2
Ouellet, P.3
Dumouchel, P.4
-
16
-
-
70450180849
-
Support vector machines versus fast scoring in the lowdimensional total variability space for speaker verification
-
N. Dehak, R. Dehak, P. Kenny, N. Brummer, P. Ouellet, and P. Dumouchel, "Support vector machines versus fast scoring in the lowdimensional total variability space for speaker verification, " in Proc. Interspeech, 2009, pp. 1559-1562.
-
(2009)
Proc. Interspeech
, pp. 1559-1562
-
-
Dehak, N.1
Dehak, R.2
Kenny, P.3
Brummer, N.4
Ouellet, P.5
Dumouchel, P.6
-
17
-
-
85073247582
-
Variational bayes logistic regression as regularized fusion for NIST sre 2010
-
V. Hautamäki, K. A. Lee, A. Larcher, T. Kinnunen, B. Ma, and H. Li, "Variational bayes logistic regression as regularized fusion for NIST sre 2010, " in Proc. Odyssey: The Speaker and Language Recognition Workshop, 2012.
-
(2012)
Proc. Odyssey: The Speaker and Language Recognition Workshop
-
-
Hautamäki, V.1
Lee, K.A.2
Larcher, A.3
Kinnunen, T.4
Ma, B.5
Li, H.6
-
18
-
-
79951609039
-
Front-end factor analysis for speaker verification
-
N. Dehak, P. J. Kenny, R. Dehak, P. Dumouchel, and P. Ouellet, "Front-end factor analysis for speaker verification, " IEEE Transactions on Audio, Speech, and Language Processing, vol. 19, no. 4, pp. 788-798, 2011.
-
(2011)
IEEE Transactions on Audio, Speech, and Language Processing
, vol.19
, Issue.4
, pp. 788-798
-
-
Dehak, N.1
Kenny, P.J.2
Dehak, R.3
Dumouchel, P.4
Ouellet, P.5
-
19
-
-
0038959172
-
Probabilistic principal component analysis
-
M. E. Tipping and C. M. Bishop, "Probabilistic principal component analysis, " Journal of the Royal Statistical Society: Series B (Statistical Methodology), vol. 61, no. 3, pp. 611-622, 1999.
-
(1999)
Journal of the Royal Statistical Society: Series B (Statistical Methodology)
, vol.61
, Issue.3
, pp. 611-622
-
-
Tipping, M.E.1
Bishop, C.M.2
-
21
-
-
18744386134
-
Eigenvoice modeling with sparse training data
-
P. Kenny, G. Boulianne, and P. Dumouchel, "Eigenvoice modeling with sparse training data, " IEEE Transactions on Speech and Audio Processing, vol. 13, no. 3, pp. 345-354, 2005.
-
(2005)
IEEE Transactions on Speech and Audio Processing
, vol.13
, Issue.3
, pp. 345-354
-
-
Kenny, P.1
Boulianne, G.2
Dumouchel, P.3
-
23
-
-
0002629270
-
Maximum likelihood from incomplete data via the em algorithm
-
Series B (Methodological
-
A. P. Dempster, N. M. Laird, and D. B. Rubin, "Maximum likelihood from incomplete data via the em algorithm, " Journal of the Royal Statistical Society. Series B (Methodological), pp. 1-38, 1977.
-
(1977)
Journal of the Royal Statistical Society
, pp. 1-38
-
-
Dempster, A.P.1
Laird, N.M.2
Rubin, D.B.3
-
24
-
-
85084012167
-
ALIZE/spkdet: A state-of-the-art open source software for speaker recognition
-
J.-F. Bonastre, N. Scheffer, D. Matrouf, C. Fredouille, A. Larcher, A. Preti, G. Pouchoulin, N. Evans, B. Fauve, and J. Mason, "ALIZE/SpkDet: A state-of-the-art open source software for speaker recognition, " in Proc. Odyssey: The Speaker and Language Recognition Workshop, 2008.
-
(2008)
Proc. Odyssey: The Speaker and Language Recognition Workshop
-
-
Bonastre, J.-F.1
Scheffer, N.2
Matrouf, D.3
Fredouille, C.4
Larcher, A.5
Preti, A.6
Pouchoulin, G.7
Evans, N.8
Fauve, B.9
Mason, J.10
-
25
-
-
34249753618
-
Support-vector networks
-
C. Cortes and V. Vapnik, "Support-vector networks, " Machine learning, vol. 20, no. 3, pp. 273-297, 1995.
-
(1995)
Machine Learning
, vol.20
, Issue.3
, pp. 273-297
-
-
Cortes, C.1
Vapnik, V.2
-
26
-
-
84906260904
-
TRECVID 2012 genie: Multimedia event detection and recounting
-
A. Vahdat, K. Cannons, H. Hajimirsadeghi, G. Mori, S. Mc-Closkey, B. Miller, S. Venkatesha, P. Davalos, P. Das, C. Xu et al., "TRECVID 2012 GENIE: Multimedia event detection and recounting, " in Proc. NIST TRECVID Workshop, 2012.
-
(2012)
Proc. NIST TRECVID Workshop
-
-
Vahdat, A.1
Cannons, K.2
Hajimirsadeghi, H.3
Mori, G.4
Mc-Closkey, S.5
Miller, B.6
Venkatesha, S.7
Davalos, P.8
Das, P.9
Xu, C.10
-
27
-
-
69949113988
-
An experimental study on discriminative concept classifier combination for trecvid high-level feature extraction
-
B. Byun, C. Ma, and C.-H. Lee, "An experimental study on discriminative concept classifier combination for trecvid high-level feature extraction, " in Proc. ICIP. IEEE, 2008, pp. 2532-2535.
-
(2008)
Proc. ICIP. IEEE
, pp. 2532-2535
-
-
Byun, B.1
Ma, C.2
Lee, C.-H.3
-
28
-
-
79955702502
-
LIBSVM: A library for support vector machines
-
27:1-27:27, software available at
-
C.-C. Chang and C.-J. Lin, "LIBSVM: A library for support vector machines, " ACM Transactions on Intelligent Systems and Technology, vol. 2, pp. 27:1-27:27, 2011, software available at http://www.csie.ntu.edu.tw/cjlin/ libsvm.
-
(2011)
ACM Transactions on Intelligent Systems and Technology
, vol.2
-
-
Chang, C.-C.1
Lin, C.-J.2
-
29
-
-
0003822743
-
-
Cambridge University Engineering Department
-
S. Young, G. Evermann, D. Kershaw, G. Moore, J. Odell, D. Ollason, V. Valtchev, and P. Woodland, "The HTK book, " Cambridge University Engineering Department, vol. 3, 2002.
-
(2002)
The HTK Book
, vol.3
-
-
Young, S.1
Evermann, G.2
Kershaw, D.3
Moore, G.4
Odell, J.5
Ollason, D.6
Valtchev, V.7
Woodland, P.8
-
30
-
-
0035506942
-
Comparison of different implementations of mfcc
-
F. Zheng, G. Zhang, and Z. Song, "Comparison of different implementations of mfcc, " Journal of Computer Science and Technology, vol. 16, no. 6, pp. 582-589, 2001.
-
(2001)
Journal of Computer Science and Technology
, vol.16
, Issue.6
, pp. 582-589
-
-
Zheng, F.1
Zhang, G.2
Song, Z.3
-
31
-
-
84866410482
-
Searching for sounds: A demonstration of findsounds. Com and findsounds palette
-
S. V. Rice and S. M. Bailey, "Searching for sounds: A demonstration of Findsounds. com and Findsounds palette, " in Proc. the International Computer Music Conference, 2004, pp. 215-218.
-
(2004)
Proc. The International Computer Music Conference
, pp. 215-218
-
-
Rice, S.V.1
Bailey, S.M.2
|