-
1
-
-
0037401304
-
Speech/music discrimination using entropy and dynamism features in a HMM classification framework
-
Ajmera J., McCowan I., and Bourlard H. Speech/music discrimination using entropy and dynamism features in a HMM classification framework. Speech Communication 40 (2003) 351-363
-
(2003)
Speech Communication
, vol.40
, pp. 351-363
-
-
Ajmera, J.1
McCowan, I.2
Bourlard, H.3
-
2
-
-
33947319848
-
Application of fisher linear discriminant analysis to speech/music classification
-
Alexandre-Cortizo, E., Rosa-Zurera, M., Lopez-Ferreras, F., 2005. Application of fisher linear discriminant analysis to speech/music classification. In: IEEE Eurocon, pp. 1666-1669.
-
(2005)
IEEE Eurocon
, pp. 1666-1669
-
-
Alexandre-Cortizo, E.1
Rosa-Zurera, M.2
Lopez-Ferreras, F.3
-
4
-
-
84987906938
-
ANTS: Le systFme de transcription automatique du LORIA
-
Brun, A., Cerisara, C., Fohr, D., Illina, I., Langlois, D., Mella, O., Smaili, K., 2004. ANTS: le systFme de transcription automatique du LORIA. In: JournTes d'Etude sur la Parole - JEP'04.
-
(2004)
JournTes d'Etude sur la Parole - JEP'04
-
-
Brun, A.1
Cerisara, C.2
Fohr, D.3
Illina, I.4
Langlois, D.5
Mella, O.6
Smaili, K.7
-
5
-
-
0032638667
-
A comparison of features for speech, music discrimination
-
ICASSP, pp
-
Carey, M.J., Parris, E.S., Lloyd-Thomas, H., 1999. A comparison of features for speech, music discrimination. In: Proc. IEEE Int. Conf. on Acoustic, Speech and Signal Processing, ICASSP, pp. 149-152.
-
(1999)
Proc. IEEE Int. Conf. on Acoustic, Speech and Signal Processing
, pp. 149-152
-
-
Carey, M.J.1
Parris, E.S.2
Lloyd-Thomas, H.3
-
6
-
-
0034853025
-
Robust singing detection in speech/music discriminator design
-
ICASSP, pp
-
Chou, W., Gu, L., 2001. Robust singing detection in speech/music discriminator design. In: Proc. IEEE Int. Conf. on Acoustic, Speech and Signal Processing, ICASSP, pp. 865-868.
-
(2001)
Proc. IEEE Int. Conf. on Acoustic, Speech and Signal Processing
, pp. 865-868
-
-
Chou, W.1
Gu, L.2
-
9
-
-
44949087900
-
A wavelet-based parametrization for speech/music segmentation
-
Didiot, E., Illina, I., Mella, O., Haton, J.-P., Fohr, D., 2006. A wavelet-based parametrization for speech/music segmentation. In: Proc. Int. Conf. on Spoken Language Processing, ICSLP, pp. 653-656.
-
(2006)
Proc. Int. Conf. on Spoken Language Processing, ICSLP
, pp. 653-656
-
-
Didiot, E.1
Illina, I.2
Mella, O.3
Haton, J.-P.4
Fohr, D.5
-
10
-
-
70349241267
-
Speech/music discrimination based on wavelets for broadcast programs
-
Didiot, E., Illina, I., Mella, O., Haton, J.-P., Fohr, D., 2006. Speech/music discrimination based on wavelets for broadcast programs. In: IEEE International Conference on Signal Processing and Multimedia Applications, pp. 151-156.
-
(2006)
IEEE International Conference on Signal Processing and Multimedia Applications
, pp. 151-156
-
-
Didiot, E.1
Illina, I.2
Mella, O.3
Haton, J.-P.4
Fohr, D.5
-
12
-
-
33745182925
-
Automatic music genre classification using second-order statistical measures for the prescriptive approach
-
Ezzaidi, H., Rouat, J., 2005. Automatic music genre classification using second-order statistical measures for the prescriptive approach. In: Proc. European Conf. on Speech Communication and Technology, pp. 141-144.
-
(2005)
Proc. European Conf. on Speech Communication and Technology
, pp. 141-144
-
-
Ezzaidi, H.1
Rouat, J.2
-
13
-
-
70349263032
-
Segmentation en macro-classes acoustiques d'Tmissions radiophoniques dans le cadre d'ESTER
-
Fredouille, C., Matrouf, D., Linares, G., Nocera, P., 2004. Segmentation en macro-classes acoustiques d'Tmissions radiophoniques dans le cadre d'ESTER. In: JournTes d'Etude sur la Parole - JEP04.
-
(2004)
JournTes d'Etude sur la Parole
-
-
Fredouille, C.1
Matrouf, D.2
Linares, G.3
Nocera, P.4
-
15
-
-
0036567851
-
The LIMSI broadcast news transcription system
-
Gauvain J.-L., Lamel L., and Adda G. The LIMSI broadcast news transcription system. Speech Communication 37 1 (2002) 89-108
-
(2002)
Speech Communication
, vol.37
, Issue.1
, pp. 89-108
-
-
Gauvain, J.-L.1
Lamel, L.2
Adda, G.3
-
16
-
-
0002725741
-
The LIMSI 1998 Hub-4E transcription system
-
February, pp
-
Gauvain, J.L., Lamel, L., Adda, G., Jardino, M., 1999. The LIMSI 1998 Hub-4E transcription system. In: Proc. DARPA Broadcast News Transcription Workshop, February, pp. 99-104.
-
(1999)
Proc. DARPA Broadcast News Transcription Workshop
, pp. 99-104
-
-
Gauvain, J.L.1
Lamel, L.2
Adda, G.3
Jardino, M.4
-
17
-
-
0034842305
-
Integration of fixed and multiple resolution analysis in a speech recognition system
-
ICASSP, pp
-
Gemello, R., Albesano, D., Moisa, L., De Mori, R., 2001. Integration of fixed and multiple resolution analysis in a speech recognition system. In: Proc. IEEE Int. Conf. on Acoustic, Speech and Signal Processing, ICASSP, pp. 121-124.
-
(2001)
Proc. IEEE Int. Conf. on Acoustic, Speech and Signal Processing
, pp. 121-124
-
-
Gemello, R.1
Albesano, D.2
Moisa, L.3
De Mori, R.4
-
18
-
-
70349230534
-
ESTER, une campagne d'Tvaluation des systFmes d'indexation automatique d'Tmissions radiophoniques en francais
-
Gravier, G., Bonastre, J.F., Geoffrois, E., Galliano, S., Mc Tait, K., Choukri, K., 2004. ESTER, une campagne d'Tvaluation des systFmes d'indexation automatique d'Tmissions radiophoniques en francais. In: JournTes d'Etude sur la Parole - JEP04.
-
(2004)
JournTes d'Etude sur la Parole
-
-
Gravier, G.1
Bonastre, J.F.2
Geoffrois, E.3
Galliano, S.4
Mc Tait, K.5
Choukri, K.6
-
20
-
-
0032629671
-
The teager energy based feature parameters for robust speech recognition in car noise
-
ICASSP, pp
-
Jabloun, F., Enis Cetin, A., 1999. The teager energy based feature parameters for robust speech recognition in car noise. In: Proc. IEEE Int. Conf. on Acoustic, Speech and Signal Processing, ICASSP, pp. 273-276.
-
(1999)
Proc. IEEE Int. Conf. on Acoustic, Speech and Signal Processing
, pp. 273-276
-
-
Jabloun, F.1
Enis Cetin, A.2
-
22
-
-
20444491279
-
Automatic classification of speech and music using neural networks
-
Kahn, M., Al-Khatib, W., Moinuddin, M., 2004. Automatic classification of speech and music using neural networks. In: Proc. ACM Int. Workshop on Multimedia Databases, pp. 94-99.
-
(2004)
Proc. ACM Int. Workshop on Multimedia Databases
, pp. 94-99
-
-
Kahn, M.1
Al-Khatib, W.2
Moinuddin, M.3
-
24
-
-
34247239858
-
Speech/music discrimination based on spectral peak analysis and multi-layer perceptron
-
Keum, J.S., Lee, H.S., 2006. Speech/music discrimination based on spectral peak analysis and multi-layer perceptron. In: International Conference on Hybrid Information Technology, vol. 2, pp. 56-61.
-
(2006)
International Conference on Hybrid Information Technology
, vol.2
, pp. 56-61
-
-
Keum, J.S.1
Lee, H.S.2
-
25
-
-
33746879922
-
Machine learning-based classification of speech and music
-
Khan M., and Al-Khatib W.G. Machine learning-based classification of speech and music. Multi-Media Systems 12 (2006) 55-67
-
(2006)
Multi-Media Systems
, vol.12
, pp. 55-67
-
-
Khan, M.1
Al-Khatib, W.G.2
-
26
-
-
0034845044
-
-
Kim, I.J., Yang, S.I., Kwon, Y., 2001. Speech Enhancement using Adaptive Wavelet Shrinkage. In: ISIE-2001, 1, pp. 501-504.
-
Kim, I.J., Yang, S.I., Kwon, Y., 2001. Speech Enhancement using Adaptive Wavelet Shrinkage. In: ISIE-2001, vol. 1, pp. 501-504.
-
-
-
-
30
-
-
0036816475
-
Content analysis for audio classification and segmentation
-
Lu, L., Zhang, H.-J., Jiang, H., 2002. Content analysis for audio classification and segmentation. In: IEEE Transactions on Speech and Audio Processing, vol. 10(7), pp. 504-516.
-
(2002)
IEEE Transactions on Speech and Audio Processing
, vol.10
, Issue.7
, pp. 504-516
-
-
Lu, L.1
Zhang, H.-J.2
Jiang, H.3
-
34
-
-
13144306118
-
A speech/music discriminator based on RMS and zero-crossings
-
Panagiotakis, C., Tziritas, G., 2005. A speech/music discriminator based on RMS and zero-crossings. In: IEEE Transaction on Multimedia, vol. 7(1), pp. 155-166.
-
(2005)
IEEE Transaction on Multimedia
, vol.7
, Issue.1
, pp. 155-166
-
-
Panagiotakis, C.1
Tziritas, G.2
-
35
-
-
0036288612
-
Speech and music classification in audio documents
-
ICASSP, pp
-
Pinquier, J., Senac, C., Andre-Obrecht, R., 2002. Speech and music classification in audio documents. In: Proc. IEEE Int. Conf. on Acoustic, Speech and Signal Processing, ICASSP, pp. 4164-4167.
-
(2002)
Proc. IEEE Int. Conf. on Acoustic, Speech and Signal Processing
, pp. 4164-4167
-
-
Pinquier, J.1
Senac, C.2
Andre-Obrecht, R.3
-
37
-
-
78651558871
-
Segmentation Parole/Musique pour la transcription automatique
-
Razik, J., Fohr, D., Mella, O., Parlangeau-VallFs, N., 2004. Segmentation Parole/Musique pour la transcription automatique. In: JournTes d'Etudes sur la Parole.
-
(2004)
JournTes d'Etudes sur la Parole
-
-
Razik, J.1
Fohr, D.2
Mella, O.3
Parlangeau-VallFs, N.4
-
38
-
-
70349244394
-
Comparison of two speech/music segmentation systems for audio indexing on the web
-
Razik, J., Senac, C., Fohr, D., Mella, O., Parlangeau-Valles, N., 2003. Comparison of two speech/music segmentation systems for audio indexing on the web. In: Proc. Multi Conference on Systemics, Cybernetics and Informatics.
-
(2003)
Proc. Multi Conference on Systemics, Cybernetics and Informatics
-
-
Razik, J.1
Senac, C.2
Fohr, D.3
Mella, O.4
Parlangeau-Valles, N.5
-
39
-
-
27644502441
-
Image compression from DCT to wavelets: a review
-
Saha S. Image compression from DCT to wavelets: a review. ACM Crossroads 6 3 (2000) 644-651
-
(2000)
ACM Crossroads
, vol.6
, Issue.3
, pp. 644-651
-
-
Saha, S.1
-
40
-
-
0033688848
-
High resolution speech feature parameterization for monophone-based stressed speech recognition
-
Sarikaya R., and Hansen J.H.L. High resolution speech feature parameterization for monophone-based stressed speech recognition. IEEE Signal Processing Letters 7 7 (2000) 182-185
-
(2000)
IEEE Signal Processing Letters
, vol.7
, Issue.7
, pp. 182-185
-
-
Sarikaya, R.1
Hansen, J.H.L.2
-
42
-
-
0030648077
-
Construction and evaluation of a robust multifeature speech/music discriminator
-
ICASSP, pp
-
Scheirer, E., Slaney, M., 1997. Construction and evaluation of a robust multifeature speech/music discriminator. In: Proc. IEEE Int. Conf. on Acoustic, Speech and Signal Processing, ICASSP, pp. 1331-1334.
-
(1997)
Proc. IEEE Int. Conf. on Acoustic, Speech and Signal Processing
, pp. 1331-1334
-
-
Scheirer, E.1
Slaney, M.2
-
43
-
-
45849121392
-
Detection of speech and music based on spectral tracking
-
Taniguchi, T., Tohyama, M., Katsuhiko, S., 2008. Detection of speech and music based on spectral tracking. In: Speech Communication, vol. 50, pp. 547-563.
-
(2008)
Speech Communication
, vol.50
, pp. 547-563
-
-
Taniguchi, T.1
Tohyama, M.2
Katsuhiko, S.3
-
44
-
-
0002751623
-
Segment generation and clustering in the HTK broadcast news transcription system
-
Hain, T., Johnson, S.E., Tuerk, A., Woodland, P.C., Young, S.J., 1998. Segment generation and clustering in the HTK broadcast news transcription system. In: Proc. 1998 DARPA Broadcast News Transcription and Understanding Workshop, pp. 133-137.
-
(1998)
Proc. 1998 DARPA Broadcast News Transcription and Understanding Workshop
, pp. 133-137
-
-
Hain, T.1
Johnson, S.E.2
Tuerk, A.3
Woodland, P.C.4
Young, S.J.5
-
45
-
-
84892175859
-
Automatic speech recognition based on ceptral coefficients and a mel-based discrete energy operator
-
ICASSP, pp
-
Tolba, H., O'Shaughnessy, D., 1998. Automatic speech recognition based on ceptral coefficients and a mel-based discrete energy operator. In: Proc. IEEE Int. Conf. on Acoustic, Speech and Signal Processing, ICASSP, pp. 973-976.
-
(1998)
Proc. IEEE Int. Conf. on Acoustic, Speech and Signal Processing
, pp. 973-976
-
-
Tolba, H.1
O'Shaughnessy, D.2
-
47
-
-
16244420091
-
Multigroup classification of audio signals using time-frequency parameters
-
Umapathy K., Krishnan S., and Jimaa S. Multigroup classification of audio signals using time-frequency parameters. IEEE Transaction on Multimedia 7 2 (2005) 308-315
-
(2005)
IEEE Transaction on Multimedia
, vol.7
, Issue.2
, pp. 308-315
-
-
Umapathy, K.1
Krishnan, S.2
Jimaa, S.3
-
49
-
-
0343697653
-
-
CRC Press, LLC
-
Wold E., Blum T., Keislar D., and Wheater J. Classification, Search and Retireval of Audio. CRC Handbook of Multimedia Computing (1999), CRC Press, LLC
-
(1999)
Classification, Search and Retireval of Audio. CRC Handbook of Multimedia Computing
-
-
Wold, E.1
Blum, T.2
Keislar, D.3
Wheater, J.4
-
50
-
-
0003822743
-
-
Cambridge, England, Entropic Ltd, Microsoft
-
Young, S.J., Kershaw, D., Odell, J., Ollason, D., Valtchev, V., Woodland, P., 1995. The HTK Book. Cambridge, England, Entropic Ltd., Microsoft.
-
(1995)
The HTK Book
-
-
Young, S.J.1
Kershaw, D.2
Odell, J.3
Ollason, D.4
Valtchev, V.5
Woodland, P.6
-
51
-
-
0035340677
-
Audio content analysis for online audiovisual data segmentation and classification
-
Zhang, T., Kuo, C.-C.J., 2001. Audio content analysis for online audiovisual data segmentation and classification. In: IEEE Transactions on Speech and Audio Processing, vol. 9(4), pp. 441-457.
-
(2001)
IEEE Transactions on Speech and Audio Processing
, vol.9
, Issue.4
, pp. 441-457
-
-
Zhang, T.1
Kuo, C.-C.J.2
|