-
1
-
-
13444270474
-
LyricAlly: Automatic synchronization of acoustic musical signals and textual lyrics
-
ACM Multimedia 2004 - proceedings of the 12th ACM International Conference on Multimedia
-
Y.Wang, M.-Y. Kan, T. L. Nwe, A. Shenoy, and J. Yin, "Lyrically: Automatic synchronization of acoustic musical signals and textual lyrics," in Proc. 12th ACM Int. Conf. Multimedia, 2004, pp. 212-219. (Pubitemid 40211749)
-
(2004)
ACM Multimedia 2004 - proceedings of the 12th ACM International Conference on Multimedia
, pp. 212-219
-
-
Wang, Y.1
Kan, M.-Y.2
Nwe, T.L.3
Shenoy, A.4
Yin, J.5
-
2
-
-
33846989449
-
Automatic lyrics alignment for Cantonese popular music
-
C. H. Wong, W. M. Szeto, and K. H. Wong, "Automatic lyrics alignment for Cantonese popular music," Multimedia Syst., vol. 4-5, no. 12, pp. 307-323, 2007.
-
(2007)
Multimedia Syst.
, vol.4-5
, Issue.12
, pp. 307-323
-
-
Wong, C.H.1
Szeto, W.M.2
Wong, K.H.3
-
4
-
-
85009187525
-
An automatic singing transcription system with multilingual singing lyric recognizer and robust melody tracker
-
C.-K.Wang, R.-Y. Lyu, and Y.-C. Chiang, "An automatic singing transcription system with multilingual singing lyric recognizer and robust melody tracker," in Proc. 8th Euro. Conf. Speech Commun. Technol. (Eurospeech'03), 2003, pp. 1197-1200.
-
(2003)
Proc. 8th Euro. Conf. Speech Commun. Technol. (Eurospeech'03)
, pp. 1197-1200
-
-
Wang, C.-K.1
Lyu, R.-Y.2
Chiang, Y.-C.3
-
5
-
-
84873581013
-
Phoneme recognition in popular music
-
M. Gruhne, K. Schmidt, and C. Dittmar, "Phoneme recognition in popular music," in Proc. 8th Int. Conf. Music Inf. Retrieval (ISMIR'07), 2007, pp. 369-370.
-
(2007)
Proc. 8th Int. Conf. Music Inf. Retrieval (ISMIR'07)
, pp. 369-370
-
-
Gruhne, M.1
Schmidt, K.2
Dittmar, C.3
-
6
-
-
76949102667
-
A modeling of singing voice robust to accompaniment sounds and its application to singer identification and vocal-timbre-similarity based music information retrieval
-
Mar
-
H. Fujihara, M. Goto, T. Kitahara, and H. G. Okuno, "A modeling of singing voice robust to accompaniment sounds and its application to singer identification and vocal-timbre-similarity based music information retrieval," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 3, pp. 638-648, Mar. 2010.
-
(2010)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.18
, Issue.3
, pp. 638-648
-
-
Fujihara, H.1
Goto, M.2
Kitahara, T.3
Okuno, H.G.4
-
7
-
-
4644242508
-
A real-time music-scene-description system: Predominant- F0 estimation for detecting melody and bass lines in real-world audio signals
-
M. Goto, "A real-time music-scene-description system: Predominant- F0 estimation for detecting melody and bass lines in real-world audio signals," Speech Commun., vol. 43, no. 4, pp. 311-329, 2004.
-
(2004)
Speech Commun.
, vol.43
, Issue.4
, pp. 311-329
-
-
Goto, M.1
-
8
-
-
84873632675
-
Evaluation of multiple-F0 estimation and tracking systems
-
M. Bay, A. F. Ehmann, and J. S. Downie, "Evaluation of multiple-F0 estimation and tracking systems," in Proc. 10th Int. Soc. Music Inf. Retrieval Conf. (ISMIR'09), 2009, pp. 315-320.
-
(2009)
Proc. 10th Int. Soc. Music Inf. Retrieval Conf. (ISMIR'09)
, pp. 315-320
-
-
Bay, M.1
Ehmann, A.F.2
Downie, J.S.3
-
9
-
-
48849095345
-
Melody transcription from music audio: Approaches and evaluation
-
May
-
G. E. Poliner, D. P. Ellis, A. F. Ehmann, E. Gómez, S. Streich, and B. Ong, "Melody transcription from music audio: Approaches and evaluation," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 4, pp. 1247-1256, May 2007.
-
(2007)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.15
, Issue.4
, pp. 1247-1256
-
-
Poliner, G.E.1
Ellis, D.P.2
Ehmann, A.F.3
Gómez, E.4
Streich, S.5
Ong, B.6
-
11
-
-
4544255234
-
Automatic detection and tracking of target singer in multi-singer music recordings
-
W.-H. Tsai and H.-M. Wang, "Automatic detection and tracking of target singer in multi-singer music recordings," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP'04), 2004, pp. 221-224.
-
(2004)
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP'04)
, pp. 221-224
-
-
Tsai, W.-H.1
Wang, H.-M.2
-
13
-
-
0018306059
-
A threshold selection method from gray-level histograms
-
N. Otsu, "A threshold selection method from gray-level histograms," IEEE Trans. System, Man, Cybern., vol. SMC-9, no. 1, pp. 62-66, Jan. 1979. (Pubitemid 9413341)
-
(1979)
IEEE Trans Syst Man Cybern
, vol.SMC-9
, Issue.1
, pp. 62-66
-
-
Otsu Nobuyuki1
-
15
-
-
0029406853
-
Adaptive cepstral analysis of speech
-
Nov
-
K. Tokuda, T. Kobayashi, and S. Imai, "Adaptive cepstral analysis of speech," IEEE Trans. Speech Audio Process., vol. 3, no. 6, pp. 481-489, Nov. 1995.
-
(1995)
IEEE Trans. Speech Audio Process.
, vol.3
, Issue.6
, pp. 481-489
-
-
Tokuda, K.1
Kobayashi, T.2
Imai, S.3
-
16
-
-
29144452868
-
Pitch-dependent identification of musical instrument sounds
-
DOI 10.1007/s10489-005-4612-1
-
T. Kitahara, M. Goto, and H. G. Okuno, "Pitch-dependent identification of musical instrument sounds," Appl. Intell., vol. 23, no. 3, pp. 267-275, 2005. (Pubitemid 41801688)
-
(2005)
Applied Intelligence
, vol.23
, Issue.3
, pp. 267-275
-
-
Kitahara, T.1
Goto, M.2
Okuno, H.G.3
-
17
-
-
51449085941
-
-
IPSJ SIG Tech. Rep. no. 90
-
H. Kameoka, M. Goto, and S. Sagayama, "Selective amplifier of periodic and non-periodic components in concurrent audio signals with spectral control envelopes," 2006, vol. 2006, pp. 77-84, IPSJ SIG Tech. Rep., no. 90.
-
(2006)
Selective Amplifier of Periodic and Non-Periodic Components in Concurrent Audio Signals with Spectral Control Envelopes
, vol.2006
, pp. 77-84
-
-
Kameoka, H.1
Goto, M.2
Sagayama, S.3
-
18
-
-
0030676908
-
Accurate keyword spotting using strictly lexical fillers
-
R. E. Méliani, "Accurate keyword spotting using strictly lexical fillers," in Proc. 1997 IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP'97), 1997, pp. 907-910.
-
(1997)
Proc. 1997 IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP'97)
, pp. 907-910
-
-
Méliani, R.E.1
-
19
-
-
0030715925
-
A segment-based wordspotter using phonetic filler models
-
A. S. Manos and V. W. Zue, "A segment-based wordspotter using phonetic filler models," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP'97), 1997, pp. 899-902.
-
(1997)
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP'97)
, pp. 899-902
-
-
Manos, A.S.1
Zue, V.W.2
-
20
-
-
0028419019
-
Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains
-
Apr
-
J. L. Gauvain and C.-H. Lee, "Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains," IEEE Trans. Speech Audio Process., vol. 2, no. 2, pp. 291-298, Apr. 1994.
-
(1994)
IEEE Trans. Speech Audio Process.
, vol.2
, Issue.2
, pp. 291-298
-
-
Gauvain, J.L.1
Lee, C.-H.2
-
21
-
-
0029288633
-
Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
-
C. J. Leggetter and P. C. Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models," Comput. Speech Lang., vol. 9, pp. 171-185, 1995.
-
(1995)
Comput. Speech Lang.
, vol.9
, pp. 171-185
-
-
Leggetter, C.J.1
Woodland, P.C.2
-
22
-
-
0019053271
-
Comparison of parametric representation for monosyllabic word recognition
-
S. B. Davis and P. Mermelstein, "Comparison of parametric representation for monosyllabic word recognition," IEEE Trans. Acoustic, Speech, Signal Process., vol. ASSP-28, no. 4, pp. 357-366, Aug. 1980. (Pubitemid 11464930)
-
(1980)
IEEE Transactions on Acoustics, Speech, and Signal Processing
, vol.ASSP-28
, Issue.4
, pp. 357-366
-
-
Davis Steven, B.1
Mermelstein Paul2
-
23
-
-
0141623871
-
RWCmusic database: Popular, classical, and jazz music databases
-
Oct
-
M. Goto, H. Hashiguchi, T. Nishimura, and R. Oka, "RWCmusic database: Popular, classical, and jazz music databases," in Proc. 3rd Int. Conf. Music Inf. Retrieval (ISMIR'02), Oct. 2002, pp. 287-288.
-
(2002)
Proc. 3rd Int. Conf. Music Inf. Retrieval (ISMIR'02)
, pp. 287-288
-
-
Goto, M.1
Hashiguchi, H.2
Nishimura, T.3
Oka, R.4
-
24
-
-
85009067482
-
Recent progress of open-source LVCSR engine Julius and Japanese model repository-software of continuous speech recognition consortium
-
T. Kawahara, A. Lee, K. Takeda, and K. Shikano, "Recent progress of open-source LVCSR engine Julius and Japanese model repository- Software of continuous speech recognition consortium-," in Proc. 6th Int. Conf. Spoken Lang. Process. (Interspeech'04 ICSLP), 2004.
-
(2004)
Proc. 6th Int. Conf. Spoken Lang. Process. (Interspeech'04 ICSLP)
-
-
Kawahara, T.1
Lee, A.2
Takeda, K.3
Shikano, K.4
-
25
-
-
85111262831
-
Applying conditional random fields to Japanese morphological analysis
-
T. Kudo, K. Yamamoto, and Y. Matsumoto, "Applying conditional random fields to Japanese morphological analysis," in Proc. Conf. Empirical Methods in Natural Lang. Process., 2004, pp. 230-237.
-
(2004)
Proc. Conf. Empirical Methods in Natural Lang. Process.
, pp. 230-237
-
-
Kudo, T.1
Yamamoto, K.2
Matsumoto, Y.3
-
26
-
-
33646802604
-
An auto-regressive, non-stationary excited signal parameter estimation method and an evaluation of a singing-voice recognition
-
DOI 10.1109/ICASSP.2005.1415094, 1415094, 2005 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '05 - Proceedings - Speech Processing
-
A. Sasou, M. Goto, S. Hayamizu, and K. Tanaka, "An auto-regressive, non-stationary excited signal parameter estimation method and an evaluation of a singing-voice recognition," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP'05), 2005, pp. I-237-I-240. (Pubitemid 43761134)
-
(2005)
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
, vol.I
-
-
Sasou, A.1
Goto, M.2
Hayamizu, S.3
Tanaka, K.4
-
27
-
-
34547519365
-
Active music listening interfaces based on signal processing
-
DOI 10.1109/ICASSP.2007.367351, 4218382, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '07
-
M. Goto, "Active music listening interfaces based on signal processing," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP'07), 2007, pp. IV-1441-IV-1444. (Pubitemid 47178651)
-
(2007)
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
, vol.4
-
-
Goto, M.1
|