-
1
-
-
0002161311
-
The quefrency alanysis of time series for echos: Cepstrum, pseudo-autocovariance, cross-cepstrum. and saphe-cracking
-
B. P. Bogert, M. J. R. Healry, and J. W. Tukey, "The quefrency alanysis of time series for echos: Cepstrum, pseudo-autocovariance, cross-cepstrum. and saphe-cracking," in Proc. Symp. Time Series Analysis. 1963, pp. 209-243.
-
(1963)
Proc. Symp. Time Series Analysis
, pp. 209-243
-
-
Bogert, B.P.1
Healry, M.J.R.2
Tukey, J.W.3
-
2
-
-
84953653667
-
Short-time spectrum and 'cepstrum' techniques for vocal-pitch detection
-
Feb
-
A. M. Noll, "Short-time spectrum and 'cepstrum' techniques for vocal-pitch detection,"J. Acoust. Soc. Amer., vol. 36, no. 2, pp. 296-302, Feb. 1964.
-
(1964)
J. Acoust. Soc. Amer
, vol.36
, Issue.2
, pp. 296-302
-
-
Noll, A.M.1
-
3
-
-
34548108312
-
On individuality in a dynamic measure of speech
-
in Japanese, Jul
-
S. Sagayama and F. Itakura, "On individuality in a dynamic measure of speech," in Proc. ASJ Conf. (in Japanese), Jul. 1979, pp. 589-590.
-
(1979)
Proc. ASJ Conf
, pp. 589-590
-
-
Sagayama, S.1
Itakura, F.2
-
4
-
-
0019053271
-
Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
-
Aug
-
S. E. Davis and P. Mermelstein, "Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences," IEEE Trans. Acoust, Speech, Signal Process., vol. ASSP-28, no. 4, pp. 357-366, Aug. 1980.
-
(1980)
IEEE Trans. Acoust, Speech, Signal Process
, vol.ASSP-28
, Issue.4
, pp. 357-366
-
-
Davis, S.E.1
Mermelstein, P.2
-
5
-
-
0007958098
-
Speech analysis synthesis system using log magnitude approximation filter, (in Japanese)
-
S. Imai and T. Kitamura, "Speech analysis synthesis system using log magnitude approximation filter," (in Japanese) Trans. IEICE Japan, vol. J61-A, no. 6, pp. 527-534, 1978.
-
(1978)
Trans. IEICE Japan
, vol.J61-A
, Issue.6
, pp. 527-534
-
-
Imai, S.1
Kitamura, T.2
-
6
-
-
0347337997
-
Multiple fundamental frequency estimation based on harmonicitv and spectral smoothness
-
Nov
-
A. Klapuri, "Multiple fundamental frequency estimation based on harmonicitv and spectral smoothness," IEEE Trans. Speech Audio Process., vol. 11, no. 6, pp. 806-816, Nov. 2003.
-
(2003)
IEEE Trans. Speech Audio Process
, vol.11
, Issue.6
, pp. 806-816
-
-
Klapuri, A.1
-
7
-
-
0003182324
-
Organization of hierarchical perceptual sounds: Music scene analysis with autonomous processing modules and a quantitative information integration mechanism
-
K. Kashino, K. Nakadai, T. Kinoshita, and H. Tanaka, "Organization of hierarchical perceptual sounds: Music scene analysis with autonomous processing modules and a quantitative information integration mechanism," in Proc. Int. Joint Conf. Artif. Intell., 1995, vol. 1, pp. 158-164.
-
(1995)
Proc. Int. Joint Conf. Artif. Intell
, vol.1
, pp. 158-164
-
-
Kashino, K.1
Nakadai, K.2
Kinoshita, T.3
Tanaka, H.4
-
8
-
-
0026744657
-
Musical fundamental frequency tracking using a pattern recognition method
-
J. C. Brown, "Musical fundamental frequency tracking using a pattern recognition method," J. Acoust. Soc. Amer., vol. 92-3, pp. 1394-1402. 1992.
-
(1992)
J. Acoust. Soc. Amer
, vol.92 -3
, pp. 1394-1402
-
-
Brown, J.C.1
-
9
-
-
0033677009
-
A robust predominant-FO estimation method for real-time detection of melody and bass lines in CD recordings
-
Jun
-
M. Goto, "A robust predominant-FO estimation method for real-time detection of melody and bass lines in CD recordings," in Proc. IEEE Int. Conf. Acoust, Speech, Signal Process., Jun. 2000. vol. 2, pp. 757-760.
-
(2000)
Proc. IEEE Int. Conf. Acoust, Speech, Signal Process
, vol.2
, pp. 757-760
-
-
Goto, M.1
-
10
-
-
0034848863
-
A predominant-F0 estimation method for CD recordings: MAP estimation using EM algorithm for adaptive tone models
-
Sep
-
M. Goto. "A predominant-F0 estimation method for CD recordings: MAP estimation using EM algorithm for adaptive tone models," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Sep. 2001, vol. 5, pp. 3365-3368.
-
(2001)
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process
, vol.5
, pp. 3365-3368
-
-
Goto, M.1
-
11
-
-
4644242508
-
A real-time music-scene-description system: Predominant-F0 estimation for detecting melody and bass lines in real-world audio signals
-
M. Goto, "A real-time music-scene-description system: Predominant-F0 estimation for detecting melody and bass lines in real-world audio signals," Speech Commun., vol. 43, no. 4, pp. 311-329, 2004.
-
(2004)
Speech Commun
, vol.43
, Issue.4
, pp. 311-329
-
-
Goto, M.1
-
12
-
-
33947644951
-
Real-time pitch determination of one or more voices bv nonnegative matrix factorisation
-
F. Sha and F. Saul, "Real-time pitch determination of one or more voices bv nonnegative matrix factorisation," in Proc. Neural Inf. Process. Syst, 2004, pp. 1233-1240.
-
(2004)
Proc. Neural Inf. Process. Syst
, pp. 1233-1240
-
-
Sha, F.1
Saul, F.2
-
13
-
-
33947100466
-
Shifted non-negative matrix factorization for sound source separation
-
D. FitzGerald, M. Cranitch, and E. Coyle, "Shifted non-negative matrix factorization for sound source separation," in IEEE Workshop Statist. Signal Process., 2005, pp. 1132-1137.
-
(2005)
IEEE Workshop Statist. Signal Process
, pp. 1132-1137
-
-
FitzGerald, D.1
Cranitch, M.2
Coyle, E.3
-
14
-
-
33144463127
-
Unsupervised analysis of polyphonic music by sparse coding
-
Jan
-
S. A. Abdallah and M. D. Plumbley, "Unsupervised analysis of polyphonic music by sparse coding," IEEE Trans. Neural Netw., vol. 17, no. 1, pp. 179-196, Jan. 2006.
-
(2006)
IEEE Trans. Neural Netw
, vol.17
, Issue.1
, pp. 179-196
-
-
Abdallah, S.A.1
Plumbley, M.D.2
-
15
-
-
33744987389
-
Sparse and shift-invariant representations of music
-
Jan
-
T. Blumensath and M. Davies, "Sparse and shift-invariant representations of music," IEEE Trans. Audio, Speech, Lang. Process., vol. 14. no. 1, pp. 50-57, Jan. 2006.
-
(2006)
IEEE Trans. Audio, Speech, Lang. Process
, vol.14
, Issue.1
, pp. 50-57
-
-
Blumensath, T.1
Davies, M.2
-
16
-
-
17344378590
-
Bayesian harmonic models for musical pitch estimation and analysis
-
S. Godsill and M. Davy, "Bayesian harmonic models for musical pitch estimation and analysis," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 2002, vol. 2, pp. 1769-1772.
-
(2002)
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process
, vol.2
, pp. 1769-1772
-
-
Godsill, S.1
Davy, M.2
-
17
-
-
0036297211
-
Separation of harmonic sounds using linear models for the overtone series
-
T. Virtanen and A. Klapuri, "Separation of harmonic sounds using linear models for the overtone series," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 2002, vol. 2, pp. 1757-1760.
-
(2002)
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process
, vol.2
, pp. 1757-1760
-
-
Virtanen, T.1
Klapuri, A.2
-
18
-
-
84872690534
-
Robust multipitch estimation for the analysis and manipulation of polyphonic musical signals
-
A. Klapuri, T. Virtanen, and J. Holm, "Robust multipitch estimation for the analysis and manipulation of polyphonic musical signals," in Proc. COST-G6 Conf. Digital Audio Effects, 2000, pp. 233-236.
-
(2000)
Proc. COST-G6 Conf. Digital Audio Effects
, pp. 233-236
-
-
Klapuri, A.1
Virtanen, T.2
Holm, J.3
-
19
-
-
19944419547
-
Extraction of multiple fundamental frequencies from polyphonic music
-
H. Kameoka, T. Nishimoto, and S. Sagayama, "Extraction of multiple fundamental frequencies from polyphonic music," Proc. Int. Congr. Acoust., pp. 59-62, 2004.
-
(2004)
Proc. Int. Congr. Acoust
, pp. 59-62
-
-
Kameoka, H.1
Nishimoto, T.2
Sagayama, S.3
-
20
-
-
4544303298
-
Separation of harmonic structures based on tied Gaussian mixture model and information criterion for concurrent sounds
-
May
-
H. Kameoka, T. Nishimoto, and S. Sagayama, "Separation of harmonic structures based on tied Gaussian mixture model and information criterion for concurrent sounds," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., May 2004, vol. 4, pp. 297-300.
-
(2004)
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process
, vol.4
, pp. 297-300
-
-
Kameoka, H.1
Nishimoto, T.2
Sagayama, S.3
-
21
-
-
0036497684
-
Segregating information about the size and shape of the vocal tract using a time-domain auditory model: The stabilised wavelet-Mellin transform
-
T. Irino and R. D. Patterson, "Segregating information about the size and shape of the vocal tract using a time-domain auditory model: The stabilised wavelet-Mellin transform," Speech Commun., vol. 36, no. 3. pp. 181-203, 2002.
-
(2002)
Speech Commun
, vol.36
, Issue.3
, pp. 181-203
-
-
Irino, T.1
Patterson, R.D.2
-
22
-
-
85131821539
-
Mel-generalized cepstral analysis - A unified approach to speech spectral estimation
-
K. Tokuda, T. Kobayashi, T. Masuko, and S. Imai, "Mel-generalized cepstral analysis - A unified approach to speech spectral estimation," in Proc. Int. Conf. Spoken Lang. Process., 1994, pp. 1043-1046.
-
(1994)
Proc. Int. Conf. Spoken Lang. Process
, pp. 1043-1046
-
-
Tokuda, K.1
Kobayashi, T.2
Masuko, T.3
Imai, S.4
-
23
-
-
0043087956
-
-
in Japanese Elec. Commun. Lab, NTT, Tokyo, Japan, Tech. Rep. 3107
-
S. Saito and F. ltakura, "The theoretical consideration of statistically optimum methods for speech spectral density," (in Japanese) Elec. Commun. Lab., NTT, Tokyo, Japan, 1966, Tech. Rep. 3107.
-
(1966)
The theoretical consideration of statistically optimum methods for speech spectral density
-
-
Saito, S.1
ltakura, F.2
-
25
-
-
2442437071
-
RWC music database: Music genre database and musical instrument sound database
-
Oct
-
M. Goto, H. Hashiguchi, T. Nishimura, and R. Oka, "RWC music database: Music genre database and musical instrument sound database," in Proc. Int. Conf. Music Inf. Retrieval, Oct. 2003, pp. 229-230.
-
(2003)
Proc. Int. Conf. Music Inf. Retrieval
, pp. 229-230
-
-
Goto, M.1
Hashiguchi, H.2
Nishimura, T.3
Oka, R.4
-
26
-
-
64849094083
-
Iterative multipitch estimation algorithm for MAP specmurt analvsis,
-
Aug. 2006, 2006-MUS-66, pp
-
S. Saito, H. Kameoka, N. Ono, and S. Sagayama, "Iterative multipitch estimation algorithm for MAP specmurt analvsis," (in Japanese) IPSJ SIG Tech. Rep., Aug. 2006, vol. 2006-MUS-66, pp. 85-92.
-
in Japanese) IPSJ SIG Tech. Rep
, pp. 85-92
-
-
Saito, S.1
Kameoka, H.2
Ono, N.3
Sagayama, S.4
-
27
-
-
0141623871
-
RWC music database: Popular, classical, and jazz music database
-
Oct
-
M. Goto, H. Hashiguchi, T. Nishimura, and R. Oka, "RWC music database: Popular, classical, and jazz music database," in Proc. Int. Svmp. Music Inf. Retrieval, Oct. 2002, pp. 287-288.
-
(2002)
Proc. Int. Svmp. Music Inf. Retrieval
, pp. 287-288
-
-
Goto, M.1
Hashiguchi, H.2
Nishimura, T.3
Oka, R.4
-
28
-
-
50249173884
-
A multipitch analyzer based on harmonic temporal structured clustering
-
Mar
-
H. Kameoka, T. Nishimoto, and S. Sagayama, "A multipitch analyzer based on harmonic temporal structured clustering," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 3, pp. 982-994, Mar. 2007.
-
(2007)
IEEE Trans. Audio, Speech, Lang. Process
, vol.15
, Issue.3
, pp. 982-994
-
-
Kameoka, H.1
Nishimoto, T.2
Sagayama, S.3
-
29
-
-
11144277258
-
Automatic rhythm transcription from multiphonic MIDI signals
-
Oct
-
H. Takeda, T. Nishimoto, and S. Sagayama, "Automatic rhythm transcription from multiphonic MIDI signals," in Proc. Int. Conf. Music Inf. Retrieval, Oct. 2003, pp. 263-264.
-
(2003)
Proc. Int. Conf. Music Inf. Retrieval
, pp. 263-264
-
-
Takeda, H.1
Nishimoto, T.2
Sagayama, S.3
|